Wikipedia Population Table Data Extraction

This project demonstrates web scraping using Selenium WebDriver to extract data from an HTML table on a Wikipedia page. Specifically, it retrieves the list of countries and territories by total population, presenting a real-world example of data extraction and automation using Selenium.

Features

Automates navigation to a Wikipedia page on country populations.
Extracts data from an HTML table, including:
- Location (Country or territory)
- Population
- Percentage of world population
- Date of population data
- Source of the data
- Notes
Processes table rows dynamically to handle updates to the table structure or content.
Uses JavaScript for smooth scrolling to the target table.

Prerequisites

Ensure you have the following before running the project:

Java Development Kit (JDK) - Version 8 or above.
Google Chrome - Latest stable version.
ChromeDriver - Version compatible with your Chrome browser.
Selenium WebDriver - Included in the project dependencies.

Technologies Used

Java - The programming language for the project.
Selenium WebDriver - For web element interaction and automation.
Google Chrome & ChromeDriver - For browser-based automation.
JavaScript Executor - For advanced browser interactions like scrolling.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wikipedia Population Table Data Extraction

Features

Prerequisites

Technologies Used

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Wikipedia Population Table Data Extraction

Features

Prerequisites

Technologies Used