Skip to content

Latest commit

 

History

History
28 lines (24 loc) · 1.4 KB

File metadata and controls

28 lines (24 loc) · 1.4 KB

Wikipedia Population Table Data Extraction

This project demonstrates web scraping using Selenium WebDriver to extract data from an HTML table on a Wikipedia page. Specifically, it retrieves the list of countries and territories by total population, presenting a real-world example of data extraction and automation using Selenium.

Features

  • Automates navigation to a Wikipedia page on country populations.
  • Extracts data from an HTML table, including:
    • Location (Country or territory)
    • Population
    • Percentage of world population
    • Date of population data
    • Source of the data
    • Notes
  • Processes table rows dynamically to handle updates to the table structure or content.
  • Uses JavaScript for smooth scrolling to the target table.

Prerequisites

Ensure you have the following before running the project:

  1. Java Development Kit (JDK) - Version 8 or above.
  2. Google Chrome - Latest stable version.
  3. ChromeDriver - Version compatible with your Chrome browser.
  4. Selenium WebDriver - Included in the project dependencies.

Technologies Used

  • Java - The programming language for the project.
  • Selenium WebDriver - For web element interaction and automation.
  • Google Chrome & ChromeDriver - For browser-based automation.
  • JavaScript Executor - For advanced browser interactions like scrolling.