The Wikipedia Table Scraper is a simple yet powerful tool designed to extract tables from Wikipedia pages and save them as CSV files. This tool is perfect for anyone looking to perform data extraction and analysis on structured data from Wikipedia.
- Easy to Use: Simple command-line interface.
- Fast: Efficient scraping with minimal dependencies.
- Flexible: Handles multiple tables on a single page.
- Portable: Outputs tables in CSV format for easy analysis.
- Python 3.6 or higher
- pip (Python package installer)
-
Clone the repository:
git clone https://github.com/yourusername/wikipedia-table-scraper.git cd wikipedia-table-scraper
-
Install the required dependencies:
pip install -r requirements.txt
To scrape tables from a specific Wikipedia page, run the following command:
python scraper.py "https://en.wikipedia.org/wiki/List_of_countries_by_population_(United_Nations)"