scrape2gether is a web scraping tool that leverages human interaction to collect data from websites. By relying on human traffic instead of automated bots, scrape2gether bypasses the limitations of traditional web scraping, like robots.txt
, and promotes a user-driven approach to web data collection.
This extension is currently in development for Google Chrome.
Follow these steps to install scrape2gether on your Chrome browser:
- Clone the repository to your local machine:
git clone https://github.com/tommi-fish/scrape2gether-extension
- Open Chrome and navigate to the Extensions page.
- Enable Developer Mode (toggle in the top-right corner of the page).
- Click the Load Unpacked button.
- Select the folder where you cloned the repository (specifically the
scrape2gether-extension
folder).
Your scrape2gether extension is now ready to use!
- Open the scrape2gether popup by clicking on its icon in the Chrome toolbar.
- Use the interface to add CSS selectors for the elements you want to scrape.
- Provide a name and a valid CSS selector for each target.
- View and manage your saved selectors in the popup.
- Scrape live data from web pages and download or use the collected data as needed.
- Customizable Selectors: Add and manage custom CSS selectors for any webpage elements.
- Live Data Scraping: Collect data dynamically from the elements you specify.
- Visual Feedback: Highlight scraped elements on the page for easy verification.
- Persistent Storage: Save your selectors locally to re-use them across sessions.
- JSON Export: View and export collected data in a structured JSON format.
- Community Sharing: Share and use pre-configured selectors created by other scrape2gether users.
- Browser Compatibility: Extend support to Firefox and other Chromium-based browsers.
- Advanced Export Options: Export data to CSV, Excel, or Google Sheets.
To contribute:
- Fork the repository.
- Create a new branch:
git checkout -b feature-name
- Make your changes and commit them:
git commit -m "Add description of changes"
- Push to your fork and submit a pull request:
git push origin feature-name
For any questions or support, feel free to reach out via GitHub Issues.
- Thanks to the open-source community for inspiration and support.
- Built with love and a mission to democratize web scraping.