Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

## What is this Python project? #495

Open
Exceltuxil opened this issue Oct 19, 2022 · 0 comments
Open

## What is this Python project? #495

Exceltuxil opened this issue Oct 19, 2022 · 0 comments

Comments

@Exceltuxil
Copy link

What is this Python project?

It's a framework for scraping HTML sites, and aggregating data from multiple sites from a same category (e.g. banking sites, news sites, video sites, etc.).
There are ready-made modules for popular websites and ready-apps to interact with them.
Think youtube-dl applied to other domains than video!

What's the difference between this Python project and similar ones?

  • It's possible to scrape new websites with declarative-style extraction rules
  • It provides a standardized API for categories of sites for dedicated tasks (e.g. banking, web forums, video sites, news sites, music lyrics sites, etc.)
    • Scraped websites are grouped in those categories
  • Scraped websites are grouped in categories for a dedicated task:
  • The project comes with many existing backends for real-life websites
  • It has an internal upgrade system

Originally posted by @hydrargyrum in vinta/awesome-python#1441

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant