Skip to content

Latest commit

 

History

History
29 lines (25 loc) · 1.12 KB

readme.md

File metadata and controls

29 lines (25 loc) · 1.12 KB

Some Demo scripts 1_scrapy_selenium_scraper_clean:


Scrapes HD images from google.com This is a pretty simple scraper, but it demonstrates combined use of scrapy with selenium, that, and I cleaned it up a bit an wrote some comments.

If you couldn't be bothered running it I have a youtube video which demonstrates it. Its not that fancy, but it saved me a lot of time. (I work also doing MLS Listings, they need lots of stock images)

The Video: https://www.youtube.com/watch?v=cVXg5elLmu0

To Run: scrapy crawl my-house-spider


2_realtor_ca_script - pure scrapy script that scrapes a real estate site


3_selenese_scripts - this is an example of some of the scripts with which I make most of my money. Some javascript is involved, but it is I admit pretty basic


4_kijiji - Another scrapy real estate scraper


5_craigs_list_scraper - This script is actually kind of interesting, it uses a proxy. It "flags" craigslist entries (blocks them basically)


6_misc_scrapers - google.com scraper - very unorganised


7_hh_autoworker - nicely commented script that applies for jobs automatically on hh.ru, it logs in to the site