My approach to books.toscrape.com with Scrapy CrawlSpider and ItemLoaders
- python3
- pip3
- itemadapter==0.6.0
- itemloaders==1.0.4
- Scrapy==2.6.1
Clone the repo and install dependencies, dependencies are available at requirements.txt
git clone https://github.com/egarcia2506/books-to-scrape
cd books-to-scrape
pip3 install -r requirements.txt
scrapy crawl book -o books.json