UNDER DEVELOPMENT
Node runner is a scrapy command designed to manage scrapy spiders via api.
This tool aims to:
- easily launch multiple scrapy spiders on the node
- expose api wich can be used by external scheduler
- support graceful shutdown processes to ensure data integrity and minimal disruption
- synchronizing its configuration with ZooKeeper, allowing seamless integration and accessibility by external scheduler
This command is supposed to used with scrapy-node-operator
component which is under development now.
- start docker compose
- install deps with
poetry install
- go into scrapy project with
cd example
- start scrapy node with
scrapy node
- send
{"id": "uniq-id-1", "spider": "quotes"}
tohttp://localhost:8000/start
Note: This document is subject to further updates.