If a page contains forms with user specific tokens, it needs to be excluded from page caching. This spider crawls a website for such forms.
Only detects YOOEssentials Forms and page caching using W3 Total Cache or WP Optimize.
- install Python 3.*
- install Scrapy
scrapy crawl form_spider -a url=https://www.example.com/
scrapy crawl form_spider -a url=https://www.example.com/ -a display=tree # displays results as tree
scrapy crawl form_spider -a url=https://www.example.com/ -a display=combined # displays results as list and tree
On Windows, the Anaconda Shell needs to be used.
- Move repo to e.g. ~/Applications/form-spider
- Add
export PATH="$HOME/Applications/form-spider"
to~/.bashrc
- Execute using
form-spider.sh https://www.example.com/ tree