This script is written in bash shell script, and this reads standard XML sitemap file line by line, extract url and make CURL based HTTP request to web host. The advantage of this scrpit is on CMS based websites having thousands of web pages, where most of the pages remain unchanged. In enviornments where memcache or varnish cache is used to build snapshot of web pages, this crawler reads every webpage and builds internal cache. Helps in performance of CMS driven websites.
-
Notifications
You must be signed in to change notification settings - Fork 0
jindalatul/bash-crawler
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published