bash-crawler

This script is written in bash shell script, and this reads standard XML sitemap file line by line, extract url and make CURL based HTTP request to web host. The advantage of this scrpit is on CMS based websites having thousands of web pages, where most of the pages remain unchanged. In enviornments where memcache or varnish cache is used to build snapshot of web pages, this crawler reads every webpage and builds internal cache. Helps in performance of CMS driven websites.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
crawl.sh		crawl.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bash-crawler

About

Releases

Packages

Languages

jindalatul/bash-crawler

Folders and files

Latest commit

History

Repository files navigation

bash-crawler

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages