Skip to content

asktoby.php Version 2.0.0

Compare
Choose a tag to compare
@topherPedersen topherPedersen released this 23 May 03:46
· 38 commits to master since this release

"Done is Better Than Perfect"

I completely rewrote the tobycrawler bot to increase its performance. The old crawler ran in series, crawling one site before crawling the next. Version 2.0.0 has been rewritten to crawl 20 sites at a time in parallel. This has resulted in a 10X performance increase. However, additional performance increases will likely need to be made through increasing server bandwidth, as bandwidth appears to be the limiting factor in crawl speed, much more than computing power. Also, please note that this code was committed to github as it was when I first ran the new crawler successfully, and has not been refactored at all. Likewise, much of the new code was written on a quirky text editor on my Chromebook which has a tendency to throw off all of my indentation. But hey, it works!