Skip to content

spsu/forager

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Forager

Forager is a webcrawler for the spsu.edu domain that finds and generates web reports on missing media files and hyperlinks throughout the website.

This is the readme file for the project. We will update it with more documentation as it becomes available.

Required Python Libraries

  • flask -- excellent, lightweight web platform
  • lxml -- provides HTML parsing
  • requests -- performs HTTP

These can be installed on Linux or Windows with pip, the Python package manager, eg:

pip install flask pip install lxml pip install requests

To install lxml on Windows, you'll need the libxml2 and libxslt. The easiest way to fix this is to simply use the precompiled lxml found here:

Releases

No releases published

Packages

No packages published

Languages