reddit_crawler

A python web crawler that I wrote from scratch.

It starts at www.reddit.com and searches for href tags. Subreddits: creates a file called subs.txt and appends all patterns matching /r/* to this file with no repetition. Users: creates a file called users.txt and appends all patterns matching /user/* to this file with no repetition. Topics: creates a file called topics.txt and appends all patterns matching /t/* to this file with no repetition.

To run:

Install the requirements in requirements.txt
python3 main.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reddit_crawler

About

Releases

Packages

Languages

colincron/reddit_crawler

Folders and files

Latest commit

History

Repository files navigation

reddit_crawler

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages