Python PDF Web Scraper

A simple Python script that scrapes web pages for PDF files and downloads them to a local directory.

Getting Started

Clone this repository.
Install Python.
Install Pip.
Install the required packages using pip install -r requirements.txt in your terminal.
Place the web page URL and output file location in the main.py file here:

# Define your URL
url = "https://yourWebsiteURL"

# By default, the script will download PDF files to the downloads folder.
# You can change the folder location by updating the folder_location variable.
# Example: folder_location = r'/Users/yourname/Documents'

folder_location = r'./downloads'

Run the script: python main.py
PDF files will be downloaded to your local directory.

Disclaimer

Important

This tool is not intended to break copyright laws and is for personal use only. It merely automates the retrieval of publicly available data using standard web scraping techniques. The copyright of the data retrieved belongs to its respective owners, and I am not responsible for any illegal redistribution or misuse of data obtained using this tool.

Caution

Use of this tool is at your own risk. By using this tool, you agree that you are solely responsible for any legal issues that may arise from your use of this tool.

Resources

License

This project is released under the terms of The Unlicense, which allows you to use, modify, and distribute the code as you see fit.

The Unlicense removes traditional copyright restrictions, giving you the freedom to use the code in any way you choose.
For more details, see the LICENSE file in this repository.

Credits

Author: Scott Grivner
Email: scott.grivner@gmail.com
Website: scottgrivner.dev
Reference: Main Branch

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Downloads		Downloads
docs/images		docs/images
.gitignore		.gitignore
LICENSE		LICENSE
PRG.md		PRG.md
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python PDF Web Scraper

Table of Contents

Getting Started

Disclaimer

Resources

License

Credits

About

Releases

Packages

Languages

License

scottgriv/python-pdf_web_scraper

Folders and files

Latest commit

History

Repository files navigation

Python PDF Web Scraper

Table of Contents

Getting Started

Disclaimer

Resources

License

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages