This project demonstrates the loading, preprocessing, and summarization of text from PDF files using Streamlit and LLM APIs like (Deepseek r1,o3-mini). The application allows users to upload a PDF, extract its text, and generate a concise summary. The summary can be downloaded as a .txt
file for offline use.
- PDF Summarization
- Users can upload a PDF file (up to 200MB) and get the summary of the document.
- Upload a PDF:
- Users upload a PDF file using the file uploader in the app.
- Extract Text:
- The app extracts text from the PDF using the
PyPDF2
library.
- The app extracts text from the PDF using the
- Generate Summary:
- The extracted text is sent to the LLM API (Deepseek r1, o3-mini), which generates a summary.
- Display and Download:
- The summary is displayed on the app, and users can download it as a
.txt
file.
- The summary is displayed on the app, and users can download it as a
- Python 3.8+
- Libraries:
streamlit
,PyPDF2
,openai
- Hardware: CPU (GPU not required)
This project is licensed under the MIT License. See the LICENSE file for details.