Skip to content

Latest commit

 

History

History
94 lines (59 loc) · 3.16 KB

README.md

File metadata and controls

94 lines (59 loc) · 3.16 KB

Newspaper Advertisement Analyzer (Server)

version

Newspaper Advertisement Analyzer Logo or Screenshot (if available)

Table of Contents

Introduction

Welcome to the Newspaper Advertisement Analyzer (Server) repository! This server component is an essential part of our comprehensive application designed to extract key information from newspaper advertisements using Natural Language Processing (NLP) techniques. The server processes requests from the frontend, analyzes advertisements, and sends relevant information back to the client.

Features

  1. Advertisement Analysis: Utilizes NLTK, pytesseract, Specy, and other libraries to extract vital details from newspaper advertisements.

  2. Flask API: Built on the Flask web framework, providing RESTful API endpoints for communication with the frontend.

  3. MongoDB Integration: Stores extracted information in a MongoDB database hosted on MongoDB Atlas using the pymongo driver.

  4. Frontend Communication: Serves as the backend for the frontend application, facilitating the exchange of data and results.

Technologies Used

The Newspaper Advertisement Analyzer (Server) relies on a modern technology stack to accomplish its objectives:

  • Flask: The primary web framework used to create the server.

  • NLTK (Natural Language Toolkit): Used for natural language processing tasks.

  • pytesseract: Enables optical character recognition (OCR) to extract text from images.

  • Specy: A library for parsing and analyzing structured documents.

  • Newspaper: Used for scraping articles from news websites.

  • Beautiful Soup: A library for web scraping and parsing HTML and XML documents.

  • MongoDB: The NoSQL database for storing and managing extracted data.

  • pymongo: The MongoDB driver for Python to interact with the database.

Environment Setup

To set up your development environment, follow these steps:

Create a Python virtual environment:

python -m venv venv

Activate the virtual environment:

source venv/bin/activate   # On macOS and Linux
venv\Scripts\activate      # On Windows

Install Flask and other dependencies from the requirements.txt file:

pip install -r requirements.txt

Usage

Create a Python virtual environment:

python main.py

Access the API at http://localhost:5000.

Database Configuration

Configure your application's database settings in the environment variables. Make sure to provide the connection string for your MongoDB Atlas instance.

# Example .env.py
MONGODB_URI = "your_mongodb_uri_here"

Contributors