Skip to content

EishaMazhar/IR_Assignment2-VSM-tfidf

Repository files navigation

To Run the code

  1. To run and open GUI CODE, the COMMANDS that you need to run in ANACONDA PROMPT are : code TFIDF_GUIcode.py (to open code in VScode) and python TFIDF_GUIcode.py (to run the code)

  2. You can run code file with '.ipynb' extension easily in the jupyter notebook (it doesn't have GUI)


Data Set

Data Set is a collection of Trump Speeches (File name: Trump Speeches 56 files) for implementing inverted index and positional index. A single file contains a single speech from All of Trump's Speeches from June 2015 to November 9, 2016.

Total unstructured text Documents: 52

Provided Files

Files Provided:

  1. TrumpSpeeches
  2. Stop-words list as a single file
  3. Queries in a single file.

Results

GUI - Fetched Documents and Query Score Based on Query and cut-off value

Result Visualization

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published