Skip to content

Latest commit

 

History

History
16 lines (7 loc) · 306 Bytes

README.md

File metadata and controls

16 lines (7 loc) · 306 Bytes

Install the required modules

pip install -r requirements.txt

Create the (title, vector) from the pdf

python create_vector.py pdf1 pdf2

To check the similarity of documents

python similarity.py pdf1 pdf2 pdf3 [pdf4 pdf5 ...]

The result will show how similar pdf1 is to the rest of pdfs