Skip to content

funginstitute/noveltymeasure

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Novelty Measure

novelty.py expects two files: firstly, a document matrix following [i,j,val] nomenclature: X[i,j] is the number of times term j appears in document i. Secondly, appdates.csv, which has 3 columns: document identifier, date of publication/filing, and transformed date. The transformed date should look like YYYYMMDD so that it is a sortable integer.

This will create 2 files: after.csv and before.csv, which give the number of overlapping words with each document 5 years before/after the focal document.

plot.py will create some helpful graphs from the above output files.


sample.7z is a 7zipped file containing the matrix and application dates for 20000 patents to use as a sample.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages