PerceptAnalyze

This Repo focusses on Data Extraction and Network Analysis using Google 2 gram Dataset

Google N-gram dataset link http://storage.googleapis.com/books/ngrams/books/datasetsv2.html

Requirements

NOTE (Your files and code should be in the same folder or place)

Python Installed on PC preferably(Python 3.5 or Python 3)

Two .csv perceptual data files for example here they are flavornetPercepts.csv & superscentPercepts.csv

For Python 2.7

Change urllib.request to urllib everywhere in the program (As it was shifted in Python 3)

Remove encoding="utf8" everywhere in the program

For running script.py (Data extraction)

Open Command Prompt

cd to the loaction of code and flavornetPercepts.csv , superscentPercepts.csv files

type "python script.py"

THE CODE WILL START

The errors and information will be logged in sample.log file which will be automatically created in the same folder

The final output in JSON format for flavornetPercepts will be in json.txt and for superscentPercepts will be in newjson.txt

For running graph.py (Data Analysis)

Run this after running script.py

Additional Requirements/Dependecies

pip install networkx

Two .csv perceptualEdges files i.e in this case edgesFlav.csv and edgesSuperSc.csv

Also Added newjson.txt and json.txt for Reference

Open Command Prompt

cd to the loaction of code and PerceptualEdges files

type "python graph.py"

THE CODE WILL START

The final output similarity comparing Superscent and Flavournet will be printed on screen

Final Results Obtained were

Similarity[0,inf) in Flavournet Graphs is 23903599.9347 Similarity[0,inf) in Superscent Graphs is 38827186.0992

For running newscript.py (Data extraction for Stem Words)

You Will need word.csv file

STEM WORDS are words whose all kind of occurances we want to find like taint* will include taint,tainted,tainting......

Open Command Prompt

cd to the loaction of code and word.csv file

type "python newscript.py"

THE CODE WILL START

The errors and information will be logged in log.out file which will be automatically created in the same folder

The final output in JSON format for word will be in newjson.txt

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Figure_1.png		Figure_1.png
Figure_2.png		Figure_2.png
Figure_3.png		Figure_3.png
Figure_4.png		Figure_4.png
GoodScentsPercepts.csv		GoodScentsPercepts.csv
LJPercepts.csv		LJPercepts.csv
README.md		README.md
edgesAll.csv		edgesAll.csv
edgesFlav.csv		edgesFlav.csv
edgesGoodSc.csv		edgesGoodSc.csv
edgesLJ.csv		edgesLJ.csv
edgesSA.csv		edgesSA.csv
edgesSuperSc.csv		edgesSuperSc.csv
flavornetPercepts.csv		flavornetPercepts.csv
graph.py		graph.py
json.txt		json.txt
newjson.txt		newjson.txt
newscript.py		newscript.py
percept.csv		percept.csv
script.py		script.py
sigma_aldrichPercepts.csv		sigma_aldrichPercepts.csv
superscentPercepts.csv		superscentPercepts.csv
word.csv		word.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PerceptAnalyze

Requirements

NOTE (Your files and code should be in the same folder or place)

For Python 2.7

For running script.py (Data extraction)

For running graph.py (Data Analysis)

Final Results Obtained were

For running newscript.py (Data extraction for Stem Words)

STEM WORDS are words whose all kind of occurances we want to find like taint* will include taint,tainted,tainting......

About

Releases

Packages

Languages

rijuldhir/PerceptAnalyze

Folders and files

Latest commit

History

Repository files navigation

PerceptAnalyze

Requirements

NOTE (Your files and code should be in the same folder or place)

For Python 2.7

For running script.py (Data extraction)

For running graph.py (Data Analysis)

Final Results Obtained were

For running newscript.py (Data extraction for Stem Words)

STEM WORDS are words whose all kind of occurances we want to find like taint* will include taint,tainted,tainting......

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages