591 Final Project

J. Hugh Wright

Vignesh Muthukumar

Gabriel Silva de Oliviera

This project used the following datasets collected from DataShop:

Sadly, these datasets are too large to be included within this repository. However, we do include CSV files containing the data we extracted from them.

The following is a list of the files included in this rep, and what they were used for.

baseline_model.py: A python script for training and testing our non-neural network predictive models.
CorrelationCalculation.ipynb: A notebook for calculating the Pearson's correlation coefficients and p-values of our data.
dataHotEncoding.csv: A CSV file containing the final form of our data, with all KCs one hot encoded. Used to train the predictive models.
Graphs.xlsx: An Excel file that we used to create our data visualizations.
HotEncoding.ipynb: A python script for one hot encoding our knowledge components. Produced the "dataHotencoding.csv" file.
language_features.py: A python script for extracting language features and adding them to our data set. Produced the "Language_Processed.csv" file in the ProcessedData folder
LowFrequencyCalculation.ipynb: A notebook for detecting low frequency words. Due to time constraints, we ultimately did not use this data in our analysis.
neural_networks.py: A Python script for training and testing our neural network models.
preprocess.py: A simple python script to extract information about individual questions from our three original datasets. Produced the "More_Processed_Data.csv" in the ProcessedData folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

591 Final Project

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.idea		.idea
ProcessedData		ProcessedData
.gitattributes		.gitattributes
CorrelationCalculation.ipynb		CorrelationCalculation.ipynb
Graphs.xlsx		Graphs.xlsx
HotEncoding.ipynb		HotEncoding.ipynb
LowFrequencyCalculation.ipynb		LowFrequencyCalculation.ipynb
README.md		README.md
baseline_model.py		baseline_model.py
dataHotEncoding.csv		dataHotEncoding.csv
language_features.py		language_features.py
neural_networks.py		neural_networks.py
preprocess.py		preprocess.py

hughman98/591_Final_Project

Folders and files

Latest commit

History

Repository files navigation

591 Final Project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages