Skip to content

Latest commit

 

History

History
36 lines (24 loc) · 379 Bytes

README.md

File metadata and controls

36 lines (24 loc) · 379 Bytes

Scripts for generating grams from large datasets

Dowloading texts

make get_data

Cleaning data

make clean_txt_files

Create 1,2 grams:

make compile
make run

Grams are created as data/gram*.csv

Prepare data for prediction

make prepare_data_simple_prediction:

Calulace accuracy

make generate_tests
make accuracy