Skip to content

Latest commit

 

History

History
17 lines (11 loc) · 609 Bytes

README.md

File metadata and controls

17 lines (11 loc) · 609 Bytes

Language detection model

This model uses an LSTM for character-level language detection. Given a sentence of text, each character is fed into the model and the final output determines which of five languages the sentence was written in.

First run scrape.jl to download a Wikipedia data set. model.jl contains the actual model and training code.

Training

cd text/lang-detection
julia scrape.jl
julia --project model.jl

References