Releases: hsci-r/las
Releases · hsci-r/las
Fix for using both guessing and OCR noise correction at the same time
Modular release
This release marks the first modular release of LAS, where you can trade functionality for smaller file sizes.
The options are:
- las: complete package including all support for all languages, but almost 600 megabytes
- las-fi: complete functionality for (only) Finnish, including edit distance fuzzy analysis for noisy data as well as guessed word segmentation for words not in the lexicon
- las-fi-small: basic functionality for (only) Finnish without fuzzy analysis or segmentation for guessed words, but a much smaller file size
- las-small: support all languages, but provide only the basic functionality for Finnish
- las-non-fi: support all languages apart from Finnish
v1.5.9
Bugfix release
v1.5.5 bump version
JOSS version
Version accepted to Journal of Open Source Software
Lexical Analysis Command-Line Tool for lemmatizing, lexical analysis, inflected form generation and language identification of multiple languages
Updates for Finnish disambiguation rules, proper documentation
Support for frequency scoring as a last measure
v1.4.5 bump version
add option to process by line
v1.4.4 fix option to process by line
v1.4.1
Add support for edit distance
v1.4.0 honor pretty printing also for interactive operation