ERRANT v2.1.0
v2.1.0 (09-01-20)
-
The character level cost in the sentence alignment function is now computed by the much faster python-Levenshtein library instead of python's native
difflib.SequenceMatcher
. This makes ERRANT 3x faster! -
Various minor updates:
- Updated the English wordlist.
- Fixed a broken rule for classifying contraction errors.
- Changed a condition in the calculation of transposition errors to be more intuitive.
- Partially updated the ERRANT POS tag map to match the updated Universal POS tag map. Specifically, EX now maps to PRON rather than ADV, LS maps to X rather than PUNCT, and CONJ has been renamed CCONJ. I did not change the mapping of RP from PART to ADP yet because this breaks several rules involving phrasal verbs.
- Added an
errant.__version__
attribute. - Added a warning about using ERRANT with spaCy 2.
- Tidied some code in the classifier.