Skip to content

Latest commit

 

History

History
35 lines (24 loc) · 1.7 KB

README.md

File metadata and controls

35 lines (24 loc) · 1.7 KB

Oxford Translator

Parse and translate Oxford English Dictionary vocab lists into target foreign language

Background

Vocab aquisition is an important component of learning a foreign language. Obviously studying more frequently used words is beneficial, however it is often difficult to find reliable frequency counts because they either

  1. Include too many "helper" words (the, and, one, a, etc.) that you already know by heart
  2. Are biased towards a particular domain
  3. Were published by a source with questionable expertise in the target language
  4. Don't include enough words for the CEPR level you're trying to reach

Oxford Dictionary publishes excellent lists that resolve these issues

but sadly these are only in English. However, as an English speaker learning another language I think you can assume the most frequent English words are going to still be roughly the ones you should target in the foreign language. Given that, why not just translate the Oxford lists instead? Even better, programmatically translate them to save a bunch of manual typing

Usage

German is the most developed Language so far, but other subclasses can be added

python translate.py --pdf_path oxford_3000.pdf --dst de

Testing

  • From command line: pytest tests/
  • Through PyCharm gutters: under Settings > Tools > Python Integrated Tools, choose pytest as default test runner