Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lemma column needed in conllu input? #48

Open
jowagner opened this issue Feb 15, 2019 · 0 comments
Open

Lemma column needed in conllu input? #48

jowagner opened this issue Feb 15, 2019 · 0 comments

Comments

@jowagner
Copy link

The example in the readme shows conllu code with ID, form and lemma column populated. However, all read_conll_*corpus functions in __main__.py do not read the lemma column. Do you read the lemma column anywhere else or do you plan to use it in the near future? Do I need a lemmatiser?

Of course, udpipe users can get the lemmatisation from udpipe and one will want to use udpipe with the offered models as udpipe's tokenisation differs from other popular tokenisers. The lemma column would, however, be a difficulty if I use my own tokeniser and train my own models (as described in the readme).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant