Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Grouping identical analyses into a single entry #15

Open
gfranzini opened this issue Dec 21, 2018 · 1 comment
Open

Grouping identical analyses into a single entry #15

gfranzini opened this issue Dec 21, 2018 · 1 comment
Assignees

Comments

@gfranzini
Copy link
Contributor

Create a filter/function to group identical analyses into a single entry. For example, analyses 18 and 19 of forma (Du Cange) are identical:

============================ANALYSIS 18==================================

SEGMENTATION:	form -a

---------------------morphological feats 1 ----------------------------
--bfs--

Case:	Ablative
Gender:	Feminine
Number:	Singular
---------------------morphological feats 2 ----------------------------
--nfs--

Case:	Nominative
Gender:	Feminine
Number:	Singular
---------------------morphological feats 3 ----------------------------
--vfs--

Case:	Vocative
Gender:	Feminine
Number:	Singular
	============================LEMMA =================================
	forma                         N1   D68HA f
	-----------------------morphological feats-------------------------
	NcA

	PoS:	Noun
	Type:	Common
	Inflexional Category:	I decl
	-----------------------derivational info---------------------------
	IS DERIVED: NO

============================ANALYSIS 19==================================

SEGMENTATION:	form -a

---------------------morphological feats 1 ----------------------------
--bfs--

Case:	Ablative
Gender:	Feminine
Number:	Singular
---------------------morphological feats 2 ----------------------------
--nfs--

Case:	Nominative
Gender:	Feminine
Number:	Singular
---------------------morphological feats 3 ----------------------------
--vfs--

Case:	Vocative
Gender:	Feminine
Number:	Singular
	============================LEMMA =================================
	forma                         N1   D68HB f
	-----------------------morphological feats-------------------------
	NcA

	PoS:	Noun
	Type:	Common
	Inflexional Category:	I decl
	-----------------------derivational info---------------------------
	IS DERIVED: NO
@gersh0m
Copy link
Member

gersh0m commented Jun 10, 2019

LemLat application has indeed a problem of grouping but in case of word-forms with both 'ordinary' and exceptional lemmatization.

NOT the case of your example where you have actually two lemmas (note the different ids).

It could be arguable (in cases like that) the choice of using two different entries in the support database.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants