-
Notifications
You must be signed in to change notification settings - Fork 252
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
problem with annotation of "sadece" in UD_Turkish-BOUN #1016
Comments
@iambusra, do you have thoughts on this? |
Does this pass validation, when you have an |
Yep, the validator does not look at subtypes except a couple of the semi-mandatory ones, and |
I guess my question was more about |
I would make it a dependent of time. There is a policy (that I only noticed recently) distinguishing "pure function words" that do not accept modifiers other than negation: these include case markers/adpositions and articles, and potentially other words. For English I would assume that demonstrative determiners should be considered pure function words. (As opposed to demonstrative pronouns, which can take relative clause modifiers, for instance.) Looking at treebanks there are some quantifier determiners with modifiers like "yet another", "nearly all", "almost every". But I don't see any demonstrative determiners with modifiers. |
If something is an Since it is annotated as a function word, I also agree that its relation is with the head of the noun phrase, so dakika or time in current examples. But it might also be otherwise: in something like nearly all persons, I would propend towards |
The question wasn't about the deprel but the head.
This is my inclination as well, but @nschneid suggests this isn't attested in (English?) treebanks. Is there at least consensus that this is a reasonable annotation of such structures?
I don't fully get why this should be different from nearly all persons. |
The pattern I am seeing in English treebanks is that prenominal "nearly all" is treated as a unit, but not prenominal "only this". This may have to do with the fact that "only" is a focusing modifier that applies to nominals in general, whereas "nearly" is a degree modifier of a quantity.
But then we have to deal with the less common case where the modified quantity is not a det or nummod:
The quoted ones are both treebank examples, and "nearly" modifies the head of the nominal ("everyone", "century"). So it is not a categorical rule that "nearly" never modifies the head nominal. At the end of the day I don't think English UD has all the answers on determiners and determiner-related dependents in nominals, especially when it comes to various constructions for expressing quantity and measurement. So, take this precedent with a grain of salt. |
I was referring to this in the original post:
With regard to the head
I would also give an explanation like the one by @nschneid , but the more I think about it, the less obvious it is 🤔 But it might as well be that I was making a blunder. Looking at these examples
the position of nearly "outside" the determiner seems to point to it modifying the whole phrase, and so attaching to the head. Now we might talk about how much lexical or functional terms like entire (maybe The reason for not being able of saying nearly books is that this expression is not quantified somehow. But I would no longer say that this means that if a quantifier is present, nearly attaches to it. So personally I would correct my previous statement and lean towards attaching elements like nearly, only, etc. always to the head. In some cases, this might indirectly lead to reconsidering annotation of some elements (entire, quarter...) |
In UD_Turkish-BOUN, the word sadece "only" is often treated as a dependent on a verb, even when it's serving to restrict something else.
For example:
Here sadece 30 dakika sürmüştü means something like "continued for only 30 minutes". Surely sadece should be dependent on 30, and not the verb?
But that raises what kind of relation it should be. It's not quite an
advmod
because it can be a dependent on pretty much any part of speech (like only and just in English). This seems to me to be whatadvmod:emph
is for?The text was updated successfully, but these errors were encountered: