-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing Abbr=Yes annotatons #35
Comments
Where do things such as middle names fit in? |
What did you use for the list of abbreviations? I tried a regex which wound up capturing Similarly, you did capture |
I suppose it is technically an abbreviation, but it's very specialized—not the sort of abbreviation that can be expanded based on general knowledge of language & culture. If the goal is to expand abbreviations, this will only be possible for public figures. |
It could still technically get an Abbr tag, unless the idea is to only do
that for known abbreviations
…On Thu, Dec 7, 2023, 11:41 AM Nathan Schneider ***@***.***> wrote:
Where do things such as middle names fit in? Adnan Z. Amin
I suppose it is technically an abbreviation, but it's very specialized—not
the sort of abbreviation that can be expanded based on general knowledge of
language & culture. If the goal is to expand abbreviations, this will only
be possible for public figures.
—
Reply to this email directly, view it on GitHub
<#35 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AA2AYWIH2MMZNLU5ARE6METYIE3HHAVCNFSM6AAAAAA76HTNYOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQNBUGIYDQMJSHA>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Sure. I guess I would say Some of these abbreviations, but not all, would be pronounced in a different way from their spelled-out and standard equivalents. |
I put My checker is only checking for internal consistency, so it is currently using normalization rules like:
It also selects the lemmatization rules like that (e.g. |
Got it. I made most of these updates, I think, so please let me know if the issue can be closed or if there is still work to be done |
The following lemmas are for abbreviated forms. EWT adds
Abbr=Yes
to these:Initialisms
The following initialisms should have
Abbr=Yes
to indicate that the lemma should be in uppercase, as is annotated in GUM:The text was updated successfully, but these errors were encountered: