Skip to content

Commit

Permalink
remove keywords where they are identical with tasks
Browse files Browse the repository at this point in the history
  • Loading branch information
anne17 committed Nov 18, 2024
1 parent f826d1b commit 54a741e
Show file tree
Hide file tree
Showing 10 changed files with 14 additions and 67 deletions.
1 change: 0 additions & 1 deletion sparv/modules/conll_export/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,6 @@ short_description:
task: export
keywords:
- conll-u
- export
sparv_handler: conll_export:conllu
example_output: |-
```
Expand Down
3 changes: 1 addition & 2 deletions sparv/modules/geo/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,7 @@ abstract: true
task: geotagging
language_codes:
- swe
keywords:
- geotagging
keywords: []
standard_reference: ''
other_references: []
tool: ''
Expand Down
15 changes: 5 additions & 10 deletions sparv/modules/hunpos/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -19,8 +19,7 @@ short_description:
swe: Annotering av SUC-ordklasser med Hunpos för svenska
eng: Swedish part-of-speech annotation with SUC tags by Hunpos
task: part-of-speech tagging
keywords:
- pos-tagging
keywords: []
annotations:
- <token>:hunpos.pos
example_output: |-
Expand Down Expand Up @@ -52,8 +51,7 @@ short_description:
swe: Annotering av morfosyntaktiska deskriptorer (SUC) med Hunpos för svenska
eng: Annotation of morphological features (SUC) by Hunpos for Swedish
task: morphosyntactic tagging
keywords:
- msd
keywords: []
annotations:
- <token>:hunpos.msd
example_output: |-
Expand Down Expand Up @@ -85,8 +83,7 @@ short_description:
swe: Annotering av SUC-ordklasser med Hunpos för 1800-talssvenska
eng: Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's
task: part-of-speech tagging
keywords:
- pos-tagging
keywords: []
annotations:
- <token>:hunpos.pos
example_output: |-
Expand Down Expand Up @@ -129,8 +126,7 @@ short_description:
swe: Annotering av SUC-ordklasser med Hunpos för 1800-talssvenska
eng: Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's
task: part-of-speech tagging
keywords:
- pos-tagging
keywords: []
annotations:
- <token>:hunpos.pos
example_output: |-
Expand Down Expand Up @@ -173,8 +169,7 @@ short_description:
swe: Annotering av morfosyntaktiska deskriptorer (SUC) med Hunpos för 1800-talssvenska
eng: Annotation of morphological features (SUC) by Hunpos for Swedish from the 1800's
task: morphosyntactic tagging
keywords:
- msd
keywords: []
annotations:
- <token>:hunpos.msd
example_output: |-
Expand Down
3 changes: 1 addition & 2 deletions sparv/modules/malt/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,7 @@ short_description:
task: dependency parsing
language_codes:
- swe
keywords:
- dependency parsing
keywords: []
annotations:
- <token>:malt.ref
- <token>:malt.dephead_ref
Expand Down
3 changes: 1 addition & 2 deletions sparv/modules/readability/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,7 @@ abstract: true
task: readability measures
language_codes:
- swe
keywords:
- readability measures
keywords: []
other_references: []
tool: ''
model: ''
Expand Down
6 changes: 0 additions & 6 deletions sparv/modules/saldo/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,6 @@ short_description:
eng: Full-form lookup for SALDO citation forms (lemmas)
task: lemmatization
keywords:
- lemmatization
- saldo
annotations:
- <token>:saldo.baseform
Expand Down Expand Up @@ -55,7 +54,6 @@ short_description:
eng: Lookup for SALDO lemgrams
task: lexical lookup
keywords:
- lexical lookup
- saldo
annotations:
- <token>:saldo.lemgram
Expand Down Expand Up @@ -89,7 +87,6 @@ short_description:
eng: Lookup for SALDO identifiers
task: lexical lookup
keywords:
- lexical lookup
- saldo
annotations:
- <token>:saldo.sense
Expand Down Expand Up @@ -120,7 +117,6 @@ short_description:
eng: Analysis of SALDO lemgram compounds including a probability ranking
task: compound analysis
keywords:
- compound analysis
- saldo
annotations:
- <token>:saldo.complemgram
Expand Down Expand Up @@ -162,7 +158,6 @@ short_description:
eng: Analysis of SALDO wordform compounds
task: compound analysis
keywords:
- compound analysis
- saldo
annotations:
- <token>:saldo.compwf
Expand Down Expand Up @@ -204,7 +199,6 @@ short_description:
eng: Full-form lookup for SALDO citation forms (lemmas) plus analysis of compounds made up of SALDO entries
task: lemmatization
keywords:
- lemmatization
- saldo
annotations:
- <token>:saldo.baseform2
Expand Down
3 changes: 1 addition & 2 deletions sparv/modules/sensaldo/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,7 @@ short_description:
task: sentiment analysis
language_codes:
- swe
keywords:
- sentiment analysis
keywords: []
annotations:
- <token>:sensaldo.sentiment_label
- <token>:sensaldo.sentiment_score
Expand Down
43 changes: 4 additions & 39 deletions sparv/modules/stanza/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,8 @@ id: stanza-parent-swe
abstract: true
language_codes:
- swe
keywords:
- stanza
tool: "Stanza"
trained_on: "[SUC3](https://spraakbanken.gu.se/resurser/suc3), [TalbankenSBX](https://spraakbanken.gu.se/resurser/talbanken), [SIC2](https://spraakbanken.gu.se/resurser/sic2)"
other_references:
Expand Down Expand Up @@ -34,9 +36,6 @@ short_description:
swe: Annotering av SUC-ordklasser med Stanza för svenska
eng: Swedish part-of-speech annotation with SUC tags by Stanza
task: part-of-speech tagging
keywords:
- pos-tagging
- stanza
standard_reference: 'https://aclanthology.org/2021.nodalida-main.20/'
annotations:
- <token>:stanza.pos
Expand Down Expand Up @@ -65,9 +64,6 @@ short_description:
swe: Annotering av morfosyntaktiska deskriptorer (SUC) med Stanza för svenska
eng: Annotation of morphological features (SUC) by Stanza for Swedish
task: morphosyntactic tagging
keywords:
- msd
- stanza
standard_reference: 'https://aclanthology.org/2021.nodalida-main.20/'
annotations:
- <token>:stanza.msd
Expand Down Expand Up @@ -99,9 +95,6 @@ short_description:
swe: Morfologisk analys för svenska med universal features (UD) baserad på Stanza
eng: Stanza-based morphological analysis for Swedish, using universal features (UD)
task: morphosyntactic tagging
keywords:
- msd
- stanza
annotations:
- <token>:stanza.ufeats
example_output: |-
Expand Down Expand Up @@ -136,9 +129,6 @@ short_description:
swe: Annotering av grundformer (lemman) med Stanza för svenska tränat på SUC3
eng: Swedish citation form analysis (base forms, lemmas) by Stanza, trained on SUC3
task: lemmatization
keywords:
- lemmatization
- stanza
annotations:
- <token>:stanza.baseform
example_output: |-
Expand Down Expand Up @@ -173,9 +163,6 @@ short_description:
swe: Svensk dependensparsning tränad på Svensk trädbank med Stanza
eng: Swedish dependency parsing with Stanza trained on Sweedish treebank
task: dependency parsing
keywords:
- dependency parsing
- stanza
annotations:
- <token>:stanza.dephead_ref
- <token>:stanza.deprel
Expand Down Expand Up @@ -204,6 +191,8 @@ id: stanza-parent-eng
abstract: true
language_codes:
- eng
keywords:
- stanza
standard_reference: ''
tool: "Stanza"
trained_on: ''
Expand All @@ -224,9 +213,6 @@ short_description:
swe: Annotering av ordklasser (Penn Treebank-taggar) med Stanzas standardmodell för engelska
eng: Part-of-speech annotation with Penn Treebank tags with Stanza's standard model for English
task: part-of-speech tagging
keywords:
- pos-tagging
- stanza
tagset: "[Penn Treebank tagset](https://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.html)"
annotations:
- <token>:stanza.pos
Expand All @@ -248,9 +234,6 @@ short_description:
swe: Meningssegmentering med Stanzas standardmodell för engelska
eng: Sentence segmentation with Stanza's standard model for English
task: sentence segmentation
keywords:
- sentence segmentation
- stanza
annotations:
- stanza.sentence
example_output: |-
Expand Down Expand Up @@ -287,9 +270,6 @@ short_description:
swe: Tokenisering med Stanzas standardmodell för engelska
eng: Tokenization with Stanza's standard model for English
task: tokenization
keywords:
- tokenization
- stanza
annotations:
- stanza.token
example_output: |-
Expand All @@ -310,9 +290,6 @@ short_description:
swe: Lemmatisering med Stanzas standardmodell för engelska
eng: Lemmatization with Stanza's standard model for English
task: lemmatization
keywords:
- lemmatization
- stanza
annotations:
- <token>:stanza.baseform
example_output: |-
Expand All @@ -336,9 +313,6 @@ short_description:
swe: Dependensparsning med Stanzas standardmodell för engelska
eng: Dependency parsing with Stanza's standard model for English
task: dependency parsing
keywords:
- dependency parsing
- stanza
tagset: "[UD](https://universaldependencies.org/en/dep/)"
annotations:
- <token>:stanza.ref
Expand All @@ -365,9 +339,6 @@ short_description:
swe: Namnigenkänning (NER) med Stanzas standardmodell för engelska
eng: Named entity recognition with Stanza's standard model for English
task: named entity recognition
keywords:
- ner
- stanza
annotations:
- stanza.ne
- stanza.ne:stanza.ne_type
Expand Down Expand Up @@ -414,9 +385,6 @@ short_description:
swe: Annotering av UD-ordklasser (universal dependencies) med Stanzas standardmodell för engelska
eng: Part-of-speech annotation with UD (universal dependency) tags with Stanza's standard model for English
task: part-of-speech tagging
keywords:
- pos-tagging
- stanza
tagset: "[UD](https://universaldependencies.org/u/pos/)"
annotations:
- <token>:stanza.upos
Expand All @@ -438,9 +406,6 @@ short_description:
swe: Morfologisk analys för engelska med universal features (UD) baserad på Stanza
eng: Stanza-based morphological analysis for English, using universal features (UD)
task: morphosyntactic tagging
keywords:
- msd
- stanza
tagset: "[UD](https://universaldependencies.org/u/feat/index.html)"
annotations:
- <token>:stanza.ufeats
Expand Down
3 changes: 1 addition & 2 deletions sparv/modules/swener/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,7 @@ short_description:
task: named entity recognition
language_codes:
- swe
keywords:
- ner
keywords: []
annotations:
- swener.ne
- swener.ne:swener.name
Expand Down
1 change: 0 additions & 1 deletion sparv/modules/wsd/metadata.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,6 @@ task: sense disambiguation
language_codes:
- swe
keywords:
- sense disambiguation
- saldo
annotations:
- <token>:wsd.sense
Expand Down

0 comments on commit 54a741e

Please sign in to comment.