Skip to content


Corrected some howlers discovered by Martin Popel's search for non-cl…
Browse files Browse the repository at this point in the history
…ausal deprels with subjects. Fixes #35.
  • Loading branch information
colinbatchelor committed Nov 16, 2024
1 parent 7c9274d commit 0932478
Show file tree
Hide file tree
Showing 3 changed files with 78 additions and 74 deletions.
17 changes: 9 additions & 8 deletions gd_arcosg-ud-dev.conllu
Original file line number Diff line number Diff line change
Expand Up @@ -261,7 +261,7 @@
# speaker = [1]
# text = carson nach gabh thu gearr thu am pie agad fhèin?
1 carson carson PRON Uq PronType=Int 0 root _ _
2 nach nach PART Qnr PartType=Vb|Polarity=Neg|PronType=Rel 3 obl _ _
2 nach nach PART Qnr PartType=Vb|Polarity=Neg|PronType=Rel 3 mark:prt _ _
3 gabh gabh VERB V-f--d Mood=Ind|Tense=Fut|VerbForm=Fin 5 reparandum _ _
4 thu thu PRON Pp2s Number=Sing|Person=2|PronType=Prs 3 nsubj _ _
5 gearr gearr VERB V-f--d Mood=Ind|Tense=Fut|VerbForm=Fin 1 acl:relcl _ _
Expand Down Expand Up @@ -1606,7 +1606,7 @@
20-21 ann _ _ _ _ _ _ _ _
20 an an ADP Sp _ 21 case _ _
21 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 17 xcomp:pred _ _
22 a a PART Q-r PartType=Vb|PronType=Rel 24 obl _ _
22 a a PART Q-r PartType=Vb|PronType=Rel 24 mark:prt _ _
23 b’ is AUX Ws Tense=Past 24 cop _ _
24 fhearr math ADJ Apc Degree=Cmp,Sup 18 acl:relcl _ _
25 a a PART Ug PartType=Inf 26 mark:prt _ _
Expand Down Expand Up @@ -2375,10 +2375,11 @@
11 bith bi ADJ Aq _ 10 fixed _ SpaceAfter=No
12 . . PUNCT Fe _ 3 punct _ _

# comment = node 2 must be an adjective, surely?
# sent_id = f08_004
# text = Dh’fhàs maoil an duine dearg, is fliuch, is bha cuislean ri bòcadh le cabhaig fala.
1 Dh’fhàs fàs VERB V-s Mood=Ind|Tense=Past|VerbForm=Fin 0 root _ _
2 maoil maoil NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 1 nsubj _ _
2 maoil maoil NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 1 xcomp _ _
3 an an DET Tdsmg Case=Gen|Definite=Def|Gender=Masc|Number=Sing|PronType=Art 4 det _ _
4 duine duine NOUN Ncsmg Case=Gen|Gender=Masc|Number=Sing 2 nsubj _ _
5 dearg dearg ADJ Ap _ 1 xcomp:pred _ SpaceAfter=No
Expand Down Expand Up @@ -5467,7 +5468,7 @@
15 " " PUNCT Fq _ 17 punct _ SpaceAfter=No
16 a a PART Uv PartType=Voc 17 case:voc _ _
17 Rìgh rìgh NOUN Ncsmv Case=Voc|Gender=Masc|Number=Sing 3 vocative _ _
18 Èireann Èireann PROPN Nt _ 17 nsubj _ SpaceAfter=No
18 Èireann Èireann PROPN Nt _ 17 nmod _ SpaceAfter=No
19 , , PUNCT Fi _ 21 punct _ _
20 's is AUX Wp-i Tense=Pres 21 cop _ _
21 fhada fada ADV Rt _ 3 parataxis _ _
Expand Down Expand Up @@ -5777,7 +5778,7 @@
11 “ “ PUNCT Fq _ 13 punct _ SpaceAfter=No
12 a a PART Uv PartType=Voc 13 case:voc _ _
13 Rìgh rìgh NOUN Ncsmv Case=Voc|Gender=Masc|Number=Sing 3 vocative _ _
14 Èirinn Èirinn PROPN Nt _ 13 nsubj _ SpaceAfter=No
14 Èirinn Èirinn PROPN Nt _ 13 nmod _ SpaceAfter=No
15 . . PUNCT Fe _ 3 punct _ _

# sent_id = n02_016
Expand Down Expand Up @@ -6690,9 +6691,9 @@
5 air air ADP Sp _ 7 case _ _
6 an an DET Tdsm Definite=Def|Gender=Masc|Number=Sing|PronType=Art 7 det _ _
7 fhear fear NOUN Ncsmd Case=Dat|Gender=Masc|Number=Sing 1 obl _ _
8 a a PART Q-r PartType=Vb|PronType=Rel 10 nsubj _ _
8 a a PART Q-r PartType=Vb|PronType=Rel 10 mark:prt _ _
9 b' is AUX Ws Tense=Past 10 cop _ _
10 fhaisge faisg ADJ Apc Degree=Cmp,Sup 7 acl:relcl _ _
10 fhaisge faisg ADJ Apc Degree=Cmp,Sup 7 amod _ _
11-12 dha _ _ _ _ _ _ _ SpaceAfter=No
11 do do ADP Sp _ 12 case _ _
12 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 10 obl _ _
Expand Down Expand Up @@ -9780,7 +9781,7 @@
26 air air ADP Sp _ 27 case _ _
27 dòigh dòigh NOUN Ncsfd Case=Dat|Gender=Fem|Number=Sing 24 xcomp:pred _ _
28 gheibh faigh VERB V-f Mood=Ind|Tense=Fut|VerbForm=Fin 6 conj _ _
29 thu thu PRON Pp2s Number=Sing|Person=2|PronType=Prs 31 nsubj _ _
29 thu thu PRON Pp2s Number=Sing|Person=2|PronType=Prs 28 nsubj _ _
30 do do DET Dp2s Number=Sing|Person=2|Poss=Yes|PronType=Prs 31 nmod:poss _ _
31 chàineadh càineadh NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 28 obj _ _

Expand Down
14 changes: 8 additions & 6 deletions gd_arcosg-ud-test.conllu
Original file line number Diff line number Diff line change
Expand Up @@ -60,9 +60,9 @@
1 bha bi VERB V-s Mood=Ind|Tense=Past|VerbForm=Fin 0 root _ _
2 a' an DET Tdsf Definite=Def|Gender=Fem|Number=Sing|PronType=Art 3 det _ _
3 chuid cuid NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 1 nsubj _ _
4 a a PART Q-r PartType=Vb|PronType=Rel 6 nsubj _ _
4 a a PART Q-r PartType=Vb|PronType=Rel 6 mark:prt _ _
5 b' is AUX Ws Tense=Past 6 cop _ _
6 fhearr math ADJ Apc Degree=Cmp,Sup 3 acl:relcl _ _
6 fhearr math ADJ Apc Degree=Cmp,Sup 3 amod _ _
7 an an DET Tdsm Definite=Def|Gender=Masc|Number=Sing|PronType=Art 9 det _ _
8 dà dà NUM Mc NumForm=Word|NumType=Card 9 nummod _ _
9 tharbh tarbh NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 12 obj _ _
Expand Down Expand Up @@ -2729,11 +2729,12 @@
33 Mhór Mór PROPN Nt _ 32 flat _ SpaceAfter=No
34 . . PUNCT Fe _ 3 punct _ _

# comment = Not entirely sure what the right deprel is for node 3.
# sent_id = f03_034
# text = Sean a’ bhliadhna thàinig e dha 'n sgoil an seo, agus cha robh Màiri Anna ceart as a chionn, 's i cho sàmhach, fad-as 'na dòigh aig an àm a chaidh e dh’fhuireach thuca an toiseach.
1 Sean sean PRON Pd PronType=Dem 4 obl:tmod _ _
2 a’ an DET Tdsf Definite=Def|Gender=Fem|Number=Sing|PronType=Art 3 det _ _
3 bhliadhna bliadhna NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 1 nsubj _ _
3 bhliadhna bliadhna NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 1 nmod _ _
4 thàinig thig VERB V-s Mood=Ind|Tense=Past|VerbForm=Fin 0 root _ _
5 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 4 nsubj _ _
6 dha do ADP Sp _ 8 case _ _
Expand Down Expand Up @@ -2862,7 +2863,7 @@

# sent_id = f03_038
# text = Ge b'e dé bu choireach, bha Uilleam trom air an deoch aig an àm.
1 Ge ge PRON Uq PronType=Int 5 nsubj _ _
1 Ge ge PRON Uq PronType=Int 5 obl _ _
2 b'e b'e PRON Uq PronType=Int 1 fixed _ _
3 dé dé PRON Uq PronType=Int 1 fixed _ _
4 bu is AUX Ws Tense=Past 5 cop _ _
Expand Down Expand Up @@ -4922,7 +4923,7 @@
4 bhròinein bròinein NOUN Ncsmv Case=Voc|Gender=Masc|Number=Sing 12 vocative _ _
5 bhochd bochd ADJ Aq-smv Case=Voc|Gender=Masc|Number=Sing 4 amod _ SpaceAfter=No
6 " " PUNCT Fz _ 7 punct _ _
7 ars' arsa VERB V-s Mood=Ind|Tense=Past|VerbForm=Fin 4 dep _ _
7 ars' arsa VERB V-s Mood=Ind|Tense=Past|VerbForm=Fin 4 parataxis _ _
8 ise i PRON Pp3sf-e Form=Emp|Gender=Fem|Number=Sing|Person=3|PronType=Prs 7 nsubj _ SpaceAfter=No
9 , , PUNCT Fi _ 12 punct _ _
10 " " PUNCT Fq _ 12 punct _ SpaceAfter=No
Expand Down Expand Up @@ -5207,7 +5208,7 @@
26 ri ri ADP Sp _ 28 case _ _
27 am an DET Tdsm Definite=Def|Gender=Masc|Number=Sing|PronType=Art 28 det _ _
28 fear fear NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 24 obl _ _
29 a a PART Q-r PartType=Vb|PronType=Rel 31 nsubj _ _
29 a a PART Q-r PartType=Vb|PronType=Rel 31 mark:prt _ _
30 b' is AUX Ws Tense=Past 31 cop _ _
31 òige òg ADJ Apc Degree=Cmp,Sup 28 amod _ _
32 dhe de ADP Sp _ 34 case _ _
Expand Down Expand Up @@ -5810,6 +5811,7 @@
30 . . PUNCT Fe _ 1 punct _ _

# comment = what is going on with node 3?
# comment = Node 21 is a bit odd too but I think it's fine and not an error.
# sent_id = n01_035
# text = Cha robh ach, thug e a aghaidh air an doras agus chunnaic e an fhuil a's an dol seachad e agus chuir e fhèin a leithid eile ann.
1 Cha cha PART Qn PartType=Vb|Polarity=Neg 2 mark:prt _ _
Expand Down

0 comments on commit 0932478

Please sign in to comment.