-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy patheval.log
52 lines (52 loc) · 3.06 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_Czech-CLTT:
commit 347c7fa68fa8728bfa81bfb1d5098b7360acb01b
Merge: 07d2b40 edd86ec
Author: Dan Zeman <[email protected]>
Size: counted 36013 of 36013 words (nodes).
Size: min(0, log((N/1000)**2)) = 7.16775996876459.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.8.
Universal POS tags: 15 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 26559 out of 36013 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 28 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.8.
Udapi:
TOTAL 40
Udapi: found 40 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 36013 words.
Genres: found 1 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang cs --max-err=10 UD_Czech-CLTT/cs_cltt-ud-dev.conllu
[Line 9902 Sent zakon.iso-002-p8s6 Node 7]: [L3 Warning leaf-det] 'det' not expected to have children (7:tomu:det --> 18:neslučitelné:acl)
[Line 9995 Sent zakon.iso-002-p8s8 Node 10]: [L3 Warning leaf-det] 'det' not expected to have children (10:tom:det --> 17:nastává:acl)
Warnings: 2
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang cs --max-err=10 UD_Czech-CLTT/cs_cltt-ud-test.conllu
[Line 3549 Sent zakon.iso-004-p25s8 Node 13]: [L3 Warning leaf-det] 'det' not expected to have children (13:tom:det --> 21:schváleny:amod)
[Line 4767 Sent zakon.iso-004-p29s6 Node 7]: [L3 Warning leaf-det] 'det' not expected to have children (7:některé:det --> 11:jednotek:nmod)
Warnings: 2
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang cs --max-err=10 UD_Czech-CLTT/cs_cltt-ud-train.conllu
[Line 8940 Sent vyhlaska.iso-004-p48s7 Node 14]: [L3 Warning leaf-det] 'det' not expected to have children (14:každé:det --> 17:jednotek:nmod)
Warnings: 1
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615
(weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805
(weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615
(weight=0.256410256410256) * (score{size}=0.518819767006915) = 0.133030709488952
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.705882352941177) = 0.0542986425339367
(weight=0.307692307692308) * (score{udapi}=0.988892899786188) = 0.30427473839575
(weight=0.0769230769230769) * (score{udeprels}=0.605405405405405) = 0.0465696465696466
(TOTAL score=0.717057598225088) * (availability=1) * (validity=1) = 0.717057598225088
STARS = 3.5
UD_Czech-CLTT 0.717057598225088 3.5