forked from UniversalDependencies/UD_Turkish-PUD
-
Notifications
You must be signed in to change notification settings - Fork 0
/
eval.log
47 lines (47 loc) · 2.87 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
Running the following version of UD tools:
commit 13e6b709a8bc643c3f902800321a7beda46feb8d
Author: Dan Zeman <[email protected]>
Date: Sun Nov 13 22:03:41 2022 +0100
Evaluating the following revision of UD_Turkish-PUD:
commit d939ced3b6999e7aa0dc9f63d7d7a564659b47ce
Author: Dan Zeman <[email protected]>
Date: Sat May 14 15:33:14 2022 +0200
Size: counted 16882 of 16882 words (nodes).
Size: min(0, log((N/1000)**2)) = 5.65249593112306.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Did not find more than 10000 training words.
Split: Did not find at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.4.
Universal POS tags: 16 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 12555 out of 16882 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 33 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 1.
Udapi:
TOTAL 2376
Udapi: found 2376 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 16882 words.
Genres: found 2 out of 17 known.
validate.py --lang tr --max-err=10 UD_Turkish-PUD/tr_pud-ud-test.conllu
[Line 3872 Sent n01068029 Node 13]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [2, 6]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 12181 Sent w01072079 Node 13]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [1, 12]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 19215 Sent w02007008 Node 4]: [L3 Morpho goeswith-missing-typo] Since the treebank has morphological features, 'Typo=Yes' must be used with 'goeswith' heads.
[Line 19216 Sent w02007008 Node 5]: [L3 Morpho goeswith-feats] The morphological features of a 'goeswith'-connected word must be annotated only at the first part.
Morpho errors: 2
Syntax errors: 2
*** FAILED *** with 4 errors
Exit code: 1
Validity: 0.01
(weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615
(weight=0.0769230769230769) * (score{genres}=0.117647058823529) = 0.00904977375565611
(weight=0.0769230769230769) * (score{lemmas}=0.4) = 0.0307692307692308
(weight=0.256410256410256) * (score{size}=0.409141298644555) = 0.104908025293476
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974
(weight=0.0769230769230769) * (score{tags}=0.752941176470588) = 0.0579185520361991
(weight=0.307692307692308) * (score{udapi}=0.01) = 0.00307692307692308
(weight=0.0769230769230769) * (score{udeprels}=0.891891891891892) = 0.0686070686070686
(TOTAL score=0.353303932512912) * (availability=1) * (validity=0.01) = 0.00353303932512912
STARS = 0
UD_Turkish-PUD 0.00353303932512912 0