Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NumType=Card tokens missing NumForm annotations #22

Open
rhdunn opened this issue Oct 25, 2023 · 19 comments
Open

NumType=Card tokens missing NumForm annotations #22

rhdunn opened this issue Oct 25, 2023 · 19 comments

Comments

@rhdunn
Copy link

rhdunn commented Oct 25, 2023

Validation issues:

ERROR: Sentence n01003007 token 2 -- NumType=Card should be paired with NumForm=Digit for form '5,000'
ERROR: Sentence n01004017 token 9 -- NumType=Card should be paired with NumForm=Digit for form '4'
ERROR: Sentence n01004017 token 17 -- NumType=Card should be paired with NumForm=Digit for form '8'
ERROR: Sentence n01005023 token 7 -- invalid NUM with NumType=Card form '103.7'
ERROR: Sentence n01005023 token 8 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence n01005023 token 20 -- NumType=Card should be paired with NumForm=Digit for form '2004'
ERROR: Sentence n01005031 token 11 -- NumType=Card should be paired with NumForm=Word for form 'four'
ERROR: Sentence n01008017 token 12 -- NumType=Card should be paired with NumForm=Digit for form '11'
ERROR: Sentence n01008017 token 24 -- NumType=Card should be paired with NumForm=Digit for form '1996'
ERROR: Sentence n01011004 token 11 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01012003 token 2 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01012003 token 11 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01014003 token 23 -- NumType=Card should be paired with NumForm=Digit for form '2035'
ERROR: Sentence n01014012 token 22 -- NumType=Card should be paired with NumForm=Digit for form '2014'
ERROR: Sentence n01015036 token 3 -- NumType=Card should be paired with NumForm=Digit for form '2007'
ERROR: Sentence n01015036 token 23 -- NumType=Card should be paired with NumForm=Digit for form '50'
ERROR: Sentence n01015036 token 27 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence n01016032 token 3 -- NumType=Card should be paired with NumForm=Digit for form '9'
ERROR: Sentence n01017005 token 2 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01019004 token 2 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence n01021011 token 13 -- NumType=Card should be paired with NumForm=Digit for form '21'
ERROR: Sentence n01022016 token 8 -- NumType=Card should be paired with NumForm=Digit for form '6'
ERROR: Sentence n01022016 token 11 -- NumType=Card should be paired with NumForm=Digit for form '2015'
ERROR: Sentence n01022016 token 14 -- invalid NUM with NumType=Card form '221bn'
ERROR: Sentence n01022027 token 20 -- invalid NUM with NumType=Card form '1.5'
ERROR: Sentence n01024010 token 3 -- NumType=Card should be paired with NumForm=Digit for form '70'
ERROR: Sentence n01024010 token 15 -- NumType=Card should be paired with NumForm=Digit for form '17'
ERROR: Sentence n01024010 token 31 -- NumType=Card should be paired with NumForm=Digit for form '66'
ERROR: Sentence n01024013 token 23 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence n01029006 token 17 -- NumType=Card should be paired with NumForm=Digit for form '2014'
ERROR: Sentence n01031005 token 12 -- NumType=Card should be paired with NumForm=Digit for form '20'
ERROR: Sentence n01031021 token 8 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01035025 token 17 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01036033 token 14 -- NumType=Card should be paired with NumForm=Digit for form '25,000'
ERROR: Sentence n01043005 token 23 -- invalid NUM with NumType=Card form '1.5'
ERROR: Sentence n01043005 token 24 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence n01043005 token 32 -- NumType=Card should be paired with NumForm=Digit for form '15,000'
ERROR: Sentence n01043014 token 8 -- invalid NUM with NumType=Card form '1.4'
ERROR: Sentence n01043014 token 9 -- NumType=Card should be paired with NumForm=Word for form 'billion'
ERROR: Sentence n01043014 token 16 -- NumType=Card should be paired with NumForm=Digit for form '6,000'
ERROR: Sentence n01043025 token 24 -- NumType=Card should be paired with NumForm=Digit for form '2013'
ERROR: Sentence n01043025 token 26 -- NumType=Card should be paired with NumForm=Digit for form '2014'
ERROR: Sentence n01043025 token 29 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01043027 token 12 -- invalid NUM with NumType=Card form '1.5'
ERROR: Sentence n01043027 token 13 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence n01043027 token 21 -- NumType=Card should be paired with NumForm=Digit for form '2015'
ERROR: Sentence n01043027 token 23 -- NumType=Card should be paired with NumForm=Digit for form '2016'
ERROR: Sentence n01044004 token 15 -- NumType=Card should be paired with NumForm=Digit for form '2004'
ERROR: Sentence n01044004 token 17 -- NumType=Card should be paired with NumForm=Digit for form '2006'
ERROR: Sentence n01044009 token 8 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01044009 token 13 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence n01050009 token 6 -- NumType=Card should be paired with NumForm=Word for form 'eight'
ERROR: Sentence n01051007 token 13 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01051007 token 16 -- NumType=Card should be paired with NumForm=Digit for form '35,000'
ERROR: Sentence n01052004 token 3 -- NumType=Card should be paired with NumForm=Digit for form '84'
ERROR: Sentence n01053008 token 9 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01053036 token 3 -- NumType=Card should be paired with NumForm=Digit for form '20'
ERROR: Sentence n01058052 token 12 -- NumType=Card should be paired with NumForm=Digit for form '2013'
ERROR: Sentence n01061041 token 6 -- NumType=Card should be paired with NumForm=Digit for form '2016'
ERROR: Sentence n01066045 token 2 -- NumType=Card should be paired with NumForm=Digit for form '2010'
ERROR: Sentence n01068029 token 13 -- NumType=Card should be paired with NumForm=Digit for form '62'
ERROR: Sentence n01069004 token 21 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence n01069006 token 11 -- NumType=Card should be paired with NumForm=Digit for form '2010'
ERROR: Sentence n01070016 token 3 -- NumType=Card should be paired with NumForm=Digit for form '330'
ERROR: Sentence n01071009 token 19 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence n01072010 token 4 -- NumType=Card should be paired with NumForm=Word for form 'four'
ERROR: Sentence n01072021 token 1 -- NumType=Card should be paired with NumForm=Word for form 'One'
ERROR: Sentence n01074011 token 3 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01075028 token 14 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence n01079065 token 16 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01084008 token 7 -- NumType=Card should be paired with NumForm=Word for form 'four'
ERROR: Sentence n01084008 token 12 -- NumType=Card should be paired with NumForm=Digit for form '3'
ERROR: Sentence n01084008 token 16 -- NumType=Card should be paired with NumForm=Digit for form '12,000'
ERROR: Sentence n01084008 token 19 -- NumType=Card should be paired with NumForm=Digit for form '360'
ERROR: Sentence n01084023 token 8 -- NumType=Card should be paired with NumForm=Digit for form '3'
ERROR: Sentence n01084023 token 14 -- NumType=Card should be paired with NumForm=Digit for form '3,000'
ERROR: Sentence n01084023 token 17 -- NumType=Card should be paired with NumForm=Digit for form '5,000'
ERROR: Sentence n01084045 token 2 -- NumType=Card should be paired with NumForm=Digit for form '3'
ERROR: Sentence n01084045 token 15 -- NumType=Card should be paired with NumForm=Digit for form '15,001'
ERROR: Sentence n01084045 token 18 -- NumType=Card should be paired with NumForm=Digit for form '19,999'
ERROR: Sentence n01084045 token 24 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01084045 token 26 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence n01085008 token 4 -- NumType=Card should be paired with NumForm=Digit for form '2016'
ERROR: Sentence n01090004 token 12 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01090037 token 29 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01092025 token 4 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01093007 token 14 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01094014 token 2 -- NumType=Card should be paired with NumForm=Digit for form '50'
ERROR: Sentence n01095019 token 2 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence n01098041 token 5 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01099035 token 9 -- invalid NUM with NumType=Card form '6.30'
ERROR: Sentence n01099035 token 11 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence n01101003 token 14 -- NumType=Card should be paired with NumForm=Digit for form '1997'
ERROR: Sentence n01104011 token 18 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence n01104019 token 9 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence n01107006 token 22 -- invalid NUM with NumType=Card form '16bn'
ERROR: Sentence n01107010 token 8 -- NumType=Card should be paired with NumForm=Digit for form '2015'
ERROR: Sentence n01109012 token 8 -- NumType=Card should be paired with NumForm=Digit for form '760'
ERROR: Sentence n01109012 token 17 -- NumType=Card should be paired with NumForm=Digit for form '760'
ERROR: Sentence n01110006 token 8 -- NumType=Card should be paired with NumForm=Digit for form '10,000'
ERROR: Sentence n01110006 token 16 -- NumType=Card should be paired with NumForm=Digit for form '125'
ERROR: Sentence n01111021 token 21 -- invalid NUM with NumType=Card form '2bn'
ERROR: Sentence n01111021 token 24 -- invalid NUM with NumType=Card form '1.4bn'
ERROR: Sentence n01112014 token 15 -- NumType=Card should be paired with NumForm=Digit for form '2050'
ERROR: Sentence n01114025 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Four'
ERROR: Sentence n01114025 token 3 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence n01114025 token 13 -- NumType=Card should be paired with NumForm=Word for form 'nine'
ERROR: Sentence n01114025 token 15 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence n01115005 token 7 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence n01115013 token 19 -- NumType=Card should be paired with NumForm=Digit for form '2017'
ERROR: Sentence n01118010 token 14 -- NumType=Card should be paired with NumForm=Word for form 'eight'
ERROR: Sentence n01119012 token 2 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01121051 token 3 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01123012 token 6 -- NumType=Card should be paired with NumForm=Digit for form '31'
ERROR: Sentence n01123012 token 8 -- NumType=Card should be paired with NumForm=Digit for form '1832'
ERROR: Sentence n01127008 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Two'
ERROR: Sentence n01127008 token 8 -- NumType=Card should be paired with NumForm=Digit for form '31'
ERROR: Sentence n01127008 token 22 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01127008 token 27 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01127130 token 8 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01130003 token 5 -- NumType=Card should be paired with NumForm=Word for form 'zero'
ERROR: Sentence n01131007 token 3 -- invalid NUM with NumType=Card form '5.7'
ERROR: Sentence n01131007 token 4 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence n01131007 token 13 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01131013 token 20 -- NumType=Card should be paired with NumForm=Digit for form '16,500'
ERROR: Sentence n01132013 token 6 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01133022 token 16 -- NumType=Card should be paired with NumForm=Digit for form '2017'
ERROR: Sentence n01133022 token 18 -- NumType=Card should be paired with NumForm=Digit for form '2020'
ERROR: Sentence n01134005 token 9 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence n01138007 token 2 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01138017 token 4 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01142007 token 14 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01142008 token 12 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence n01143003 token 16 -- NumType=Card should be paired with NumForm=Word for form 'Nine'
ERROR: Sentence n01143009 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Two'
ERROR: Sentence n01144021 token 10 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n01145015 token 5 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n01145028 token 4 -- NumType=Card should be paired with NumForm=Digit for form '1200'
ERROR: Sentence n01149002 token 2 -- NumType=Card should be paired with NumForm=Digit for form '100'
ERROR: Sentence n01149002 token 5 -- NumType=Card should be paired with NumForm=Digit for form '328'
ERROR: Sentence n01149002 token 14 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01002075 token 2 -- NumType=Card should be paired with NumForm=Digit for form '2019'
ERROR: Sentence w01005022 token 5 -- NumType=Card should be paired with NumForm=Digit for form '512'
ERROR: Sentence w01005022 token 7 -- NumType=Card should be paired with NumForm=Digit for form '511'
ERROR: Sentence w01005022 token 14 -- NumType=Card should be paired with NumForm=Roman for form 'I'
ERROR: Sentence w01006027 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1912'
ERROR: Sentence w01006027 token 15 -- NumType=Card should be paired with NumForm=Digit for form '1916'
ERROR: Sentence w01006081 token 2 -- NumType=Card should be paired with NumForm=Digit for form '2007'
ERROR: Sentence w01007004 token 3 -- NumType=Card should be paired with NumForm=Digit for form '1918'
ERROR: Sentence w01009017 token 26 -- NumType=Card should be paired with NumForm=Digit for form '1973'
ERROR: Sentence w01010046 token 2 -- NumType=Card should be paired with NumForm=Digit for form '833'
ERROR: Sentence w01010047 token 6 -- NumType=Card should be paired with NumForm=Roman for form 'I'
ERROR: Sentence w01010047 token 9 -- NumType=Card should be paired with NumForm=Digit for form '830'
ERROR: Sentence w01010047 token 11 -- NumType=Card should be paired with NumForm=Digit for form '846'
ERROR: Sentence w01010048 token 9 -- NumType=Card should be paired with NumForm=Roman for form 'I'
ERROR: Sentence w01013083 token 16 -- NumType=Card should be paired with NumForm=Digit for form '1918'
ERROR: Sentence w01016028 token 12 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01016070 token 13 -- NumType=Card should be paired with NumForm=Digit for form '2008'
ERROR: Sentence w01017004 token 28 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01018029 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence w01018029 token 4 -- NumType=Card should be paired with NumForm=Digit for form '1961'
ERROR: Sentence w01018101 token 14 -- NumType=Card should be paired with NumForm=Digit for form '56'
ERROR: Sentence w01018101 token 19 -- NumType=Card should be paired with NumForm=Digit for form '5'
ERROR: Sentence w01018101 token 21 -- NumType=Card should be paired with NumForm=Digit for form '14'
ERROR: Sentence w01018101 token 28 -- NumType=Card should be paired with NumForm=Digit for form '53'
ERROR: Sentence w01018101 token 33 -- NumType=Card should be paired with NumForm=Digit for form '7'
ERROR: Sentence w01018101 token 35 -- NumType=Card should be paired with NumForm=Digit for form '14'
ERROR: Sentence w01019073 token 5 -- NumType=Card should be paired with NumForm=Digit for form '2010'
ERROR: Sentence w01020019 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Two'
ERROR: Sentence w01023120 token 14 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01026024 token 12 -- NumType=Card should be paired with NumForm=Digit for form '100'
ERROR: Sentence w01026024 token 13 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01026024 token 16 -- NumType=Card should be paired with NumForm=Digit for form '1987'
ERROR: Sentence w01029015 token 15 -- invalid NUM with NumType=Card form 'yellowish'
ERROR: Sentence w01030093 token 7 -- NumType=Card should be paired with NumForm=Digit for form '11'
ERROR: Sentence w01030093 token 21 -- NumType=Card should be paired with NumForm=Digit for form '90'
ERROR: Sentence w01030094 token 7 -- NumType=Card should be paired with NumForm=Digit for form '80'
ERROR: Sentence w01030095 token 9 -- NumType=Card should be paired with NumForm=Digit for form '500'
ERROR: Sentence w01030095 token 18 -- NumType=Card should be paired with NumForm=Digit for form '2900'
ERROR: Sentence w01033025 token 8 -- NumType=Card should be paired with NumForm=Digit for form '1975'
ERROR: Sentence w01033067 token 6 -- NumType=Card should be paired with NumForm=Digit for form '1000'
ERROR: Sentence w01035082 token 4 -- NumType=Card should be paired with NumForm=Digit for form '50'
ERROR: Sentence w01035082 token 17 -- NumType=Card should be paired with NumForm=Digit for form '40'
ERROR: Sentence w01037024 token 13 -- NumType=Card should be paired with NumForm=Word for form 'seven'
ERROR: Sentence w01037080 token 13 -- NumType=Card should be paired with NumForm=Digit for form '1492'
ERROR: Sentence w01038016 token 11 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01039065 token 1 -- NumType=Card should be paired with NumForm=Digit for form '1987'
ERROR: Sentence w01042055 token 12 -- NumType=Card should be paired with NumForm=Word for form 'seven'
ERROR: Sentence w01043027 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1340'
ERROR: Sentence w01043027 token 13 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01045002 token 11 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01045003 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1492'
ERROR: Sentence w01045006 token 13 -- NumType=Card should be paired with NumForm=Digit for form '1492'
ERROR: Sentence w01049066 token 9 -- NumType=Card should be paired with NumForm=Digit for form '1879'
ERROR: Sentence w01050067 token 9 -- NumType=Card should be paired with NumForm=Digit for form '1911'
ERROR: Sentence w01050070 token 28 -- NumType=Card should be paired with NumForm=Digit for form '1911'
ERROR: Sentence w01051032 token 9 -- NumType=Card should be paired with NumForm=Roman for form 'II'
ERROR: Sentence w01051080 token 16 -- NumType=Card should be paired with NumForm=Digit for form '1903'
ERROR: Sentence w01052038 token 30 -- NumType=Card should be paired with NumForm=Roman for form 'III'
ERROR: Sentence w01052038 token 32 -- NumType=Card should be paired with NumForm=Digit for form '120'
ERROR: Sentence w01052046 token 19 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01053045 token 9 -- NumType=Card should be paired with NumForm=Word for form 'eighteen'
ERROR: Sentence w01053045 token 15 -- NumType=Card should be paired with NumForm=Word for form 'seventeen'
ERROR: Sentence w01053067 token 15 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01053067 token 16 -- NumType=Card should be paired with NumForm=Word for form 'thousand'
ERROR: Sentence w01058009 token 12 -- NumType=Card should be paired with NumForm=Digit for form '363'
ERROR: Sentence w01058013 token 12 -- NumType=Card should be paired with NumForm=Digit for form '393'
ERROR: Sentence w01060038 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1072'
ERROR: Sentence w01060040 token 23 -- NumType=Card should be paired with NumForm=Digit for form '1075'
ERROR: Sentence w01065018 token 11 -- NumType=Card should be paired with NumForm=Word for form 'ten'
ERROR: Sentence w01065018 token 30 -- NumType=Card should be paired with NumForm=Digit for form '3'
ERROR: Sentence w01065019 token 29 -- NumType=Card should be paired with NumForm=Digit for form '1600'
ERROR: Sentence w01065020 token 22 -- NumType=Card should be paired with NumForm=Roman for form 'III'
ERROR: Sentence w01065022 token 14 -- NumType=Card should be paired with NumForm=Word for form 'ten'
ERROR: Sentence w01066006 token 15 -- NumType=Card should be paired with NumForm=Digit for form '1992'
ERROR: Sentence w01067059 token 4 -- NumType=Card should be paired with NumForm=Word for form 'fifteen'
ERROR: Sentence w01068027 token 3 -- NumType=Card should be paired with NumForm=Digit for form '9'
ERROR: Sentence w01068027 token 5 -- NumType=Card should be paired with NumForm=Digit for form '2002'
ERROR: Sentence w01068056 token 5 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w01069007 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1519'
ERROR: Sentence w01069007 token 21 -- NumType=Card should be paired with NumForm=Digit for form '1530'
ERROR: Sentence w01069056 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1566'
ERROR: Sentence w01069056 token 8 -- NumType=Card should be paired with NumForm=Digit for form '400'
ERROR: Sentence w01070031 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1900'
ERROR: Sentence w01070031 token 23 -- NumType=Card should be paired with NumForm=Digit for form '700'
ERROR: Sentence w01070033 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Three'
ERROR: Sentence w01070033 token 14 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01070034 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Four'
ERROR: Sentence w01070035 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Two'
ERROR: Sentence w01071036 token 5 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01072065 token 13 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01073067 token 6 -- NumType=Card should be paired with NumForm=Word for form 'thirty'
ERROR: Sentence w01073067 token 8 -- NumType=Card should be paired with NumForm=Word for form 'nine'
ERROR: Sentence w01073067 token 16 -- NumType=Card should be paired with NumForm=Digit for form '1886'
ERROR: Sentence w01073067 token 19 -- NumType=Card should be paired with NumForm=Digit for form '1887'
ERROR: Sentence w01074088 token 7 -- NumType=Card should be paired with NumForm=Digit for form '1610'
ERROR: Sentence w01075037 token 8 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence w01075037 token 21 -- NumType=Card should be paired with NumForm=Digit for form '3'
ERROR: Sentence w01075037 token 28 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence w01075037 token 48 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence w01075038 token 5 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w01076054 token 7 -- NumType=Card should be paired with NumForm=Digit for form '1916'
ERROR: Sentence w01076054 token 32 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01079049 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1538'
ERROR: Sentence w01079077 token 8 -- NumType=Card should be paired with NumForm=Digit for form '1492'
ERROR: Sentence w01080130 token 9 -- NumType=Card should be paired with NumForm=Digit for form '352'
ERROR: Sentence w01080131 token 16 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01086037 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1563'
ERROR: Sentence w01086037 token 12 -- NumType=Card should be paired with NumForm=Digit for form '168,000'
ERROR: Sentence w01089040 token 8 -- NumType=Card should be paired with NumForm=Word for form 'forty'
ERROR: Sentence w01091016 token 27 -- NumType=Card should be paired with NumForm=Digit for form '5,000'
ERROR: Sentence w01094066 token 22 -- NumType=Card should be paired with NumForm=Digit for form '1960'
ERROR: Sentence w01095091 token 11 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01096013 token 8 -- NumType=Card should be paired with NumForm=Digit for form '3'
ERROR: Sentence w01096013 token 9 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01096013 token 13 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence w01096013 token 14 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01096013 token 22 -- invalid NUM with NumType=Card form '7.5'
ERROR: Sentence w01096013 token 23 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01100046 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1839'
ERROR: Sentence w01100047 token 7 -- NumType=Card should be paired with NumForm=Digit for form '1842'
ERROR: Sentence w01100049 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1856'
ERROR: Sentence w01100049 token 23 -- NumType=Card should be paired with NumForm=Digit for form '1858'
ERROR: Sentence w01100049 token 29 -- NumType=Card should be paired with NumForm=Digit for form '1860'
ERROR: Sentence w01102020 token 26 -- NumType=Card should be paired with NumForm=Digit for form '1945'
ERROR: Sentence w01106052 token 16 -- NumType=Card should be paired with NumForm=Digit for form '1997'
ERROR: Sentence w01107013 token 10 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01108063 token 3 -- NumType=Card should be paired with NumForm=Digit for form '10'
ERROR: Sentence w01108063 token 5 -- NumType=Card should be paired with NumForm=Digit for form '1896'
ERROR: Sentence w01109009 token 2 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01109120 token 2 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01111089 token 13 -- NumType=Card should be paired with NumForm=Digit for form '1979'
ERROR: Sentence w01111093 token 6 -- NumType=Card should be paired with NumForm=Digit for form '4'
ERROR: Sentence w01111093 token 8 -- NumType=Card should be paired with NumForm=Digit for form '1988'
ERROR: Sentence w01111093 token 12 -- NumType=Card should be paired with NumForm=Digit for form '19'
ERROR: Sentence w01111093 token 14 -- NumType=Card should be paired with NumForm=Digit for form '1993'
ERROR: Sentence w01112098 token 11 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01112098 token 16 -- NumType=Card should be paired with NumForm=Word for form 'seven'
ERROR: Sentence w01113046 token 6 -- NumType=Card should be paired with NumForm=Word for form 'five'
ERROR: Sentence w01113046 token 10 -- NumType=Card should be paired with NumForm=Digit for form '2013'
ERROR: Sentence w01113046 token 12 -- NumType=Card should be paired with NumForm=Word for form 'four'
ERROR: Sentence w01113046 token 16 -- NumType=Card should be paired with NumForm=Digit for form '2014'
ERROR: Sentence w01114053 token 9 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence w01115024 token 8 -- NumType=Card should be paired with NumForm=Digit for form '21'
ERROR: Sentence w01115024 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1882'
ERROR: Sentence w01116036 token 2 -- NumType=Card should be paired with NumForm=Digit for form '3000'
ERROR: Sentence w01116088 token 2 -- NumType=Card should be paired with NumForm=Digit for form '2009'
ERROR: Sentence w01116088 token 22 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w01116088 token 23 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01116088 token 25 -- NumType=Card should be paired with NumForm=Word for form 'five'
ERROR: Sentence w01116088 token 26 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence w01117009 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1991'
ERROR: Sentence w01117009 token 4 -- NumType=Card should be paired with NumForm=Digit for form '1997'
ERROR: Sentence w01119059 token 21 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01122031 token 13 -- NumType=Card should be paired with NumForm=Digit for form '50'
ERROR: Sentence w01122064 token 19 -- NumType=Card should be paired with NumForm=Digit for form '1984'
ERROR: Sentence w01124011 token 9 -- NumType=Card should be paired with NumForm=Digit for form '200'
ERROR: Sentence w01124011 token 12 -- NumType=Card should be paired with NumForm=Digit for form '96'
ERROR: Sentence w01124011 token 20 -- NumType=Card should be paired with NumForm=Digit for form '31'
ERROR: Sentence w01124011 token 31 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01125035 token 6 -- NumType=Card should be paired with NumForm=Digit for form '1770'
ERROR: Sentence w01125038 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1770'
ERROR: Sentence w01127071 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1981'
ERROR: Sentence w01127071 token 7 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01128053 token 3 -- NumType=Card should be paired with NumForm=Digit for form '2012'
ERROR: Sentence w01128059 token 6 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w01129053 token 2 -- NumType=Card should be paired with NumForm=Digit for form '2003'
ERROR: Sentence w01129053 token 14 -- NumType=Card should be paired with NumForm=Digit for form '33'
ERROR: Sentence w01129053 token 16 -- NumType=Card should be paired with NumForm=Digit for form '36'
ERROR: Sentence w01129053 token 21 -- NumType=Card should be paired with NumForm=Digit for form '2003'
ERROR: Sentence w01130099 token 3 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01130100 token 8 -- NumType=Card should be paired with NumForm=Digit for form '1992'
ERROR: Sentence w01130101 token 3 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w01130102 token 26 -- NumType=Card should be paired with NumForm=Digit for form '1994'
ERROR: Sentence w01130103 token 5 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01130103 token 16 -- NumType=Card should be paired with NumForm=Digit for form '1998'
ERROR: Sentence w01131060 token 3 -- NumType=Card should be paired with NumForm=Digit for form '330,000'
ERROR: Sentence w01131060 token 7 -- NumType=Card should be paired with NumForm=Digit for form '10,000'
ERROR: Sentence w01134062 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1955'
ERROR: Sentence w01134062 token 14 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w01134062 token 24 -- NumType=Card should be paired with NumForm=Digit for form '15'
ERROR: Sentence w01135034 token 2 -- NumType=Card should be paired with NumForm=Digit for form '2011'
ERROR: Sentence w01135035 token 5 -- NumType=Card should be paired with NumForm=Digit for form '2011'
ERROR: Sentence w01135037 token 5 -- NumType=Card should be paired with NumForm=Digit for form '2012'
ERROR: Sentence w01135037 token 8 -- NumType=Card should be paired with NumForm=Word for form 'Five'
ERROR: Sentence w01135038 token 3 -- NumType=Card should be paired with NumForm=Digit for form '2011'
ERROR: Sentence w01135038 token 37 -- NumType=Card should be paired with NumForm=Digit for form '2012'
ERROR: Sentence w01137087 token 8 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01140030 token 4 -- NumType=Card should be paired with NumForm=Digit for form '1926'
ERROR: Sentence w01141025 token 3 -- NumType=Card should be paired with NumForm=Digit for form '2013'
ERROR: Sentence w01141137 token 5 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w01142013 token 22 -- NumType=Card should be paired with NumForm=Roman for form 'IV'
ERROR: Sentence w01142013 token 25 -- NumType=Card should be paired with NumForm=Roman for form 'III'
ERROR: Sentence w01142013 token 32 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w01142031 token 6 -- NumType=Card should be paired with NumForm=Digit for form '1399'
ERROR: Sentence w01143037 token 17 -- NumType=Card should be paired with NumForm=Digit for form '1954'
ERROR: Sentence w01144056 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1980'
ERROR: Sentence w01144063 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1952'
ERROR: Sentence w01144063 token 14 -- NumType=Card should be paired with NumForm=Digit for form '34'
ERROR: Sentence w01144084 token 11 -- NumType=Card should be paired with NumForm=Digit for form '11'
ERROR: Sentence w01147010 token 11 -- NumType=Card should be paired with NumForm=Digit for form '1991'
ERROR: Sentence w01147018 token 14 -- NumType=Card should be paired with NumForm=Digit for form '16'
ERROR: Sentence w01147122 token 4 -- NumType=Card should be paired with NumForm=Digit for form '2015'
ERROR: Sentence w01150045 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1912'
ERROR: Sentence w01150045 token 15 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence w01150048 token 8 -- NumType=Card should be paired with NumForm=Digit for form '30'
ERROR: Sentence w01150048 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1913'
ERROR: Sentence n02002007 token 8 -- NumType=Card should be paired with NumForm=Digit for form '53'
ERROR: Sentence n02007010 token 2 -- NumType=Card should be paired with NumForm=Digit for form '71'
ERROR: Sentence n02007010 token 5 -- NumType=Card should be paired with NumForm=Digit for form '137'
ERROR: Sentence n02011003 token 14 -- NumType=Card should be paired with NumForm=Word for form 'four'
ERROR: Sentence n02016006 token 3 -- NumType=Card should be paired with NumForm=Digit for form '1'
ERROR: Sentence n02016006 token 15 -- NumType=Card should be paired with NumForm=Digit for form '100'
ERROR: Sentence n02024008 token 17 -- NumType=Card should be paired with NumForm=Digit for form '2'
ERROR: Sentence n02027021 token 34 -- NumType=Card should be paired with NumForm=Digit for form '2000'
ERROR: Sentence n02042005 token 4 -- NumType=Card should be paired with NumForm=Digit for form '30'
ERROR: Sentence n02042005 token 6 -- NumType=Card should be paired with NumForm=Digit for form '2015'
ERROR: Sentence n02042028 token 10 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence n02044009 token 10 -- NumType=Card should be paired with NumForm=Digit for form '28'
ERROR: Sentence n02066010 token 17 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence n02073024 token 15 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n02074009 token 34 -- NumType=Card should be paired with NumForm=Digit for form '20'
ERROR: Sentence n02074009 token 36 -- NumType=Card should be paired with NumForm=Digit for form '2001'
ERROR: Sentence n02079042 token 2 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n02083054 token 33 -- NumType=Card should be paired with NumForm=Digit for form '2005'
ERROR: Sentence n03001030 token 10 -- invalid NUM with NumType=Card form '23.45'
ERROR: Sentence n03002010 token 16 -- NumType=Card should be paired with NumForm=Word for form 'twenty'
ERROR: Sentence n03003036 token 13 -- NumType=Card should be paired with NumForm=Digit for form '367'
ERROR: Sentence n03003036 token 18 -- NumType=Card should be paired with NumForm=Digit for form '550'
ERROR: Sentence n03003036 token 22 -- NumType=Card should be paired with NumForm=Digit for form '330'
ERROR: Sentence n03004003 token 35 -- NumType=Card should be paired with NumForm=Digit for form '2014'
ERROR: Sentence n03007006 token 2 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence n03007011 token 15 -- NumType=Card should be paired with NumForm=Digit for form '45'
ERROR: Sentence n03010012 token 12 -- NumType=Card should be paired with NumForm=Digit for form '42'
ERROR: Sentence n03010012 token 13 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence n03010012 token 18 -- invalid NUM with NumType=Card form '15.5'
ERROR: Sentence n03010012 token 19 -- NumType=Card should be paired with NumForm=Word for form 'million'
ERROR: Sentence n04001002 token 11 -- NumType=Card should be paired with NumForm=Digit for form '1927'
ERROR: Sentence n04005003 token 7 -- NumType=Card should be paired with NumForm=Digit for form '500'
ERROR: Sentence n04005003 token 19 -- NumType=Card should be paired with NumForm=Digit for form '2017'
ERROR: Sentence n04006004 token 20 -- NumType=Card should be paired with NumForm=Digit for form '2017'
ERROR: Sentence n04006014 token 20 -- NumType=Card should be paired with NumForm=Digit for form '1,335'
ERROR: Sentence n04006014 token 31 -- NumType=Card should be paired with NumForm=Digit for form '1,165'
ERROR: Sentence n04006016 token 23 -- NumType=Card should be paired with NumForm=Digit for form '1,365'
ERROR: Sentence n04009002 token 1 -- NumType=Card should be paired with NumForm=Word for form 'Four'
ERROR: Sentence n04009003 token 1 -- NumType=Card should be paired with NumForm=Word for form 'One'
ERROR: Sentence n04010006 token 8 -- NumType=Card should be paired with NumForm=Word for form 'billion'
ERROR: Sentence n05002004 token 8 -- NumType=Card should be paired with NumForm=Word for form 'six'
ERROR: Sentence n05002004 token 17 -- NumType=Card should be paired with NumForm=Digit for form '20'
ERROR: Sentence n05002004 token 24 -- NumType=Card should be paired with NumForm=Digit for form '600,000'
ERROR: Sentence n05003010 token 10 -- NumType=Card should be paired with NumForm=Digit for form '100'
ERROR: Sentence w02001041 token 3 -- NumType=Card should be paired with NumForm=Digit for form '1925'
ERROR: Sentence w02001069 token 8 -- NumType=Card should be paired with NumForm=Digit for form '30'
ERROR: Sentence w02001069 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1955'
ERROR: Sentence w02002032 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1882'
ERROR: Sentence w02002032 token 9 -- NumType=Card should be paired with NumForm=Digit for form '34'
ERROR: Sentence w02002093 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1992'
ERROR: Sentence w02002120 token 6 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w02003037 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1832'
ERROR: Sentence w02004021 token 17 -- NumType=Card should be paired with NumForm=Word for form 'Thirty'
ERROR: Sentence w02005026 token 8 -- NumType=Card should be paired with NumForm=Word for form 'four'
ERROR: Sentence w02006081 token 3 -- NumType=Card should be paired with NumForm=Digit for form '2010'
ERROR: Sentence w02006081 token 15 -- NumType=Card should be paired with NumForm=Word for form 'seven'
ERROR: Sentence w02007032 token 3 -- NumType=Card should be paired with NumForm=Digit for form '2012'
ERROR: Sentence w02008038 token 21 -- NumType=Card should be paired with NumForm=Digit for form '3300'
ERROR: Sentence w02013076 token 11 -- NumType=Card should be paired with NumForm=Digit for form '1933'
ERROR: Sentence w02013093 token 6 -- NumType=Card should be paired with NumForm=Digit for form '1917'
ERROR: Sentence w02014013 token 1 -- NumType=Card should be paired with NumForm=Word for form 'One'
ERROR: Sentence w02015086 token 8 -- NumType=Card should be paired with NumForm=Word for form 'Thirty'
ERROR: Sentence w02015086 token 13 -- NumType=Card should be paired with NumForm=Digit for form '1632'
ERROR: Sentence w02016015 token 27 -- NumType=Card should be paired with NumForm=Digit for form '2002'
ERROR: Sentence w02016060 token 6 -- NumType=Card should be paired with NumForm=Digit for form '2004'
ERROR: Sentence w02019085 token 6 -- NumType=Card should be paired with NumForm=Digit for form '1977'
ERROR: Sentence w03002048 token 8 -- NumType=Card should be paired with NumForm=Word for form 'five'
ERROR: Sentence w03002048 token 11 -- NumType=Card should be paired with NumForm=Digit for form '100,000'
ERROR: Sentence w03002055 token 7 -- NumType=Card should be paired with NumForm=Digit for form '2008'
ERROR: Sentence w03002055 token 12 -- NumType=Card should be paired with NumForm=Word for form 'fifty'
ERROR: Sentence w03003023 token 10 -- NumType=Card should be paired with NumForm=Digit for form '1820'
ERROR: Sentence w03003023 token 12 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w03003023 token 20 -- NumType=Card should be paired with NumForm=Word for form 'ten'
ERROR: Sentence w03003023 token 28 -- NumType=Card should be paired with NumForm=Digit for form '1820'
ERROR: Sentence w03003039 token 6 -- NumType=Card should be paired with NumForm=Digit for form '1914'
ERROR: Sentence w03003039 token 22 -- NumType=Card should be paired with NumForm=Word for form 'twenty'
ERROR: Sentence w03003039 token 23 -- NumType=Card should be paired with NumForm=Word for form 'eight'
ERROR: Sentence w03008029 token 4 -- NumType=Card should be paired with NumForm=Roman for form 'III'
ERROR: Sentence w03008029 token 9 -- NumType=Card should be paired with NumForm=Digit for form '1794'
ERROR: Sentence w03009029 token 15 -- NumType=Card should be paired with NumForm=Digit for form '1994'
ERROR: Sentence w03009044 token 5 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w03010096 token 17 -- NumType=Card should be paired with NumForm=Digit for form '1947'
ERROR: Sentence w03010098 token 3 -- NumType=Card should be paired with NumForm=Digit for form '1948'
ERROR: Sentence w03010099 token 18 -- NumType=Card should be paired with NumForm=Digit for form '18'
ERROR: Sentence w03010099 token 26 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w04001027 token 8 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w04002048 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1928'
ERROR: Sentence w04002048 token 5 -- NumType=Card should be paired with NumForm=Digit for form '90'
ERROR: Sentence w04002055 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1976'
ERROR: Sentence w04002055 token 12 -- NumType=Card should be paired with NumForm=Digit for form '1990'
ERROR: Sentence w04003025 token 3 -- NumType=Card should be paired with NumForm=Digit for form '1969'
ERROR: Sentence w04004005 token 17 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w04009042 token 10 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w04010029 token 12 -- NumType=Card should be paired with NumForm=Word for form 'one'
ERROR: Sentence w04010030 token 14 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w04010031 token 13 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w05001026 token 7 -- NumType=Card should be paired with NumForm=Digit for form '1904'
ERROR: Sentence w05001026 token 9 -- NumType=Card should be paired with NumForm=Digit for form '1914'
ERROR: Sentence w05001026 token 19 -- NumType=Card should be paired with NumForm=Word for form 'ten'
ERROR: Sentence w05001036 token 3 -- NumType=Card should be paired with NumForm=Word for form 'three'
ERROR: Sentence w05003012 token 25 -- NumType=Card should be paired with NumForm=Digit for form '1879'
ERROR: Sentence w05005083 token 7 -- NumType=Card should be paired with NumForm=Digit for form '1777'
ERROR: Sentence w05007004 token 21 -- NumType=Card should be paired with NumForm=Roman for form 'IV'
ERROR: Sentence w05007080 token 11 -- NumType=Card should be paired with NumForm=Digit for form '1415'
ERROR: Sentence w05009044 token 2 -- NumType=Card should be paired with NumForm=Digit for form '1976'
ERROR: Sentence w05010023 token 5 -- NumType=Card should be paired with NumForm=Word for form 'two'
ERROR: Sentence w05010027 token 5 -- NumType=Card should be paired with NumForm=Digit for form '49'

Note: The numbers such as 7.5 should be NumType=Frac|NumForm=Digit to be consistent with the GUM treebank.

Note: Sentence n01111021 has a form 1.4bn. -- Other treebanks, such as EWT, treat 1.4 and bn as two separate tokens. The bn is NumType=Card|NumForm=Word in EWT.

@AngledLuffa
Copy link
Contributor

Note: Sentence n01111021 has a form 1.4bn. -- Other treebanks, such as EWT, treat 1.4 and bn as two separate tokens. The bn is NumType=Card|NumForm=Word in EWT

Any thought on what to make 16bn? Also split into two separate tokens? I'm not sure changing that tokenization is in our purview

I updated some here

#24

but have not done the Roman words yet

@dan-zeman
Copy link
Member

Any thought on what to make 16bn?

Fixed in 47847f6

@AngledLuffa
Copy link
Contributor

your validation script missed V and X as Roman numerals

@rhdunn
Copy link
Author

rhdunn commented Oct 26, 2023

My validation script does detect V and X. The isssue is that the ones my script didn't identify are PROPN+CD compared to NUM+CD. My script was going on the UPOS tags listed in https://universaldependencies.org/u/feat/NumType.html. I should adjust my check to detect the use of NumType on any UPOS other than PUNCT and SYM.

That does indicate that w05007004 has inconsistent UPOS for the roman numerals for token 15, 18, and 21. Token 21 should really be PROPN to be consistent with the PTB rules that the other treebanks like EWT use.

@AngledLuffa
Copy link
Contributor

Oh, I hadn't even noticed that. I wonder if those are still supposed to have NumForm and NumType when they are of this tag. @nschneid or @amir-zeldes any thoughts on labeling Roman numerals when used as PROPN?

@amir-zeldes
Copy link

Hm, that's another inconsistency between GUM and EWT then, in GUM roman numerals after monarchs, WWII etc. are CD+NUM, not PROPN (the rest of the name is PROPN)

@rhdunn
Copy link
Author

rhdunn commented Oct 26, 2023

Doing a search, it looks like EWT is consistent with GUM in using CD+NUM for these -- e.g. email-enronsent07_01-0045 -- so it makes sense to use that to be consistent. PRON+CD looks like it is only used in the PUD treebank.

@AngledLuffa
Copy link
Contributor

That's pretty easy to update as well. Added that to the previous Roman change:

#25

@AngledLuffa
Copy link
Contributor

Mind rerunning the script on the new dev branch now that we've merged multiple changes?

nschneid added a commit to UniversalDependencies/UD_English-EWT that referenced this issue Oct 26, 2023
@nschneid
Copy link
Contributor

^ fixed the stray EWT cases

@rhdunn
Copy link
Author

rhdunn commented Oct 27, 2023

@AngledLuffa I've published my script at https://github.com/rhdunn/conllu-en-validator.

I now get the following output:

$ ../conllu-en-validator/validate --language en --validator form en_pud-ud-test.conllu | grep -F "NumType=Card"
ERROR: Sentence n01005023 token 7 -- invalid NUM with NumType=Card|NumForm=Digit form '103.7'
ERROR: Sentence n01022027 token 20 -- invalid NUM with NumType=Card|NumForm=Digit form '1.5'
ERROR: Sentence n01043005 token 23 -- invalid NUM with NumType=Card|NumForm=Digit form '1.5'
ERROR: Sentence n01043014 token 8 -- invalid NUM with NumType=Card|NumForm=Digit form '1.4'
ERROR: Sentence n01043027 token 12 -- invalid NUM with NumType=Card|NumForm=Digit form '1.5'
ERROR: Sentence n01099035 token 9 -- invalid NUM with NumType=Card|NumForm=Digit form '6.30'
ERROR: Sentence n01111021 token 25 -- invalid NUM with NumType=Card|NumForm=Digit form '1.4'
ERROR: Sentence n01131007 token 3 -- invalid NUM with NumType=Card|NumForm=Digit form '5.7'
ERROR: Sentence w01029015 token 15 -- invalid NUM with NumType=Card|NumForm=Word form 'yellowish'
ERROR: Sentence w01096013 token 22 -- invalid NUM with NumType=Card|NumForm=Digit form '7.5'
ERROR: Sentence n03001030 token 10 -- invalid NUM with NumType=Card|NumForm=Digit form '23.45'
ERROR: Sentence n03010012 token 18 -- invalid NUM with NumType=Card|NumForm=Digit form '15.5'

Note: Aside from yellowish -- which is an error -- these are because my script is expecting 1.5, etc. to be annotated as NumType=Frac.

@AngledLuffa
Copy link
Contributor

I can change that. Anything other than NumType=Frac or is that the complete feature?

I can also update the tag on yellowish I suppose.

Hopefully my PI is okay with the idea that I spend quite a bit of time during one week once every six months around the next UD deadline @manning

@AngledLuffa
Copy link
Contributor

Quite a few are still tagged with the Card in EWT

29      4.5     4.5     NUM     CD      NumForm=Digit|NumType=Card      30      compound        30:compound     _
30      billion billion NUM     CD      NumForm=Word|NumType=Card       28      nummod  28:nummod       SpaceAfter=No

24      $       $       SYM     $       _       13      parataxis       13:parataxis    SpaceAfter=No
25      13.9    13.9    NUM     CD      NumForm=Digit|NumType=Card      26      compound        26:compound     SpaceAfter=No
26      M       million NUM     CD      Abbr=Yes|NumForm=Word|NumType=Card      24      nummod  24:nummod       _
27      from    from    ADP     IN      _       28      case    28:case _
28      $       $       SYM     $       _       24      nmod    24:nmod:from    SpaceAfter=No
29      11.5    11.5    NUM     CD      NumForm=Digit|NumType=Card      30      compound        30:compound     SpaceAfter=No
30      M       million NUM     CD      Abbr=Yes|NumForm=Word|NumType=Card      28      nummod  28:nummod       SpaceAfter=No

14      May     May     PROPN   NNP     Number=Sing     10      obl     10:obl:on       _
15      30th    30th    NOUN    NN      Number=Sing|NumType=Ord 14      nummod  14:nummod       _
16      @       @       ADP     IN      _       17      case    17:case SpaceAfter=No
17      2.975   2.975   NUM     CD      NumForm=Digit|NumType=Card      10      obl     10:obl  SpaceAfter=No

14      will    will    AUX     MD      VerbForm=Fin    15      aux     15:aux  _
15      last    last    VERB    VB      VerbForm=Inf    3       conj    3:conj:and      _
16      1.5     1.5     NUM     CD      NumForm=Digit|NumType=Card      17      nummod  17:nummod       _
17      hours   hour    NOUN    NNS     Number=Plur     15      obl:tmod        15:obl:tmod     _

and then there's phone numbers:

7       832.676.3177    832.676.3177    NUM     CD      NumForm=Digit|NumType=Card      5       appos   5:appos _

NumType=Frac appears to only occur on written fraction words: half, third, tenth, etc

Calling in the cavalry:

@nschneid @amir-zeldes

@dan-zeman
Copy link
Member

NumType=Frac appears to only occur on written fraction words: half, third, tenth, etc

This is definitely how NumType=Frac was originally meant but I'm not sure if the concensus of English treebank maintainers hasn't shifted towards including 1.5 and such. I'm pretty sure it has been discussed somewhere.

@AngledLuffa
Copy link
Contributor

I do recall that discussion as well. It also appears to be implemented that way in GUM, but not EWT or PUD

@nschneid
Copy link
Contributor

@rhdunn
Copy link
Author

rhdunn commented Oct 27, 2023

The 1.5 etc. forms would be NumType=Frac|NumForm=Digit according to the features, and how they are annotated in GUM.

@rhdunn
Copy link
Author

rhdunn commented Oct 27, 2023

I can easily modify my validation script so that 1.2, etc. are allowed on NumType=Card and to report errors if digit forms with NumType=Frac are used, or whatever the consensus is for these.

@nschneid
Copy link
Contributor

nschneid commented Oct 29, 2023

@AngledLuffa switched EWT to use Frac for decimals like "1.2" in UniversalDependencies/UD_English-EWT@2faee04. Is that the consensus?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants