Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

what are the │_ ? #14

Open
arademaker opened this issue Dec 12, 2022 · 1 comment
Open

what are the │_ ? #14

arademaker opened this issue Dec 12, 2022 · 1 comment

Comments

@arademaker
Copy link

(│_ _open_v_1│mood=INDICATIVE|perf=-|sf=PROP|tense=PAST ARG1-NEQ│_ (│_ _window_n_1│ind=+|num=SG|pers=3 RSTR-H-of│_ (│_ _the_q│_ )│_ )│_ )│_

Do you guys remember what those │_ symbols are? Fig 2 in https://arxiv.org/pdf/1904.11564.pdf does not show them. I didn't find any explanation in the paper about those extra symbols before the predicates. @goodmami ?

@goodmami
Copy link
Contributor

No, I don't. All I can find is this:

# Tokens with "│" were already featurized by combine_attributes. For anything else,
# add empty feature _ to fill requirement that all tokens have same number of features.
featurized_tokens = [token if u'│' in token else token + u'│_' for token in tokens]

Interestingly, that character is not a regular pipe character |, but a halfwidth vertical bar character for box drawing: https://codepoints.net/U+FFE8. I don't recall writing this code so I cannot provide any more info about its use other than the comments in the code above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants