Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Words not delimited by newline in Parser.getKeywords() #9

Open
schmamps opened this issue May 7, 2018 · 1 comment
Open

Words not delimited by newline in Parser.getKeywords() #9

schmamps opened this issue May 7, 2018 · 1 comment

Comments

@schmamps
Copy link

schmamps commented May 7, 2018

Source text (snippet):

Nay, never play the brave man, else when you go back home, your own mother
won't know you. But, dear friends and allies, first let us lay our burdens down; 

I'm expecting [...{word: 'mother', count: 1}, {word: 'wont', count: 1}...] but get [...{word: 'motherwont', count: 1}...] instead.

There are possible side effects to fixing the culprit, but Parser.removePunctations() should filter on t.isAlnum() or t.isSpace(), not t.isAlnum() or t == ' '.

edit: having problems with Markdown

@schmamps
Copy link
Author

Tab is another problematic character.

schmamps added a commit to schmamps/OolongT that referenced this issue Jun 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant