We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WikipediaTokenizer incorrectly splits tokens that have italics/bold inside the token, for instance '''F'''oo is a bold Foo, not F, oo
Migrated from LUCENE-1141 by Grant Ingersoll (@gsingers), updated May 16 2011 Attachments: LUCENE-1141-test.patch
The text was updated successfully, but these errors were encountered:
Grant Ingersoll (@gsingers) (migrated from JIRA)
Here's a test case for the problem
Sorry, something went wrong.
Jens Muecke (@ryd) (migrated from JIRA)
Patch doesn't apply any more.
common.compile-test: [javac] Compiling 1 source file to /home/jens/projects/java/lucene-git/build/contrib/wikipedia/classes/test [javac] /home/jens/projects/java/lucene-git/contrib/wikipedia/src/test/org/apache/lucene/wikipedia/analysis/WikipediaTokenizerTest.java:232: cannot find symbol [javac] Token token = new Token(); [javac] ^ [javac] symbol: class Token [javac] location: class WikipediaTokenizerTest [javac] /home/jens/projects/java/lucene-git/contrib/wikipedia/src/test/org/apache/lucene/wikipedia/analysis/WikipediaTokenizerTest.java:232: cannot find symbol [javac] Token token = new Token(); [javac] ^ [javac] symbol: class Token [javac] location: class WikipediaTokenizerTest [javac] Note: /home/jens/projects/java/lucene-git/contrib/wikipedia/src/test/org/apache/lucene/wikipedia/analysis/WikipediaTokenizerTest.java uses unchecked or unsafe operations. [javac] Note: Recompile with -Xlint:unchecked for details. [javac] 2 errors
BUILD FAILED
It's not by adding the import for the Token class.
gsingers
No branches or pull requests
WikipediaTokenizer incorrectly splits tokens that have italics/bold inside the token, for instance '''F'''oo is a bold Foo, not F, oo
Migrated from LUCENE-1141 by Grant Ingersoll (@gsingers), updated May 16 2011
Attachments: LUCENE-1141-test.patch
The text was updated successfully, but these errors were encountered: