Add sentence tokenization to process longer texts. #71

askonivala · 2020-01-24T11:23:30Z

The supported sequence length of BERT is up to 512 tokens. Adding a simple sentence tokenization to API would enable users to process longer texts.

tanmayag78 · 2020-03-08T12:33:10Z

Any other way to handle longer texts as time complexity is higher and it will be inefficient while handling huge text. Like Mitie Ner and Stanford Ner are more efficient for handling longer texts though not as accurate as BERT-NER

Add sentence tokenization to process longer texts.

cfa8094

ntedgi mentioned this pull request Mar 28, 2020

inconsistency between GPU/CPU inference #78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sentence tokenization to process longer texts. #71

Add sentence tokenization to process longer texts. #71

askonivala commented Jan 24, 2020

tanmayag78 commented Mar 8, 2020

Add sentence tokenization to process longer texts. #71

Are you sure you want to change the base?

Add sentence tokenization to process longer texts. #71

Conversation

askonivala commented Jan 24, 2020

tanmayag78 commented Mar 8, 2020