Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨ Bidirectional streaming for regex sentence splitting #346

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

evaline-ju
Copy link
Collaborator

The regex sentence splitter is not a very accurate sentence splitter but we would like to provide an initial implementation of aggregation and splitting for bidirectional streaming use, in the case of streaming text chunks/tokens needing to be aggregated to sentences for further sentence analysis.

For tracking purposes, output streamed sentences remain directly concatenable.

Closes: #345

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Initial bidirectional streaming tokenization on regex sentence splitter
1 participant