Example not working with Spacy version 3.1 and 3.0.6 #101

Atul997 · 2021-09-16T06:55:43Z

I have installed current spacy version 3. 1 and running the example with some modifications but it keeps throwing error of
ValueError: [E030] Sentence boundaries unset. You can add the 'sentencizer' component to the pipeline with: nlp.add_pipe('sentencizer'). Alternatively, add the dependency parser or sentence recognizer, or set sentence boundaries by setting doc[i].is_sent_start.

Below is the code that I am using-
import spacy
import pysbd
from spacy.language import Language

@Language.component("sbd")
def pysbd_sentence_boundaries(doc):
seg = pysbd.Segmenter(language="en", clean=False, char_span=True)
sents_char_spans = seg.segment(doc.text)
char_spans = [doc.char_span(sent_span.start, sent_span.end, alignment_mode='contract')
for sent_span in sents_char_spans]
start_token_ids = [span[0].idx for span in char_spans if span is not None]
for token in doc:
token.is_sent_start = True if token.idx in start_token_ids else False
return doc

if name == "main":
text = "My name is Jonas E. Smith. Please turn to p.55."
nlp = spacy.blank('en')
doc = nlp(text)
# add as a spacy pipeline
nlp.add_pipe('sbd')
print('sent_id', 'sentence', sep='\t|\t')
for sent_id, sent in enumerate(doc.sents, start=1):
print(sent_id, sent.text, sep='\t|\t')

The text was updated successfully, but these errors were encountered:

gserapio · 2022-06-28T13:13:11Z

Facing the same issue

alexhamiltonRN · 2022-07-03T18:36:33Z

@gserapio - You might find this example from medspacy helpful https://github.com/medspacy/medspacy/blob/master/medspacy/sentence_splitting.py

gserapio · 2022-07-04T15:21:45Z

@gserapio - You might find this example from medspacy helpful https://github.com/medspacy/medspacy/blob/master/medspacy/sentence_splitting.py

Thanks @alexhamiltonRN !

gserapio mentioned this issue Jun 28, 2022

pySBD - python Sentence Boundary Disambiguation: Example no longer runs explosion/spaCy#11045

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example not working with Spacy version 3.1 and 3.0.6 #101

Example not working with Spacy version 3.1 and 3.0.6 #101

Atul997 commented Sep 16, 2021

gserapio commented Jun 28, 2022

alexhamiltonRN commented Jul 3, 2022

gserapio commented Jul 4, 2022

Example not working with Spacy version 3.1 and 3.0.6 #101

Example not working with Spacy version 3.1 and 3.0.6 #101

Comments

Atul997 commented Sep 16, 2021

gserapio commented Jun 28, 2022

alexhamiltonRN commented Jul 3, 2022

gserapio commented Jul 4, 2022