You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am exploring ways to extract raw text from Parsr output. Options i understood:
Extract entire raw text and then split content based on line break of 10 lines. I have observed that between pages, there are 10 new lines. The approach has advantage in terms of easy processing but not sure how this would behave if multiple contiguous page doesn't have any text content.
Extract json content and then based on json schema, create page based raw text. This approach should work, but involves processing json.
Kindly recommend which options should i choose? For option 2, if you have any documentation on how to convert json back to raw text. That would be great. I am new to library, let me know if i am missing any other obvious option.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hey,
Awesome library. Absolutely loving it.
I am exploring ways to extract raw text from Parsr output. Options i understood:
Kindly recommend which options should i choose? For option 2, if you have any documentation on how to convert json back to raw text. That would be great. I am new to library, let me know if i am missing any other obvious option.
Regards,
Suraj
Beta Was this translation helpful? Give feedback.
All reactions