-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot handle double byte characters #1
Comments
Thanks for taking the time to open a ticket. Could you please share the bogus As a side note the parsing errors aren't very helpful in |
Hi @brianfreud, do you have any updates on this issue? |
Getting similar issue with a gedcom file generated by heredis
|
Thanks, I'll look into that in a week (currently on vacation). |
@kefniark I'm not sure exactly what went wrong; I should definitely improve these error messages to include the detected encoding and other potentially relevant information. |
No worries, at the end I run into other issues and realize it would probably not work with this library. So at the end, I went another way
|
Thanks for the reply, I understand better. This package is known to work on large files. For instance the file This package was in fact designed to address some of the limitations of |
Using a .ged encoded in UTF8. The double byte character "�" (erroniously) was in the content of a couple of NOTE+CONT tags.
The parser threw an "Invalid format" error for the line and halted processing of the file.
The text was updated successfully, but these errors were encountered: