Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problems with stylized PGS subs on some files #72

Open
outlyer opened this issue Apr 10, 2024 · 1 comment
Open

Problems with stylized PGS subs on some files #72

outlyer opened this issue Apr 10, 2024 · 1 comment

Comments

@outlyer
Copy link

outlyer commented Apr 10, 2024

I've had very good results with most of the files I've converted, but I have noticed the OCR seems to be particularly bad in some situations. I've narrowed this down to the specific way that the PGS subtitles are styled and how they are processed.

With a particularly styled file, the OCR is really inaccurate. I used the --keep-temp-files to see the files, and it looks like the text is inverted and placed on a black background, but the way these particular subtitles are formatted, they show up as a mostly black file.

Here is a normal file:

english srt-1539-psm6-NEURAL-65

and here is an example of the issue:

Blue Collar example

The second example has a border around the font which seems to be the cause of the issues.

@ratoaq2
Copy link
Owner

ratoaq2 commented Jun 22, 2024

Subtitles could have so many different styles. I can't think of a solution that fits all, inclusive this case

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants