GOOGLE OCR: current situation #685

bropines · 2024-12-17T16:42:25Z

I @bropines wrote a plugin for Google ocr.

Lately it may not work, producing “text not found” or 303 errors.
In case of an error, the text was not found, try going to the google.com page and passing it an image to the search area. If you are moved to this (screenshot), then the problem is in the EU region. Use proxies of CIS countries or any other except EU.

ScreenShot

If your page is significantly different from my example, then there was an update in your region and Google changed the Endpoints. This happens 3 times. I will find a solution if suddenly the update becomes final.

If you have a 303 error, UPDATE PROGRAMM. How? Look this GOOGLE OCR: current situation #685 (comment)

I'm writing a plugin that connects to Google's paid API, and unfortunately, in many regions it's not even possible to register with it. The plugin will be for those who have somehow gained access and a key from google vision.

30.01.25 UPD:

I know that some people will not be able to use Lens OCR. Fixed #757

vanderalex · 2024-12-17T17:30:59Z

If you have a 303 error, increase delay to 1.5.

With delay 30!! I still have sometimes 303 error...

bropines · 2024-12-17T17:37:58Z

If you have a 303 error, increase delay to 1.5.

With 30!! I still have sometimes 303 error...

chrome-lens-py

Here is my library. Make a bunch of images (40 pieces) and run the folder through this library. If everything goes well with it, I will add updated COOKIES. A 303 error literally means that Google is sending you somewhere. And there are two reasons

Google is moving you to a new interface due to an update.
You live in the EU and Google forcibly asks you for “COOKIES”.

You can check with this command:

lens_scan <folder> full_text_default --debug=debug

vanderalex · 2024-12-17T17:44:52Z

I just want to say what is simple way to solve this problem 303 - rerun this OCRed picture again (and again if you got 303 in second time). I try to do it manually and it works.

bropines · 2024-12-17T17:47:07Z

I just want to say what is simple way to solve this problem 303 - rerun this OCRed picture again (and again if you got 303 in second time). I try to do it manually and it works.

Do the test as I described above. If it goes smoothly, I'll rewrite the plugin to make it work correctly. I don't yet understand what causes this error.

vanderalex · 2024-12-17T17:51:00Z

I just want to say what is simple way to solve this problem 303 - rerun this OCRed picture again (and again if you got 303 in second time). I try to do it manually and it works.

Do the test as I described above. If it goes smoothly, I'll rewrite the plugin to make it work correctly. I don't yet understand what causes this error.

Sorry, but it's too late in my country (KZ). Tomorrow I'm going to test it.

bropines · 2024-12-17T18:19:53Z

I just want to say what is simple way to solve this problem 303 - rerun this OCRed picture again (and again if you got 303 in second time). I try to do it manually and it works.

Do the test as I described above. If it goes smoothly, I'll rewrite the plugin to make it work correctly. I don't yet understand what causes this error.

Sorry, but it's too late in my country (KZ). Tomorrow I'm going to test it.

We could have spoken Russian

vanderalex · 2024-12-17T19:19:55Z

We could have spoken Russian

На 30 тестовых страницах из 30 комиксов вывел результаты для всех 30 страниц, но насколько он там все или не все распознал, или часть баллонов пропустил - я сверять не могу, у меня нет столько времени. Но по крайней мере все 30 страниц есть результат в виде текста
log:
https://pastebin.com/aZ7xMbmm

bropines · 2024-12-18T00:33:26Z

Offtop:

Во первых. Я тебя ща ушатаю за то что ты лог загрузил так. Лучше грузи такое длинное на pastebin или хотябы в код блок(перезалей, а то листать не удобно, мыж тут не одни в теме). На крайняк загрузи сюда https://gist.github.com/ и замени на ссылку. А как файл сделай debug.bash
Во вторых, я примерно понял почему. Завтра после универа сделаю фикс. Посмотрим. Если гугл не учудит что-то снова.

vanderalex · 2024-12-18T07:03:51Z

Завтра после универа сделаю фикс.

Offtop. 2 вопроса.

У меня гугл OCR часто (1-5 раз на страницу) распознает часть текста другим языком. То есть если страница например на английском, то иногда в предложении одно-два слова распознается другим языком, например на русском или на греческом. Естественно это рушит весь смысл и приходится руками править. Например OCR выдает : On va se changer...Ca c'est pour τοι... И тут последнее слово - это вообще греческие буквы, а должно быть toi. И тако происходит постоянно, прям среди предложения бумс и левое слово. По написанию то оно похоже, но с другого языка, другими символами. Нельзя ли при распознавании явно задать язык перевода, чтобы не было таких багов?
Дружищще, подскажи, мне часто нужен такой функционал - при распознавании очень часто теряются диакритические знаки у символов. Все эти á È ê Ç и так далее, зависит от исходного языка. Проблема в том, что без этих знаков часто становится совершенно другой смысл, и получается ты много времени проводишь в правке исходного текста, заменяя в некоторых случаях а на á, Е на È и так далее. И мне очень не хватает на правой кнопке мыши в поле текстового ввода минитабличку быстрого ввода определенных символов. То есть смысл такой - на правой кнопке есть новый пункт - добавить символ. В нем заранее определенные пара десятков символов. Появляется необходимость ввести какой-то нестандартный символ - нажал правую кнопку, выбрал, символ появился в тексте на позиции курсора. Надеюсь понятно объяснил. Возможно, тебе будет интересно добавить такой функционал?

bropines · 2024-12-18T10:47:57Z

Нельзя ли при распознавании явно задать язык перевода, чтобы не было таких багов?

Нет нельзя. Поясняю точнее. Текст детектится и распознается гуглом. У интерфейса GOOGLE lens тупо нет варианта выбора языка. Я как решение могу сделать заглушки. Что имеется в виду. Ты выбираешь бабл, ЯВНО указываешь язык в параметрах плагина. В этот момент мой плагин автоматически с каждой картинкой посылает заглушку гдето сверху над блоком с текстом на нужном языке. Да, костыль. Я могу попробовать это сделать. Так же, есть теория что язык выбирается от страны запроса, но я не понимаю как встроить в куки такое.
Второе решение á можно добавить в

В теории тоже поможет.

Eng:

Is it possible to explicitly set the translation language during recognition so that there are no such bugs?

No, you can't. I'm explaining it more precisely. The text is detected and recognized by Google. The GOOGLE lens interface stupidly has no language selection option. As a solution, I can make stubs. What is meant by that. You select the bubble, EXPLICITLY specify the language in the plugin parameters. At this point, my plugin automatically sends a stub with each picture somewhere above the block with the text in the desired language. Yes, a crutch. I can try to do it. Also, there is a theory that the language is selected from the country of the request, but I do not understand how to embed this in cookies.
The second solution á can be added to

In theory, it will also help.

vanderalex · 2024-12-18T11:38:43Z

Оба решения не помогут.
Потому что гугл то распознает язык бабла нормально. Но вот какое-то одно слово (или даже несколько символов) заменяет. Вряд ли заглушка поможет. Автозамена тоже не вариант, так как иногда заменяется одна буква а иногда друга, не говоря уже что у исходных букв могут быть диакритика... В общем ладно, лучше руками править, так хоть какой-то контроль.

bropines · 2024-12-18T13:58:57Z

Оба решения не помогут. Потому что гугл то распознает язык бабла нормально. Но вот какое-то одно слово (или даже несколько символов) заменяет. Вряд ли заглушка поможет. Автозамена тоже не вариант, так как иногда заменяется одна буква а иногда друга, не говоря уже что у исходных букв могут быть диакритика... В общем ладно, лучше руками править, так хоть какой-то контроль.

Кинь ка мне в тг страницы с аномалиями. Я хочу посмотреть че там гугл засылает. А еще кинь скрин страницы с аномалией. Так же в тг. @bropines

bropines · 2024-12-20T07:41:20Z

Оба решения не помогут. Потому что гугл то распознает язык бабла нормально. Но вот какое-то одно слово (или даже несколько символов) заменяет. Вряд ли заглушка поможет. Автозамена тоже не вариант, так как иногда заменяется одна буква а иногда друга, не говоря уже что у исходных букв могут быть диакритика... В общем ладно, лучше руками править, так хоть какой-то контроль.

303 ошибку починил(теоретически). Как DmMaze проснется, он зальет изменение. Ща сижу потею над фиксом "неверного" языка

I fixed the 303 error (theoretically). As DmMaze wakes up, he will flood the change. Right now I’m sitting here sweating over a fix for the “wrong” language

vanderalex · 2024-12-21T06:29:46Z

Problem 303 is apparently solved, it is no longer present on the tests. The problem with partial recognition of another language remains, if you still need - I can give you test pages, but I have this problem on almost any page of French comics (I mainly use translation from French to English), it's strange that no one else reported this problem. Do you still need me to send test pages to telegram?

bropines · 2024-12-21T07:13:30Z

Problem 303 is apparently solved, it is no longer present on the tests. The problem with partial recognition of another language remains, if you still need - I can give you test pages, but I have this problem on almost any page of French comics (I mainly use translation from French to English), it's strange that no one else reported this problem. Do you still need me to send test pages to telegram?

Yes. Let's. I'll see what's there

bropines · 2024-12-22T01:04:35Z

22.12.24

I'm developing a plugin that connects to Google's paid API. Unfortunately, in many regions, it’s not even possible to register for it. The plugin is intended for those who have somehow managed to get access and an API key for Google Vision.

Based on my tests, it seems the paid version of Cloud Vision is significantly worse compared to what Google Lens provides. My assumption is that the paid version is trained on much more specialized data, such as documents. While it handles English fairly well, it struggles with other languages. For example:

In Japanese, it often ignores the ー character.
In Chinese, it occasionally skips characters altogether.
Korean, however, is recognized reliably.

I initially thought the issues might be caused by specific data included in the API requests. However, the problem remains unresolved because the official documentation at Google Cloud Vision API Reference has been down for several days. This has prevented me from experimenting further, although my current implementation follows Google’s official guidelines.

When it comes to detecting text and text blocks (such as speech bubbles), Google Vision performs on par with CTD, and in some minor cases, even exceeds it. My plan is to integrate Google Vision as a text detection option, but only experimentally. The primary issue is the cost: both detection and recognition are charged under the same pricing model. This means the free monthly limit of 1,000 units will be exhausted quickly. Unfortunately, I’m not yet sure if the plugin can be designed to handle detection and recognition in a single request while outputting the identified text blocks. @dmMaze – is this possible, or would this require modifications to the plugin?

If your page is significantly different from my example, there may have been a regional update where Google changed the endpoints. This has happened three times before. I’ll look for a solution if the update becomes permanent.

From what I’ve seen on forums, Google seems to have rolled back the recent changes—possibly due to user complaints or because they broke something. After the update, Google browsers suddenly stopped utilizing Lens features entirely. I don’t know how long the current method will remain functional, but I’ll provide updates here as soon as something changes.

bropines · 2025-01-09T13:34:30Z

Anyone who has encountered this problem (#712). UPGRADE TO THE LATEST VERSION OF BT.

For google drive users from dmmaze, if everything didn't update automatically at startup, open the console in the program folder and write

.\PortableGit\bin\git.exe pull

For users https://github.com/bropines/Ballon-translator-portable You don't need to do that. In theory, it always checks for updates.

bropines mentioned this issue Dec 17, 2024

Possible problem with GOOGLE OCR #682

Open

dmMaze pinned this issue Dec 17, 2024

bropines mentioned this issue Dec 20, 2024

Update ocr_google_lens.py #689

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GOOGLE OCR: current situation #685

GOOGLE OCR: current situation #685

bropines commented Dec 17, 2024 •

edited

Loading

vanderalex commented Dec 17, 2024 •

edited

Loading

bropines commented Dec 17, 2024 •

edited

Loading

vanderalex commented Dec 17, 2024

bropines commented Dec 17, 2024

vanderalex commented Dec 17, 2024

bropines commented Dec 17, 2024

vanderalex commented Dec 17, 2024 •

edited

Loading

bropines commented Dec 18, 2024 •

edited

Loading

vanderalex commented Dec 18, 2024 •

edited

Loading

bropines commented Dec 18, 2024

vanderalex commented Dec 18, 2024

bropines commented Dec 18, 2024

bropines commented Dec 20, 2024 •

edited

Loading

vanderalex commented Dec 21, 2024

bropines commented Dec 21, 2024

bropines commented Dec 22, 2024

bropines commented Jan 9, 2025

GOOGLE OCR: current situation #685

GOOGLE OCR: current situation #685

Comments

bropines commented Dec 17, 2024 • edited Loading

30.01.25 UPD:

vanderalex commented Dec 17, 2024 • edited Loading

bropines commented Dec 17, 2024 • edited Loading

vanderalex commented Dec 17, 2024

bropines commented Dec 17, 2024

vanderalex commented Dec 17, 2024

bropines commented Dec 17, 2024

vanderalex commented Dec 17, 2024 • edited Loading

bropines commented Dec 18, 2024 • edited Loading

vanderalex commented Dec 18, 2024 • edited Loading

bropines commented Dec 18, 2024

vanderalex commented Dec 18, 2024

bropines commented Dec 18, 2024

bropines commented Dec 20, 2024 • edited Loading

vanderalex commented Dec 21, 2024

bropines commented Dec 21, 2024

bropines commented Dec 22, 2024

bropines commented Jan 9, 2025

bropines commented Dec 17, 2024 •

edited

Loading

vanderalex commented Dec 17, 2024 •

edited

Loading

bropines commented Dec 17, 2024 •

edited

Loading

vanderalex commented Dec 17, 2024 •

edited

Loading

bropines commented Dec 18, 2024 •

edited

Loading

vanderalex commented Dec 18, 2024 •

edited

Loading

bropines commented Dec 20, 2024 •

edited

Loading