Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Erroneous translation DE->en-GB and en-US #4

Open
boeryepes69 opened this issue Jul 24, 2023 · 3 comments
Open

Erroneous translation DE->en-GB and en-US #4

boeryepes69 opened this issue Jul 24, 2023 · 3 comments
Labels
bug Something isn't working

Comments

@boeryepes69
Copy link

boeryepes69 commented Jul 24, 2023

Describe the bug
A German contract document (PDF) contains the words AUFTRAGGEBER (Client) and AUFTRAGNEHMER (provider). These 2 German words are both translated to CONTRACTOR or both to CUSTOMER, effectively corrupting the resulting document.

Adding the correct translation to a glossary does not change the behaviour.
Testing a empty Word document with only these 2 words and a glossary does translate correctly.

To Reproduce
Steps to reproduce the behavior:
Have a PDF document with these 2 words in amongst many other text
Post it to the API with or without a glossary
The result is as described above

Expected behavior
Both words should not be translated the same AND the glossary should enforce the right translation.

Screenshots or sample code
Original image
Translated image

** Additional context **
Selecting the one sentence and translating with the desktop app gives the exact same bad result.

@daniel-jones-deepl
Copy link
Member

Hi @boeryepes69, thank you for reporting the issue.

I could reproduce it, and you are right: this behavior is unexpected. I'll forward it to the relevant team and ask if there is a solution.

@daniel-jones-deepl daniel-jones-deepl added the bug Something isn't working label Jul 25, 2023
@JanEbbing
Copy link
Member

Thanks for the report! In case you can change the document, avoiding the upper casing fixes this for me:

"Jegliche Verlagerung von Betriebsaktivitäten des Auftragnehmers im Bezug auf Services erfolgt anhand eines vom Auftragnehmer erstellten und Auftraggeber genehmigten, formellen Migrationsplans für die ordnungsgemäße TRANSITION und SERVICE-MIGRATION sowie ununterbrochenen Service."

gets translated to

"Any relocation of the Contractor's operations with respect to Services shall be based on a formal migration plan for proper TRANSITION and SERVICE MIGRATION and uninterrupted service prepared by the Contractor and approved by the Client."

We'll check why glossaries don't work for this and if we can fix the translation problem itself.

@boeryepes69
Copy link
Author

i confirm JanEbbing's observation that without capitals, the translation works. It's really weird that capitals make such a difference.

The glossaries not working is interesting and I believe issue DeepLcom/deepl-dotnet#16 is related (it's the same me but I lost access to the other email when I switched providers ...).

I can also confirm that when I pass the text to the DeepL API, the correct GlossaryId is included in the TextTranslateOptions object.

@JanEbbing JanEbbing transferred this issue from DeepLcom/deepl-dotnet Sep 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants