Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dealing with words with multiple pronunciations #193

Open
ivancarapinha opened this issue Jul 6, 2020 · 2 comments
Open

Dealing with words with multiple pronunciations #193

ivancarapinha opened this issue Jul 6, 2020 · 2 comments

Comments

@ivancarapinha
Copy link

Hello,

Since this g2p transformer performs phonetic transcription word by word, how does it select the correct pronunciation for a word that has several possible pronunciations? This is very common for many nouns and verbs, for example, the noun "content" and the verb "to content" (to satisfy).

Thank you

@nshmyrev
Copy link
Contributor

nshmyrev commented Jul 6, 2020

It supports n-best output in theory. As for using part of speech as input feature for training, it is also possible, but requires work on model architecture, and, correspondingly, code.

@widdiot
Copy link

widdiot commented Jul 1, 2021

Does that mean , as of now, for training the g2p model, input dictionary should only have 1-best pronunciations?
If not, how to handle multiple pronunciations in the training dictionary?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants