Replies: 3 comments 5 replies
-
It's probably hallucinations from the xtts model, It is a transformer based model so I suppose you could like lower its temperature on inference? But I've never dived that deep into using xtts to be honest. Also you could try using STYLETTS2 model instead, it's not quite as good as XTTS but I don't think it has the hallucination issues |
Beta Was this translation helpful? Give feedback.
-
I've seen this too. xtts works beautifully with non fiction books since there's so little quotes, but in fiction hallucinations happen at the end of most quotes. I kind of wonder if xtts is struggling to pivot between what it interprets as different speakers and the narrator. Drew what you've done here is awesome. Really appreciate the tool and I use it quite a lot. Thank you for your work! |
Beta Was this translation helpful? Give feedback.
-
Actually I think I've solved this in my ebook2audiobookxtts repo:
It'll be adding To all lines that have this
I'll probs add this into the new gradio version of VoxNovel I'm working on (once I get it up and working lol)More info can be found here ⬇️ |
Beta Was this translation helpful? Give feedback.
-
Drew, this is working really well thus far. I do, however, get some odd sounds at a few end quotes. I've looked, and tried to remove extra spaces, and make sure all the quotes are proper end quotes. Not sure why it addes in these odd noises from the reader voice in these places. I've seen this with both .txt and .epub files. Any ideas?
Beta Was this translation helpful? Give feedback.
All reactions