Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'Voice Conversion' paper candidate 2501.10256 #683

Open
github-actions bot opened this issue Jan 20, 2025 · 0 comments
Open

'Voice Conversion' paper candidate 2501.10256 #683

github-actions bot opened this issue Jan 20, 2025 · 0 comments

Comments

@github-actions
Copy link
Contributor

Please check whether this paper is about 'Voice Conversion' or not.

article info.

  • title: Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR

  • summary: Automatic speech recognition (ASR) systems are well known to perform poorly
    on dysarthric speech. Previous works have addressed this by speaking rate
    modification to reduce the mismatch with typical speech. Unfortunately, these
    approaches rely on transcribed speech data to estimate speaking rates and
    phoneme durations, which might not be available for unseen speakers. Therefore,
    we combine unsupervised rhythm and voice conversion methods based on
    self-supervised speech representations to map dysarthric to typical speech. We
    evaluate the outputs with a large ASR model pre-trained on healthy speech
    without further fine-tuning and find that the proposed rhythm conversion
    especially improves performance for speakers of the Torgo corpus with more
    severe cases of dysarthria. Code and audio samples are available at
    https://idiap.github.io/RnV .

  • id: http://arxiv.org/abs/2501.10256v1

judge

Write [vclab::confirmed] or [vclab::excluded] in comment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants