Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'Voice Conversion' paper candidate 2501.04416 #677

Open
github-actions bot opened this issue Jan 9, 2025 · 0 comments
Open

'Voice Conversion' paper candidate 2501.04416 #677

github-actions bot opened this issue Jan 9, 2025 · 0 comments

Comments

@github-actions
Copy link
Contributor

github-actions bot commented Jan 9, 2025

Please check whether this paper is about 'Voice Conversion' or not.

article info.

  • title: ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training

  • summary: Style voice conversion aims to transform the speaking style of source speech
    into a desired style while keeping the original speaker's identity. However,
    previous style voice conversion approaches primarily focus on well-defined
    domains such as emotional aspects, limiting their practical applications. In
    this study, we present ZSVC, a novel Zero-shot Style Voice Conversion approach
    that utilizes a speech codec and a latent diffusion model with speech prompting
    mechanism to facilitate in-context learning for speaking style conversion. To
    disentangle speaking style and speaker timbre, we introduce information
    bottleneck to filter speaking style in the source speech and employ Uncertainty
    Modeling Adaptive Instance Normalization (UMAdaIN) to perturb the speaker
    timbre in the style prompt. Moreover, we propose a novel adversarial training
    strategy to enhance in-context learning and improve style similarity.
    Experiments conducted on 44,000 hours of speech data demonstrate the superior
    performance of ZSVC in generating speech with diverse speaking styles in
    zero-shot scenarios.

  • id: http://arxiv.org/abs/2501.04416v1

judge

Write [vclab::confirmed] or [vclab::excluded] in comment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants