Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'Voice Conversion' paper candidate 2501.01674 #675

Open
github-actions bot opened this issue Jan 6, 2025 · 0 comments
Open

'Voice Conversion' paper candidate 2501.01674 #675

github-actions bot opened this issue Jan 6, 2025 · 0 comments

Comments

@github-actions
Copy link
Contributor

github-actions bot commented Jan 6, 2025

Please check whether this paper is about 'Voice Conversion' or not.

article info.

  • title: Controlling your Attributes in Voice

  • summary: Attribute control in generative tasks aims to modify personal attributes,
    such as age and gender while preserving the identity information in the source
    sample. Although significant progress has been made in controlling facial
    attributes in image generation, similar approaches for speech generation remain
    largely unexplored. This letter proposes a novel method for controlling speaker
    attributes in speech without parallel data. Our approach consists of two main
    components: a GAN-based speaker representation variational autoencoder that
    extracts speaker identity and attributes from speaker vector, and a two-stage
    voice conversion model that captures the natural expression of speaker
    attributes in speech. Experimental results show that our proposed method not
    only achieves attribute control at the speaker representation level but also
    enables manipulation of the speaker age and gender at the speech level while
    preserving speech quality and speaker identity.

  • id: http://arxiv.org/abs/2501.01674v1

judge

Write [vclab::confirmed] or [vclab::excluded] in comment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

0 participants