Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SSAST for embedding model #34

Open
Ironmomo opened this issue May 7, 2024 · 0 comments
Open

SSAST for embedding model #34

Ironmomo opened this issue May 7, 2024 · 0 comments

Comments

@Ironmomo
Copy link

Ironmomo commented May 7, 2024

Hi Yuan,
We're reaching out again regarding our Bachelor Thesis on Speaker Recognition. We're facing a challenge in implementing the SSAST as an embedding model trained on Contrastive Loss, such as Triplet Loss. Since speaker recognition poses an open set problem, where the number of speaker classes isn't predetermined, we need to determine a suitable dimension for the embedding. Additionally, we consider to make adjustments to the multilayer perceptron (MLP) head to accommodate this. During your studies on SSAST do u came up with any insights that could maybe lead to any recommendations for us?
Thanks, Andrin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant