You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I recently came across this paper on arxiv, and then found the implementation here - great work! I was wondering if you could give any guidance on hyperparameter selection, especially the alpha parameter? I don't see its mention in the original paper, and it seems to act like a regularization strength. Small values tend to force all mixture components to be the same, while larger values allow for different means/variances. Do you have any further insights in how to set that parameter?
The text was updated successfully, but these errors were encountered:
I recently came across this paper on arxiv, and then found the implementation here - great work! I was wondering if you could give any guidance on hyperparameter selection, especially the alpha parameter? I don't see its mention in the original paper, and it seems to act like a regularization strength. Small values tend to force all mixture components to be the same, while larger values allow for different means/variances. Do you have any further insights in how to set that parameter?
The text was updated successfully, but these errors were encountered: