-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Microphone style transfer (MicAugment) #88
Comments
I am curious - how do you plan on using micaugment? Is it the idea that we can submit a trained micaugment model as part of the transform? |
I haven't thought it through, but yeah, we should ideally have a pretrained model that is ready to be used. The model can be uploaded as a binary in a github release, and can be downloaded and stored locally on demand (the first time the transform gets used). This approach is inspired by the way Keras did pretrained imagenet models. Would https://github.com/akashrajkn/micaugment be suitable? |
I think it is suitable - however, I still have to update the repo with a pretrained model. |
It would be awesome if you could make that happen 🤩 But I guess the pretrained model would depend on a specific sample rate, right? Ideally, torch-audiomentations should be compatible with a wide range of sample rates 🤔 Maybe it could do some resampling to match the sample rate used in the model |
À la https://arxiv.org/abs/2010.09658
The text was updated successfully, but these errors were encountered: