You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have noticed that the speech enhancement model by huggingface speechbrain/sepformer-wham16k-enhancement brakes with audio clips longer than 3s. More experiments are required. a workaround may be breaking the audio in segments and concatenating their enhanced versions.
Steps to Reproduce
Simply try the speech enhancement tutorial with a long audio clip.
Expected Results
The enhanced audio.
Actual Results
RuntimeError: The size of tensor a (2942) must match the size of tensor b (2500) at non-singleton dimension 1
Additional Notes
No response
The text was updated successfully, but these errors were encountered:
Description
I have noticed that the speech enhancement model by huggingface
speechbrain/sepformer-wham16k-enhancement
brakes with audio clips longer than 3s. More experiments are required. a workaround may be breaking the audio in segments and concatenating their enhanced versions.Steps to Reproduce
Simply try the speech enhancement tutorial with a long audio clip.
Expected Results
The enhanced audio.
Actual Results
RuntimeError: The size of tensor a (2942) must match the size of tensor b (2500) at non-singleton dimension 1
Additional Notes
No response
The text was updated successfully, but these errors were encountered: