You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
Separable convolutions is a trick described in the paper of QuartzNet. Shortly, it uses less parameters achieving pretty same results (so it makes the model smaller and faster for on-device inference)
As far as I remember, it can be unclear in the paper about the blocks where sepconvs are used. But we have tried to fully reproduce the paper and the number of the parameters of the model is known. If I remember correctly we tried using sepconvs everywhere to get the same number of the parameters as described in the paper and it worked.
Thank you for sharing your great work.
I noticed you have used
sepconv_bn
in C1 and C2 instead ofconv_bn_act
.Is it on purpose? Does it give better results?
QuartzNet-ASR-pytorch/model.py
Line 49 in ec6073e
The text was updated successfully, but these errors were encountered: