Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by `featurefy.py` #29

simondurand · 2018-09-18T16:52:34Z

The VGGish features in the TGZ are so far in the format:

{"time_points": [...], "features": [[...]]}

The VGGish features obtained with featurefy.py are so far in the format:

{time, features, features_z}

However, the "features" key in the TGZ VGGish features do not correspond to the to the "features" key in the featurefy.py VGGish features, but to its "features_z" key.

This might create confusion when trying to apply to trained model to new audio.

The text was updated successfully, but these errors were encountered:

simondurand · 2018-09-19T10:08:36Z

@ejhumphrey I suggest two possible solutions:

We modify the keys in either the TGZ or the featurefy.py function so that the same things are named the same way.
We use a different name in the TGZ that can't be confused with what is produced by featurefy.py

I prefer 1) as I think it is easier for someone jumping in the project who wants to train a model with the TGZ data and then apply it to new audio files with featurefy.py to deal with the same key for the same things.
I would even suggest that the output of featurefy.py to be {time_points, features}, with "time_points" corresponding to what is currently "time" and "features" corresponding to what is currently "features_z". We probably do not need 3 fields. And this way, we only have 1-2 line of code change.

simondurand assigned ejhumphrey Sep 18, 2018

ejhumphrey mentioned this issue Sep 19, 2018

added a little helper function to vggish to get features from waveforms #31

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by `featurefy.py` #29

Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by `featurefy.py` #29

simondurand commented Sep 18, 2018

simondurand commented Sep 19, 2018

Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by featurefy.py #29

Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by featurefy.py #29

Comments

simondurand commented Sep 18, 2018

simondurand commented Sep 19, 2018

Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by `featurefy.py` #29

Confusion between the format of the VGGish features in the TGZ and in the VGGish features created by `featurefy.py` #29