diff --git a/README.md b/README.md index bfea4e2..23acf3f 100644 --- a/README.md +++ b/README.md @@ -37,4 +37,15 @@ embed, time_stamps = get_timestamp_embeddings(audio, model) print(embed.shape) embed = get_scene_embeddings(audio, model) print(embed.shape) -``` \ No newline at end of file +``` + +# Getting the Loggits/Class labels + +You can get the logits (before the sigmoid activation) for the 527 classes of audioset: +```pyton +from hear21passt.base import load_model + +model = load_model(mode="logits").cuda() +logits = model(wave_signal) +``` +The class labels indices can be found [here](https://github.com/qiuqiangkong/audioset_tagging_cnn/blob/master/metadata/class_labels_indices.csv)