From ab3440c996616fee1f4a9934fe14215e8a02be19 Mon Sep 17 00:00:00 2001 From: kkoutini <51880718+kkoutini@users.noreply.github.com> Date: Tue, 7 Jun 2022 15:15:13 +0200 Subject: [PATCH] update readme #1 --- README.md | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index bfea4e2..23acf3f 100644 --- a/README.md +++ b/README.md @@ -37,4 +37,15 @@ embed, time_stamps = get_timestamp_embeddings(audio, model) print(embed.shape) embed = get_scene_embeddings(audio, model) print(embed.shape) -``` \ No newline at end of file +``` + +# Getting the Loggits/Class labels + +You can get the logits (before the sigmoid activation) for the 527 classes of audioset: +```pyton +from hear21passt.base import load_model + +model = load_model(mode="logits").cuda() +logits = model(wave_signal) +``` +The class labels indices can be found [here](https://github.com/qiuqiangkong/audioset_tagging_cnn/blob/master/metadata/class_labels_indices.csv)