Add tutorial for multilabel training #254

PariaValizadeh · 2024-07-24T14:07:43Z

This PR adds a tutorial notebook to guide practictioners how to train a ResNet model for multilabel classification on the HSN dataset.

Add notebook
merge main
check that it is working

raphaelschwinger

Hey @PariaValizadeh,
Thanks for your work. To get the most out it for our readers, I would suggest the following changes:

Load the data with our BirdSetDataModule without any preprocessing so that it outputs batched waveforms
Pick the first sample of the batch from the train and test set (this will show the difference of focal / soundscapes)
Make the sounds audible
Now, add preprocessing to train a EfficientNet (EfficientNet because we can upload a pretrained model for that, and we included the EfficientNet in BirdSet), make sure to use the same preprocessing config as done in configs/experiment/birdset_neurips24/HSN/DT/efficientnet.yaml
Visualize the first sample (now a spectrogram)
Train the model (with same configs) and add the option to download the pretrained model
Run test with trainer.test()
Run model on previously selected test sample
Print the predicted classes and the target class.

raphaelschwinger · 2024-07-25T09:42:38Z

notebooks/tutorials/additional_tutorials/Multilabel_ResNet_on_HSN.ipynb

+   "source": [
+    "## load the test dataset\n",
+    "from datasets import load_dataset\n",
+    "hsn_test = load_dataset(\"DBD-research-group/BirdSet\",\"HSN\", split=\"test\")"


@PariaValizadeh This will not use the cached version and therefore requires a complete redownload.

raphaelschwinger · 2024-07-25T10:27:58Z

notebooks/tutorials/additional_tutorials/Multilabel_ResNet_on_HSN.ipynb

+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Step 6: Visualization"


@PariaValizadeh Could you move the visualization before the model training? So that the reader sees how the data gets preprocessed by the birdset pipeline?

PariaValizadeh added 3 commits July 24, 2024 14:02

Add tutorial notebook

eaf2971

rename notebook

41e022c

checkcodeworking

fffa134

PariaValizadeh marked this pull request as ready for review July 25, 2024 06:53

PariaValizadeh changed the title ~~WIP: Add tutorial for multilabel training~~ Add tutorial for multilabel training Jul 25, 2024

PariaValizadeh marked this pull request as draft July 25, 2024 06:54

PariaValizadeh requested a review from raphaelschwinger July 25, 2024 06:54

PariaValizadeh marked this pull request as ready for review July 25, 2024 06:54

set cache directory + small language fixes

854f328

raphaelschwinger requested changes Jul 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tutorial for multilabel training #254

Add tutorial for multilabel training #254

PariaValizadeh commented Jul 24, 2024 •

edited

Loading

raphaelschwinger left a comment

raphaelschwinger Jul 25, 2024

raphaelschwinger Jul 25, 2024

Add tutorial for multilabel training #254

Are you sure you want to change the base?

Add tutorial for multilabel training #254

Conversation

PariaValizadeh commented Jul 24, 2024 • edited Loading

raphaelschwinger left a comment

Choose a reason for hiding this comment

raphaelschwinger Jul 25, 2024

Choose a reason for hiding this comment

raphaelschwinger Jul 25, 2024

Choose a reason for hiding this comment

PariaValizadeh commented Jul 24, 2024 •

edited

Loading