Test baseline on audio stream #107

HeChengHui · 2024-07-11T09:42:44Z

is it possible to run sed baseline in causal mode? i would like to use it on an audio stream to detect certain audio cues in a noisy environment.

popcornell · 2024-07-11T11:41:46Z

Yes but it is trained on 10 seconds chunks and the model is not causal.
You would need to use 10 seconds windows and advance by a certain stride each time.
Latency of the system will depends on the post processing you will do. E.g. if you do overlap add it will still be 10 seconds, if you only take for granted the prediction on the new stride region then it is equal to the stride.

HeChengHui · 2024-07-11T12:37:08Z

@popcornell
i see. is there any code to reference to build this pipeline?

popcornell · 2024-07-11T13:38:02Z

Not really, but in

DESED_task/recipes/dcase2024_task4_baseline/local/sed_trainer_pretrained.py

Line 1457 in c6bcb45

def _get_segment_scores(scores_df, clip_length, segment_length=1.0):

we reconstruct long-form predictions from windowed predictions

HeChengHui · 2024-07-12T02:49:25Z

@popcornell
thank you. can i check if SED is the correct task to look into for online detection of audio cues in a noisy environment?

popcornell · 2024-07-12T10:02:29Z

What do you mean by audio cues ?

HeChengHui · 2024-07-12T10:11:16Z

Like alarms

popcornell · 2024-07-13T10:52:36Z

Then yeah

HeChengHui · 2024-07-17T05:58:45Z

If i want to train my own model based on the 2024 task, looks like i can use the pretrained baseline and pre-compute embeddings of my dataset as base.
Then if i want to inference on a video clip or 10s audio, am i supposed to also use this? :

DESED_task/recipes/dcase2024_task4_baseline/local/sed_trainer_pretrained.py

Line 1457 in c6bcb45

def _get_segment_scores(scores_df, clip_length, segment_length=1.0):

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test baseline on audio stream #107

Test baseline on audio stream #107

HeChengHui commented Jul 11, 2024

popcornell commented Jul 11, 2024

HeChengHui commented Jul 11, 2024

popcornell commented Jul 11, 2024

HeChengHui commented Jul 12, 2024

popcornell commented Jul 12, 2024

HeChengHui commented Jul 12, 2024

popcornell commented Jul 13, 2024

HeChengHui commented Jul 17, 2024

Test baseline on audio stream #107

Test baseline on audio stream #107

Comments

HeChengHui commented Jul 11, 2024

popcornell commented Jul 11, 2024

HeChengHui commented Jul 11, 2024

popcornell commented Jul 11, 2024

HeChengHui commented Jul 12, 2024

popcornell commented Jul 12, 2024

HeChengHui commented Jul 12, 2024

popcornell commented Jul 13, 2024

HeChengHui commented Jul 17, 2024