Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test baseline on audio stream #107

Open
HeChengHui opened this issue Jul 11, 2024 · 8 comments
Open

Test baseline on audio stream #107

HeChengHui opened this issue Jul 11, 2024 · 8 comments

Comments

@HeChengHui
Copy link

is it possible to run sed baseline in causal mode? i would like to use it on an audio stream to detect certain audio cues in a noisy environment.

@popcornell
Copy link
Collaborator

Yes but it is trained on 10 seconds chunks and the model is not causal.
You would need to use 10 seconds windows and advance by a certain stride each time.
Latency of the system will depends on the post processing you will do. E.g. if you do overlap add it will still be 10 seconds, if you only take for granted the prediction on the new stride region then it is equal to the stride.

@HeChengHui
Copy link
Author

@popcornell
i see. is there any code to reference to build this pipeline?

@popcornell
Copy link
Collaborator

Not really, but in

def _get_segment_scores(scores_df, clip_length, segment_length=1.0):
we reconstruct long-form predictions from windowed predictions

@HeChengHui
Copy link
Author

@popcornell
thank you. can i check if SED is the correct task to look into for online detection of audio cues in a noisy environment?

@popcornell
Copy link
Collaborator

What do you mean by audio cues ?

@HeChengHui
Copy link
Author

Like alarms

@popcornell
Copy link
Collaborator

Then yeah

@HeChengHui
Copy link
Author

If i want to train my own model based on the 2024 task, looks like i can use the pretrained baseline and pre-compute embeddings of my dataset as base.
Then if i want to inference on a video clip or 10s audio, am i supposed to also use this? :

def _get_segment_scores(scores_df, clip_length, segment_length=1.0):

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants