Skip `OpenAIWhisperParser` extremely small audio chunks to avoid api error #11450

leodiegues · 2023-10-05T20:19:21Z

Description
This PR addresses a rare issue in OpenAIWhisperParser that causes it to crash when processing an audio file with a duration very close to the class's chunk size threshold of 20 minutes.

Issue
#11449

Dependencies
None

Tag maintainer
@agola11 @eyurtsev

Twitter handle
leonardodiegues

vercel · 2023-10-05T20:19:24Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Feb 20, 2024 7:21pm

leodiegues · 2023-10-09T13:52:36Z

@agola11 @eyurtsev

baskaryan · 2024-02-14T21:13:53Z

Apologies for the slow review! Pr has some merge conflicts, happy to re-review if you'd like to resolve

leodiegues · 2024-02-15T13:51:17Z

Apologies for the slow review! Pr has some merge conflicts, happy to re-review if you'd like to resolve

No problem! Will resolve conflicts this weekend

leodiegues · 2024-02-17T20:37:21Z

@baskaryan, I believe everything is in order now.

baskaryan · 2024-02-19T17:38:22Z

libs/community/langchain_community/document_loaders/parsers/audio.py

@@ -52,11 +52,15 @@ def lazy_parse(self, blob: Blob) -> Iterator[Document]:
        # Need to meet 25MB size limit for Whisper API
        chunk_duration = 20
        chunk_duration_ms = chunk_duration * 60 * 1000
+        chunk_duration_threshold = 0.1


can we make this a configurable param with default value of None, so that default behavior doesn't change

Does it make sense to make this param configurable? chunk_duration_threshold only reflects OpenAI's API minimal audio duration, which is 0.1s.

Attempt 1 failed. Exception: Audio file is too short. Minimum audio length is 0.1 seconds.

ah i see. perhaps still worth making configurable (with default value 0.1) for future proofing? i.e. in case OpenAI API is updated some time in the future to accept shorter lengths

also could we add a comment somewhere explaining why the threshold is needed

I made the changes you suggested and thought more about the future-proofing idea – you're right about that. Also, I added the comments you mentioned to the class docstring. But if you'd rather have them inside the code, just let me know, and I'll fix it

…reshold parameter

@agola11

…unks to avoid api error (langchain-ai#11450) **Description** This PR addresses a rare issue in `OpenAIWhisperParser` that causes it to crash when processing an audio file with a duration very close to the class's chunk size threshold of 20 minutes. **Issue** langchain-ai#11449 **Dependencies** None **Tag maintainer** @agola11 @eyurtsev **Twitter handle** leonardodiegues --------- Co-authored-by: Leonardo Diegues <[email protected]> Co-authored-by: Bagatur <[email protected]>

dosubot bot added Ɑ: doc loader Related to document loader module (not documentation) 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature labels Oct 5, 2023

leodiegues changed the title ~~Skip OpenAIWhisperParser extremely small audio chunks to avoid api …~~ Skip OpenAIWhisperParser extremely small audio chunks to avoid api error Oct 5, 2023

hwchase17 closed this Jan 30, 2024

baskaryan reopened this Jan 30, 2024

leodiegues closed this Feb 17, 2024

leodiegues force-pushed the fix-whisper-chunk-error branch from 373456c to d7c26c8 Compare February 17, 2024 20:28

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Feb 17, 2024

Add skip for short audio chunks in OpenAIWhisperParser

26cb778

leodiegues reopened this Feb 17, 2024

baskaryan reviewed Feb 19, 2024

View reviewed changes

Leo Diegues and others added 2 commits February 20, 2024 06:13

Merge branch 'master' into fix-whisper-chunk-error

9713b0b

Refactor OpenAIWhisperParser constructor to include chunk_duration_th…

2443592

…reshold parameter

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Feb 20, 2024

Leo Diegues and others added 2 commits February 20, 2024 13:53

Merge branch 'master' into fix-whisper-chunk-error

2be85d8

fmt

15be07f

baskaryan approved these changes Feb 20, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Feb 20, 2024

fmt

a6c2d10

baskaryan merged commit b15fccb into langchain-ai:master Feb 23, 2024
58 checks passed

leodiegues deleted the fix-whisper-chunk-error branch February 23, 2024 18:07

leodiegues mentioned this pull request Feb 23, 2024

OpenAIWhisperParser raises error if audio has duration too close to the chunk limit #11449

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip `OpenAIWhisperParser` extremely small audio chunks to avoid api error #11450

Skip `OpenAIWhisperParser` extremely small audio chunks to avoid api error #11450

leodiegues commented Oct 5, 2023

vercel bot commented Oct 5, 2023 •

edited

Loading

leodiegues commented Oct 9, 2023

baskaryan commented Feb 14, 2024

leodiegues commented Feb 15, 2024

leodiegues commented Feb 17, 2024

baskaryan Feb 19, 2024

leodiegues Feb 19, 2024

baskaryan Feb 19, 2024

baskaryan Feb 19, 2024

leodiegues Feb 20, 2024

Skip OpenAIWhisperParser extremely small audio chunks to avoid api error #11450

Skip OpenAIWhisperParser extremely small audio chunks to avoid api error #11450

Conversation

leodiegues commented Oct 5, 2023

vercel bot commented Oct 5, 2023 • edited Loading

leodiegues commented Oct 9, 2023

baskaryan commented Feb 14, 2024

leodiegues commented Feb 15, 2024

leodiegues commented Feb 17, 2024

baskaryan Feb 19, 2024

Choose a reason for hiding this comment

leodiegues Feb 19, 2024

Choose a reason for hiding this comment

baskaryan Feb 19, 2024

Choose a reason for hiding this comment

baskaryan Feb 19, 2024

Choose a reason for hiding this comment

leodiegues Feb 20, 2024

Choose a reason for hiding this comment

Skip `OpenAIWhisperParser` extremely small audio chunks to avoid api error #11450

Skip `OpenAIWhisperParser` extremely small audio chunks to avoid api error #11450

vercel bot commented Oct 5, 2023 •

edited

Loading