Community: added audio-parser RemoteFasterWhisperParser in audio.py #26638

lfenzo · 2024-09-18T22:44:19Z

Description

This PR introduces a new audio parser to Langchain that allows interaction with Faster Whisper running in a remote server. This enhancement addresses a significant issue with the current Faster Whisper integration, which requires the Langchain process to have all necessary GPU drivers and dependencies installed locally. In microservice-based architectures, this leads to heavy and redundant dependencies in the Langchain container, making deployment more complex and resource-intensive.

Solution

This PR solves this issue by introducing a remote audio parser that can communicate with faster-whisper-server running in a dedicated container. This allows Langchain to delegate audio transcription tasks to this remote server, eliminating the need for GPU dependencies within the Langchain container itself reducing its dependency footprint.

The implementation relies on Python's built-in subprocess module to handle communication with the remote server, ensuring that no extra dependencies are introduced in Langchain.

cc.: @eyurtsev

vercel · 2024-09-18T22:44:24Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Nov 13, 2024 10:14pm

…audio.py

lfenzo · 2024-09-23T14:11:03Z

@baskaryan

lfenzo · 2024-10-01T12:12:10Z

@efriis

efriis · 2024-10-31T02:42:20Z

Marking as stale and still in need of support, per the review process

efriis · 2024-12-11T22:42:31Z

libs/community/langchain_community/document_loaders/parsers/audio.py

+            RuntimeError: If the transcription process fails.
+        """
+
+        process = subprocess.Popen(


unfortunately this isn't acceptable for security reasons, so will close!

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. community Related to langchain-community labels Sep 18, 2024

lfenzo changed the title ~~feat(parser, audio): added audio-parser RemoteFasterWhisperPerser in audio.py~~ feat(parser, audio): added audio-parser RemoteFasterWhisperParser in audio.py Sep 18, 2024

lfenzo force-pushed the remote-faster-whisper branch from 58d08f2 to 362ea98 Compare September 19, 2024 12:05

feat(parser, audio): added audio-parser RemoteFasterWhisperPerser in …

581115a

…audio.py

lfenzo force-pushed the remote-faster-whisper branch from 362ea98 to 581115a Compare September 19, 2024 12:19

Merge branch 'master' into remote-faster-whisper

e0a09de

Merge branch 'master' into remote-faster-whisper

4ae8542

lfenzo force-pushed the remote-faster-whisper branch 3 times, most recently from adb3287 to 78576cf Compare November 10, 2024 00:47

feat(parser, audio): fixed formatting for remote faster whisper server

9ff7392

lfenzo force-pushed the remote-faster-whisper branch from 78576cf to 9ff7392 Compare November 10, 2024 00:51

lfenzo changed the title ~~feat(parser, audio): added audio-parser RemoteFasterWhisperParser in audio.py~~ Community: added audio-parser RemoteFasterWhisperParser in audio.py Nov 10, 2024

Merge branch 'master' into remote-faster-whisper

a7b7a34

efriis reviewed Dec 11, 2024

View reviewed changes

efriis closed this Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Community: added audio-parser RemoteFasterWhisperParser in audio.py #26638

Community: added audio-parser RemoteFasterWhisperParser in audio.py #26638

lfenzo commented Sep 18, 2024 •

edited

Loading

vercel bot commented Sep 18, 2024 •

edited

Loading

lfenzo commented Sep 23, 2024

lfenzo commented Oct 1, 2024

efriis commented Oct 31, 2024

efriis Dec 11, 2024

Community: added audio-parser RemoteFasterWhisperParser in audio.py #26638

Community: added audio-parser RemoteFasterWhisperParser in audio.py #26638

Conversation

lfenzo commented Sep 18, 2024 • edited Loading

Description

Solution

vercel bot commented Sep 18, 2024 • edited Loading

lfenzo commented Sep 23, 2024

lfenzo commented Oct 1, 2024

efriis commented Oct 31, 2024

efriis Dec 11, 2024

Choose a reason for hiding this comment

lfenzo commented Sep 18, 2024 •

edited

Loading

vercel bot commented Sep 18, 2024 •

edited

Loading