Audio file loader implemented with Azure speech service #13988

kzmain · 2023-11-28T21:50:20Z

No description provided.

vercel · 2023-11-28T21:50:37Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Mar 4, 2024 6:08am

eyurtsev · 2023-12-02T02:25:02Z

libs/langchain/langchain/document_loaders/audio.py

+from langchain.document_loaders.parsers.audio import AzureSpeechServiceParser
+
+
+class AzureSpeechServiceLoader(GenericLoader):


Made a PR here to help with the pattern for Generic Loader: #14004 -- we don't want any file logic to live outside of file system blob loader.

@kzmain met know if you want to complete the refactor or if you want me to do it

Dear eyurtsev,

I found I miss used the GenericLoader, and updated the AzureSpeechServiceLoader class to inherit the BaseLoader class. I pushed it in the commit 7b8372d. Look forward to seeing feedback from you. Thanks.

Kind Regards,
Kai Zhang

@kzmain

I updated the service loader to use generic loader (see Improve file system blob loader and generic loader #14004)

I expanded the kwargs so the parameters are easy to discover and refactored some of the resolving of values

Would you be able to test that the service still works for you? I don't have credentials set up so testing help will be helpful

Could you run the parser with 2 audio files at once using from_filesystem to confirm that transcribing more than one file at a time works?

eyurtsev · 2023-12-02T02:30:02Z

libs/langchain/langchain/document_loaders/parsers/audio.py

+class AzureSpeechServiceParser(BaseBlobParser):
+    """Loads an Audio with azure.cognitiveservices.speech."""
+
+    def __init__(self, **kwargs: Any) -> None:


Could you add a link to the relevant pages on azure that explain the API?

Could you expand the kwargs and document the variable meanings?

Could we add a parameter to control the polling interval (i.e., sleep is hard-coded to 0.5 seconds right now, we should expose this to the user)

nit: It's common to use logger rather than print statements in production code, you could consider swapping to using logger.info / logger.error as appropriate. (not a requirement to merge since we have some other parsers that are using print right now)

Dear eyurtsev,

I fixed all the issues in the commit 7b8372d:

I Added Azure Speech service official documents and code pages.
I expanded and documented the kwargs meanings.
I added a parameter to control the transcribe job's polling interval.

Furthermore, I did all the unit tests in the test_audio.py file and here's the evidence:

Look forward to seeing feedback from you. Thanks.

1. Fix misuse of GenericLoader in the AzureSpeechServiceLoader class 2.1. Add Azure Speech service official documents and code pages. 2.2. Expand and document the kwargs meanings. 2.3. Add a parameter to control the transcribe job's polling interval

eyurtsev · 2023-12-05T19:47:18Z

libs/langchain/langchain/document_loaders/audio.py

+from langchain.document_loaders.parsers.audio import AzureSpeechServiceParser
+
+
+class AzureSpeechServiceLoader(GenericLoader):


@kzmain

I updated the service loader to use generic loader (see Improve file system blob loader and generic loader #14004)

I expanded the kwargs so the parameters are easy to discover and refactored some of the resolving of values

Would you be able to test that the service still works for you? I don't have credentials set up so testing help will be helpful

Could you run the parser with 2 audio files at once using from_filesystem to confirm that transcribing more than one file at a time works?

eyurtsev · 2023-12-05T19:48:23Z

libs/langchain/tests/integration_tests/test_audio.py

+SPEECH_SERVICE_KEY = ""
+
+
+def _get_csv_file_path() -> str:


Could you fix the testing code here?

eyurtsev · 2023-12-05T19:48:58Z

libs/langchain/tests/integration_tests/test_audio.py

+
+
+def test_azure_speech_load_key_region_auto_detect_languages() -> None:
+    loader = AzureSpeechServiceLoader(


loader = AzureSpeechServiceLoader.from_filesystem(...)

Dear @eyurtsev ,

AzureSpeechServiceLoader doesn't inherit the GenericLoader but the BaseLoader just like the CSVLoader and the PDFLoader. In this case, it does not use the from_filesystem() function but load the file with class initializer like: AzureSpeechServiceLoader(file_path: str) format.

I uploaded the unit tests success screenshot in previous conversation. I am wondering if I upload my unit test result with console result with console and the test code (without my credential), will help your code review?

Dear @eyurtsev ,

First appreciate you soooo much for your updated my parser neat and well-structured, I found it until I got home. Here's what I updated today:

the class AzureSpeechServiceParser __init__ function kwargs' type

changed the loader from a GenericLoader to BaseLoader, and added the lazy_load function

updated the unit tests

…oader to BaseLoader

kzmain · 2023-12-25T14:20:20Z

Dear @eyurtsev Can you help me to check if my commit is available to merge?

eyurtsev · 2024-02-28T21:05:07Z

libs/community/tests/unit_tests/document_loaders/test_audio.py

+from langchain_community.document_loaders import AzureAISpeechLoader
+
+SPEECH_SERVICE_REGION = "eastasia"
+SPEECH_SERVICE_KEY = "c77dcf2aa5c04dd6b6613f77d9d9161d"


@kzmain if this is a correct service key assume that it's been compromised since it's available here in plain text

eyurtsev · 2024-12-11T23:00:41Z

Hi @kzmain sorry for failing to follow on this PR. I'm going to close this since this is a year old and still hasn't been merged. I appreciate you contributing to the project and really apologize that it's taken so long to get a resolution.

Audio file loader implemented with Azure speech service

e80074a

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. Ɑ: doc loader Related to document loader module (not documentation) 🤖:enhancement A large net-new component, integration, or chain. Use sparingly. The largest features labels Nov 28, 2023

Merge branch 'master' into feature_audio_loader_auzre_speech

c3a9f6f

vercel bot had a problem deploying to Preview November 28, 2023 22:00 Failure

baskaryan self-assigned this Nov 28, 2023

baskaryan added 2 commits November 28, 2023 14:51

Merge branch 'master' into feature_audio_loader_auzre_speech

3e2c139

cr

add9346

baskaryan assigned eyurtsev Nov 28, 2023

eyurtsev self-requested a review November 29, 2023 02:49

Merge branch 'master' into feature_audio_loader_auzre_speech

f1c7eef

eyurtsev reviewed Dec 2, 2023

View reviewed changes

kzmain and others added 8 commits December 3, 2023 03:26

Merge branch 'master' into feature_audio_loader_auzre_speech

7b8372d

Merge branch 'master' into feature_audio_loader_auzre_speech

6c7ca55

Merge branch 'master' into feature_audio_loader_auzre_speech

ba1d82d

Fix lint issue

f407c47

Merge branch 'master' into feature_audio_loader_auzre_speech

b1f7f3c

x

ec5bdb6

x

4c639b0

eyurtsev reviewed Dec 5, 2023

View reviewed changes

kzmain and others added 6 commits December 6, 2023 20:14

Update unit test, parser kwarg's type and change loader from GenericL…

1c41d34

…oader to BaseLoader

Merge branch 'master' into feature_audio_loader_auzre_speech

46b5ef3

Merge branch 'master' into feature_audio_loader_auzre_speech

61786f7

Merge branch 'master' into feature_audio_loader_auzre_speech

a99f350

Merge branch 'master' into feature_audio_loader_auzre_speech

edbb795

move loader to the langchain_community

a6b49b2

Linting and formatting

5b830b3

kzmain added 3 commits December 28, 2023 17:58

Merge branch 'master' into feature_audio_loader_auzre_speech

b4c6272

Merge branch 'master' into feature_audio_loader_auzre_speech

ae2d85b

Merge branch 'master' into feature_audio_loader_auzre_speech

f84fcef

hwchase17 closed this Jan 30, 2024

baskaryan reopened this Jan 30, 2024

kzmain and others added 2 commits February 1, 2024 01:31

Merge branch 'master' into feature_audio_loader_auzre_speech

6d58293

Merge branch 'master' into feature_audio_loader_auzre_speech

fd3a3a0

eyurtsev mentioned this pull request Feb 28, 2024

community[minor]: Audio file loader implemented with Azure speech service #18284

Closed

kzmain added 4 commits February 29, 2024 16:46

Merge branch 'master' into feature_audio_loader_auzre_speech

9746cb9

Merge branch 'master' into feature_audio_loader_auzre_speech

24c0786

Merge branch 'master' into feature_audio_loader_auzre_speech

f534e3d

Merge branch 'master' into feature_audio_loader_auzre_speech

7d1ad03

ccurme added the community Related to langchain-community label Jun 18, 2024

eyurtsev reviewed Dec 11, 2024

View reviewed changes

eyurtsev closed this Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio file loader implemented with Azure speech service #13988

Audio file loader implemented with Azure speech service #13988

kzmain commented Nov 28, 2023

vercel bot commented Nov 28, 2023 •

edited

Loading

eyurtsev Dec 2, 2023

kzmain Dec 2, 2023

eyurtsev Dec 5, 2023

eyurtsev Dec 2, 2023

kzmain Dec 2, 2023

eyurtsev Dec 5, 2023

eyurtsev Dec 5, 2023

eyurtsev Dec 5, 2023

kzmain Dec 6, 2023

kzmain Dec 6, 2023 •

edited

Loading

kzmain commented Dec 25, 2023

eyurtsev Feb 28, 2024

eyurtsev commented Dec 11, 2024

		from langchain.document_loaders.parsers.audio import AzureSpeechServiceParser


		class AzureSpeechServiceLoader(GenericLoader):



		def test_azure_speech_load_key_region_auto_detect_languages() -> None:
		loader = AzureSpeechServiceLoader(

Audio file loader implemented with Azure speech service #13988

Audio file loader implemented with Azure speech service #13988

Conversation

kzmain commented Nov 28, 2023

vercel bot commented Nov 28, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kzmain Dec 6, 2023 • edited Loading

Choose a reason for hiding this comment

kzmain commented Dec 25, 2023

Choose a reason for hiding this comment

eyurtsev commented Dec 11, 2024

vercel bot commented Nov 28, 2023 •

edited

Loading

kzmain Dec 6, 2023 •

edited

Loading