Add Funcodec #3

indiejoseph · 2024-11-15T21:16:28Z

This pull request includes several changes to the codec_bpe/audio_to_codes.py file to enhance the functionality and readability of the code. The main changes involve reformatting argument parsing, adding support for a new model, and updating encoding logic.

Argument Parsing Enhancements:

Reformatting of the argument parsing section for better readability.

New Model Support:

Added support for the Funcodec model, including new arguments and handling logic. [1] [2]

Encoding Logic Updates:

Updated the sample rate logic to include the Funcodec model.
Modified the encoding logic to handle the Funcodec model.

AbrahamSanders · 2024-11-16T21:21:18Z

Hey @indiejoseph thanks for this contribution! I'll review it soon.

AbrahamSanders · 2024-11-17T23:34:39Z

@indiejoseph in testing it out I get this error:

  File "/home/codec-bpe/codec_bpe/audio_to_codes.py", line 115, in <module>
    model = Speech2Token(config_file, model_pth, device=device)
  File "/home/anaconda3/envs/dev/lib/python3.9/site-packages/funcodec/bin/codec_inference.py", line 68, in __init__
    with open(config_file, "rt", encoding="utf-8") as f:
FileNotFoundError: [Errno 2] No such file or directory: 'facebook/funcodec-base-8bit/config.yaml'

I'm not familiar with how downloading works with ModelScope - ideally it should work like in transformers, where the model is automatically downloaded from the huggingface hub if it does not exist. Alternatively to implementing this directly in audio_to_codes, instructions to download the model in the readme would be sufficient.

indiejoseph · 2024-11-18T09:12:16Z

Oh sorry, coz the it required to download the model in the working folder, and I have not add the instruction or other information into the README.md, this is my fault. I will update the PR. And it doesnt work like how transformers does, that have to download manually.
https://huggingface.co/alibaba-damo/audio_codec-encodec-zh_en-general-16k-nq32ds640-pytorch

AbrahamSanders · 2024-11-21T06:04:02Z

@indiejoseph thanks - please add a short instruction to the Readme on how to download the model. I'll review once that's in.

Add FunCodec usage

indiejoseph · 2024-11-21T10:48:35Z

I've added a section into README.md, please check

indiejoseph added 2 commits November 15, 2024 21:15

add funcodec

539b2f5

funcodec with n_quantizers arg

7edf5ec

Update requirements.txt

0f55a7c

Update README.md

152fd43

Add FunCodec usage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Funcodec #3

Add Funcodec #3

indiejoseph commented Nov 15, 2024

AbrahamSanders commented Nov 16, 2024

AbrahamSanders commented Nov 17, 2024

indiejoseph commented Nov 18, 2024

AbrahamSanders commented Nov 21, 2024

indiejoseph commented Nov 21, 2024

Add Funcodec #3

Are you sure you want to change the base?

Add Funcodec #3

Conversation

indiejoseph commented Nov 15, 2024

Argument Parsing Enhancements:

New Model Support:

Encoding Logic Updates:

AbrahamSanders commented Nov 16, 2024

AbrahamSanders commented Nov 17, 2024

indiejoseph commented Nov 18, 2024

AbrahamSanders commented Nov 21, 2024

indiejoseph commented Nov 21, 2024