Technology | Description |
---|---|
PyTorch | A machine learning framework based on the Torch library, used for applications such as computer vision and natural language processing, originally developed by Meta AI and now part of the Linux Foundation umbrella. |
Python | A high-level, general-purpose programming language |
Transformers | It provides APIs and tools to easily download and train state-of-the-art pretrained models |
openai/whisper-large-v3 | A pre-trained model for automatic speech recognition (ASR) and speech translation.a pre-trained model for automatic speech recognition (ASR) and speech translation. |
Accelerate | A library that enables the same PyTorch code to be run across any distributed configuration |
librosa | A python package for music and audio analysis. |
FFmpeg | A suite of libraries and programs for handling video, audio, and other multimedia files and streams |