Albanian Transcriber using Machine learning | DibraSpeaks.

This project is an AI-based transcription tool for the Albanian language. The tool is designed to automatically transcribe Albanian speech to text using Python.

Features

Automatic speech recognition for Albanian language with our best trained model
User-friendly interface to label and validate speech datas
Dataset with Albanian Speech datas.

Installation

This project is made up of three main parts: an API that serves all the files to the UI, a section that is used to label and validate Albanian speech data, and the third part which is the model test. In the web folder are all the files needed to run the UI, in the api folder are the files needed to run the faastAPI, and in the automation folder are some scripts used for many things, including generating datasets. The folder training includes all the notebooks used to train the model.

Docker is required to run the project locally. To run the project locally run:

docker-compose up --build -d

After all the containers are up and running you should find at:

localhost:140 -> API

localhost:120 -> WEB

Domains

Production

The PRODUCTION WEB interface: dibraspeaks.uneduashqiperine.com

The PRODUCTION API: api.uneduashqiperine.com

Dataset amd Pre-trained Models Links

https://www.kaggle.com/flooxperia/

Contributing

We welcome all contributions to this project. If you have any ideas for new features or improvements, please feel free to create an branch, and test it by creating a pull request to merge your ticket to dev. we will try to approve the request as soon as possible and the change will be for some time in dev. After the feature or the improvement have been tested on dev than we can merge it to production.

Writing about the project

finalyearproject.pdf

Screenshots

Our model Architecture VS Deepspeech on our dataset

Authors

@florijanqosja

Buy Me A Coffee

https://www.buymeacoffee.com/florijanqosja https://github.com/sponsors/florijanqosja

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
.github		.github
.vscode		.vscode
api		api
automation		automation
notebooks		notebooks
old_web		old_web
web		web
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
project_paper.pdf		project_paper.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Albanian Transcriber using Machine learning | DibraSpeaks.

Features

Installation

Domains

Production

Dataset amd Pre-trained Models Links

Contributing

Writing about the project

Screenshots

Authors

Buy Me A Coffee

License

About

Releases

Sponsor this project

Packages

Languages

License

florijanqosja/Albanian-ASR

Folders and files

Latest commit

History

Repository files navigation

Albanian Transcriber using Machine learning | DibraSpeaks.

Features

Installation

Domains

Production

Dataset amd Pre-trained Models Links

Contributing

Writing about the project

Screenshots

Authors

Buy Me A Coffee

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages