PSTT- Python Speech to Text

A web application built using Flask.

Features

Speech to Text Transcription:
- Upload option- a button to upload audio file & perform transcription along with displaying it on the same page.
- Microphone option- a button to record audio & perform transcription along with displaying it on the same page.
Number of filler words displayed as a count from the transcribed text.
Energy graph of the audio file.

Implementation

PyAudio and SpeechRecognition are the two main APIs used for implementation. WebAudioRecorder.js is a JavaScript library that is used here to record audio input (Web Audio API AudioNode object) and encode to audio file image (Blob object).

Sample audio files are present in wavFiles

After installing the requirements, in the terminal use: python -m flask run to run the application.

The web application looks like this:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PSTT- Python Speech to Text

Features

Implementation

About

Languages

License

iamarchisha/PSTT

Folders and files

Latest commit

History

Repository files navigation

PSTT- Python Speech to Text

Features

Implementation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages