This is a small tool could help you generate the transcript from the Google ASR output json files.
The code is inspired and modified from tscribe. tscribe
is a wonderful tool to help you produce transcriptions using the ASR files from AWS.
The input is expected to be a json
file from Google ASR, and the output is the txt file.
Tested with Python 3.7 via virtual environment.
Clone the repo, go to the repo folder, setup the virtual environment, and install the required packages:
$ python3 -m venv venv
$ source venv/bin/activate
$ pip install -r requirements.txt
import gtranscribe
gtranscribe.write("sample.json")
Start loading sample.json
Start decoding sample.json
Start generating the output file sample.txt
Finish!