Skip to content

Commit

Permalink
Image sizes
Browse files Browse the repository at this point in the history
  • Loading branch information
ookgezellig committed Nov 5, 2024
1 parent 09a406b commit 91d6e59
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 5 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,9 +37,9 @@ Using this toolkit, an mp4-video has been converted into the following products:
- [Closed captions / subtitles](https://commons.wikimedia.org/wiki/TimedText:Wikidata_Workshop_-_Theoretical_part_-_Maastricht_University_-_15_October_2024.webm.en.srt) in English were also generated. This was done using Whisper with ChatGPT post-corrections.

## Articles
* [How to create high-quality offline video transcriptions and subtitles using Whisper and Python](stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md)
* [How to create high-quality offline video transcriptions and subtitles using Whisper and Python](stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md), 5 November 2024

## Info
* Latest update: 22 October 2024
* ## Info
* Latest update: 5 November 2024
* Author: Olaf Janssen (ookgezellig) - Supported by ChatGPT
* License: Creative Commons CC0 - http://creativecommons.org/publicdomain/zero/1.0
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# How to create high-quality offline video transcriptions and subtitles using Whisper and Python

<image src="media/afbeelding1.png" width="300" hspace="10" align="right"/>
<image src="media/afbeelding1.png" width="400" hspace="10" align="right"/>

I always thought that 'doing things with AI' was equivalant to smoking data centers, overheated servers, and massive cloud computing power.

Expand Down Expand Up @@ -30,7 +30,7 @@ As I work with ChatGPT regularly, I had heard of [Whisper, OpenAI’s speech-to-

After some research to see if this could suit my ASR (Automatic Speech Recognition) needs, I found out that [this model excels in Dutch](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). But it also performs well in English.

<image src="media/afbeelding2.png" width="300" hspace="10" align="right"/>
<image src="media/afbeelding2.png" width="400" hspace="10" align="right"/>

OK, that already sounds promising. But Whisper doesn’t have a user-friendly front end (as far as I know), so I had to work with the API and Python. Fortunately, I found [this short blog post](https://nicobytes.com/blog/en/how-to-use-whisper/) to help me get started, and, combined with the [documentation](https://platform.openai.com/docs/guides/speech-to-text), it was straightforward to set up.

Expand Down

0 comments on commit 91d6e59

Please sign in to comment.