diff --git a/README.md b/README.md index 14b0d39..24a0658 100644 --- a/README.md +++ b/README.md @@ -37,9 +37,9 @@ Using this toolkit, an mp4-video has been converted into the following products: - [Closed captions / subtitles](https://commons.wikimedia.org/wiki/TimedText:Wikidata_Workshop_-_Theoretical_part_-_Maastricht_University_-_15_October_2024.webm.en.srt) in English were also generated. This was done using Whisper with ChatGPT post-corrections. ## Articles -* [How to create high-quality offline video transcriptions and subtitles using Whisper and Python](stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md) +* [How to create high-quality offline video transcriptions and subtitles using Whisper and Python](stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md), 5 November 2024 -## Info -* Latest update: 22 October 2024 +* ## Info +* Latest update: 5 November 2024 * Author: Olaf Janssen (ookgezellig) - Supported by ChatGPT * License: Creative Commons CC0 - http://creativecommons.org/publicdomain/zero/1.0 diff --git a/stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md b/stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md index a83c3d5..17e9ac9 100644 --- a/stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md +++ b/stories/How to create high-quality offline video transcriptions and subtitles using Whisper and Python.md @@ -1,6 +1,6 @@ # How to create high-quality offline video transcriptions and subtitles using Whisper and Python - + I always thought that 'doing things with AI' was equivalant to smoking data centers, overheated servers, and massive cloud computing power. @@ -30,7 +30,7 @@ As I work with ChatGPT regularly, I had heard of [Whisper, OpenAI’s speech-to- After some research to see if this could suit my ASR (Automatic Speech Recognition) needs, I found out that [this model excels in Dutch](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages). But it also performs well in English. - + OK, that already sounds promising. But Whisper doesn’t have a user-friendly front end (as far as I know), so I had to work with the API and Python. Fortunately, I found [this short blog post](https://nicobytes.com/blog/en/how-to-use-whisper/) to help me get started, and, combined with the [documentation](https://platform.openai.com/docs/guides/speech-to-text), it was straightforward to set up.