✨Update (01/2023) I'm working on this project again, but things are going to be a little messy while I get a new process set up.
✨Update (01/2022) This project is now inactive. 😦. If I gain renewed time/interest, I'll start it back up again.
✨Update (09/2021): I'm looking for help! See the contribution guide for more info.
This project hosts transcripts for Metamuse, a podcast about tools for thought, product design, and how to have good ideas, from the team behind Muse. You can see an sample hosted at https://ppkn.github.io/metamuse-transcripts/
original/
has the transcripts as they came out from Rev. I don't expect these to be useful to anyone, I just thought they would be interesting to see.captions/
has WebVTT files for use with an interactive transcript player like Able Player- The files in
plaintext/
are the updated output of the originals. The text is split into paragraphs headed by the name of the speaker and the timestamp.
After trying a couple of homegrown approaches, I landed on using automated transcription from Rev to get the original transcripts and I'm using their online editor to fix things. I'm keeping files up here so people can open pull requests to fix any errors they find in the transcripts. I got this idea from Design Details. It's a great podcast and you should listen to it.
TL;DR there is an open issue to correct the auto-generated transcript for each episode. Copy the file from original/
to plaintext/
, fix the errors, and open an MR.
Detailed info for contributing can be found here
Reduct has very generously donated their resources to make the machine translation happen. Show them some love by attributing them if you use this in a project for something outside of personal use.
This work is licensed under a Creative Commons Attribution 4.0 International License.