Skip to content

ppkn/metamuse-transcripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Metamuse Transcripts

Update (01/2023) I'm working on this project again, but things are going to be a little messy while I get a new process set up.
Update (01/2022) This project is now inactive. 😦. If I gain renewed time/interest, I'll start it back up again.
Update (09/2021): I'm looking for help! See the contribution guide for more info.

This project hosts transcripts for Metamuse, a podcast about tools for thought, product design, and how to have good ideas, from the team behind Muse. You can see an sample hosted at https://ppkn.github.io/metamuse-transcripts/

Structure

  • original/ has the transcripts as they came out from Rev. I don't expect these to be useful to anyone, I just thought they would be interesting to see.
  • captions/ has WebVTT files for use with an interactive transcript player like Able Player
  • The files in plaintext/ are the updated output of the originals. The text is split into paragraphs headed by the name of the speaker and the timestamp.

Methodology

After trying a couple of homegrown approaches, I landed on using automated transcription from Rev to get the original transcripts and I'm using their online editor to fix things. I'm keeping files up here so people can open pull requests to fix any errors they find in the transcripts. I got this idea from Design Details. It's a great podcast and you should listen to it.

Contributing

TL;DR there is an open issue to correct the auto-generated transcript for each episode. Copy the file from original/ to plaintext/, fix the errors, and open an MR.

Detailed info for contributing can be found here

License

Reduct has very generously donated their resources to make the machine translation happen. Show them some love by attributing them if you use this in a project for something outside of personal use.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Releases

No releases published

Packages

No packages published

Languages