Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project Planning - Proof of Concept #1

Open
victormaricato opened this issue Jul 27, 2019 · 0 comments
Open

Project Planning - Proof of Concept #1

victormaricato opened this issue Jul 27, 2019 · 0 comments

Comments

@victormaricato
Copy link
Owner

victormaricato commented Jul 27, 2019

  • Specify projects scope and tasks

  • Core High-level Functions

    • Start Transcribing Robot
    • Start Filter and break-down Transcribing robot output
    • Start Natural Language Understanding Robot
    • Start Image Gathering Robot
    • Start Video Robot
    • Start to Export Video
  • Transcribing Robot

    • Ask input file (.raw format)
      • Get select filepath
      • Check if file exists
      • Data structure (Key-Value)
    • Make Google Speech-To-Text API return a transcript of a simple audio
      • Make it returns with Timestamp
      • Get the timestamps and phrases data structure and logic working
    • Make Google Speech-To-Text API return transcript of a complex audio
    • Break-down sentences in Speech-To-Text output.
      • Define an approach for splitting sentences. (Check if Google NLU can do this)
    • Save output (sentences) on data structure
  • NLU Robot

    • Load data structure
    • Send sentences programatically for Google NLU API.
    • Retrieve keywords and entities output from API.
    • Save on data structure.
  • Image Gathering Robot

    • Get Images from Keywords using Google Image API
      • Check Google Image API parameters for getting only Copyright-free images
      • Images and Sentences should be ordered according to transcription order of appearance.
    • Use ImageMagick for resizing and preparing downloaded images.
  • Video robot

    • Images should be added according to order.
    • Define how to match transcription/audio keyword with image
      • Possible approach is to check Word2Vec of keywords and checking if they're semantically different from each other
      • Another possible approach is to define images timeouts. Or define Word "Countouts" (for each 10 words, change the image to another that corresponds to the keyword of that snippet).
    • Render video
    • Save/Export video as .mp4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant