description |
---|
AI-powered lip-sync in seconds! |
There are use cases for many companies to use Audio files instead of Text-to-Speech. This is especially useful if you:
- Already have a large content library of high quality voice overs
- Want a specific voice that AI can't produce currently
- Have a higher need for a realistic voice for your brand loyalty
Audio File with Human Generated Speech | AI Generated Speech |
---|---|
PROS | PROS |
Realistic and accurate voice | No production/recording cost |
Brand consistency is high | Faster iteration for brand if changes are needed |
CONS | CONS |
High production cost (voice actor, recording and mastering) | Robotic Voice, can be inconsistent |
Slower turn around time | Harder to add intonation. E.g. stressing on certain words or saying things with a particular emotion |
Try it here
{% embed url="https://gooey.ai/Lipsync/" %}
{% embed url="https://www.youtube.com/watch?v=EJdtC0USujM" %}
Prep your avatar video or photograph. Here are some pointers when choosing your image:
- Make sure the media is high-resolution
- Ensure it clearly shows all the features of your talking head
- The image must be cropped up till bust height
- Use only human faces
For this example, we have used Alfred Hitchcock! 🐦
Upload your audio file. This can be in .wav/.mp3 format.
{% hint style="info" %} Note: Use shorter pieces of audio, to ensure high quality lipsync with low-latency and minimum distortion. {% endhint %}
Our workflow allows for multilingual lip-sync. Try our hindi example below:
{% embed url="https://gooey.ai/Lipsync/?example_id=eu8o3GshpBQ" %}
Hit “Submit” ☄️🚀
{% embed url="https://storage.googleapis.com/dara-c1b52.appspot.com/daras_ai/media/d25e4d82-4b73-11ee-9484-02420a0001b2/gooey.ai%20lipsync.mp4" %}
Try it here:
{% embed url="https://gooey.ai/Lipsync/" %}
You can use the “Face Padding” settings to improve the accuracy of the detected face in the image/video. This ensures that the Lip Sync video looks more realistic.