GAN movie maker

This program create a video clip from an audio file chosen by the user. The generated clip adapt himself to the frequencies and the rythm of the chosen track.

Requirements

Python version : 3.6 or 3.7 (you can test with others, it should work).

Python packages : tensorflow, tensorflow-hub, numpy, open-cv, librosa, pandas, scikit-learn, Pillow, scipy, ffmpeg and their dependencies.

FFmpeg should be installed :

linux : sudo apt install ffmpeg

mac : brew install ffmpeg

windows:

1 download ffmped buil here : https://ffmpeg.zeranoe.com/builds/

2 unzip the downloaded folder

3 rename it "FFmpeg"

4 paste it in your local disk

5 add a new environment variable for ffmpeg

more information here : https://www.wikihow.com/Install-FFmpeg-on-Windows

If you want to run the program using a GPU, here are some information : https://www.tensorflow.org/install/gpu

How to use it

Launch "main.py".

You have two main choices : creates images or create video. You have two create images first (the created images will be stored in the folder "images") and then create the video.

1-Create images :

Select an audio track :

You can add your own audio track by adding it in the "audioFiles" folder. When you launch the code, you can choose your audio file among those in the "audioFiles" folder.

Select a frame rate :

You can choose the number of images generated for each second of the chosen audio file. We recommend 60 and it can not work above 65. The results are bad for a frame rate below 20.

Number of images :

The image structures used to create new images from the audio come from the website Artbreeder (https://www.artbreeder.com). We collected different types of image from this website and they are stored in the script "classes.py". We ask the user the number of images he wants to use to make the video. The user can add his own images in "classes.py".

Transitions images :

You will be asked to choose the number of transition images. The transition images are created to smooth the video.

If you don't want transition images then choose 0.

The choice of the number of transition images will depend on the framerate. For a good compromise between a dynamic and a fluid video 20 images of transition for a framerate of 60 is a good choice (For a framerate of 30, it will be 10 transition images, you have to keep the same ratio).

At this step make sure you remember the audio and the frame rate you chose, you will need it to create the video.

2-Create video :

Resolution :

You can choose the resolution of the final video, the original video is in 512x512, you can upscale or downscale it. The original 512x512 video will be kept anyway.

Frame rate :

Keep the same framerate as the one you selected in "Create images".

Sound selection :

Keep the same audio as the one you selected in "Create images".

How the code work

Fourier transform :

Firstly we use the Fourier transform to compute the frequency response of the audio. The Fourier transform is applied on each parts of the audio track in function of the framerate (for exemple if the audio is 1 seconds long and the framerate is 24, we will compute the Fourier transform 24 times.

Exponential moving average :

In order to smooth the frequency response and obtain a fluid video, we use the exponential moving average that will smooth the signal.

Normalization :

The GAN function that we will use later take value between -1 and 1. If lower or greater values are used, the resulting images are strange. Therefore, we normalize the data between these two values.

Image type attribution :

We use K-mean clustering to give an image type (a class) to each part of the audio file. For example, if the user chose to use 4 different type of images, the clustering will be compute with 4 clusters.

Image creation :

To create the images we inject our data in the GAN function. Three parameters are given to the GAN function: the "latent space" defined by the audio smoothed and normalized frequencies, the "class" of the image define by the clustering and another parameter called "truncation" that is different for each classes.

The programm will run as following : if the class of N is the same as the class of N + 1, we create the image. If not, we generate transition images between N and N + the number of transitions chosen by the user.

Video creation : Finally we put together all the created images in one video and we add the chosen audio track and we add it on the video.

Function of the different scripts

add_audio : add the audio track on the created video

biggan : the GAN function, it generates images

change_video_resolution : upscale the video with a video resolution defined by the user

classes : a collection of image classes from the website Artbreeder, we can add or delete some classes depending of what kind of image we want in our video

cli : extract transition image data from JSON files in order to generate these transition images

create_images : create images

create_transitions : create transition images

create_video : create the video from the create images in the "images" folder

fourier : perform the audio processing, Fourier transform, exponetial moving average, normalization and clustering

ganbreeder :

image_utils : define the image properties

latent_space : interpolation function to create transition images

main : launch the program, ask the user abut the properties of the video that he wants to create

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
audioFiles		audioFiles
jsonStore		jsonStore
old		old
README.md		README.md
add_audio.py		add_audio.py
biggan.py		biggan.py
change_video_resolution.py		change_video_resolution.py
classes.py		classes.py
cli.py		cli.py
create_images.py		create_images.py
create_transitions.py		create_transitions.py
create_video.py		create_video.py
fourier.py		fourier.py
ganbreeder.py		ganbreeder.py
image_utils.py		image_utils.py
latent_space.py		latent_space.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GAN movie maker

Requirements

How to use it

How the code work

Function of the different scripts

About

Releases

Packages

Contributors 3

Languages

FABB2011/GAN_PI-2

Folders and files

Latest commit

History

Repository files navigation

GAN movie maker

Requirements

How to use it

How the code work

Function of the different scripts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages