Skip to content
This repository has been archived by the owner on Feb 1, 2024. It is now read-only.

What is input in the def call()? #59

Open
Megan8821 opened this issue Sep 16, 2022 · 4 comments
Open

What is input in the def call()? #59

Megan8821 opened this issue Sep 16, 2022 · 4 comments

Comments

@Megan8821
Copy link

Hi, I am looking inside the code. I've seen a lot of methods about def call(self, inputs) in your code, especially looking at this one.

  def call(self, inputs):
    synth_params = self.get_synth_params(inputs)

However, I couldn't find out what's the calculation of inputs, there are some clues I've found. In those codes, inputs is respond to the data in get_fake_data_synthesis_generator, then what are the data and units you input to get_fake_data_synthesis_generator? Frames? Amplitude or anything else?

Thanks!

@lukewys
Copy link
Contributor

lukewys commented Sep 16, 2022

Hi, if you are referring to the input to the Synthesis Generator and DDSP Inference, the input format is the one used in training data. Please see #52 for more details.

@Megan8821
Copy link
Author

Megan8821 commented Sep 17, 2022

Thanks for replying. So If I want to input my own data, I need to transfer the format of data to tf.record?

Or let's say what's the way you calculate the attack feature? Do you use DDSP noise magnitude Inference?

@lukewys
Copy link
Contributor

lukewys commented Sep 18, 2022

So If I want to input my own data, I need to transfer the format of data to tf.record? Yes.

Or let's say what's the way you calculate the attack feature? Do you use DDSP noise magnitude Inference? This is not related to input format. We calculate the attack feature by taking the average of first 10 frames of noise magnitude used for DDSP synthesis. “Do you use DDSP noise magnitude Inference?” if you are referring to the fact that noise magnitude is estimated from the input, just like DDSP does, then yes, that is correct.

@Megan8821
Copy link
Author

Got it!
So if I use the way as DDSP has done, compute the noise magnitude calculation in DDSP, input the audio with tfrecord format, taking the average of first 10 frames, then we can get the attack values(But still need to figure out how to compute the noise magnitude, seems a little bit complicated for me.).)
In another case, I need to compute the attack in another way, then put it in the MIDIDDSP model as the attack parameter. But do you suggest computing attack without using DDSP inference?

Thanks for replying, I appreciate it.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants