-
Notifications
You must be signed in to change notification settings - Fork 20
Can i test with custom data? #51
Comments
Huge thanks for your interest in our work! First, sorry that training on custom datasets is still hacky. For your question:
Hope this might help. I probably will work on updating this codebase to support arbitrary dataset, but I don't know when exactly I will have that time. |
Thank you for reply in detail! |
Hi! I think "note_active_frame_indices" is actually used in model training because this feature is used to calculate |
Hi, data['midi'] is a 1D integer tensor of shape [num_frames] where each item is the MIDI pitch number (integer) at that frame. Thus, MIDI-DDSP relies on data['midi'] for the MIDI input and to get the note boundary. It is crucial to train the model. Hope this helps. Best |
Hi! Thanks a lot for your reply. This makes sense then. So the difference between Just a little follow-up question about the content of Thanks! |
I don't remember very clearly as a long time has passed. I think the value
you are referring to is the velocity of that frame / note.
Best,
Yusong
adagio715 ***@***.***> 于2022年12月17日周六 22:57写道:
… Hi, note_active_frame_indices is a binary tensor containing the onset
information. The tensor is at shape [num_frame, 128], and if
note_active_frame_indices[i,j] is 1, it means at i-th frame, the pitch j
is on. By applying argmax to the -1 dimension, the
note_active_frame_indices becomes data['midi'].
data['midi'] is a 1D integer tensor of shape [num_frames] where each item
is the MIDI pitch number (integer) at that frame. Thus, MIDI-DDSP relies on
data['midi'] for the MIDI input and to get the note boundary. It is crucial
to train the model.
Hope this helps.
Best
Hi! Thanks a lot for your reply. This makes sense then. So the difference
between note_active_frame_indices and note_onsets is that
note_active_frame_indices indicates that the note "is being played" while
note_onsets indicates that the note "starts". Did I get this correctly?
Just a little follow-up question about the content of
note_active_frame_indices: From the provided urmp tfrecords, I checked
the tensor values of this feature before it was reshaped into [num_frame].
I found that for each frame, the 128-d array contains 127 zeros and an
integer value (something like 86, 87, 427, 428..), instead of 127 zeros and
a "1". I don't think this affects the feature data['midi'], but I was
wondering why those integer values instead of a simple "1" :)) Maybe you
could explain if you know why?
Thanks!
—
Reply to this email directly, view it on GitHub
<#51 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AGXJZ7Z5Q2CDC46ABIDAJPDWN2DUBANCNFSM5VW34BUQ>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Hi, i am interested in this exciting project and i am trying to test this with our custom dataset and reproduce the format of original data. But there are some difficulties and questions below.
Thank you for reading!
The text was updated successfully, but these errors were encountered: