-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is it possible calculate inter DTW path matrices of a list of sequences in parallel? #12
Comments
Hi Percy, I'm not 100% sure that I understand what this API call should look like from your description, but are you looking to only run DTW, not all of DBA? And run it in parallel? The DTW code should be reentrant if you provide the optional cudaStream argument. If you can provide a concrete example of what the tensorflow wrapper should look like, I may be able to help more specifically. Cheers, Paul |
Hi, Paul Generally, the problem roots in the way that how tensorflow manages gpu/cpu resources. In particular, there are three issues.
The following is a minimal example of tensorflow warpper for dtw GPU OP. Some notes are included in the comment lines.
Best regards, |
Hi, Thanks for the wrapper info.
|
As for memory management in tensorflow, I think you could refer to this material at the first glimpse: https://github.com/miglopst/cs263_spring2018/wiki/Memory-management-for-tensorflow. Generally, all the tensor creator inherits from an abstract base class as the "allocator" (https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/framework/allocator.h). Then, if required something in detail, you could find various other allocator by searching "public: Allocator" in tensorflow src folder. I think the comments of Googlers are fairly enough to fully understand the whole tensorflow project.
That sounds great!
I am not 100% understand how to achieve this. But I am agree with that pitched memory allocation is quite critical. Very glad to see further cooperation. Best, |
Hi, Paul
Very excellent implementation of dtw.
I am trying to integrate the DTW sub-module into well-known tensorflow as an gpu operator. For being compatible to the tf APIs, it is better to have an operator calculating the dtw costs as well as the dtw path matrices between a sequence x and a list of sequences, respectively. However, the implementation of approximateMedoidIndices function seems to suggest that cuda kernel DTWDistance does not support this task. Any advices would be greatly appreciating.
Best,
Percy
The text was updated successfully, but these errors were encountered: