-
Notifications
You must be signed in to change notification settings - Fork 209
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem in reorg layer #49
Comments
Are you sure the error is in the reorg? |
I used the debug folder and checked the output of all the layers. Everything is correct until the reorg layer. This output is similar to darknet so the dimensions must be correct:
|
I found the problem. reorgForward(srcData, dstData, output_dim.n, output_dim.c, output_dim.h, output_dim.w, stride); Similarly void configure(const Dims* inputDims, int nbInputs, const Dims* outputDims, int nbOutputs, int maxBatchSize) override {
c = outputDims[0].d[0];
h = outputDims[0].d[1];
w = outputDims[0].d[2];
} Pull request #47, now contains this fix. |
Reorg Is a layer of the old version of YOLOs, in that time I was referring to the original darknet implementation, not the Alexey one. This layer in the Alexey impl is called reorg_old_layer: I have to check if this layer modification works even for the old YOLOs and then I will approve your changes. |
You are right. I didn't noticed this. Actually my configuration contains a |
Or we can add an option to the reorg, since the code is pretty much the same |
Ok. I did the fix. |
Using this network configuration the output of reorg layer using CUDNN and TensorRT is different from the exported outputs of darknet using CPU:
I have checked the code and noticed that it is the exact code in the darknet project for reorg layer on GPU. Could it be a bug in darknet code? Or there is something I am missing?
It should be noted that I have tested the provided network in darknet using both CPU and GPU many times and the final results are similar. But I have not compared all the elements of the output of reorg layer between CPU and GPU as in tkDNN.
P.S. I used my fork of this repo which is mentioned in #47.
The text was updated successfully, but these errors were encountered: