-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test speed #3
Comments
auto output_tensor = CPU(kByte).tensorFromBlob(data, {output_height, output_width, 3}); spend an abnormal time |
Sorry for the late reply @jjn037 This piece of code is slow because you transfer the data from gpu to cpu -- Would be cool if you can compare the timing of the cpp line with a pytorch's one: |
FYI, I have just added a file with a speed benchmark: |
Have you tested the speed? I get a lower speed(30ms/img) with resnet18 224*224 bachsize1
The text was updated successfully, but these errors were encountered: