Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QST]int4b_t dataptr question #1955

Closed
liuyt929 opened this issue Nov 20, 2024 · 1 comment
Closed

[QST]int4b_t dataptr question #1955

liuyt929 opened this issue Nov 20, 2024 · 1 comment

Comments

@liuyt929
Copy link

What is your question?
hi~
excuse me,I have some problems when using s4t_s4n_s4t_sm80 gemm, can I create a cutlass::int4_b* data_ptr ,and malloc elements_number/2 byte for it. And create a tensorref for it, then directly use it as the output_ref for the gemm op.
like this
cutlass::int4b_t* output_int4; cudaMalloc((void**)&output_int4, M*N/2); cutlass::TensorRef<cutlass::int4b_t,cutlass::layout::ColumnMajor> out_ref( output_int4,cutlass::layout::ColumnMajor::packed(cutlass::MatrixCoord(M,N)));

Can I get the right answer if I do so?
I have tried a lot, but the result is always wrong, I wonder how I can receive the result correctly.
Thanks!

@thakkarV
Copy link
Collaborator

We can't help you debug without proper repro steps

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants