[QST]int4b_t dataptr question #1955

liuyt929 · 2024-11-20T11:13:11Z

What is your question?
hi~
excuse me,I have some problems when using s4t_s4n_s4t_sm80 gemm, can I create a cutlass::int4_b* data_ptr ,and malloc elements_number/2 byte for it. And create a tensorref for it, then directly use it as the output_ref for the gemm op.
like this
cutlass::int4b_t* output_int4; cudaMalloc((void**)&output_int4, M*N/2); cutlass::TensorRef<cutlass::int4b_t,cutlass::layout::ColumnMajor> out_ref( output_int4,cutlass::layout::ColumnMajor::packed(cutlass::MatrixCoord(M,N)));

Can I get the right answer if I do so?
I have tried a lot, but the result is always wrong, I wonder how I can receive the result correctly.
Thanks!

The text was updated successfully, but these errors were encountered:

thakkarV · 2024-11-20T12:50:43Z

We can't help you debug without proper repro steps

liuyt929 added ? - Needs Triage question Question labels Nov 20, 2024

liuyt929 closed this as completed Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST]int4b_t dataptr question #1955

[QST]int4b_t dataptr question #1955

liuyt929 commented Nov 20, 2024

thakkarV commented Nov 20, 2024

[QST]int4b_t dataptr question #1955

[QST]int4b_t dataptr question #1955

Comments

liuyt929 commented Nov 20, 2024

thakkarV commented Nov 20, 2024