SYCL: Remove misleading ggml_sycl_op_flatten function #12387

qnixsynapse · 2025-03-14T12:06:40Z

Original work of #11515. Tried to submit smaller change this time.

qnixsynapse · 2025-03-14T12:07:26Z

ggml/src/ggml-sycl/common.cpp

-    // dd = data device
-    float * src0_ddf = (float *) src0->data;
-    float * src1_ddf = use_src1 ? (float *) src1->data : nullptr;
-    float *  dst_ddf = (float *) dst->data;


This function typecasted src0, src1 and dst tensors to float regardless of whatever types they were. I think we don't want this behaviour in the long run.

Just curious, why we don't use auto? You could avoid the casting and if the datatype changes in future, you don't have to rework it again?

Modern C++ code should use auto where ever it is possible.

Do we need to create a separate pointer variable when we are passing the ggml_tensor itself anyways to the kernel OP function which contains everything needed to perform the OP?

arthw · 2025-03-15T15:09:52Z

This PR include many code change. But no any bug or feature work.
Because some CI UT cases are skipped manually for 100% pass rate. I'm afraid the updated code can't be verified by CI cases.

So it's high risk to update so much code without 100% UT cover rate.

I suggest refactor them as additional work for bug fix or feature in same source file or class.
So that the code will be covered by test fully.

qnixsynapse · 2025-03-15T15:56:37Z

This PR include many code change. But no any bug or feature work.
Because some CI UT cases are skipped manually for 100% pass rate. I'm afraid the updated code can't be verified by CI cases.

So it's high risk to update so much code without 100% UT cover rate.

I suggest refactor them as additional work for bug fix or feature in same source file or class.
So that the code will be covered by test fully.

I usually test them locally(by running a model) before opening a PR here. If you want, we can update the CI to run test cases in an available Nvidia ci/cd instance here.

Rbiessy

That makes sense to me. We have to be careful when we merge this as it will break #12412 though.

ggml/src/ggml-sycl/element_wise.cpp

qnixsynapse · 2025-03-18T00:58:05Z

@Rbiessy I don't think this will break RWKV kernels because they don't depend on ggml_sycl_op_flatten function. I will do a rebase and test.

Please note that this is a prerequisite for supporting F16 types in unary and eltwise operations which I intend to do in future PRs.

NeoZhangJianyu · 2025-03-18T02:37:21Z

This PR include many code change. But no any bug or feature work.
Because some CI UT cases are skipped manually for 100% pass rate. I'm afraid the updated code can't be verified by CI cases.
So it's high risk to update so much code without 100% UT cover rate.
I suggest refactor them as additional work for bug fix or feature in same source file or class.
So that the code will be covered by test fully.

I usually test them locally(by running a model) before opening a PR here. If you want, we can update the CI to run test cases in an available Nvidia ci/cd instance here.

Test with several models can't cover the OPs.
It's better to run CI.

If your PC has iGPU in intel Core CPU (since 11th or newer Core CPU), you could run the CI on it too.

Rbiessy · 2025-03-18T09:08:51Z

@Rbiessy I don't think this will break RWKV kernels because they don't depend on ggml_sycl_op_flatten function. I will do a rebase and test.

Please note that this is a prerequisite for supporting F16 types in unary and eltwise operations which I intend to do in future PRs.

I was referring to this line which adds a usage of ggml_sycl_op_flatten. It will be fine if you rebase it now.

qnixsynapse · 2025-03-18T10:20:12Z

@Rbiessy I don't think this will break RWKV kernels because they don't depend on ggml_sycl_op_flatten function. I will do a rebase and test.
Please note that this is a prerequisite for supporting F16 types in unary and eltwise operations which I intend to do in future PRs.

I was referring to this line which adds a usage of ggml_sycl_op_flatten. It will be fine if you rebase it now.

Ah okay. That can be fixed.

ggml/src/ggml-sycl/common.hpp

ggml/src/ggml-sycl/element_wise.cpp

Alcpz

Overall looks great to me, thanks for simplifying the code a bit!
Do you have an answer to the try catch questions from @Rbiessy (#12387 (comment))?

I'm happy to give the approval but if you intend to add a few more changes then I will wait until you finish all the stuff.

ggml/src/ggml-sycl/common.hpp

qnixsynapse · 2025-03-28T14:47:53Z

Overall looks great to me, thanks for simplifying the code a bit!

Thank you. Will add support for fp16 once this PR gets merged.

Do you have an answer to the try catch questions from @Rbiessy (#12387 (comment))?

I'm happy to give the approval but if you intend to add a few more changes then I will wait until you finish all the stuff.

If you look at my latest changes, I did exactly what @Rbiessy suggested, i.e to add the try - catch only to the compute_forward function and removed it on elementwise ops functions

Rbiessy

LGTM, great work!

Alcpz · 2025-03-31T09:11:33Z

If you look at my latest changes, I did exactly what @Rbiessy suggested, i.e to add the try - catch only to the compute_forward function and removed it on elementwise ops functions

I misunderstood the comment then. LGTM!

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Mar 14, 2025

qnixsynapse commented Mar 14, 2025

View reviewed changes

qnixsynapse force-pushed the remove_op_flatten_fn branch from f42c7bb to 73f1849 Compare March 14, 2025 12:11

qnixsynapse marked this pull request as draft March 14, 2025 12:27

qnixsynapse marked this pull request as ready for review March 14, 2025 13:19

This comment was marked as spam.

Sign in to view

Rbiessy reviewed Mar 17, 2025

View reviewed changes

ggml/src/ggml-sycl/element_wise.cpp Outdated Show resolved Hide resolved

qnixsynapse marked this pull request as draft March 19, 2025 15:05

qnixsynapse force-pushed the remove_op_flatten_fn branch 2 times, most recently from 1384273 to e99683f Compare March 22, 2025 06:43

qnixsynapse added 3 commits March 27, 2025 11:31

SYCL: Remove misleading ggml_sycl_op_flatten function

245239a

remove trailing whitespace

3160faa

Fix L2 norm from rebase

dc87ed7

qnixsynapse force-pushed the remove_op_flatten_fn branch from e99683f to dc87ed7 Compare March 27, 2025 06:02

qnixsynapse marked this pull request as ready for review March 27, 2025 06:02

Rbiessy reviewed Mar 27, 2025

View reviewed changes

ggml/src/ggml-sycl/common.hpp Outdated Show resolved Hide resolved

ggml/src/ggml-sycl/element_wise.cpp Outdated Show resolved Hide resolved

qnixsynapse marked this pull request as draft March 27, 2025 12:03

qnixsynapse added 4 commits March 27, 2025 17:36

remove try catch block from element_wise.cpp

2c8ca36

remove comment from common.hp

bb0532e

ggml-sycl.cpp: Add try catch sycl::exception block in compute_forward

84a31c6

norm.cpp: remove try catch exception block

3e8491a

qnixsynapse marked this pull request as ready for review March 27, 2025 12:31

qnixsynapse requested a review from Alcpz March 27, 2025 15:04

Alcpz reviewed Mar 28, 2025

View reviewed changes

ggml/src/ggml-sycl/common.hpp Outdated Show resolved Hide resolved

Rbiessy approved these changes Mar 28, 2025

View reviewed changes

Alcpz approved these changes Mar 31, 2025

View reviewed changes

Rbiessy merged commit 6c02a03 into ggml-org:master Mar 31, 2025
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SYCL: Remove misleading ggml_sycl_op_flatten function #12387

SYCL: Remove misleading ggml_sycl_op_flatten function #12387

qnixsynapse commented Mar 14, 2025 •

edited

Loading

qnixsynapse Mar 14, 2025 •

edited

Loading

acbits Mar 14, 2025 •

edited

Loading

qnixsynapse Mar 15, 2025

This comment was marked as spam.

This comment was marked as spam.

arthw commented Mar 15, 2025

qnixsynapse commented Mar 15, 2025

Rbiessy left a comment

qnixsynapse commented Mar 18, 2025

NeoZhangJianyu commented Mar 18, 2025

Rbiessy commented Mar 18, 2025

qnixsynapse commented Mar 18, 2025

Alcpz left a comment

qnixsynapse commented Mar 28, 2025

Rbiessy left a comment

Alcpz commented Mar 31, 2025

SYCL: Remove misleading ggml_sycl_op_flatten function #12387

SYCL: Remove misleading ggml_sycl_op_flatten function #12387

Conversation

qnixsynapse commented Mar 14, 2025 • edited Loading

qnixsynapse Mar 14, 2025 • edited Loading

Choose a reason for hiding this comment

acbits Mar 14, 2025 • edited Loading

Choose a reason for hiding this comment

qnixsynapse Mar 15, 2025

Choose a reason for hiding this comment

This comment was marked as spam.

This comment was marked as spam.

arthw commented Mar 15, 2025

qnixsynapse commented Mar 15, 2025

Rbiessy left a comment

Choose a reason for hiding this comment

qnixsynapse commented Mar 18, 2025

NeoZhangJianyu commented Mar 18, 2025

Rbiessy commented Mar 18, 2025

qnixsynapse commented Mar 18, 2025

Alcpz left a comment

Choose a reason for hiding this comment

qnixsynapse commented Mar 28, 2025

Rbiessy left a comment

Choose a reason for hiding this comment

Alcpz commented Mar 31, 2025

qnixsynapse commented Mar 14, 2025 •

edited

Loading

qnixsynapse Mar 14, 2025 •

edited

Loading

acbits Mar 14, 2025 •

edited

Loading