Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

Merged
merged 1 commit into from
Apr 11, 2023
Merged

[AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour #14468

merged 1 commit into from
Apr 11, 2023

Conversation

cbalint13
Copy link
Contributor

@cbalint13 cbalint13 commented Apr 3, 2023

This PR fix the latest xgboost >= 2.0.0 behaviour requiring binarized labels.

This address both onlyautotune (and autoscheduler).

Note:
Unsure about TVM overall tunner impact, but we can introduce more sophisticated way of measuring AP like PASCAL evenly spaced one, the advantages are unclear and would require extensive comparative tests.


The errors cought on TVM autotune process:

File "/usr/lib64/python3.11/site-packages/tvm/autotvm/tuner/xgboost_cost_model.py"
, line 538, in after_iteration
    bst_eval = model.eval_set(self.evals, epoch, feval)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib64/python3.11/site-packages/xgboost/core.py", line 1995, in eval_set
    _check_call(
File "/usr/lib64/python3.11/site-packages/xgboost/core.py", line 270, in _check_call
    raise XGBoostError(py_str(_LIB.XGBGetLastError()))
xgboost.core.XGBoostError: [15:59:59] /builddir/build/BUILD/xgboost/src/common/ranking_utils.h:378: 
Check failed: is_binary: MAP can only be used with binary labels.

Cc @Sunny-Island , @zxybazh , @junrushao , @vinx13 , please help with the review.

Thanks,
~Cristian.


Update:

  • The autoschedule (ansor) is not affected at all.
  • The autotune reg (reg:linear) loss_type is fine.
  • Only autotune with rank (rank:pairwise) loss_type is affected.

@tvm-bot
Copy link
Collaborator

tvm-bot commented Apr 3, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

@cbalint13 cbalint13 changed the title Binarize labels for the new xgboost >= 1.7.5 behaviour [AutoScheduler][AutoTVM] Binarize labels for the new xgboost >= 1.7.5 behaviour Apr 3, 2023
@tqchen
Copy link
Member

tqchen commented Apr 4, 2023

Given that we are doing cost model. I am not sure if binarization is the best approach here. Can you dump out the labels and check the current assigned behavior?

Likely we might want to move away from the MAP metric, and use other metric instead, either regression metric or pair-wise ranking.

@cbalint13
Copy link
Contributor Author

cbalint13 commented Apr 4, 2023

@tqchen

Given that we are doing cost model. I am not sure if binarization is the best approach here.

Can you dump out the labels and check the current assigned behavior?

  • Sure, attached is a small script + dmatrix dump: tvm-xgboost-dmatrix.zip with results.txt
  • This was captured from a real tvm autotunning process targeting a rk3399 opencl device.

Likely we might want to move away from the MAP metric, and use other metric instead, either regression metric or pair-wise ranking.

  • Apparently this proposal works, tunning finds good kernels, but the real impact is hard to measure (on personal side).

Another quick idea for now is to add condition of binarization to xgboost >=1.7.5 2.0.0 version, keeping the old behaviour.

@cbalint13
Copy link
Contributor Author

@tqchen , additionally to my response to the request in the previous message:

Likely we might want to move away from the MAP metric, and use other metric instead, either regression metric or pair-wise ranking.

  • Apparently this proposal works, tunning finds good kernels, but the real impact is hard to measure (on personal side).

Another quick idea for now is to add condition of binarization to xgboost >=1.7.5 version, keeping the old behaviour.

  • If binarization is too steep , a simple (but recursive) way would be the PASCAL trick to avg/split into 11 levels:

image

@tqchen
Copy link
Member

tqchen commented Apr 4, 2023

I think in this case we should change ranking loss to regression loss, use logistic regression so the values can still be used. binarization causes too much info loss

@cbalint13 cbalint13 changed the title [AutoScheduler][AutoTVM] Binarize labels for the new xgboost >= 1.7.5 behaviour [AutoTVM] Binarize labels for the new xgboost >= 2.0.0 behaviour Apr 4, 2023
@cbalint13
Copy link
Contributor Author

cbalint13 commented Apr 4, 2023

@tqchen ,

I think in this case we should change ranking loss to regression loss, use logistic regression so the values can still be used. binarization causes too much info loss

Updates:

  1. The autoschedule (ansor) is not affected at all.
  2. The autotune reg (reg:linear) loss_type is not affected.
  3. Only autotune with rank (rank:pairwise) loss_type``` is affected.
  4. Only xgboost >= 2.0.0-dev (as it presents itself via py API).

I updated this PR code to do binarization only in case:

  • check if xgboost >= 2
  • check if rank:pairwise

Updated here the code, the title, the first comment (barred out any erroneous info).
I could imagine to leave this for autotune in a prepearing for what will be xgb >= 2.0.0

@tqchen
Copy link
Member

tqchen commented Apr 4, 2023

I see, i think we should report error if binarization is needed, since the original intention was continuous prediction.

I know it might still work OK, but that was not the intention of the cost predictor.

Would be good to visit the default choice, i think if ranking is not possible, reg:logistic would be another good choice usually

@cbalint13 cbalint13 changed the title [AutoTVM] Binarize labels for the new xgboost >= 2.0.0 behaviour [AutoTVM] New rank-binary loss_type labels for the new xgboost >= 2.0.0 behaviour Apr 4, 2023
@cbalint13
Copy link
Contributor Author

@tqchen ,

  • Introduced a new rank-binary loss_type along with current rank and reg.

I see, i think we should report error if binarization is needed, since the original intention was continuous prediction.

  • If loss_type is rank and xgboost >= 2.0.0 will report an error with explicit suggestion to use rank-binary instead.

I know it might still work OK, but that was not the intention of the cost predictor.

  • All remains as it was, it is user's choice (informed by tvm) to switch over with rank-binary or downgrade xgboost.

Would be good to visit the default choice, i think if ranking is not possible, reg:logistic would be another good choice usually

  • I won't touch this, all the described above would offer user a informed alternative.

Let me know if still need changes or more polishing, I stop here for now.
In future, will keep track on xgboost upcoming custom gain function , we may restore rank continuous behaviour.

@cbalint13 cbalint13 changed the title [AutoTVM] New rank-binary loss_type labels for the new xgboost >= 2.0.0 behaviour [AutoTVM] New rank-binary loss_type for the new xgboost >= 2.0.0 behaviour Apr 5, 2023
@tqchen
Copy link
Member

tqchen commented Apr 5, 2023

Thanks @cbalint13 ! we still need to make sure the default loss_type now updates to regression loss, mainly because rank-binary do not fit into the customized loss models.

@cbalint13
Copy link
Contributor Author

@tqchen ,

Thanks @cbalint13 ! we still need to make sure the default loss_type now updates to regression loss, mainly because rank-binary do not fit into the customized loss models.

  • Changed to "reg" all class init occurrences.
  • Lowered test case rmse for the new "reg" default.

There are still tutorials / applications that use explicit "rank", would like to change all of them ?

@tqchen
Copy link
Member

tqchen commented Apr 5, 2023

@cbalint13 Yes, let us update and to change all to reg. cc @junrushao to double check cases in MetaSchedule.

Might be useful to use reg:logistic, if the output is scaled into [0, 1]

@cbalint13
Copy link
Contributor Author

cbalint13 commented Apr 5, 2023

@tqchen ,

@cbalint13 Yes, let us update and to change all to reg.

  • Updated all tutorials, apps and tests from any explicit "rank" to "reg".

cc @junrushao to double check cases in MetaSchedule.

  • Double check is welcome, as quick info, none of my metascheduler test failed, xgboost would fail like:
xgboost.core.XGBoostError: [15:59:59] /builddir/build/BUILD/xgboost/src/common/ranking_utils.h:378: 
Check failed: is_binary: MAP can only be used with binary labels.
  • Used xgboost 20230403 git hash 15e073ca .

Might be useful to use reg:logistic, if the output is scaled into [0, 1]

My thought on the newly introduced type_loss="rank-binary":

  • Is alternative to the original rank and booth have: ("objective": "rank:pairwise")
  • It binarize (within a cloned copy) the labels only for the xgboost eval step (being xgboost>=2 compatible).
  • The dynamics inside TVM further steps still remained with full continuous values.

I would leave it as described, if would like we can create another one type_loss="reg-binary" w/ reg:logistic.
For the case reg:logistic the naming "rank-binary" loose the "rank"-ness, so it would be just another "reg" class type.

@cbalint13
Copy link
Contributor Author

@tqchen , @junrushao

In continuation of previous comment, I also attach here some test result.

Comparative test confirms that rank-binary (binarized only at eval step) behaves identically with rank (original):

loss_type="rank-binary" (xgboost-2.0.0-dev 20230403 git hash 15e073ca)
[Task  1/54] (conv2d_nchw_spatial_pack.mali) {17.75 GFLOPS / #4912 records} SKIP
[Task  2/54] (conv2d_nchw_spatial_pack.mali) {40.74 GFLOPS / #1040 records} SKIP
[Task  3/54] (conv2d_nchw_spatial_pack.mali) {19.63 GFLOPS / #2032 records} SKIP

loss_type="rank" (xgboost-1.7.5 20230328 git hash 21d95f3d)
[Task  1/54] (conv2d_nchw_spatial_pack.mali) {11.71 GFLOPS / #1680 records} SKIP
[Task  2/54] (conv2d_nchw_spatial_pack.mali) {26.17 GFLOPS / #1024 records} SKIP
[Task  3/54] (conv2d_nchw_spatial_pack.mali) {13.15 GFLOPS / #1040 records} SKIP

Note:

  • In the case of rank-binary there was more steps (bit prologed / see amount of records) hence bit better results.
  • The tuned network was first three layers of yolov8s using float16 for half a day on a rk3399 board (nanopc-t4).

@zxybazh
Copy link
Member

zxybazh commented Apr 7, 2023

@tvm-bot rerun

@cbalint13 cbalint13 requested a review from FrozenGene April 10, 2023 14:37
@tqchen tqchen merged commit 515583c into apache:main Apr 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants