Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GH-6723 - AdaBoost #15639

Merged
merged 34 commits into from
Sep 26, 2023
Merged

GH-6723 - AdaBoost #15639

merged 34 commits into from
Sep 26, 2023

Conversation

valenad1
Copy link
Collaborator

@valenad1 valenad1 commented Jul 17, 2023

#6723

Implementation of AdaBoost algorithm.

Algorithm input

adaboost_model = H2OAdaBoostEstimator(nlearners = 50, learn_rate = 0.5, weak_learner = "DRF", seed = 0xBEEF)
adaboost_model.train(training_frame=training_frame, y = response)

Whole algorithm output from training

Model Details
=============
H2OAdaBoostEstimator : AdaBoost
Model Key: AdaBoost_model_python_1695210886653_28170


Model Summary: 
    number_of_weak_learners    learn_rate    weak_learner    seed
--  -------------------------  ------------  --------------  ------
    50                         0.5           DRF             48879

Algorithm output from scoring

  predict        p0         p1
        0  0.932313  0.0676868
        1  0.372386  0.627614
        0  0.904304  0.0956963
        0  0.677733  0.322267
        0  0.970847  0.029153
        1  0.372386  0.627614
        1  0.182789  0.817211
        1  0.30957   0.69043
        0  0.918434  0.0815656
        0  0.931758  0.0682418

Computing time benchmark compared to H2O GBM on higgs dataset - training

image

Prediction performance on prostate dataset

Reported on validation dataset

image

@valenad1 valenad1 force-pushed the valenad-6723-adaboost branch 4 times, most recently from a2a2118 to d4c7d53 Compare August 30, 2023 16:13
@valenad1 valenad1 force-pushed the valenad-6723-adaboost branch 5 times, most recently from 6f024ec to df188df Compare September 13, 2023 11:55
@valenad1 valenad1 force-pushed the valenad-6723-adaboost branch 5 times, most recently from 18e679e to 719a356 Compare September 19, 2023 15:58
@maurever
Copy link
Contributor

Good Job @valenad1! I have just a few suggestions.

@wendycwong
Copy link
Contributor

Thank you Adam for the great implementation. I don't have any more suggestions. @maurever : Please approve when you are satisfied. Thanks, Wendy

maurever
maurever previously approved these changes Sep 25, 2023
Copy link
Contributor

@maurever maurever left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @valenad1, LGTM.

mn-mikke
mn-mikke previously approved these changes Sep 26, 2023
Copy link
Collaborator

@mn-mikke mn-mikke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, great change!

* GH-6723 - implement AdaBoost API for python and R

* Expose weights column

* Refactor learning_rate to learn_rate

* Add GBM to the weak_learners

* Refactor n_estimators to nlearners
@valenad1 valenad1 dismissed stale reviews from maurever and mn-mikke via 5b2b780 September 26, 2023 15:02
@valenad1 valenad1 merged commit 010554a into master Sep 26, 2023
2 checks passed
@valenad1 valenad1 deleted the valenad-6723-adaboost branch September 26, 2023 16:45
@valenad1 valenad1 changed the title GH-6723 - Adaboost GH-6723 - AdaBoost Sep 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants