[python-package] early stopping min_delta (fixes #2526) #4580

jmoralez · 2021-09-01T02:06:18Z

This aims to close #2526 by providing an additional argument min_delta to the early stopping callback, which allows to perform early stopping when the score doesn't improve by at least that min_delta in stopping_rounds iterations. The default value is 0, so the current behavior is maintained.

In the case for multiple metrics the user can provide a single delta for all metrics or a specific delta for each metric. The handling of all the cases (single metric, multiple metrics, multiple metrics considering first only) can be a bit cumbersome so I'm open to suggestions on how to handle them.

jameslamb

Thanks for working on this! I left a few preliminary comments.

python-package/lightgbm/callback.py

python-package/lightgbm/engine.py

tests/python_package_test/test_engine.py

jameslamb · 2021-09-01T04:07:53Z

@jmoralez by the way, since you just joined...we use a bot to create release notes automatically based on the set of pull requests merged between git tags.

That bot chooses which section of the release notes to put a change in based on the use of specific labels.

LightGBM/.github/release-drafter.yml

Lines 4 to 15 in 3942126

    
           - title: '💡 New Features' 
        
             label: 'feature' 
        
           - title: '🔨 Breaking' 
        
             label: 'breaking' 
        
           - title: '🚀 Efficiency Improvement' 
        
             label: 'efficiency' 
        
           - title: '🐛 Bug Fixes' 
        
             label: 'fix' 
        
           - title: '📖 Documentation' 
        
             label: 'doc' 
        
           - title: '🧰 Maintenance' 
        
             label: 'maintenance'

So every PR needs to be assigned a label from that set. I'd call this one feature. If you're ever unsure, maintenance is a catch-all.

StrikerRUS · 2021-09-01T20:30:07Z

@jmoralez Wow, great idea of per metric thresholds, really like it! Do you have plans replicating the same logic in cpp code (PR for first_metric_only as a reference: #2172)?

jmoralez · 2021-09-01T21:21:51Z

I can give it a shot once this is done.

StrikerRUS · 2021-09-21T16:04:12Z

In general the proposed solution looks good to me. Thanks for the PR!

In dmlc/xgboost#7137 (comment) it is said that min_delta is a commonly used name for a such type of parameter. Maybe it is better to use name min_delta here as well for the consistency with other ML libraries?

Also, looking into APIs of early stopping callbacks from other frameworks (for example, pytorch-lightning and tensorflow), I see only one parameter min_delta and no tolerance parameter.

I believe all _log_warning() occurrences should be replaced with _log_info() because those messages look like debug-level info, not like something that is expected to be fixed at the user side.

Please enhance new parameter description with some info about the semantics of expected parameter types. It should help users avoid this error at the time of writing their code but not after they read this error message.

Must provide a single early stopping threshold or as many as metrics.

Something like the following:

    If float, this singe value is used as early stopping threshold for all metrics.
    If list, its length should match the total number of metrics to use one early stopping threshold per one metric.

enhance parameter description update tests

jmoralez · 2021-09-29T01:38:11Z

@StrikerRUS I've changed the name to min_delta and updated the description. Let me know what you think.

shiyu1994

The changes LGTM. Thank you! @jmoralez

StrikerRUS

Sorry for the delay!

Everything is great, thanks a lot!

I just think we should allow users to silence debugging messages. Moreover, we already have verbose argument for this callback.

python-package/lightgbm/callback.py

tests/python_package_test/test_engine.py

Co-authored-by: Nikita Titov <[email protected]>

StrikerRUS

Thank you very much for implementing this feature! LGTM!

Just noticed that two my suggestions from the previous review were not so neat.

python-package/lightgbm/callback.py

Co-authored-by: Nikita Titov <[email protected]>

jmoralez · 2021-11-01T18:03:39Z

@StrikerRUS should I merge this?

StrikerRUS · 2021-11-01T19:08:13Z

@jmoralez

should I merge this?

TBH, I'm not sure. We decided to give some time (about 2 weeks or so) after the v3.3.1 release for reporting critical bugs in it and during that time not merge large PRs that contain new features.

I'd like to ask to not merge new large PRs with major features for one or two weeks after releasing v3.3.0. This will ease the process of creating v3.3.1 hotfix release if that will be needed in case of some critical bugs in v3.3.0 we might get reports about.
#4633 (comment)

I'd also like to propose that after the v3.3.1 release, we again wait 2 weeks to begin merging other large PRs, in case a v3.3.2 release is required.
#4715 (comment)

This PR touches existing code and may possibly change current logic. I'd better wait a little bit before merging it.
@jameslamb @shiyu1994 WDYT?

jameslamb · 2021-11-01T20:36:59Z

I'd better wait a little bit before merging it

I agree. v3.3.1 was only merged 6 days ago (#4715 (comment)), so I think we should wait another week for this PR.

shiyu1994 · 2021-11-03T05:33:12Z

I'd better wait a little bit before merging it.

I agree.

jameslamb · 2021-11-11T03:54:35Z

Great work on this, @jmoralez !!!

github-actions · 2023-08-23T14:40:51Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

jmoralez and others added 3 commits August 27, 2021 20:50

initial changes

8e31870

initial version

dc0e621

better handling of cases

6ff7ad3

jmoralez requested review from chivee, henry0312, jameslamb, shiyu1994 and StrikerRUS as code owners September 1, 2021 02:06

warn only with positive threshold

4b3aa35

jameslamb reviewed Sep 1, 2021

View reviewed changes

python-package/lightgbm/callback.py Outdated Show resolved Hide resolved

python-package/lightgbm/callback.py Outdated Show resolved Hide resolved

python-package/lightgbm/engine.py Outdated Show resolved Hide resolved

tests/python_package_test/test_engine.py Outdated Show resolved Hide resolved

jameslamb added the feature label Sep 1, 2021

jmoralez and others added 7 commits September 4, 2021 21:42

remove early_stopping_threshold from high-level functions

830961b

Merge branch 'master' into feat/early-stopping-threshold

f97af78

remove remaining early_stopping_threshold

71d379f

update test to use callback

8db30b6

better handling of cases

9ec9891

Merge branch 'master' into feat/early-stopping-threshold

cf361d5

Merge branch 'master' into feat/early-stopping-threshold

98e4f83

jmoralez added 2 commits September 25, 2021 10:46

rename threshold to min_delta

b6d3432

enhance parameter description update tests

merge master

f40eb5a

shiyu1994 approved these changes Oct 4, 2021

View reviewed changes

StrikerRUS requested changes Oct 14, 2021

View reviewed changes

jmoralez and others added 2 commits October 14, 2021 10:47

Apply suggestions from code review

8b6e013

Co-authored-by: Nikita Titov <[email protected]>

reduce num_boost_round in tests

b73f3f2

StrikerRUS approved these changes Oct 14, 2021

View reviewed changes

python-package/lightgbm/callback.py Outdated Show resolved Hide resolved

python-package/lightgbm/callback.py Outdated Show resolved Hide resolved

Apply suggestions from code review

7fa8f5f

Co-authored-by: Nikita Titov <[email protected]>

jmoralez changed the title ~~[python-package] early stopping threshold~~ [python-package] early stopping min_delta (fixes #2526) Oct 15, 2021

trigger ci

972dba5

This was referenced Nov 3, 2021

[python][sklearn] respect objective aliases #4758

Merged

cmake: use object library to avoid duplicate compilation. #4489

Merged

StrikerRUS merged commit 99e0a4b into microsoft:master Nov 10, 2021

StrikerRUS mentioned this pull request Nov 10, 2021

Feature Request: customizable early_stopping_tolerance #2526

Closed

jmoralez deleted the feat/early-stopping-threshold branch November 10, 2021 15:16

StrikerRUS mentioned this pull request Jan 6, 2022

[DO NOT MERGE] Release 3.3.2 #4930

Closed

13 tasks

StrikerRUS mentioned this pull request Feb 15, 2022

Early stopping: overfit prevention #4996

Open

ummel mentioned this pull request Apr 13, 2022

Implement advanced early stopping in train() ummel/fusionModel#24

Open

jameslamb mentioned this pull request Oct 7, 2022

[DO NOT MERGE] Release v3.3.3 #5525

Closed

40 tasks

jameslamb mentioned this pull request Jun 1, 2023

[python-package] Early stopping not reproducible when nthreads>1 #5758

Closed

jameslamb mentioned this pull request Jun 27, 2023

[docs] add versionadded notes for v4.0.0 features #5948

Merged

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python-package] early stopping min_delta (fixes #2526) #4580

[python-package] early stopping min_delta (fixes #2526) #4580

jmoralez commented Sep 1, 2021 •

edited

Loading

jameslamb left a comment

jameslamb commented Sep 1, 2021

StrikerRUS commented Sep 1, 2021

jmoralez commented Sep 1, 2021

StrikerRUS commented Sep 21, 2021

jmoralez commented Sep 29, 2021

shiyu1994 left a comment

StrikerRUS left a comment

StrikerRUS left a comment

jmoralez commented Nov 1, 2021

StrikerRUS commented Nov 1, 2021

jameslamb commented Nov 1, 2021

shiyu1994 commented Nov 3, 2021

jameslamb commented Nov 11, 2021

github-actions bot commented Aug 23, 2023

[python-package] early stopping min_delta (fixes #2526) #4580

[python-package] early stopping min_delta (fixes #2526) #4580

Conversation

jmoralez commented Sep 1, 2021 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb commented Sep 1, 2021

StrikerRUS commented Sep 1, 2021

jmoralez commented Sep 1, 2021

StrikerRUS commented Sep 21, 2021

jmoralez commented Sep 29, 2021

shiyu1994 left a comment

Choose a reason for hiding this comment

StrikerRUS left a comment

Choose a reason for hiding this comment

StrikerRUS left a comment

Choose a reason for hiding this comment

jmoralez commented Nov 1, 2021

StrikerRUS commented Nov 1, 2021

jameslamb commented Nov 1, 2021

shiyu1994 commented Nov 3, 2021

jameslamb commented Nov 11, 2021

github-actions bot commented Aug 23, 2023

jmoralez commented Sep 1, 2021 •

edited

Loading