title

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

The sample complexity of ERMs in stochastic convex optimization

Stochastic convex optimization is one of the most well-studied models for learning in modern machine learning. Nevertheless, a central fundamental question in this setup remained unresolved: how many data points must be observed so that any empirical risk minimizer (ERM) shows good performance on the true population? This question was proposed by Feldman who proved that $\Omega(\frac{d}{\epsilon} + \frac{1}{\epsilon^2} )$ data points are necessary (where $d$ is the dimension and $\epsilon > 0$ the accuracy parameter). Proving an $\omega(\frac{d}{\epsilon} + \frac{1}{\epsilon^2})$ lower bound was left as an open problem. In this work we show that in fact $\tilde{O}(\frac{d}{\epsilon} + \frac{1}{\epsilon^2})$ data points are also sufficient. This settles the question and yields a new separation between ERMs and uniform convergence. This sample complexity holds for the classical setup of learning bounded convex Lipschitz functions over the Euclidean unit ball. We further generalize the result and show that a similar upper bound holds for all symmetric convex bodies. The general bound is composed of two terms: (i) a term of the form $\tilde{O}(\frac{d}{\epsilon})$ with an inverse-linear dependence on the accuracy parameter, and (ii) a term that depends on the statistical complexity of the class of linear functions (captured by the Rademacher complexity). The proof builds a mechanism for controlling the behavior of stochastic convex optimization problems.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

carmon24a

0

The sample complexity of {ERM}s in stochastic convex optimization

3799

3807

3799-3807

3799

false

Carmon, Daniel and Yehudayoff, Amir and Livni, Roi

given	family
Daniel	Carmon

given	family
Amir	Yehudayoff

given	family
Roi	Livni

2024-04-18

Proceedings of The 27th International Conference on Artificial Intelligence and Statistics

238

inproceedings

date-parts

2024

4

18

https://proceedings.mlr.press/v238/carmon24a/carmon24a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024-04-18-carmon24a.md

2024-04-18-carmon24a.md

Files

2024-04-18-carmon24a.md

Latest commit

History

2024-04-18-carmon24a.md

File metadata and controls