title

section

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning

Original Papers

A pervasive phenomenon in machine learning applications is \emph{distribution shift}, where training and deployment conditions for a machine learning model differ. As distribution shift typically results in a degradation in performance, much attention has been devoted to algorithmic interventions that mitigate these detrimental effects. This paper studies the effect of distribution shift in the presence of model misspecification, specifically focusing on $L_{\infty}$-misspecified regression and \emph{adversarial covariate shift}, where the regression target remains fixed while the covariate distribution changes arbitrarily. We show that empirical risk minimization, or standard least squares regression, can result in undesirable \emph{misspecification amplification} where the error due to misspecification is amplified by the density ratio between the training and testing distributions. As our main result, we develop a new algorithm—inspired by robust optimization techniques—that avoids this undesirable behavior, resulting in no misspecification amplification while still obtaining optimal statistical rates. As applications, we use this regression procedure to obtain new guarantees in offline and online reinforcement learning with misspecification and establish new separations between previously studied structural conditions and notions of coverage.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

amortila24a

0

Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning

130

160

130-160

130

false

Amortila, Philip and Cao, Tongyi and Krishnamurthy, Akshay

given	family
Philip	Amortila

given	family
Tongyi	Cao

given	family
Akshay	Krishnamurthy

2024-06-30

Proceedings of Thirty Seventh Conference on Learning Theory

247

inproceedings

date-parts

2024

6

30

https://proceedings.mlr.press/v247/amortila24a/amortila24a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2024-06-30-amortila24a.md

2024-06-30-amortila24a.md

Files

2024-06-30-amortila24a.md

Latest commit

History

2024-06-30-amortila24a.md

File metadata and controls