title

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_author

author

date

address

container-title

volume

genre

issued

pdf

extras

On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

This paper studies the exponential stability of random matrix products driven by a general (possibly unbounded) state space Markov chain. It is a cornerstone in the analysis of stochastic algorithms in machine learning (e.g. for parameter tracking in online-learning or reinforcement learning). The existing results impose strong conditions such as uniform boundedness of the matrix-valued functions and uniform ergodicity of the Markov chains. Our main contribution is an exponential stability result for the p-th moment of random matrix product, provided that (i) the underlying Markov chain satisfies a super-Lyapunov drift condition, (ii) the growth of the matrix-valued functions is controlled by an appropriately defined function (related to the drift condition). Using this result, we give finite-time p-th moment bounds for constant and decreasing stepsize linear stochastic approximation schemes with Markovian noise on general state space. We illustrate these findings for linear value-function estimation in reinforcement learning. We provide finite-time p-th moment bound for various members of temporal difference (TD) family of algorithms.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

durmus21a

0

On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

1711

1752

1711-1752

1711

false

Durmus, Alain and Moulines, Eric and Naumov, Alexey and Samsonov, Sergey and Wai, Hoi-To

given	family
Alain	Durmus

given	family
Eric	Moulines

given	family
Alexey	Naumov

given	family
Sergey	Samsonov

given	family
Hoi-To	Wai

2021-07-21

Proceedings of Thirty Fourth Conference on Learning Theory

134

inproceedings

date-parts

2021

7

21

http://proceedings.mlr.press/v134/durmus21a/durmus21a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2021-07-21-durmus21a.md

2021-07-21-durmus21a.md

Files

2021-07-21-durmus21a.md

Latest commit

History

2021-07-21-durmus21a.md

File metadata and controls