TY - JOUR
T1 - Online learning for changing environments using coin betting
AU - Jun, Kwang Sung
AU - Orabona, Francesco
AU - Wright, Stephen
AU - Willett, Rebecca
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-25
PY - 2017/1/1
Y1 - 2017/1/1
N2 - A key challenge in online learning is that classical algorithms can be slow to adapt to changing environments. Recent studies have proposed “meta” algorithms that convert any online learning algorithm to one that is adaptive to changing environments, where the adaptivity is analyzed in a quantity called the strongly-adaptive regret. This paper describes a new meta algorithm that has a strongly-adaptive regret bound that is a factor of √log(T) better than other algorithms with the same time complexity, where TT is the time horizon. We also extend our algorithm to achieve a first-order (i. e., dependent on the observed losses) strongly-adaptive regret bound for the first time, to our knowledge. At its heart is a new parameter-free algorithm for the learning with expert advice (LEA) problem in which experts sometimes do not output advice for consecutive time steps (i. e., sleeping experts). This algorithm is derived by a reduction from optimal algorithms for the so-called coin betting problem. Empirical results show that our algorithm outperforms state-of-the-art methods in both learning with expert advice and metric learning scenarios.
AB - A key challenge in online learning is that classical algorithms can be slow to adapt to changing environments. Recent studies have proposed “meta” algorithms that convert any online learning algorithm to one that is adaptive to changing environments, where the adaptivity is analyzed in a quantity called the strongly-adaptive regret. This paper describes a new meta algorithm that has a strongly-adaptive regret bound that is a factor of √log(T) better than other algorithms with the same time complexity, where TT is the time horizon. We also extend our algorithm to achieve a first-order (i. e., dependent on the observed losses) strongly-adaptive regret bound for the first time, to our knowledge. At its heart is a new parameter-free algorithm for the learning with expert advice (LEA) problem in which experts sometimes do not output advice for consecutive time steps (i. e., sleeping experts). This algorithm is derived by a reduction from optimal algorithms for the so-called coin betting problem. Empirical results show that our algorithm outperforms state-of-the-art methods in both learning with expert advice and metric learning scenarios.
UR - https://projecteuclid.org/journals/electronic-journal-of-statistics/volume-11/issue-2/Online-learning-for-changing-environments-using-coin-betting/10.1214/17-EJS1379SI.full
UR - http://www.scopus.com/inward/record.url?scp=85038366694&partnerID=8YFLogxK
U2 - 10.1214/17-EJS1379SI
DO - 10.1214/17-EJS1379SI
M3 - Article
SN - 1935-7524
VL - 11
SP - 5282
EP - 5310
JO - Electronic Journal of Statistics
JF - Electronic Journal of Statistics
IS - 2
ER -