A STOCHASTIC DERIVATIVE FREE OPTIMIZATION METHOD WITH MOMENTUM

Eduard Gorbunov, Adel Bibi, Ozan Sener, Bergou El Houcine, Peter Richtárik

Research output: Contribution to conferencePaperpeer-review

5 Scopus citations

Abstract

We consider the problem of unconstrained minimization of a smooth objective function in Rd in setting where only function evaluations are possible. We propose and analyze stochastic zeroth-order method with heavy ball momentum. In particular, we propose, SMTP, a momentum version of the stochastic three-point method (STP) Bergou et al. (2019). We show new complexity results for non-convex, convex and strongly convex functions. We test our method on a collection of learning to continuous control tasks on several MuJoCo Todorov et al. (2012) environments with varying difficulty and compare against STP, other state-of-the-art derivative-free optimization algorithms and against policy gradient methods. SMTP significantly outperforms STP and all other methods that we considered in our numerical experiments. Our second contribution is SMTP with importance sampling which we call SMTP_IS. We provide convergence analysis of this method for non-convex, convex and strongly convex objectives.

Original languageEnglish (US)
StatePublished - 2020
Event8th International Conference on Learning Representations, ICLR 2020 - Addis Ababa, Ethiopia
Duration: Apr 30 2020 → …

Conference

Conference8th International Conference on Learning Representations, ICLR 2020
Country/TerritoryEthiopia
CityAddis Ababa
Period04/30/20 → …

Bibliographical note

Funding Information:
∗The research of Eduard Gorbunov was supported by RFBR, project number 18-31-20005 mol_a_ved

Publisher Copyright:
© 2020 8th International Conference on Learning Representations, ICLR 2020. All rights reserved.

ASJC Scopus subject areas

  • Education
  • Linguistics and Language
  • Language and Linguistics
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'A STOCHASTIC DERIVATIVE FREE OPTIMIZATION METHOD WITH MOMENTUM'. Together they form a unique fingerprint.

Cite this