Reinforcement learning in sparse-reward environments with hindsight policy gradients

Paulo Rauber, Avinash Ummadisingu, Filipe Mutz, Jürgen Schmidhuber

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Fingerprint

Dive into the research topics of 'Reinforcement learning in sparse-reward environments with hindsight policy gradients'. Together they form a unique fingerprint.

Computer Science

Keyphrases

Chemical Engineering