Alchemy: A structured task distribution for meta-reinforcement learning
Jane Wang,
Michael King,
Zeb Kurth-Nelson,
Nicolas Porcel,
Francis Song,
Peter Choy,
David Reichert,
Charlie Deck,
Malcolm Reynolds,
Demis Hassabis,
Mary Cassin,
Neil Rabinowitz,
Gavin Buttimore,
Loic Matthey,
Alexander Lerchner,
Matt Botvinick
arXiv
2021-02-08
Deep reinforcement learning