Deep Q-learning from Demonstrations
Todd Hester,
Mel Vecerik,
Olivier Pietquin *,
Tom Schaul,
Bilal Piot,
Dan Horgan,
John Quan,
A Sendonaris *,
Gabriel Dulac-Arnold *,
Ian Osband,
John Agapiou,
Joel Leibo,
Audrunas Gruslys
AAAI
2017-04-12
Deep reinforcement learning