Paper
Publication
Quinoa: A Q-function You Infer Normalized Over Actions
Paper
Publication
Task-Relevant Adversarial Imitation Learning
Paper
Publication
Deep Reinforcement Learning and the Deadly Triad