Paper
Publication
From Dirichlet to Rubin: Optimistic exploration in RL without bonuses
Paper
Publication
Active offline policy selection
Paper
Publication
An adaptive and efficient multi-goal exploration
Paper
Publication
Marginalized operators for off-policy reinforcement learning
Paper
Publication
Your Policy Regularizer is Secretly an Adversary
Paper
Publication
Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO
Paper
Publication
Online Apprenticeship Learning
Paper
Publication
Chaining Value Functions for Off-Policy Learning
Paper
Publication
Assessing Human Interaction in Virtual Reality with Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Paper
Publication
Model-Value Self-Inconsistency as a Signal for Epistemic Uncertainty
Paper
Publication
On the Expressivity of Markov Reward
Paper
Publication
The Difficulty of Passive Learning in Deep Reinforcement Learning
Paper
Publication
Importance of Representation Learning for Off-Policy Fitted Q-Evaluation
Paper
Publication
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Paper
Publication
When should agents explore?
Paper
Publication
Temporally Abstract Partial Models
Paper
Publication
Evaluating Strategic Structures in Multi-Agent Inverse Reinforcement Learning
Paper
Publication
Improved Chinese Sentence Segmentation with Reinforcement Learning
Paper
Publication
Revisiting Peng's Q(λ) for modern reinforcement learning
Paper
Publication
Taylor expansions of discount factors
Paper
Publication
Kernel-based reinforcement Learning: A finite-time analysis
Paper
Publication
PsiPhi: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Paper
Publication
Unifying gradient estimators for meta-reinforcement learning via off-policy evaluation
Paper
Publication
Learning in two-player zero-sum partially observable Markov games with perfect recall
Paper
Publication
Representation Matters: Improving Observations and Exploration for Robotics