Paper
Publication
Optimistic posterior sampling for reinforcement learning with few samples and tight guarantees
Paper
Publication
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Paper
Publication
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality
Paper
Publication
On Reward Binarisation and Bayesian Agents
Paper
Publication
Optimizing Industrial Cooling Systems with Hierarchical Reinforcement Learning
Paper
Publication
From Motor Control to Team Play in Simulated Humanoid Football
Paper
Publication
Reinforcement Learning with Information Theoretic Actuation
Paper
Publication
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Paper
Publication
Generalised Policy Improvement with Geometric Policy Composition
Paper
Publication
From Dirichlet to Rubin: Optimistic exploration in RL without bonuses
Paper
Publication
Expressing Non-Markov Reward to a Markov Agent
Paper
Publication
A Simple Approach for State-Action Abstraction\\using a Learned MDP Homomorphism
Paper
Publication
Uniqueness and Complexity of Inverse MDP Models
Paper
Publication
Active offline policy selection
Paper
Publication
An adaptive and efficient multi-goal exploration
Paper
Publication
Marginalized operators for off-policy reinforcement learning
Paper
Publication
Your Policy Regularizer is Secretly an Adversary
Paper
Publication
Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO
Paper
Publication
Online Apprenticeship Learning
Paper
Publication
Chaining Value Functions for Off-Policy Learning
Paper
Publication
Reward-Punishment Symmetric Universal Intelligence
Paper
Publication
Assessing Human Interaction in Virtual Reality with Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Paper
Publication
Model-Value Self-Inconsistency as a Signal for Epistemic Uncertainty
Paper
Publication
On the Expressivity of Markov Reward
Paper
Publication
The Difficulty of Passive Learning in Deep Reinforcement Learning