Paper
Publication
Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Paper
Publication
Generalised Policy Improvement with Geometric Policy Composition
Paper
Publication
From Dirichlet to Rubin: Optimistic exploration in RL without bonuses
Paper
Publication
Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games
Paper
Publication
Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning
Paper
Publication
An adaptive and efficient multi-goal exploration
Paper
Publication
Marginalized operators for off-policy reinforcement learning
Paper
Publication
Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO
Paper
Publication
NeuPL: Neural Population Learning
Paper
Publication
Learning more skills through optimistic exploration
Paper
Publication
Hidden Agenda
Paper
Publication
A Constrained Multi-Objective Reinforcement Learning Framework
Paper
Publication
The Difficulty of Passive Learning in Deep Reinforcement Learning
Paper
Publication
Statistical discrimination in learning agents
Paper
Publication
Collaborating with Humans without Human Data
Paper
Publication
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Paper
Publication
On the role of population heterogeneity in emergent communication
Paper
Publication
Bootstrapped Meta-Learning
Paper
Publication
When should agents explore?
Paper
Publication
Emergent Communication at Scale
Paper
Publication
Open-Ended Learning Leads to Generally Capable Agents
Paper
Publication
Revisiting Peng's Q(λ) for modern reinforcement learning
Paper
Publication
Taylor expansions of discount factors
Paper
Publication
Unifying gradient estimators for meta-reinforcement learning via off-policy evaluation
Paper
Publication
Vector Quantized Models for Planning