Paper
Publication
Unsupervised Object-based Transition Models For Embodied Agents in 3D Partially Observable Environments.
Paper
Publication
An Empirical Investigation of Learning from Biased Toxicity Labels
Paper
Publication
Neural rate control for video encoding using imitation learning
Paper
Publication
Ponder Net
Paper
Publication
Revisiting Peng's Q(λ) for modern reinforcement learning
Paper
Publication
Taylor expansions of discount factors
Paper
Publication
Kernel-based reinforcement Learning: A finite-time analysis
Paper
Publication
Counterfactual Credit Assignment
Paper
Publication
Online A-optimal design and active linear regression
Paper
Publication
PsiPhi: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Paper
Publication
Fast active learning for pure exploration in reinforcement learning
Paper
Publication
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Paper
Publication
Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Paper
Publication
Reasoning-Modulated Representations
Paper
Publication
CoBERL: Contrastive BERT for Reinforcement Learning
Paper
Publication
Imitation by Predicting Observations
Paper
Publication
Emphatic Algorithms for Deep Reinforcement Learning
Paper
Publication
A Closer Look at the Adversarial Robustness of Information Bottleneck Models
Paper
Publication
Vector Quantized Models for Planning
Paper
Publication
A Distribution-Dependent Analysis of Meta-Learning
Paper
Publication
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Paper
Publication
Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation
Paper
Publication
Robust Learning-Augmented Caching: An Experimental Study
Paper
Publication
Leveraging Non-uniformity in First-order Non-convex Optimization
Paper
Publication
Spectral normalisation for Deep Reinforcement Learning: an Optimisation Perspective