Paper
Publication
Online Apprenticeship Learning
Paper
Publication
Assessing Human Interaction in Virtual Reality with Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Paper
Publication
Asymptotically Best Casual Effect Identification with Multi-Armed Bandits
Paper
Publication
Bootstrapped Meta-Learning
Paper
Publication
Online A-optimal design and active linear regression
Paper
Publication
Fast active learning for pure exploration in reinforcement learning
Paper
Publication
Learning in two-player zero-sum partially observable Markov games with perfect recall
Paper
Publication
UCB Momentum Q-learning: Correcting the bias without forgetting
Paper
Publication
A kernel-based approach to non-stationary reinforcement learning in metric spaces
Paper
Publication
Episodic reinforcement learning in finite MDPs: Minimax lower bounds revisited
Paper
Publication
Adaptive reward-free exploration
Paper
Publication
Hindsight and Sequential Rationality of Correlated Play - arXiv
Paper
Publication
Fixed-Confidence Guarantees for Bayesian Best-Arm Identification
Paper
Publication
Adapting to Delays and Data in Adversarial Multi-Armed Bandits
Paper
Publication
Mirror Descent and the Information Ratio
Paper
Publication
Low-rank Tensor Bandits
Paper
Publication
A Simpler Approach to Accelerated Stochastic Optimization: Iterative Averaging Meets Optimism
Paper
Publication
Expected Eligibility Traces
Paper
Publication
Stochastic bandits with arm-dependent delays
Paper
Publication
A single algorithm for both restless and rested rotting bandits
Paper
Publication
Statistical efficiency of Thompson sampling for combinatorial semi-bandits
Paper
Publication
Gaussian Gated Linear Networks
Paper
Publication
Rapid Task-Solving in Novel Environments
Paper
Publication
Non-Stationary Bandits with Intermediate Observations
Paper
Publication
Covariance-adapting algorithm for semi-bandits with application to sparse outcomes