Paper
Publication
Online Apprenticeship Learning
Paper
Publication
Feature and Parameter Selection in Stochastic Linear Bandits
Paper
Publication
Assessing Human Interaction in Virtual Reality with Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study
Paper
Publication
Asymptotically Best Casual Effect Identification with Multi-Armed Bandits
Paper
Publication
Bootstrapped Meta-Learning
Paper
Publication
Online A-optimal design and active linear regression
Paper
Publication
Fast active learning for pure exploration in reinforcement learning
Paper
Publication
Learning in two-player zero-sum partially observable Markov games with perfect recall
Paper
Publication
UCB Momentum Q-learning: Correcting the bias without forgetting
Paper
Publication
A kernel-based approach to non-stationary reinforcement learning in metric spaces
Paper
Publication
Episodic reinforcement learning in finite MDPs: Minimax lower bounds revisited
Paper
Publication
Adaptive reward-free exploration
Paper
Publication
Hindsight and Sequential Rationality of Correlated Play - arXiv
Paper
Publication
Fixed-Confidence Guarantees for Bayesian Best-Arm Identification
Paper
Publication
Adapting to Delays and Data in Adversarial Multi-Armed Bandits
Paper
Publication
Mirror Descent and the Information Ratio
Paper
Publication
Low-rank Tensor Bandits
Paper
Publication
A Simpler Approach to Accelerated Stochastic Optimization: Iterative Averaging Meets Optimism
Paper
Publication
Expected Eligibility Traces
Paper
Publication
Stochastic bandits with arm-dependent delays
Paper
Publication
A single algorithm for both restless and rested rotting bandits
Paper
Publication
Statistical efficiency of Thompson sampling for combinatorial semi-bandits
Paper
Publication
Gaussian Gated Linear Networks
Paper
Publication
Rapid Task-Solving in Novel Environments
Paper
Publication
Non-Stationary Bandits with Intermediate Observations