Tags
Venues
No items found
Paper
Publication
Search-Improved Game-Theoretic Multiagent Reinforcement Learning in General and Negotiation Games
Paper
Publication
Is forgetting less a good inductive bias for forward transfer?
Paper
Publication
Meta-Learning Black-Box Optimization via Black-Box Optimization
Paper
Publication
Equilibrium-Invariant Embedding, Metric Space, and Fundamental Set of 2×2 Normal-Form Games
Paper
Publication
Fitting Autoregressive Graph Generative Models through Maximum Likelihood Estimation
Paper
Publication
Rethinking Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization
Paper
Publication
Three ways to improve feature alignment for open vocabulary detection
Paper
Publication
Fast exploration and learning of latent graphs with aliased observations
Paper
Publication
Evaluating Number Discrimination in Deep Neural Networks for Vision
Paper
Publication
Denoising diffusion samplers
Paper
Publication
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains
Paper
Publication
Graph schemas as abstractions for transfer learning, inference, and planning
Paper
Publication
Universal Agent Mixtures and the Geometry of Intelligence
Paper
Publication
Scaling Goal-based Exploration via Pruning Proto-goals
Paper
Publication
Equivariant MuZero
Paper
Publication
3D Neural Embedding Likelihood for Robust Sim-to-Real Transfer in Inverse Graphics
Paper
Publication
Exploration via Epistemic Value Estimation
Paper
Publication
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
Paper
Publication
Reinforcement Learning for Minimizing Age of Information over Wireless Links
Paper
Publication
PGMax: Factor Graphs for Discrete Probabilistic Graphical Models and Loopy Belief Propagation in JAX
Paper
Publication
Learning Noisy OR Bayesian Networks with Max-Product Belief Propagation
Paper
Publication
Dual Algorithmic Reasoning
Paper
Publication
Distilling Internet-Scale Vision-Language Models into Embodied Agents
Paper
Publication
Pragmatic Fairness: Developing Policies with Outcome Disparity Control
Paper
Publication
On a continuous time model of gradient descent dynamics and instability in deep learning