Introduction
Reinforcement Learning for Call Admission Control and Routing under Quality of Service Constraints in Multimedia Networks
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts
Kernel-Based Reinforcement Learning
On Average Versus Discounted Reward Temporal-Difference Learning
A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
Near-Optimal Reinforcement Learning in Polynomial Time
Technical Update: Least-Squares Temporal Difference Learning
Continuous-Action Q-Learning
Risk-Sensitive Reinforcement Learning
Variable Resolution Discretization in Optimal Control
Structure in the Space of Value Functions
The Lagging Anchor Algorithm: Reinforcement Learning in Two-Player Zero-Sum Games with Imperfect Information
A Simple Method for Generating Additive Clustering Models with Limited Complexity
Feature Generation Using General Constructor Functions