0885-6125

Machine Learning (ML) - January 1996, issue 1-3 论文列表

本期论文列表
Editorial

Editorial

Introduction

Introduction

Efficient Reinforcement Learning through Symbiotic Evolution

Efficient reinforcement learning through symbiotic evolution

Linear Least-Squares Algorithms for Temporal Difference Learning

Linear Least-Squares algorithms for temporal difference learning

Feature-Based Methods for Large Scale Dynamic Programming

Feature-based methods for large scale dynamic programming

On the Worst-Case Analysis of Temporal-Difference Learning Algorithms

On the worst-case analysis of temporal-difference learning algorithms

Reinforcement Learning with Replacing Eligibility Traces

Reinforcement learning with replacing eligibility traces

Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results

Average reward reinforcement learning: Foundations, algorithms, and empirical results

The Loss from Imperfect Value Functions in Expectation-Based and Minimax-Based Tasks

The loss from imperfect value functions in expectation-based and minimax-based tasks

The Effect of Representation and Knowledge on Goal-Directed Exploration with Reinforcement-Learning Algorithms

The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms

Creating Advice-Taking Reinforcement Learners

Creating advice-taking reinforcement learners

Incremental Multi-Step Q-Learning

Incremental multi-step Q-learning