to ISI Home Page
isd home
About ISD
education at isd
employment
environment
news
people
research
AI Seminars
div3admin

environment
Thomas Dietterich
donotspam.tgd@cs.orst.edu


"Sharing and Abstraction in Hierarchical Reinforcement Learning"

3/24/2000: [time not recorded]
[location not recorded]

Abstract: Reinforcement learning (RL) algorithms learn to solve sequential decision-making tasks by interacting with a Markovian environment. Existing methods treat the Markovian environment as a flat search space. While these methods have successfully solved many interesting problems, there are limits to the size of problem that can be successfully handled as a flat state space. In this talk, we will describe one approach to addressing this problem: Hierarchical reinforcement learning using the MAXQ value function decomposition. In this approach, the programmer defines a hierarchy of tasks and subtasks, and the value function (the key data structure in reinforcement learning) is decomposed hierarchically into value functions for each task and subtask. We will describe how the MAXQ decomposition permits the subtasks to ignore aspects of the state space, and hence, the subtasks become re-usable. This speeds search and learning with only minor losses in optimality.


Last updated: Mon Jun 19 17:44:06 2006

 

 

 

 

 
USC Home Page ISI Home Page