Chapter

Abstraction for Genetics-Based Reinforcement Learning

01/2008; ISBN: 978-3-902613-14-1 In book: Reinforcement Learning
Source: InTech

ABSTRACT Abstraction may appear a trivial task for humans and the positive results from this work intuitive, but abstraction has not been routinely used in genetics-based reinforcement learning. One reason is that the time each iteration requires is an important consideration and abstraction increases the time for each iteration. Typically XCS takes 20 minutes to play 1000 games (and remains constant), mXCS with abstraction takes 20 minutes for 100 games (although this can vary greatly depending on the choice of parameters) and the Q-Learning algorithm ranges from 5 minutes for 1000 games initially to 90 minutes for 1000 games after 100,000 games training. However, given a fixed amount of time to train all three algorithms mXCS with abstraction would perform the best, once the initial base rules were found. The Q-Learning algorithm has to visit every single state at least once in order to form a successful playing strategy. Whilst the Q-Learning system would ultimately play a very good game, weeks of computation failed to achieve the level of success the Abstraction algorithm had in a very short space of time (hours rather than weeks). Although better Q-learning algorithms (including generalization capabilities) exist (Sutton & Barto, 1998) this choice of benchmark algorithm showed the scale of the problem, which is difficult to calculate. The improvement in abstraction performance from standard XCS to the modified XCS was due to using simpler reinforcement learning. The Widrow-Hoff delta rule converges much faster, which for simpler domains that can be solved easily is beneficial. However, slower and more graceful learning may be required in complex domains when interacting with higher level features. The abstracted rules allow the system to play on states as a whole, including those that have not been encountered, where these states contain a known pattern. This is useful in data-mining, but with the inherent dangers of interpolation and extrapolation. The abstracted rule-base is also compact as an abstracted rule covers more states than either a generalized LCS rule or a Qlearning state. Unique states may still be covered by the base rules. Abstraction has been shown to give an improvement in a complex, but structured domain. It is anticipated that the Abstraction algorithm would be suited to other domains containing repeated patterns.

0 0
 · 
0 Bookmarks
 · 
32 Views

Full-text (2 Sources)

View
0 Downloads
Available from

Keywords

20 minutes
 
5 minutes
 
90 minutes
 
abstracted rule
 
abstracted rule-base
 
abstraction performance
 
fixed amount
 
generalized LCS rule
 
genetics-based reinforcement
 
initial base rules
 
known pattern
 
modified XCS
 
Q-Learning algorithm ranges
 
Q-Learning system
 
simpler reinforcement
 
standard XCS
 
three algorithms mXCS
 
Unique states
 
Widrow-Hoff delta rule converges
 
work intuitive