Sebastian Risi, Sandy D. Vanderbleek, Charles E. Hughes and Kenneth O. Stanley (2009)
How Novelty Search Escapes the Deceptive Trap of Learning to Learn
In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2009). New York, NY:ACM (8 pages)
Winner of the Best Paper Award in the Artificial Life, Evolutionary Robotics, Adaptive Behavior, Evolvable Hardware Track at GECCO-2009
A major goal for researchers in neuroevolution is to evolve artificial neural networks (ANNs) that can learn during their lifetime. Such networks can adapt to changes in their environment that evolution on its own cannot anticipate. However, a profound problem with evolving adaptive systems is that if the impact of learning on the fitness of the agent is only marginal, then evolution is likely to produce individuals that do not exhibit the desired adaptive behavior. Instead, because it is easier at first to improve fitness without evolving the ability to learn, they are likely to exploit domain-dependent static (i.e. non-adaptive) heuristics. This paper proposes a way to escape the deceptive trap of static policies based on the novelty search algorithm, which opens up a new avenue in the evolution of adaptive systems because it can exploit the behavioral difference between learning and non-learning individuals. The main idea in novelty search is to abandon objective-based fitness and instead simply search only for novel behavior, which avoids deception entirely and has shown prior promising results in other domains. This paper shows that novelty search significantly outperforms fitness-based search in a tunably deceptive T-Maze navigation domain because it fosters the emergence of adaptive behavior.