Abstract
Curriculum learning allows complex tasks to be mastered via incremental progression over 'stepping stone' goals towards a final desired behaviour. Typical implementations learn locomotion policies for challenging environments through gradual complexification of a terrain mesh generated through a parameterised noise function. To date, researchers have predominantly generated terrains from a limited range of noise functions, and the effect of the generator on the learning process is underrepresented in the literature. We compare popular noise-based terrain generators to two indirect encodings, CPPN and GAN. To allow direct comparison between both direct and indirect representations, we assess the impact of a range of representation-agnostic MAP-Elites feature descriptors that compute metrics directly from the generated terrain meshes. Next, performance and coverage are assessed when training a humanoid robot in a physics simulator using the PPO algorithm. Results describe key differences between the generators that inform their use in curriculum learning, and present a range of useful feature descriptors for uptake by the community.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2022 Genetic and Evolutionary Computation Conference |
Editors | Jonathan Fieldsend |
Place of Publication | New York NY USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 377-384 |
Number of pages | 8 |
ISBN (Electronic) | 9781450392372 |
DOIs | |
Publication status | Published - Jul 2022 |
Event | The Genetic and Evolutionary Computation Conference 2022 - Online, Boston, United States of America Duration: 9 Jul 2022 → 13 Jul 2022 https://dl.acm.org/doi/proceedings/10.1145/3520304 (Proceedings) https://gecco-2022.sigevo.org/HomePage (Website) |
Conference
Conference | The Genetic and Evolutionary Computation Conference 2022 |
---|---|
Abbreviated title | GECCO 2022 |
Country/Territory | United States of America |
City | Boston |
Period | 9/07/22 → 13/07/22 |
Internet address |
|
Keywords
- CPPN
- curriculum learning
- GAN
- procedural content generation
- quality-diversity
- reinforcement learning
- representations