Abstract
Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating prior knowledge. This results in prohibitively long training times for use on real-world robotic tasks. Existing algorithms capable of extracting task-level representations from high-dimensional inputs, e.g. object detection, often produce outputs of varying lengths, restricting their use in RL methods due to the need for neural networks to have fixed length inputs. In this work, we propose a framework that combines deep sets encoding, which allows for variable-length abstract representations, with modular RL that utilizes these representations, decoupling high-level decision making from low-level control. We successfully demonstrate our approach on the robot manipulation task of object sorting, showing that this method can learn effective policies within mere minutes of highly simplified simulation. The learned policies can be directly deployed on a robot without further training, and generalize to variations of the task unseen during training.
Original language | English |
---|---|
Title of host publication | Australasian Conference on Robotics and Automation |
Editors | David Harvey |
Place of Publication | Australia |
Publisher | Australian Robotics and Automation Association (ARAA) |
Number of pages | 10 |
Volume | 2019-December |
Publication status | Published - 2019 |
Externally published | Yes |
Event | Australasian Conference on Robotics and Automation 2019 - University of Adelaide, Adelaide, Australia Duration: 9 Dec 2019 → 11 Dec 2019 http://www.araa.asn.au/conferences/acra-2019 |
Publication series
Name | Australasian Conference on Robotics and Automation, ACRA |
---|---|
ISSN (Print) | 1448-2053 |
Conference
Conference | Australasian Conference on Robotics and Automation 2019 |
---|---|
Abbreviated title | ACRA 2019 |
Country/Territory | Australia |
City | Adelaide |
Period | 9/12/19 → 11/12/19 |
Internet address |