UAV control method combining reptile meta-reinforcement learning and generative adversarial imitation learning

Shui Jiang, Yanning Ge, Xu Yang, Wencheng Yang, Hui Cui

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Reinforcement learning (RL) is pivotal in empowering Unmanned Aerial Vehicles (UAVs) to navigate and make decisions efficiently and intelligently within complex and dynamic surroundings. Despite its significance, RL is hampered by inherent limitations such as low sample efficiency, restricted generalization capabilities, and a heavy reliance on the intricacies of reward function design. These challenges often render single-method RL approaches inadequate, particularly in the context of UAV operations where high costs and safety risks in real-world applications cannot be overlooked. To address these issues, this paper introduces a novel RL framework that synergistically integrates meta-learning and imitation learning. By leveraging the Reptile algorithm from meta-learning and Generative Adversarial Imitation Learning (GAIL), coupled with state normalization techniques for processing state data, this framework significantly enhances the model’s adaptability. It achieves this by identifying and leveraging commonalities across various tasks, allowing for swift adaptation to new challenges without the need for complex reward function designs. To ascertain the efficacy of this integrated approach, we conducted simulation experiments within both two-dimensional environments. The empirical results clearly indicate that our GAIL-enhanced Reptile method surpasses conventional single-method RL algorithms in terms of training efficiency. This evidence underscores the potential of combining meta-learning and imitation learning to surmount the traditional barriers faced by reinforcement learning in UAV trajectory planning and decision-making processes.

Original languageEnglish
Article number105
Number of pages18
JournalFuture Internet
Volume16
Issue number3
DOIs
Publication statusPublished - 20 Mar 2024

Keywords

  • generative adversarial imitation learning
  • meta-reinforcement learning
  • unmanned aerial vehicles (UAVs)

Cite this