Distributed trajectory design for cooperative internet of UAVs using deep reinforcement learning

Jingzhi Hu, Hongliang Zhang, Kaigui Bian, Lingyang Song, Zhu Han

Research output: Contribution to journalConference articlepeer-review

12 Citations (Scopus)

Abstract

In this paper, we consider a cellular Internet of UAVs, where UAVs execute multiple sensing tasks continuously and cooperatively through sensing and transmission with the objective to minimize the age of information (AoI). However, the cooperative sensing and transmission is coupled with the trajectories of the UAVs, which makes the trajectory design a challenging problem. To tackle this challenge, we first propose a distributed sense-and-send protocol to coordinate the UAVs. Based on this protocol, we formulate the trajectory design problem for AoI minimization and propose a deep reinforcement learning algorithm to solve it, which we refer to as the compound-action actor-critic (CA2C) algorithm. Simulation results show that the CA2C algorithm outperforms two baseline algorithms for AoI minimization.

Original languageEnglish
Article number9014214
JournalProceedings - IEEE Global Communications Conference, GLOBECOM
DOIs
Publication statusPublished - 2019
Event2019 IEEE Global Communications Conference, GLOBECOM 2019 - Waikoloa, United States
Duration: 9 Dec 201913 Dec 2019

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

Fingerprint

Dive into the research topics of 'Distributed trajectory design for cooperative internet of UAVs using deep reinforcement learning'. Together they form a unique fingerprint.

Cite this