Maximizing Throughput of Aerial Base Stations via Resources-based Multi-Agent Proximal Policy Optimization: A Deep Reinforcement Learning Approach

Yu Min Park, Sheikh Salman Hassan, Choong Seon Hong

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Fifth-generation (5G) networks use millimeter-wave (mmWave) technology to process high-speed and capacity data services. However, wireless communication losses occur due to mmWave limitations, i.e., penetration, rain attenuation, and coverage range. Furthermore, many base stations (BSs) are needed to support stable wireless communications and overcome coverage distances in rural and suburban areas. Therefore, a new wireless communication platform that supports communication services at the aerial level is required. Furthermore, this aerial platform enables line-of-sight (LoS) communications rather than non-LoS (NLoS), which is advantageous in overcoming ground-level losses. Thus, an unmanned aerial vehicle (UAV) or an unmanned aerial platform (UAP) that can be rapidly and dynamically deployed at the point of interest is considered. Despite these benefits, UAV-BSs (also known as aerial BSs) still have optimization problems to solve, i.e., resource allocation and trajectory optimization. Thus, this study considered resource-based multi-agent deep reinforcement learning (MADRL) to solve the resource allocation and trajectory optimization problems of UAV-BSs at the same time. However, our proposed optimization problem is non-convex. Thus we proposed an algorithm based on multi-agent proximal policy optimization (MAPPO) DRL. The proposed algorithm treats each agent as a resource variable to perform optimization more effectively. As a result, the proposed algorithm achieved faster convergence and higher rewards than the baselines.

Original languageEnglish
Title of host publicationAPNOMS 2022 - 23rd Asia-Pacific Network Operations and Management Symposium
Subtitle of host publicationData-Driven Intelligent Management in the Era of beyond 5G
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9784885523397
DOIs
Publication statusPublished - 2022
Event23rd Asia-Pacific Network Operations and Management Symposium, APNOMS 2022 - Takamatsu, Japan
Duration: 28 Sept 202230 Sept 2022

Publication series

NameAPNOMS 2022 - 23rd Asia-Pacific Network Operations and Management Symposium: Data-Driven Intelligent Management in the Era of beyond 5G

Conference

Conference23rd Asia-Pacific Network Operations and Management Symposium, APNOMS 2022
Country/TerritoryJapan
CityTakamatsu
Period28/09/2230/09/22

Bibliographical note

Publisher Copyright:
© 2022 IEICE.

Keywords

  • Unmanned aerial vehicle
  • balanced k-means clustering
  • millimeter wave
  • multi-agent deep reinforcement learning
  • proximal policy optimization

Fingerprint

Dive into the research topics of 'Maximizing Throughput of Aerial Base Stations via Resources-based Multi-Agent Proximal Policy Optimization: A Deep Reinforcement Learning Approach'. Together they form a unique fingerprint.

Cite this