Login
HomePublicationsJournal Paper

Data-driven Flight Control of Internet-of-Drones for Sensor Data Aggregation using Multi-agent Deep Reinforcement Learning
Ref: CISTER-TR-220502       Publication Date: 2022

Data-driven Flight Control of Internet-of-Drones for Sensor Data Aggregation using Multi-agent Deep Reinforcement Learning

Ref: CISTER-TR-220502       Publication Date: 2022

Abstract:
Energy-harvesting-powered sensors are increasingly deployed beyond the reach of terrestrial gateways, where there is often no persistent power supply. Making use of the internet of drones (IoD) for data aggregation in such environments is a promising paradigm to enhance network scalability and connectivity. The flexibility of IoD and favorable line-of-sight connections between the drones and ground nodes are exploited to improve data reception at the drones. In this article, we discuss the challenges of online flight control of IoD, where data-driven neural networks can be tailored to design the trajectories and patrol speeds of the drones and their communication schedules, preventing buffer overflows at the ground nodes. In a small-scale IoD, a multi-agent deep reinforcement learning can be developed with long short-term memory to train the continuous flight control of IoD and data aggregation scheduling, where a joint action is generated for IoD via sharing the flight control decisions among the drones. In a large-scale IoD, sharing the flight control decisions in real-time can result in communication overheads and interference. In this case, deep reinforcement learning can be trained with the second-hand visiting experiences, where the drones learn the actions of each other based on historical scheduling records maintained at the ground nodes.

Authors:
Kai Li
,
Wei Ni
,
Yousef Emami
,
Falko Dressler


Published in IEEE Wireless Communications Magazine (WCM) (WCM), IEEE.

Notes: This work was supported in part by the National Funds through FCT/MCTES (Portuguese Foundation for Science and Technology), within the CISTER Research Unit under Grant UIDP/UIDB/04234/2020, and in part by the National Funds through FCT, under CMU Portugal Partnership under Project CMU/TIC/0022/2019 (CRUAV).



Record Date: 6, May, 2022