Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/25934
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhang, C-
dc.contributor.authorLiang, S-
dc.contributor.authorHe, C-
dc.contributor.authorWang, K-
dc.date.accessioned2023-02-08T09:34:56Z-
dc.date.available2023-02-08T09:34:56Z-
dc.date.issued2021-02-16-
dc.identifierORCID iD: Kezhi Wang https://orcid.org/0000-0001-8602-0800-
dc.identifier.citationZhang, C. et al. (2022) 'Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning', Journal of Communications and Information Networks, 2022, 7 (2), pp. 192 - 201. doi: 10.23919/JCIN.2022.9815202en_US
dc.identifier.issn2096-1081-
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/25934-
dc.description.abstractIn this paper,multi-unmanned aerial vehicle (multi-UAV) and multi-user system are studied, where UAVs are served as aerial base stations (BS) for ground users in the same frequency band without knowing the locations and channel parameters for the users. We aim to maximize the total throughput for all the users and meet the fairness requirement by optimizing the UAVs’ trajectories and transmission power in a centralized way. This problem is non-convex and very difficult to solve,as the locations of the user are unknown to the UAVs. We propose a deep reinforcement learning(DRL)-based solution,i.e.,soft actor-critic(SAC)to address it via modeling the problem as a Markov decision process (MDP). We carefully design the reward function that combines sparse with non-sparse reward to achieve the balance between exploitation and exploration.The simulation results show that the proposed SAC has a very good performance in terms of both training and testing.en_US
dc.description.sponsorshipNational Natural Science Foundation of China under Grant 62101161; Shenzhen Basic Research Program under Grant 20200811192821001 and Grant JCYJ20190808122409660; Guangdong Basic Research Program under Grant 2019A1515110358, Grant 2021A1515012097, Grant 2020ZDZX1037, Grant 2020ZDZX1021; open research fund of National Mobile Communications Research Laboratory, Southeast University under Grant 2021D16 and Grant 2022D02.en_US
dc.format.extent192 - 201-
dc.languageEnglish-
dc.language.isoen_USen_US
dc.publisherChina InfoCom Media Groupen_US
dc.subjectdeep reinforcement learningen_US
dc.subjectmobile edge computingen_US
dc.subjectunmanned aerial vehicle (UAV)en_US
dc.subjecttrajectory controlen_US
dc.subjectuser associationen_US
dc.titleMulti-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learningen_US
dc.typeArticleen_US
dc.identifier.doihttps://doi.org/10.23919/JCIN.2022.9815202-
dc.relation.isPartOfJournal of Communications and Information Networks-
pubs.issue2-
pubs.publication-statusPublished-
pubs.volume7-
dc.identifier.eissn2509-3312-
dc.rights.holderChina InfoCom Media Group-
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdf1.15 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.