View in EDS HTML Full Text PDF Full Text

SRAD: Autonomous Decision‐Making Method for UAV Based on Safety Reinforcement Learning.

Saved in:

Bibliographic Details
Title:	SRAD: Autonomous Decision‐Making Method for UAV Based on Safety Reinforcement Learning.
Authors:	Xiao, Wenwen¹ (AUTHOR), Luo, Xiangfeng¹ (AUTHOR) luoxf@shu.edu.cn, Xie, Shaorong¹ (AUTHOR)
Source:	Expert Systems. May2025, Vol. 42 Issue 5, p1-18. 18p.
Subjects:	Image segmentation, Learning modules, Prior learning
Abstract:	Unmanned aerial vehicles (UAVs) are increasingly vital across numerous sectors, from logistics and rescue operations to military endeavours and beyond. However, ensuring safety in the decision‐making processes surrounding UAV operations in real‐world settings has become an urgent and complex challenge. At present, the main methods to minimise the risk of drone decision‐making include utilising pre‐established control rules, expert prior knowledge and regularisation constraints. However, these methodologies require UAVs to meet demanding prerequisites, including the acquisition of extensive decision‐making experience and the establishment of comprehensive rules. Regrettably, these strict requirements often lead to frequent UAV crashes in uncertain environments and subsequent mission failures. In order to tackle these issues, we propose a self‐decision‐making method for quadcopter UAVs based on safe reinforcement learning. Our method utilises a multilevel cascading feature semantic space for reinforcement learning, integrating depth images, greyscale images, semantic segmentation images and object detection results as inputs. This approach aims to facilitate safe autonomous learning. Moreover, we integrate real offline labelled data to enhance the safety policy. Depending on the varying levels of risk encountered during the UAV's decision‐making process, we dynamically select different safety policies. Through this iterative process, the UAV progressively eliminates extreme actions and reverts to the UAV learning policy module. Experimental results indicate that our method not only ensures safe decision‐making for UAVs in uncertain environments but also exhibits superior safety decision‐making efficacy compared to certain baseline methods. [ABSTRACT FROM AUTHOR]
	Copyright of Expert Systems is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database:	Engineering Source
Full text is not displayed to guests. Login for full access.

FullText	Links: – Type: pdflink Text: Availability: 1
Header	DbId: egs DbLabel: Engineering Source An: 184494874 AccessLevel: 6 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0
IllustrationInfo
Items	– Name: Title Label: Title Group: Ti Data: SRAD: Autonomous Decision‐Making Method for UAV Based on Safety Reinforcement Learning. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Xiao%2C+Wenwen%22">Xiao, Wenwen</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Luo%2C+Xiangfeng%22">Luo, Xiangfeng</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> luoxf@shu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Xie%2C+Shaorong%22">Xie, Shaorong</searchLink><relatesTo>1</relatesTo> (AUTHOR) – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22Expert+Systems%22">Expert Systems</searchLink>. May2025, Vol. 42 Issue 5, p1-18. 18p. – Name: Subject Label: Subjects Group: Su Data: <searchLink fieldCode="DE" term="%22Image+segmentation%22">Image segmentation</searchLink><br /><searchLink fieldCode="DE" term="%22Learning+modules%22">Learning modules</searchLink><br /><searchLink fieldCode="DE" term="%22Prior+learning%22">Prior learning</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Unmanned aerial vehicles (UAVs) are increasingly vital across numerous sectors, from logistics and rescue operations to military endeavours and beyond. However, ensuring safety in the decision‐making processes surrounding UAV operations in real‐world settings has become an urgent and complex challenge. At present, the main methods to minimise the risk of drone decision‐making include utilising pre‐established control rules, expert prior knowledge and regularisation constraints. However, these methodologies require UAVs to meet demanding prerequisites, including the acquisition of extensive decision‐making experience and the establishment of comprehensive rules. Regrettably, these strict requirements often lead to frequent UAV crashes in uncertain environments and subsequent mission failures. In order to tackle these issues, we propose a self‐decision‐making method for quadcopter UAVs based on safe reinforcement learning. Our method utilises a multilevel cascading feature semantic space for reinforcement learning, integrating depth images, greyscale images, semantic segmentation images and object detection results as inputs. This approach aims to facilitate safe autonomous learning. Moreover, we integrate real offline labelled data to enhance the safety policy. Depending on the varying levels of risk encountered during the UAV's decision‐making process, we dynamically select different safety policies. Through this iterative process, the UAV progressively eliminates extreme actions and reverts to the UAV learning policy module. Experimental results indicate that our method not only ensures safe decision‐making for UAVs in uncertain environments but also exhibits superior safety decision‐making efficacy compared to certain baseline methods. [ABSTRACT FROM AUTHOR] – Name: AbstractSuppliedCopyright Label: Group: Ab Data: <i>Copyright of Expert Systems is the property of Wiley-Blackwell and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink	https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=184494874
RecordInfo	BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1111/exsy.70004 Languages: – Code: eng Text: English PhysicalDescription: Pagination: PageCount: 18 StartPage: 1 Subjects: – SubjectFull: Image segmentation Type: general – SubjectFull: Learning modules Type: general – SubjectFull: Prior learning Type: general Titles: – TitleFull: SRAD: Autonomous Decision‐Making Method for UAV Based on Safety Reinforcement Learning. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Xiao, Wenwen – PersonEntity: Name: NameFull: Luo, Xiangfeng – PersonEntity: Name: NameFull: Xie, Shaorong IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 05 Text: May2025 Type: published Y: 2025 Identifiers: – Type: issn-print Value: 02664720 Numbering: – Type: volume Value: 42 – Type: issue Value: 5 Titles: – TitleFull: Expert Systems Type: main
ResultId	1