Multi-Level Attention and Scale-Aware Fusion for Remote Sensing Scene Object Detection.

Saved in:
Bibliographic Details
Title: Multi-Level Attention and Scale-Aware Fusion for Remote Sensing Scene Object Detection.
Authors: Li, Yanling1 liyanling@xynu.edu.cn, Li, Jiaman1 jiaman0813@163.com, Yang, Zhipeng1 yangzp@xynu.edu.cn, Chen, Chongyang1 cychen@xynu.edu.cn
Source: Engineering Letters. May2026, Vol. 34 Issue 5, p1506-1523. 18p.
Subjects: Remote sensing, Object recognition (Computer vision), Data fusion (Statistics), Image analysis, Computer vision, Deep learning
Abstract: Object detection plays a pivotal role in intelligent remote sensing image interpretation, with critical applications spanning national defense, security, and smart city development. However, two fundamental challenges persist: complex background interference and significant object scale variations, both severely degrading detection performance. A novel remote sensing object detection method, denoted as MASF, is proposed in this work. The framework consists of three core components: a backbone network, a neck network, and a detection head. To address background interference, we incorporated a Dynamic Bottleneck Module (DBM) into the backbone network. The DBM's core component is a Star Attention Block. This module significantly improves the model's target localization capability in complex scenes by modeling long-range dependencies across regions. A Multi-Kernel Feature Diffusion Pyramid Network is proposed in the neck network to handle multi-scale objects. This architecture utilizes hierarchical feature interactions and a dedicated FocusFeature module to adaptively aggregate features, thereby improving multi-scale detection accuracy. While preserving high-resolution details and semantic consistency, this module enhances the model's recognition accuracy for objects of varying scales by vertically diffusing information across resolutions and hierarchical levels. The detection head is responsible for final object classification and localization. Comprehensive evaluations conducted on the demanding remote sensing benchmark datasets, VisDrone2019-DET and NWPU VHR-10, confirm the efficacy of the proposed methodology. [ABSTRACT FROM AUTHOR]
Copyright of Engineering Letters is the property of International Association of Engineers (IAENG) and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
FullText Links:
  – Type: pdflink
Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 193453910
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Multi-Level Attention and Scale-Aware Fusion for Remote Sensing Scene Object Detection.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Li%2C+Yanling%22">Li, Yanling</searchLink><relatesTo>1</relatesTo><i> liyanling@xynu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Li%2C+Jiaman%22">Li, Jiaman</searchLink><relatesTo>1</relatesTo><i> jiaman0813@163.com</i><br /><searchLink fieldCode="AR" term="%22Yang%2C+Zhipeng%22">Yang, Zhipeng</searchLink><relatesTo>1</relatesTo><i> yangzp@xynu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Chen%2C+Chongyang%22">Chen, Chongyang</searchLink><relatesTo>1</relatesTo><i> cychen@xynu.edu.cn</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Engineering+Letters%22">Engineering Letters</searchLink>. May2026, Vol. 34 Issue 5, p1506-1523. 18p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Remote+sensing%22">Remote sensing</searchLink><br /><searchLink fieldCode="DE" term="%22Object+recognition+%28Computer+vision%29%22">Object recognition (Computer vision)</searchLink><br /><searchLink fieldCode="DE" term="%22Data+fusion+%28Statistics%29%22">Data fusion (Statistics)</searchLink><br /><searchLink fieldCode="DE" term="%22Image+analysis%22">Image analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+vision%22">Computer vision</searchLink><br /><searchLink fieldCode="DE" term="%22Deep+learning%22">Deep learning</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Object detection plays a pivotal role in intelligent remote sensing image interpretation, with critical applications spanning national defense, security, and smart city development. However, two fundamental challenges persist: complex background interference and significant object scale variations, both severely degrading detection performance. A novel remote sensing object detection method, denoted as MASF, is proposed in this work. The framework consists of three core components: a backbone network, a neck network, and a detection head. To address background interference, we incorporated a Dynamic Bottleneck Module (DBM) into the backbone network. The DBM's core component is a Star Attention Block. This module significantly improves the model's target localization capability in complex scenes by modeling long-range dependencies across regions. A Multi-Kernel Feature Diffusion Pyramid Network is proposed in the neck network to handle multi-scale objects. This architecture utilizes hierarchical feature interactions and a dedicated FocusFeature module to adaptively aggregate features, thereby improving multi-scale detection accuracy. While preserving high-resolution details and semantic consistency, this module enhances the model's recognition accuracy for objects of varying scales by vertically diffusing information across resolutions and hierarchical levels. The detection head is responsible for final object classification and localization. Comprehensive evaluations conducted on the demanding remote sensing benchmark datasets, VisDrone2019-DET and NWPU VHR-10, confirm the efficacy of the proposed methodology. [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of Engineering Letters is the property of International Association of Engineers (IAENG) and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=193453910
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 18
        StartPage: 1506
    Subjects:
      – SubjectFull: Remote sensing
        Type: general
      – SubjectFull: Object recognition (Computer vision)
        Type: general
      – SubjectFull: Data fusion (Statistics)
        Type: general
      – SubjectFull: Image analysis
        Type: general
      – SubjectFull: Computer vision
        Type: general
      – SubjectFull: Deep learning
        Type: general
    Titles:
      – TitleFull: Multi-Level Attention and Scale-Aware Fusion for Remote Sensing Scene Object Detection.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Li, Yanling
      – PersonEntity:
          Name:
            NameFull: Li, Jiaman
      – PersonEntity:
          Name:
            NameFull: Yang, Zhipeng
      – PersonEntity:
          Name:
            NameFull: Chen, Chongyang
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 05
              Text: May2026
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-print
              Value: 1816093X
          Numbering:
            – Type: volume
              Value: 34
            – Type: issue
              Value: 5
          Titles:
            – TitleFull: Engineering Letters
              Type: main
ResultId 1