Hierarchical Multiscale Fusion with Coordinate Attention for Lithologic Mapping from Remote Sensing.

Saved in:
Bibliographic Details
Title: Hierarchical Multiscale Fusion with Coordinate Attention for Lithologic Mapping from Remote Sensing.
Authors: Xie, Fuyuan1,2 (AUTHOR) db21010013b4yj@cumt.edu.cn, Yang, Yongguo1,2 (AUTHOR)
Source: Remote Sensing. Feb2026, Vol. 18 Issue 3, p413. 27p.
Subjects: Geological mapping, Image segmentation, Remote sensing, Remote-sensing images, Digital elevation models
Geographic Terms: China
Abstract: Highlights: What are the main findings? We propose SegNeXt-HFCA, a SegNeXt-based hierarchical multiscale fusion network with coordinate attention for lithologic mapping from Sentinel-2 and DEM data. Class-frequency-aware and boundary-weighted losses combined with seamless sliding-window inference and DenseCRF refinement significantly improve boundary fidelity and the recognition of long-tailed lithologic units, yielding 3–4% mIoU point gains over strong baselines on two structurally complex areas. What are the implications of the main findings? The proposed framework better preserves thin lithologic belts and clarifies ambiguous contacts, producing geologically more plausible lithologic maps in arid, structurally complex terranes. The methodology is transferable to other geological remote sensing applications using multispectral satellite imagery and auxiliary terrain data. Accurate lithologic maps derived from satellite imagery underpin structural interpretation, mineral exploration, and geohazard assessment. However, automated mapping in complex terranes remains challenging because spectrally similar units, narrow anisotropic bodies, and ambiguous contacts can degrade boundary fidelity. In this study, we propose SegNeXt-HFCA, a hierarchical multiscale fusion network with coordinate attention for lithologic segmentation from a Sentinel-2/DEM feature stack. The model builds on SegNeXt and introduces a hierarchical multiscale encoder with coordinate attention to jointly capture fine textures and scene-level structure. It further adopts a class-frequency-aware hybrid loss that combines boundary-weighted online hard-example mining cross-entropy with Lovász-Softmax to better handle long-tailed classes and ambiguous contacts. In addition, we employ a robust training and inference scheme, including entropy-guided patch sampling, exponential moving average of parameters, test-time augmentation, and a DenseCRF-based post-refinement. Two study areas in the Beishan orogen, northwestern China (Huitongshan and Xingxingxia), are used to evaluate the method with a unified 10-channel Sentinel-2/DEM feature stack. Compared with U-NetFormer, PSPNet, DeepLabV3+, DANet, LGMSFNet, SegFormer, BiSeNetV2, and the SegNeXt backbone, SegNeXt-HFCA improves mean intersection-over-union (mIoU) by about 3.8% in Huitongshan and 2.6% in Xingxingxia, respectively, and increases mean pixel accuracy by approximately 3–4%. Qualitative analyses show that the proposed framework better preserves thin-unit continuity, clarifies lithologic contacts, and reduces salt-and-pepper noise, yielding geologically more plausible maps. These results demonstrate that hierarchical multiscale fusion with coordinate attention, together with class- and boundary-aware optimization, provides a practical route to robust lithologic mapping in structurally complex regions. [ABSTRACT FROM AUTHOR]
Copyright of Remote Sensing is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Full text is not displayed to guests.
Description
Abstract:Highlights: What are the main findings? We propose SegNeXt-HFCA, a SegNeXt-based hierarchical multiscale fusion network with coordinate attention for lithologic mapping from Sentinel-2 and DEM data. Class-frequency-aware and boundary-weighted losses combined with seamless sliding-window inference and DenseCRF refinement significantly improve boundary fidelity and the recognition of long-tailed lithologic units, yielding 3–4% mIoU point gains over strong baselines on two structurally complex areas. What are the implications of the main findings? The proposed framework better preserves thin lithologic belts and clarifies ambiguous contacts, producing geologically more plausible lithologic maps in arid, structurally complex terranes. The methodology is transferable to other geological remote sensing applications using multispectral satellite imagery and auxiliary terrain data. Accurate lithologic maps derived from satellite imagery underpin structural interpretation, mineral exploration, and geohazard assessment. However, automated mapping in complex terranes remains challenging because spectrally similar units, narrow anisotropic bodies, and ambiguous contacts can degrade boundary fidelity. In this study, we propose SegNeXt-HFCA, a hierarchical multiscale fusion network with coordinate attention for lithologic segmentation from a Sentinel-2/DEM feature stack. The model builds on SegNeXt and introduces a hierarchical multiscale encoder with coordinate attention to jointly capture fine textures and scene-level structure. It further adopts a class-frequency-aware hybrid loss that combines boundary-weighted online hard-example mining cross-entropy with Lovász-Softmax to better handle long-tailed classes and ambiguous contacts. In addition, we employ a robust training and inference scheme, including entropy-guided patch sampling, exponential moving average of parameters, test-time augmentation, and a DenseCRF-based post-refinement. Two study areas in the Beishan orogen, northwestern China (Huitongshan and Xingxingxia), are used to evaluate the method with a unified 10-channel Sentinel-2/DEM feature stack. Compared with U-NetFormer, PSPNet, DeepLabV3+, DANet, LGMSFNet, SegFormer, BiSeNetV2, and the SegNeXt backbone, SegNeXt-HFCA improves mean intersection-over-union (mIoU) by about 3.8% in Huitongshan and 2.6% in Xingxingxia, respectively, and increases mean pixel accuracy by approximately 3–4%. Qualitative analyses show that the proposed framework better preserves thin-unit continuity, clarifies lithologic contacts, and reduces salt-and-pepper noise, yielding geologically more plausible maps. These results demonstrate that hierarchical multiscale fusion with coordinate attention, together with class- and boundary-aware optimization, provides a practical route to robust lithologic mapping in structurally complex regions. [ABSTRACT FROM AUTHOR]
ISSN:20724292
DOI:10.3390/rs18030413