Cross-View Localization Based on Few-Shot Learning for Mars Rover via MarsCVFP Guidance.

Saved in:
Bibliographic Details
Title: Cross-View Localization Based on Few-Shot Learning for Mars Rover via MarsCVFP Guidance.
Authors: Kou, Yuke1,2 (AUTHOR), Wan, Wenhui1,2 (AUTHOR) wanwh@aircas.ac.cn, Di, Kaichang1 (AUTHOR), Liu, Zhaoqin1,2 (AUTHOR), Peng, Man1 (AUTHOR), Wang, Yexin1 (AUTHOR), Xie, Bin1,2 (AUTHOR), Wang, Biao1,2 (AUTHOR), Liu, Waichung1 (AUTHOR)
Source: Remote Sensing. Feb2026, Vol. 18 Issue 4, p668. 30p.
Subjects: Mars rovers, Localization problems (Robotics), Planetary exploration, Machine learning, Feature extraction, Template matching (Digital image processing)
Abstract: Highlights: What are the main findings? We innovatively propose a two-stage cross-view localization framework for Mars rovers. The first stage incorporates a few-shot learning model, MarsCVFP, which can implicitly learn and extract cross-view invariant features on the Martian surface without relying on explicitly defined or specific learning targets. In the second stage, a modified template matching algorithm is employed to achieve robust global localization. We introduce a multi-scale feature pyramid structure (MSFPS) and a feature interaction module (FIM) to capture discriminative fine-grained features, especially in Martian environments characterized by weak textures and unstructured features. Alongside, we design a multi-resolution contrastive loss ( L C F ) within the MarsCVFP to alleviate the degradation in feature consistency extraction caused by the spatial resolution discrepancies between rover and orbiter imagery. What is the implication of the main finding? We validate our framework on 85 unit-planned sites and 20 panoramic sites traversed by the Zhurong rover. The proposed framework consistently outperforms both traditional approaches and representative learning-based methods across diverse terrains, including dunes, bedrock, craters, and flat plains. It achieves a localization success rate above 82% while maintaining an accuracy of better than 4 pixels, even under coarse prior position uncertainties spanning 40 m × 40 m. High-precision localization of Mars rovers is essential for safe path planning and efficient navigation toward scientific targets. As planetary rovers traverse the surface, their positional uncertainty accumulates, which can be corrected through global localization by registering rover images to orbital maps. To date, image-based solutions are widely adopted; however, substantial manual intervention is often required, which is time-consuming and limits the range of autonomous navigation. To address these challenges, we propose a two-stage localization framework, comprising the Mars cross-view few-shot training paradigm (MarsCVFP), Mars cross-view feature extraction network (MCVN) trained under MarsCVFP, and a robust template matching algorithm. Specifically, the MarsCVFP model can leverage implicit cross-view feature as guidance without relying on a large amount of high-precision location-level supervision and explicitly annotated, specific learning targets in the scene. MCVN can capture discriminative fine-grained features on the weakly textured and unstructured surface of Mars by constructing the multi-scale feature pyramid structure (MSFPS) and the feature interaction module (FIM). We validate our framework on 85 unit-planned sites and 20 panoramic sites, respectively, as traversed by the Zhurong rover. The experimental results demonstrate that our framework consistently outperforms both the traditional approaches and the representative learning-based methods across diverse terrains, including dunes, bedrock, craters, and flat plains, achieving a localization success rate above 82% while maintaining a localization accuracy of better than 4 pixels, even under coarse prior positions uncertainties spanning 40 m × 40 m. [ABSTRACT FROM AUTHOR]
Copyright of Remote Sensing is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
Full text is not displayed to guests.
FullText Links:
  – Type: pdflink
Text:
  Availability: 1
Header DbId: egs
DbLabel: Engineering Source
An: 191972753
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Cross-View Localization Based on Few-Shot Learning for Mars Rover via MarsCVFP Guidance.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Kou%2C+Yuke%22">Kou, Yuke</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Wan%2C+Wenhui%22">Wan, Wenhui</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> wanwh@aircas.ac.cn</i><br /><searchLink fieldCode="AR" term="%22Di%2C+Kaichang%22">Di, Kaichang</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Liu%2C+Zhaoqin%22">Liu, Zhaoqin</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Peng%2C+Man%22">Peng, Man</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Wang%2C+Yexin%22">Wang, Yexin</searchLink><relatesTo>1</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Xie%2C+Bin%22">Xie, Bin</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Wang%2C+Biao%22">Wang, Biao</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<br /><searchLink fieldCode="AR" term="%22Liu%2C+Waichung%22">Liu, Waichung</searchLink><relatesTo>1</relatesTo> (AUTHOR)
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Remote+Sensing%22">Remote Sensing</searchLink>. Feb2026, Vol. 18 Issue 4, p668. 30p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Mars+rovers%22">Mars rovers</searchLink><br /><searchLink fieldCode="DE" term="%22Localization+problems+%28Robotics%29%22">Localization problems (Robotics)</searchLink><br /><searchLink fieldCode="DE" term="%22Planetary+exploration%22">Planetary exploration</searchLink><br /><searchLink fieldCode="DE" term="%22Machine+learning%22">Machine learning</searchLink><br /><searchLink fieldCode="DE" term="%22Feature+extraction%22">Feature extraction</searchLink><br /><searchLink fieldCode="DE" term="%22Template+matching+%28Digital+image+processing%29%22">Template matching (Digital image processing)</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Highlights: What are the main findings? We innovatively propose a two-stage cross-view localization framework for Mars rovers. The first stage incorporates a few-shot learning model, MarsCVFP, which can implicitly learn and extract cross-view invariant features on the Martian surface without relying on explicitly defined or specific learning targets. In the second stage, a modified template matching algorithm is employed to achieve robust global localization. We introduce a multi-scale feature pyramid structure (MSFPS) and a feature interaction module (FIM) to capture discriminative fine-grained features, especially in Martian environments characterized by weak textures and unstructured features. Alongside, we design a multi-resolution contrastive loss ( L C F ) within the MarsCVFP to alleviate the degradation in feature consistency extraction caused by the spatial resolution discrepancies between rover and orbiter imagery. What is the implication of the main finding? We validate our framework on 85 unit-planned sites and 20 panoramic sites traversed by the Zhurong rover. The proposed framework consistently outperforms both traditional approaches and representative learning-based methods across diverse terrains, including dunes, bedrock, craters, and flat plains. It achieves a localization success rate above 82% while maintaining an accuracy of better than 4 pixels, even under coarse prior position uncertainties spanning 40 m × 40 m. High-precision localization of Mars rovers is essential for safe path planning and efficient navigation toward scientific targets. As planetary rovers traverse the surface, their positional uncertainty accumulates, which can be corrected through global localization by registering rover images to orbital maps. To date, image-based solutions are widely adopted; however, substantial manual intervention is often required, which is time-consuming and limits the range of autonomous navigation. To address these challenges, we propose a two-stage localization framework, comprising the Mars cross-view few-shot training paradigm (MarsCVFP), Mars cross-view feature extraction network (MCVN) trained under MarsCVFP, and a robust template matching algorithm. Specifically, the MarsCVFP model can leverage implicit cross-view feature as guidance without relying on a large amount of high-precision location-level supervision and explicitly annotated, specific learning targets in the scene. MCVN can capture discriminative fine-grained features on the weakly textured and unstructured surface of Mars by constructing the multi-scale feature pyramid structure (MSFPS) and the feature interaction module (FIM). We validate our framework on 85 unit-planned sites and 20 panoramic sites, respectively, as traversed by the Zhurong rover. The experimental results demonstrate that our framework consistently outperforms both the traditional approaches and the representative learning-based methods across diverse terrains, including dunes, bedrock, craters, and flat plains, achieving a localization success rate above 82% while maintaining a localization accuracy of better than 4 pixels, even under coarse prior positions uncertainties spanning 40 m × 40 m. [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of Remote Sensing is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=191972753
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.3390/rs18040668
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 30
        StartPage: 668
    Subjects:
      – SubjectFull: Mars rovers
        Type: general
      – SubjectFull: Localization problems (Robotics)
        Type: general
      – SubjectFull: Planetary exploration
        Type: general
      – SubjectFull: Machine learning
        Type: general
      – SubjectFull: Feature extraction
        Type: general
      – SubjectFull: Template matching (Digital image processing)
        Type: general
    Titles:
      – TitleFull: Cross-View Localization Based on Few-Shot Learning for Mars Rover via MarsCVFP Guidance.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Kou, Yuke
      – PersonEntity:
          Name:
            NameFull: Wan, Wenhui
      – PersonEntity:
          Name:
            NameFull: Di, Kaichang
      – PersonEntity:
          Name:
            NameFull: Liu, Zhaoqin
      – PersonEntity:
          Name:
            NameFull: Peng, Man
      – PersonEntity:
          Name:
            NameFull: Wang, Yexin
      – PersonEntity:
          Name:
            NameFull: Xie, Bin
      – PersonEntity:
          Name:
            NameFull: Wang, Biao
      – PersonEntity:
          Name:
            NameFull: Liu, Waichung
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 15
              M: 02
              Text: Feb2026
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-print
              Value: 20724292
          Numbering:
            – Type: volume
              Value: 18
            – Type: issue
              Value: 4
          Titles:
            – TitleFull: Remote Sensing
              Type: main
ResultId 1