Multi-scale channel enhanced transformer for rock thin sections identification and sequence consistency optimization.

Saved in:
Bibliographic Details
Title: Multi-scale channel enhanced transformer for rock thin sections identification and sequence consistency optimization.
Authors: Guo, Xiaoyao1,2 (AUTHOR) xiaoyao1206@foxmail.com, Chen, Yan1,2 (AUTHOR) carly_chen@126.com, He, Shipeng3 (AUTHOR) heshipeng@petrochina.com.cn, Zhang, Xingpeng1,2 (AUTHOR) xpzhang@swpu.edu.cn, Zhou, Jing1 (AUTHOR) shenwangxiaobai@163.com, Bao, Xucheng1,2 (AUTHOR) xucheng.bao@foxmail.com
Source: Computational Geosciences. Jun2025, Vol. 29 Issue 3, p1-19. 19p.
Subjects: Convolutional neural networks, Image recognition (Computer vision), Transformer models, Deep learning, Brewster's angle
Abstract: The identification of rock thin sections plays a pivotal role in geological exploration, as it provides critical insights into the fundamental properties and composition of rocks. However, the accurate identification of mineral particles presents significant challenges due to three primary factors: the inherent imbalance in data distribution, misclassification caused by feature similarity, and substantial feature variations observed under different cross-polarized angles. These complexities render conventional deep-learning models with single-structure architectures inadequate for precise mineral particle identification. Therefore, this study proposes a novel rock thin section image classification methodology that combines a Multi-Scale Channel Enhanced Transformer (MSCET) with a Sequence Consistency Optimization (SCO) strategy. This integrated approach is designed to effectively extract distinctive features of mineral particles while fully exploiting the influence of polarization angle variations. The MSCET architecture synergistically combines Convolutional Neural Networks (CNN), Squeeze-and-Excitation Networks (SENet), and Transformer mechanisms to enhance the network's feature representation capabilities. Specifically, it employs distinct convolutional operations to extract both coarse- and fine-grained features of mineral particles. The SENet and Transformer structures are then utilized to aggregate global information across both channel and spatial dimensions. Furthermore, we introduce the SCO strategy to refine low-confidence predictions, thereby mitigating the impact of feature variations in multi-angle cross-polarized images. Comprehensive experimental evaluations demonstrate the efficacy of our proposed method, achieving a classification accuracy of 92.35% on the test set. The method also shows significant improvements in key performance metrics, including recall, precision, and F1 score, substantiating its potential for robust rock thin section identification in geological applications. [ABSTRACT FROM AUTHOR]
Copyright of Computational Geosciences is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
FullText Links:
  – Type: pdflink
Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 184706291
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Multi-scale channel enhanced transformer for rock thin sections identification and sequence consistency optimization.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Guo%2C+Xiaoyao%22">Guo, Xiaoyao</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> xiaoyao1206@foxmail.com</i><br /><searchLink fieldCode="AR" term="%22Chen%2C+Yan%22">Chen, Yan</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> carly_chen@126.com</i><br /><searchLink fieldCode="AR" term="%22He%2C+Shipeng%22">He, Shipeng</searchLink><relatesTo>3</relatesTo> (AUTHOR)<i> heshipeng@petrochina.com.cn</i><br /><searchLink fieldCode="AR" term="%22Zhang%2C+Xingpeng%22">Zhang, Xingpeng</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> xpzhang@swpu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Zhou%2C+Jing%22">Zhou, Jing</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> shenwangxiaobai@163.com</i><br /><searchLink fieldCode="AR" term="%22Bao%2C+Xucheng%22">Bao, Xucheng</searchLink><relatesTo>1,2</relatesTo> (AUTHOR)<i> xucheng.bao@foxmail.com</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Computational+Geosciences%22">Computational Geosciences</searchLink>. Jun2025, Vol. 29 Issue 3, p1-19. 19p.
– Name: Subject
  Label: Subjects
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Convolutional+neural+networks%22">Convolutional neural networks</searchLink><br /><searchLink fieldCode="DE" term="%22Image+recognition+%28Computer+vision%29%22">Image recognition (Computer vision)</searchLink><br /><searchLink fieldCode="DE" term="%22Transformer+models%22">Transformer models</searchLink><br /><searchLink fieldCode="DE" term="%22Deep+learning%22">Deep learning</searchLink><br /><searchLink fieldCode="DE" term="%22Brewster's+angle%22">Brewster's angle</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: The identification of rock thin sections plays a pivotal role in geological exploration, as it provides critical insights into the fundamental properties and composition of rocks. However, the accurate identification of mineral particles presents significant challenges due to three primary factors: the inherent imbalance in data distribution, misclassification caused by feature similarity, and substantial feature variations observed under different cross-polarized angles. These complexities render conventional deep-learning models with single-structure architectures inadequate for precise mineral particle identification. Therefore, this study proposes a novel rock thin section image classification methodology that combines a Multi-Scale Channel Enhanced Transformer (MSCET) with a Sequence Consistency Optimization (SCO) strategy. This integrated approach is designed to effectively extract distinctive features of mineral particles while fully exploiting the influence of polarization angle variations. The MSCET architecture synergistically combines Convolutional Neural Networks (CNN), Squeeze-and-Excitation Networks (SENet), and Transformer mechanisms to enhance the network's feature representation capabilities. Specifically, it employs distinct convolutional operations to extract both coarse- and fine-grained features of mineral particles. The SENet and Transformer structures are then utilized to aggregate global information across both channel and spatial dimensions. Furthermore, we introduce the SCO strategy to refine low-confidence predictions, thereby mitigating the impact of feature variations in multi-angle cross-polarized images. Comprehensive experimental evaluations demonstrate the efficacy of our proposed method, achieving a classification accuracy of 92.35% on the test set. The method also shows significant improvements in key performance metrics, including recall, precision, and F1 score, substantiating its potential for robust rock thin section identification in geological applications. [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of Computational Geosciences is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=184706291
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1007/s10596-025-10356-8
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 19
        StartPage: 1
    Subjects:
      – SubjectFull: Convolutional neural networks
        Type: general
      – SubjectFull: Image recognition (Computer vision)
        Type: general
      – SubjectFull: Transformer models
        Type: general
      – SubjectFull: Deep learning
        Type: general
      – SubjectFull: Brewster's angle
        Type: general
    Titles:
      – TitleFull: Multi-scale channel enhanced transformer for rock thin sections identification and sequence consistency optimization.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Guo, Xiaoyao
      – PersonEntity:
          Name:
            NameFull: Chen, Yan
      – PersonEntity:
          Name:
            NameFull: He, Shipeng
      – PersonEntity:
          Name:
            NameFull: Zhang, Xingpeng
      – PersonEntity:
          Name:
            NameFull: Zhou, Jing
      – PersonEntity:
          Name:
            NameFull: Bao, Xucheng
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 06
              Text: Jun2025
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 14200597
          Numbering:
            – Type: volume
              Value: 29
            – Type: issue
              Value: 3
          Titles:
            – TitleFull: Computational Geosciences
              Type: main
ResultId 1