Enhanced radiology report generation via comprehensive sequence rearrangement and multi-scale cross-region attention.

Saved in:
Bibliographic Details
Title: Enhanced radiology report generation via comprehensive sequence rearrangement and multi-scale cross-region attention.
Authors: Deng, Yan1 (AUTHOR) 2404273956@qq.com, Qin, Qibing2 (AUTHOR) qinbing@wfu.edu.cn, Hu, Wei1 (AUTHOR) wei.workstation@gmail.com, Hu, Jianming3 (AUTHOR) hujianming@cqnu.edu.cn, Yan, Dengwei1 (AUTHOR) dwyan@cqnu.edu.cn, Zhang, Wenfeng1 (AUTHOR) itzhangwf@cqnu.edu.cn, Qiao, Jing4 (AUTHOR) 15114585538@163.com
Source: Visual Computer. Mar2026, Vol. 42 Issue 4, p1-15. 15p.
Abstract: In the medical domain, accurate and detailed radiology reports are pivotal for disease diagnosis and treatment. Despite existing methods showing promise, challenges persist in extracting effective features and focusing on critical regions. To address these issues, we introduce a radiology report generation model, CSR-LMCA, which integrates comprehensive sequence rearrangement with multi-scale cross-region attention. Our model enhances focus on disease-related areas through Saliency-guided Discriminative Attention Mapping (SDAM), significantly improving lesion region identification and background noise suppression. Additionally, the Sequence Rearrangement Mamba (SR-Mamba) module efficiently extracts discriminative features from rearranged long sequences. The Local Multi-scale Cross-region Attention (LMCA) mechanism models local attention relationships and performs cross-region information fusion, strengthening the model’s ability to capture global features and focus on key areas. Experiments on the IU X-ray and MIMIC-CXR datasets demonstrate that CSR-LMCA outperforms state-of-the-art methods, achieving BLEU-4 scores of 0.175 and 0.118, respectively, on these datasets. Here we show that our model not only generates informative and coherent radiology reports but also offers significant improvements in text completeness, coherence, and readability. The code and datasets are available at: . [ABSTRACT FROM AUTHOR]
Copyright of Visual Computer is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Engineering Source
FullText Text:
  Availability: 0
Header DbId: egs
DbLabel: Engineering Source
An: 192064716
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Enhanced radiology report generation via comprehensive sequence rearrangement and multi-scale cross-region attention.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Deng%2C+Yan%22">Deng, Yan</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> 2404273956@qq.com</i><br /><searchLink fieldCode="AR" term="%22Qin%2C+Qibing%22">Qin, Qibing</searchLink><relatesTo>2</relatesTo> (AUTHOR)<i> qinbing@wfu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Hu%2C+Wei%22">Hu, Wei</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> wei.workstation@gmail.com</i><br /><searchLink fieldCode="AR" term="%22Hu%2C+Jianming%22">Hu, Jianming</searchLink><relatesTo>3</relatesTo> (AUTHOR)<i> hujianming@cqnu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Yan%2C+Dengwei%22">Yan, Dengwei</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> dwyan@cqnu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Zhang%2C+Wenfeng%22">Zhang, Wenfeng</searchLink><relatesTo>1</relatesTo> (AUTHOR)<i> itzhangwf@cqnu.edu.cn</i><br /><searchLink fieldCode="AR" term="%22Qiao%2C+Jing%22">Qiao, Jing</searchLink><relatesTo>4</relatesTo> (AUTHOR)<i> 15114585538@163.com</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Visual+Computer%22">Visual Computer</searchLink>. Mar2026, Vol. 42 Issue 4, p1-15. 15p.
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: In the medical domain, accurate and detailed radiology reports are pivotal for disease diagnosis and treatment. Despite existing methods showing promise, challenges persist in extracting effective features and focusing on critical regions. To address these issues, we introduce a radiology report generation model, CSR-LMCA, which integrates comprehensive sequence rearrangement with multi-scale cross-region attention. Our model enhances focus on disease-related areas through Saliency-guided Discriminative Attention Mapping (SDAM), significantly improving lesion region identification and background noise suppression. Additionally, the Sequence Rearrangement Mamba (SR-Mamba) module efficiently extracts discriminative features from rearranged long sequences. The Local Multi-scale Cross-region Attention (LMCA) mechanism models local attention relationships and performs cross-region information fusion, strengthening the model’s ability to capture global features and focus on key areas. Experiments on the IU X-ray and MIMIC-CXR datasets demonstrate that CSR-LMCA outperforms state-of-the-art methods, achieving BLEU-4 scores of 0.175 and 0.118, respectively, on these datasets. Here we show that our model not only generates informative and coherent radiology reports but also offers significant improvements in text completeness, coherence, and readability. The code and datasets are available at: . [ABSTRACT FROM AUTHOR]
– Name: AbstractSuppliedCopyright
  Label:
  Group: Ab
  Data: <i>Copyright of Visual Computer is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=egs&AN=192064716
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1007/s00371-026-04384-3
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 15
        StartPage: 1
    Titles:
      – TitleFull: Enhanced radiology report generation via comprehensive sequence rearrangement and multi-scale cross-region attention.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Deng, Yan
      – PersonEntity:
          Name:
            NameFull: Qin, Qibing
      – PersonEntity:
          Name:
            NameFull: Hu, Wei
      – PersonEntity:
          Name:
            NameFull: Hu, Jianming
      – PersonEntity:
          Name:
            NameFull: Yan, Dengwei
      – PersonEntity:
          Name:
            NameFull: Zhang, Wenfeng
      – PersonEntity:
          Name:
            NameFull: Qiao, Jing
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 03
              Text: Mar2026
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-print
              Value: 01782789
          Numbering:
            – Type: volume
              Value: 42
            – Type: issue
              Value: 4
          Titles:
            – TitleFull: Visual Computer
              Type: main
ResultId 1