Benchmarking large language model-based agent systems for clinical decision tasks.

Saved in:
Bibliographic Details
Title: Benchmarking large language model-based agent systems for clinical decision tasks.
Authors: Liu Y; Department of Radiation Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Carrero ZI; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Jiang X; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; Department of Thoracic Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, University of Electronic Science and Technology of China (UESTC), Chengdu, China., Ferber D; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany., Wölflein G; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; School of Computer Science, University of St Andrews, St Andrews, UK., Zhang L; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Jayabalan S; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Lenz T; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Hui Z; Department of VIP Medical Services, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China. drhuizg@163.com., Kather JN; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany. kather.jn@tu-dresden.de.; Department of Medicine I, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University of Leeds, Leeds, UK. kather.jn@tu-dresden.de.
Source: NPJ digital medicine [NPJ Digit Med] 2026 Feb 18; Vol. 9 (1). Date of Electronic Publication: 2026 Feb 18.
Publication Type: Journal Article
Journal Info: Publisher: Nature Publishing Group Country of Publication: England NLM ID: 101731738 Publication Model: Electronic Cited Medium: Internet ISSN: 2398-6352 (Electronic) Linking ISSN: 23986352 NLM ISO Abbreviation: NPJ Digit Med Subsets: PubMed not MEDLINE
Database: MEDLINE Ultimate
FullText Text:
  Availability: 0
Header DbId: mdl
DbLabel: MEDLINE Ultimate
An: 41708802
AccessLevel: 2
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Benchmarking large language model-based agent systems for clinical decision tasks.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AU" term="%22Liu+Y%22">Liu Y</searchLink>; Department of Radiation Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Carrero+ZI%22">Carrero ZI</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Jiang+X%22">Jiang X</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; Department of Thoracic Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, University of Electronic Science and Technology of China (UESTC), Chengdu, China.<br /><searchLink fieldCode="AU" term="%22Ferber+D%22">Ferber D</searchLink>; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany.<br /><searchLink fieldCode="AU" term="%22Wölflein+G%22">Wölflein G</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; School of Computer Science, University of St Andrews, St Andrews, UK.<br /><searchLink fieldCode="AU" term="%22Zhang+L%22">Zhang L</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Jayabalan+S%22">Jayabalan S</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Lenz+T%22">Lenz T</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Hui+Z%22">Hui Z</searchLink>; Department of VIP Medical Services, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China. drhuizg@163.com.<br /><searchLink fieldCode="AU" term="%22Kather+JN%22">Kather JN</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany. kather.jn@tu-dresden.de.; Department of Medicine I, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University of Leeds, Leeds, UK. kather.jn@tu-dresden.de.
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22101731738%22">NPJ digital medicine</searchLink> [NPJ Digit Med] 2026 Feb 18; Vol. 9 (1). <i>Date of Electronic Publication: </i>2026 Feb 18.
– Name: TypePub
  Label: Publication Type
  Group: TypPub
  Data: Journal Article
– Name: TitleSource
  Label: Journal Info
  Group: Src
  Data: <i>Publisher: </i><searchLink fieldCode="PB" term="%22Nature+Publishing+Group%22">Nature Publishing Group </searchLink><i>Country of Publication: </i>England <i>NLM ID: </i>101731738 <i>Publication Model: </i>Electronic <i>Cited Medium: </i>Internet <i>ISSN: </i>2398-6352 (Electronic) <i>Linking ISSN: </i><searchLink fieldCode="IS" term="%2223986352%22">23986352 </searchLink><i>NLM ISO Abbreviation: </i>NPJ Digit Med <i>Subsets: </i>PubMed not MEDLINE
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=mdl&AN=41708802
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1038/s41746-026-02443-6
    Languages:
      – Code: eng
        Text: English
    Titles:
      – TitleFull: Benchmarking large language model-based agent systems for clinical decision tasks.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Liu Y
      – PersonEntity:
          Name:
            NameFull: Carrero ZI
      – PersonEntity:
          Name:
            NameFull: Jiang X
      – PersonEntity:
          Name:
            NameFull: Ferber D
      – PersonEntity:
          Name:
            NameFull: Wölflein G
      – PersonEntity:
          Name:
            NameFull: Zhang L
      – PersonEntity:
          Name:
            NameFull: Jayabalan S
      – PersonEntity:
          Name:
            NameFull: Lenz T
      – PersonEntity:
          Name:
            NameFull: Hui Z
      – PersonEntity:
          Name:
            NameFull: Kather JN
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 18
              M: 02
              Text: 2026 Feb 18
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-electronic
              Value: 2398-6352
          Numbering:
            – Type: volume
              Value: 9
            – Type: issue
              Value: 1
          Titles:
            – TitleFull: NPJ digital medicine
              Type: main
ResultId 1