Benchmarking large language model-based agent systems for clinical decision tasks.
Saved in:
| Title: | Benchmarking large language model-based agent systems for clinical decision tasks. |
|---|---|
| Authors: | Liu Y; Department of Radiation Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Carrero ZI; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Jiang X; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; Department of Thoracic Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, University of Electronic Science and Technology of China (UESTC), Chengdu, China., Ferber D; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany., Wölflein G; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; School of Computer Science, University of St Andrews, St Andrews, UK., Zhang L; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Jayabalan S; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Lenz T; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany., Hui Z; Department of VIP Medical Services, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China. drhuizg@163.com., Kather JN; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany. kather.jn@tu-dresden.de.; Department of Medicine I, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University of Leeds, Leeds, UK. kather.jn@tu-dresden.de. |
| Source: | NPJ digital medicine [NPJ Digit Med] 2026 Feb 18; Vol. 9 (1). Date of Electronic Publication: 2026 Feb 18. |
| Publication Type: | Journal Article |
| Journal Info: | Publisher: Nature Publishing Group Country of Publication: England NLM ID: 101731738 Publication Model: Electronic Cited Medium: Internet ISSN: 2398-6352 (Electronic) Linking ISSN: 23986352 NLM ISO Abbreviation: NPJ Digit Med Subsets: PubMed not MEDLINE |
| Database: | MEDLINE Ultimate |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: mdl DbLabel: MEDLINE Ultimate An: 41708802 AccessLevel: 2 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Benchmarking large language model-based agent systems for clinical decision tasks. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AU" term="%22Liu+Y%22">Liu Y</searchLink>; Department of Radiation Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Carrero+ZI%22">Carrero ZI</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Jiang+X%22">Jiang X</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; Department of Thoracic Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, University of Electronic Science and Technology of China (UESTC), Chengdu, China.<br /><searchLink fieldCode="AU" term="%22Ferber+D%22">Ferber D</searchLink>; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany.<br /><searchLink fieldCode="AU" term="%22Wölflein+G%22">Wölflein G</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.; School of Computer Science, University of St Andrews, St Andrews, UK.<br /><searchLink fieldCode="AU" term="%22Zhang+L%22">Zhang L</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Jayabalan+S%22">Jayabalan S</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Lenz+T%22">Lenz T</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.<br /><searchLink fieldCode="AU" term="%22Hui+Z%22">Hui Z</searchLink>; Department of VIP Medical Services, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China. drhuizg@163.com.<br /><searchLink fieldCode="AU" term="%22Kather+JN%22">Kather JN</searchLink>; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany. kather.jn@tu-dresden.de.; Department of Medicine I, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany. kather.jn@tu-dresden.de.; Pathology & Data Analytics, Leeds Institute of Medical Research at St James's University of Leeds, Leeds, UK. kather.jn@tu-dresden.de. – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%22101731738%22">NPJ digital medicine</searchLink> [NPJ Digit Med] 2026 Feb 18; Vol. 9 (1). <i>Date of Electronic Publication: </i>2026 Feb 18. – Name: TypePub Label: Publication Type Group: TypPub Data: Journal Article – Name: TitleSource Label: Journal Info Group: Src Data: <i>Publisher: </i><searchLink fieldCode="PB" term="%22Nature+Publishing+Group%22">Nature Publishing Group </searchLink><i>Country of Publication: </i>England <i>NLM ID: </i>101731738 <i>Publication Model: </i>Electronic <i>Cited Medium: </i>Internet <i>ISSN: </i>2398-6352 (Electronic) <i>Linking ISSN: </i><searchLink fieldCode="IS" term="%2223986352%22">23986352 </searchLink><i>NLM ISO Abbreviation: </i>NPJ Digit Med <i>Subsets: </i>PubMed not MEDLINE |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=mdl&AN=41708802 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1038/s41746-026-02443-6 Languages: – Code: eng Text: English Titles: – TitleFull: Benchmarking large language model-based agent systems for clinical decision tasks. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Liu Y – PersonEntity: Name: NameFull: Carrero ZI – PersonEntity: Name: NameFull: Jiang X – PersonEntity: Name: NameFull: Ferber D – PersonEntity: Name: NameFull: Wölflein G – PersonEntity: Name: NameFull: Zhang L – PersonEntity: Name: NameFull: Jayabalan S – PersonEntity: Name: NameFull: Lenz T – PersonEntity: Name: NameFull: Hui Z – PersonEntity: Name: NameFull: Kather JN IsPartOfRelationships: – BibEntity: Dates: – D: 18 M: 02 Text: 2026 Feb 18 Type: published Y: 2026 Identifiers: – Type: issn-electronic Value: 2398-6352 Numbering: – Type: volume Value: 9 – Type: issue Value: 1 Titles: – TitleFull: NPJ digital medicine Type: main |
| ResultId | 1 |