General scales unlock AI evaluation with explanatory and predictive power.
Saved in:
| Title: | General scales unlock AI evaluation with explanatory and predictive power. |
|---|---|
| Authors: | Zhou L; Princeton University, Princeton, NJ, USA. lz5066@princeton.edu.; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. lz5066@princeton.edu.; Microsoft Research Asia, Beijing, China. lz5066@princeton.edu.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. lz5066@princeton.edu., Pacchiardi L; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK., Martínez-Plumed F; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Collins KM; Department of Engineering, University of Cambridge, Cambridge, UK., Moros-Daval Y; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Zhang S; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK., Zhao Q; Microsoft Research Asia, Beijing, China., Huang Y; Microsoft Research Asia, Beijing, China., Sun L; The Psychometrics Centre, University of Cambridge, Cambridge, UK., Prunty JE; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK., Li Z; Department of Theoretical and Applied Linguistics, University of Cambridge, Cambridge, UK., Sánchez-García P; KU Leuven, Leuven, Belgium., Jiang-Chen K; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Casares PAM; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Zu J; Educational Testing Service, Princeton, NJ, USA., Burden J; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK., Mehrbakhsh B; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Stillwell D; The Psychometrics Centre, University of Cambridge, Cambridge, UK., Cebrian M; Center for Automation and Robotics (CAR), Spanish National Research Council (CSIC-UPM), Madrid, Spain., Wang J; William & Mary, Williamsburg, VA, USA., Henderson P; Princeton University, Princeton, NJ, USA., Wu ST; Carnegie Mellon University, Pittsburgh, PA, USA., Kyllonen PC; Educational Testing Service, Princeton, NJ, USA., Cheke L; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK., Xie X; Microsoft Research Asia, Beijing, China. xing.xie@microsoft.com., Hernández-Orallo J; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. josephorallo@gmail.com.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. josephorallo@gmail.com. |
| Source: | Nature [Nature] 2026 Apr; Vol. 652 (8108), pp. 58-67. Date of Electronic Publication: 2026 Apr 01. |
| Publication Type: | Journal Article; Research Support, Non-U.S. Gov't |
| Journal Info: | Publisher: Nature Publishing Group Country of Publication: England NLM ID: 0410462 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1476-4687 (Electronic) Linking ISSN: 00280836 NLM ISO Abbreviation: Nature Subsets: MEDLINE |
| Database: | MEDLINE Ultimate |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: mdl DbLabel: MEDLINE Ultimate An: 41922702 AccessLevel: 2 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: General scales unlock AI evaluation with explanatory and predictive power. – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AU" term="%22Zhou+L%22">Zhou L</searchLink>; Princeton University, Princeton, NJ, USA. lz5066@princeton.edu.; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. lz5066@princeton.edu.; Microsoft Research Asia, Beijing, China. lz5066@princeton.edu.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. lz5066@princeton.edu.<br /><searchLink fieldCode="AU" term="%22Pacchiardi+L%22">Pacchiardi L</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Martínez-Plumed+F%22">Martínez-Plumed F</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Collins+KM%22">Collins KM</searchLink>; Department of Engineering, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Moros-Daval+Y%22">Moros-Daval Y</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Zhang+S%22">Zhang S</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Zhao+Q%22">Zhao Q</searchLink>; Microsoft Research Asia, Beijing, China.<br /><searchLink fieldCode="AU" term="%22Huang+Y%22">Huang Y</searchLink>; Microsoft Research Asia, Beijing, China.<br /><searchLink fieldCode="AU" term="%22Sun+L%22">Sun L</searchLink>; The Psychometrics Centre, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Prunty+JE%22">Prunty JE</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Li+Z%22">Li Z</searchLink>; Department of Theoretical and Applied Linguistics, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Sánchez-García+P%22">Sánchez-García P</searchLink>; KU Leuven, Leuven, Belgium.<br /><searchLink fieldCode="AU" term="%22Jiang-Chen+K%22">Jiang-Chen K</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Casares+PAM%22">Casares PAM</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Zu+J%22">Zu J</searchLink>; Educational Testing Service, Princeton, NJ, USA.<br /><searchLink fieldCode="AU" term="%22Burden+J%22">Burden J</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Mehrbakhsh+B%22">Mehrbakhsh B</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Stillwell+D%22">Stillwell D</searchLink>; The Psychometrics Centre, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Cebrian+M%22">Cebrian M</searchLink>; Center for Automation and Robotics (CAR), Spanish National Research Council (CSIC-UPM), Madrid, Spain.<br /><searchLink fieldCode="AU" term="%22Wang+J%22">Wang J</searchLink>; William & Mary, Williamsburg, VA, USA.<br /><searchLink fieldCode="AU" term="%22Henderson+P%22">Henderson P</searchLink>; Princeton University, Princeton, NJ, USA.<br /><searchLink fieldCode="AU" term="%22Wu+ST%22">Wu ST</searchLink>; Carnegie Mellon University, Pittsburgh, PA, USA.<br /><searchLink fieldCode="AU" term="%22Kyllonen+PC%22">Kyllonen PC</searchLink>; Educational Testing Service, Princeton, NJ, USA.<br /><searchLink fieldCode="AU" term="%22Cheke+L%22">Cheke L</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Xie+X%22">Xie X</searchLink>; Microsoft Research Asia, Beijing, China. xing.xie@microsoft.com.<br /><searchLink fieldCode="AU" term="%22Hernández-Orallo+J%22">Hernández-Orallo J</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. josephorallo@gmail.com.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. josephorallo@gmail.com. – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="JN" term="%220410462%22">Nature</searchLink> [Nature] 2026 Apr; Vol. 652 (8108), pp. 58-67. <i>Date of Electronic Publication: </i>2026 Apr 01. – Name: TypePub Label: Publication Type Group: TypPub Data: Journal Article; Research Support, Non-U.S. Gov't – Name: TitleSource Label: Journal Info Group: Src Data: <i>Publisher: </i><searchLink fieldCode="PB" term="%22Nature+Publishing+Group%22">Nature Publishing Group </searchLink><i>Country of Publication: </i>England <i>NLM ID: </i>0410462 <i>Publication Model: </i>Print-Electronic <i>Cited Medium: </i>Internet <i>ISSN: </i>1476-4687 (Electronic) <i>Linking ISSN: </i><searchLink fieldCode="IS" term="%2200280836%22">00280836 </searchLink><i>NLM ISO Abbreviation: </i>Nature <i>Subsets: </i>MEDLINE |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=mdl&AN=41922702 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1038/s41586-026-10303-2 Languages: – Code: eng Text: English PhysicalDescription: Pagination: StartPage: 58 Titles: – TitleFull: General scales unlock AI evaluation with explanatory and predictive power. Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Zhou L – PersonEntity: Name: NameFull: Pacchiardi L – PersonEntity: Name: NameFull: Martínez-Plumed F – PersonEntity: Name: NameFull: Collins KM – PersonEntity: Name: NameFull: Moros-Daval Y – PersonEntity: Name: NameFull: Zhang S – PersonEntity: Name: NameFull: Zhao Q – PersonEntity: Name: NameFull: Huang Y – PersonEntity: Name: NameFull: Sun L – PersonEntity: Name: NameFull: Prunty JE – PersonEntity: Name: NameFull: Li Z – PersonEntity: Name: NameFull: Sánchez-García P – PersonEntity: Name: NameFull: Jiang-Chen K – PersonEntity: Name: NameFull: Casares PAM – PersonEntity: Name: NameFull: Zu J – PersonEntity: Name: NameFull: Burden J – PersonEntity: Name: NameFull: Mehrbakhsh B – PersonEntity: Name: NameFull: Stillwell D – PersonEntity: Name: NameFull: Cebrian M – PersonEntity: Name: NameFull: Wang J – PersonEntity: Name: NameFull: Henderson P – PersonEntity: Name: NameFull: Wu ST – PersonEntity: Name: NameFull: Kyllonen PC – PersonEntity: Name: NameFull: Cheke L – PersonEntity: Name: NameFull: Xie X – PersonEntity: Name: NameFull: Hernández-Orallo J IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 04 Text: 2026 Apr Type: published Y: 2026 Identifiers: – Type: issn-electronic Value: 1476-4687 Numbering: – Type: volume Value: 652 – Type: issue Value: 8108 Titles: – TitleFull: Nature Type: main |
| ResultId | 1 |