General scales unlock AI evaluation with explanatory and predictive power.

Saved in:
Bibliographic Details
Title: General scales unlock AI evaluation with explanatory and predictive power.
Authors: Zhou L; Princeton University, Princeton, NJ, USA. lz5066@princeton.edu.; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. lz5066@princeton.edu.; Microsoft Research Asia, Beijing, China. lz5066@princeton.edu.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. lz5066@princeton.edu., Pacchiardi L; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK., Martínez-Plumed F; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Collins KM; Department of Engineering, University of Cambridge, Cambridge, UK., Moros-Daval Y; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Zhang S; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK., Zhao Q; Microsoft Research Asia, Beijing, China., Huang Y; Microsoft Research Asia, Beijing, China., Sun L; The Psychometrics Centre, University of Cambridge, Cambridge, UK., Prunty JE; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK., Li Z; Department of Theoretical and Applied Linguistics, University of Cambridge, Cambridge, UK., Sánchez-García P; KU Leuven, Leuven, Belgium., Jiang-Chen K; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Casares PAM; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Zu J; Educational Testing Service, Princeton, NJ, USA., Burden J; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK., Mehrbakhsh B; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain., Stillwell D; The Psychometrics Centre, University of Cambridge, Cambridge, UK., Cebrian M; Center for Automation and Robotics (CAR), Spanish National Research Council (CSIC-UPM), Madrid, Spain., Wang J; William & Mary, Williamsburg, VA, USA., Henderson P; Princeton University, Princeton, NJ, USA., Wu ST; Carnegie Mellon University, Pittsburgh, PA, USA., Kyllonen PC; Educational Testing Service, Princeton, NJ, USA., Cheke L; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK., Xie X; Microsoft Research Asia, Beijing, China. xing.xie@microsoft.com., Hernández-Orallo J; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. josephorallo@gmail.com.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. josephorallo@gmail.com.
Source: Nature [Nature] 2026 Apr; Vol. 652 (8108), pp. 58-67. Date of Electronic Publication: 2026 Apr 01.
Publication Type: Journal Article; Research Support, Non-U.S. Gov't
Journal Info: Publisher: Nature Publishing Group Country of Publication: England NLM ID: 0410462 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1476-4687 (Electronic) Linking ISSN: 00280836 NLM ISO Abbreviation: Nature Subsets: MEDLINE
Database: MEDLINE Ultimate
FullText Text:
  Availability: 0
Header DbId: mdl
DbLabel: MEDLINE Ultimate
An: 41922702
AccessLevel: 2
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: General scales unlock AI evaluation with explanatory and predictive power.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AU" term="%22Zhou+L%22">Zhou L</searchLink>; Princeton University, Princeton, NJ, USA. lz5066@princeton.edu.; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. lz5066@princeton.edu.; Microsoft Research Asia, Beijing, China. lz5066@princeton.edu.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. lz5066@princeton.edu.<br /><searchLink fieldCode="AU" term="%22Pacchiardi+L%22">Pacchiardi L</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Martínez-Plumed+F%22">Martínez-Plumed F</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Collins+KM%22">Collins KM</searchLink>; Department of Engineering, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Moros-Daval+Y%22">Moros-Daval Y</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Zhang+S%22">Zhang S</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Zhao+Q%22">Zhao Q</searchLink>; Microsoft Research Asia, Beijing, China.<br /><searchLink fieldCode="AU" term="%22Huang+Y%22">Huang Y</searchLink>; Microsoft Research Asia, Beijing, China.<br /><searchLink fieldCode="AU" term="%22Sun+L%22">Sun L</searchLink>; The Psychometrics Centre, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Prunty+JE%22">Prunty JE</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Li+Z%22">Li Z</searchLink>; Department of Theoretical and Applied Linguistics, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Sánchez-García+P%22">Sánchez-García P</searchLink>; KU Leuven, Leuven, Belgium.<br /><searchLink fieldCode="AU" term="%22Jiang-Chen+K%22">Jiang-Chen K</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Casares+PAM%22">Casares PAM</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Zu+J%22">Zu J</searchLink>; Educational Testing Service, Princeton, NJ, USA.<br /><searchLink fieldCode="AU" term="%22Burden+J%22">Burden J</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Mehrbakhsh+B%22">Mehrbakhsh B</searchLink>; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain.<br /><searchLink fieldCode="AU" term="%22Stillwell+D%22">Stillwell D</searchLink>; The Psychometrics Centre, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Cebrian+M%22">Cebrian M</searchLink>; Center for Automation and Robotics (CAR), Spanish National Research Council (CSIC-UPM), Madrid, Spain.<br /><searchLink fieldCode="AU" term="%22Wang+J%22">Wang J</searchLink>; William & Mary, Williamsburg, VA, USA.<br /><searchLink fieldCode="AU" term="%22Henderson+P%22">Henderson P</searchLink>; Princeton University, Princeton, NJ, USA.<br /><searchLink fieldCode="AU" term="%22Wu+ST%22">Wu ST</searchLink>; Carnegie Mellon University, Pittsburgh, PA, USA.<br /><searchLink fieldCode="AU" term="%22Kyllonen+PC%22">Kyllonen PC</searchLink>; Educational Testing Service, Princeton, NJ, USA.<br /><searchLink fieldCode="AU" term="%22Cheke+L%22">Cheke L</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK.; Department of Psychology, University of Cambridge, Cambridge, UK.<br /><searchLink fieldCode="AU" term="%22Xie+X%22">Xie X</searchLink>; Microsoft Research Asia, Beijing, China. xing.xie@microsoft.com.<br /><searchLink fieldCode="AU" term="%22Hernández-Orallo+J%22">Hernández-Orallo J</searchLink>; Leverhulme Centre for the Future of Intelligence, University of Cambridge, Cambridge, UK. josephorallo@gmail.com.; Valencian Research Institute for Artificial Intelligence (VRAIN), Universitat Politècnica de València, València, Spain. josephorallo@gmail.com.
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%220410462%22">Nature</searchLink> [Nature] 2026 Apr; Vol. 652 (8108), pp. 58-67. <i>Date of Electronic Publication: </i>2026 Apr 01.
– Name: TypePub
  Label: Publication Type
  Group: TypPub
  Data: Journal Article; Research Support, Non-U.S. Gov't
– Name: TitleSource
  Label: Journal Info
  Group: Src
  Data: <i>Publisher: </i><searchLink fieldCode="PB" term="%22Nature+Publishing+Group%22">Nature Publishing Group </searchLink><i>Country of Publication: </i>England <i>NLM ID: </i>0410462 <i>Publication Model: </i>Print-Electronic <i>Cited Medium: </i>Internet <i>ISSN: </i>1476-4687 (Electronic) <i>Linking ISSN: </i><searchLink fieldCode="IS" term="%2200280836%22">00280836 </searchLink><i>NLM ISO Abbreviation: </i>Nature <i>Subsets: </i>MEDLINE
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=mdl&AN=41922702
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1038/s41586-026-10303-2
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        StartPage: 58
    Titles:
      – TitleFull: General scales unlock AI evaluation with explanatory and predictive power.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Zhou L
      – PersonEntity:
          Name:
            NameFull: Pacchiardi L
      – PersonEntity:
          Name:
            NameFull: Martínez-Plumed F
      – PersonEntity:
          Name:
            NameFull: Collins KM
      – PersonEntity:
          Name:
            NameFull: Moros-Daval Y
      – PersonEntity:
          Name:
            NameFull: Zhang S
      – PersonEntity:
          Name:
            NameFull: Zhao Q
      – PersonEntity:
          Name:
            NameFull: Huang Y
      – PersonEntity:
          Name:
            NameFull: Sun L
      – PersonEntity:
          Name:
            NameFull: Prunty JE
      – PersonEntity:
          Name:
            NameFull: Li Z
      – PersonEntity:
          Name:
            NameFull: Sánchez-García P
      – PersonEntity:
          Name:
            NameFull: Jiang-Chen K
      – PersonEntity:
          Name:
            NameFull: Casares PAM
      – PersonEntity:
          Name:
            NameFull: Zu J
      – PersonEntity:
          Name:
            NameFull: Burden J
      – PersonEntity:
          Name:
            NameFull: Mehrbakhsh B
      – PersonEntity:
          Name:
            NameFull: Stillwell D
      – PersonEntity:
          Name:
            NameFull: Cebrian M
      – PersonEntity:
          Name:
            NameFull: Wang J
      – PersonEntity:
          Name:
            NameFull: Henderson P
      – PersonEntity:
          Name:
            NameFull: Wu ST
      – PersonEntity:
          Name:
            NameFull: Kyllonen PC
      – PersonEntity:
          Name:
            NameFull: Cheke L
      – PersonEntity:
          Name:
            NameFull: Xie X
      – PersonEntity:
          Name:
            NameFull: Hernández-Orallo J
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 04
              Text: 2026 Apr
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-electronic
              Value: 1476-4687
          Numbering:
            – Type: volume
              Value: 652
            – Type: issue
              Value: 8108
          Titles:
            – TitleFull: Nature
              Type: main
ResultId 1