Do Test Scores Misrepresent Test Results? An Item-by-Item Analysis. Discussion Paper #2025.13
Saved in:
| Title: | Do Test Scores Misrepresent Test Results? An Item-by-Item Analysis. Discussion Paper #2025.13 |
|---|---|
| Language: | English |
| Authors: | Jesse Bruhn, Michael Gilraine, Jens Ludwig, Sendil Mullainathan, Massachusetts Institute of Technology (MIT), Blueprint Labs, National Bureau of Economic Research (NBER) |
| Source: | Blueprint Labs. 2025. |
| Availability: | Blueprint Labs. 30 Wadsworth Street, Cambridge, MA 02142. e-mail: contact@mitblueprintlabs.org; Web site: https://blueprintlabs.mit.edu/ |
| Peer Reviewed: | N |
| Page Count: | 85 |
| Publication Date: | 2025 |
| Document Type: | Reports - Research |
| Descriptors: | Testing, Tests, Scores, Test Results, Response Style (Tests), Data Use, Testing Problems, Item Analysis, Item Response Theory, Data, Data Interpretation |
| Geographic Terms: | Texas |
| Abstract: | Much of the data collected in education is effectively thrown away. Students answer individual test questions, but administrators and researchers only see aggregate performance. All the item-level data are lost. Ex ante it is not clear this destroys much useful information, since the aggregate might be a sufficient statistic. Using data from Texas for 5 million students and 1.31 billion student-item responses, the researchers show that in fact aggregation does destroy a great deal of valuable information in education: (1) Even conditional on a summary test measure, there is additional information in the item-level data; (2) This additional information is relevant for the student outcomes that education decisions seek to optimize; and (3) This information can be made practically useful for schools. |
| Abstractor: | As Provided |
| Entry Date: | 2026 |
| Access URL: | https://blueprintlabs.mit.edu/research/do-test-scores-misrepresent-test-results-an-item-by-item-analysis/ |
| Accession Number: | ED678493 |
| Database: | ERIC |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: eric DbLabel: ERIC An: ED678493 AccessLevel: 3 PubType: Report PubTypeId: report PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Do Test Scores Misrepresent Test Results? An Item-by-Item Analysis. Discussion Paper #2025.13 – Name: Language Label: Language Group: Lang Data: English – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Jesse+Bruhn%22">Jesse Bruhn</searchLink><br /><searchLink fieldCode="AR" term="%22Michael+Gilraine%22">Michael Gilraine</searchLink><br /><searchLink fieldCode="AR" term="%22Jens+Ludwig%22">Jens Ludwig</searchLink><br /><searchLink fieldCode="AR" term="%22Sendil+Mullainathan%22">Sendil Mullainathan</searchLink><br /><searchLink fieldCode="AR" term="%22Massachusetts+Institute+of+Technology+%28MIT%29%2C+Blueprint+Labs%22">Massachusetts Institute of Technology (MIT), Blueprint Labs</searchLink><br /><searchLink fieldCode="AR" term="%22National+Bureau+of+Economic+Research+%28NBER%29%22">National Bureau of Economic Research (NBER)</searchLink> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="SO" term="%22Blueprint+Labs%22"><i>Blueprint Labs</i></searchLink>. 2025. – Name: Avail Label: Availability Group: Avail Data: Blueprint Labs. 30 Wadsworth Street, Cambridge, MA 02142. e-mail: contact@mitblueprintlabs.org; Web site: https://blueprintlabs.mit.edu/ – Name: PeerReviewed Label: Peer Reviewed Group: SrcInfo Data: N – Name: Pages Label: Page Count Group: Src Data: 85 – Name: DatePubCY Label: Publication Date Group: Date Data: 2025 – Name: TypeDocument Label: Document Type Group: TypDoc Data: Reports - Research – Name: Subject Label: Descriptors Group: Su Data: <searchLink fieldCode="DE" term="%22Testing%22">Testing</searchLink><br /><searchLink fieldCode="DE" term="%22Tests%22">Tests</searchLink><br /><searchLink fieldCode="DE" term="%22Scores%22">Scores</searchLink><br /><searchLink fieldCode="DE" term="%22Test+Results%22">Test Results</searchLink><br /><searchLink fieldCode="DE" term="%22Response+Style+%28Tests%29%22">Response Style (Tests)</searchLink><br /><searchLink fieldCode="DE" term="%22Data+Use%22">Data Use</searchLink><br /><searchLink fieldCode="DE" term="%22Testing+Problems%22">Testing Problems</searchLink><br /><searchLink fieldCode="DE" term="%22Item+Analysis%22">Item Analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Item+Response+Theory%22">Item Response Theory</searchLink><br /><searchLink fieldCode="DE" term="%22Data%22">Data</searchLink><br /><searchLink fieldCode="DE" term="%22Data+Interpretation%22">Data Interpretation</searchLink> – Name: Subject Label: Geographic Terms Group: Su Data: <searchLink fieldCode="DE" term="%22Texas%22">Texas</searchLink> – Name: Abstract Label: Abstract Group: Ab Data: Much of the data collected in education is effectively thrown away. Students answer individual test questions, but administrators and researchers only see aggregate performance. All the item-level data are lost. Ex ante it is not clear this destroys much useful information, since the aggregate might be a sufficient statistic. Using data from Texas for 5 million students and 1.31 billion student-item responses, the researchers show that in fact aggregation does destroy a great deal of valuable information in education: (1) Even conditional on a summary test measure, there is additional information in the item-level data; (2) This additional information is relevant for the student outcomes that education decisions seek to optimize; and (3) This information can be made practically useful for schools. – Name: AbstractInfo Label: Abstractor Group: Ab Data: As Provided – Name: DateEntry Label: Entry Date Group: Date Data: 2026 – Name: URL Label: Access URL Group: URL Data: <link linkTarget="URL" linkTerm="https://blueprintlabs.mit.edu/research/do-test-scores-misrepresent-test-results-an-item-by-item-analysis/" linkWindow="_blank">https://blueprintlabs.mit.edu/research/do-test-scores-misrepresent-test-results-an-item-by-item-analysis/</link> – Name: AN Label: Accession Number Group: ID Data: ED678493 |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=ED678493 |
| RecordInfo | BibRecord: BibEntity: Languages: – Text: English PhysicalDescription: Pagination: PageCount: 85 Subjects: – SubjectFull: Testing Type: general – SubjectFull: Tests Type: general – SubjectFull: Scores Type: general – SubjectFull: Test Results Type: general – SubjectFull: Response Style (Tests) Type: general – SubjectFull: Data Use Type: general – SubjectFull: Testing Problems Type: general – SubjectFull: Item Analysis Type: general – SubjectFull: Item Response Theory Type: general – SubjectFull: Data Type: general – SubjectFull: Data Interpretation Type: general – SubjectFull: Texas Type: general Titles: – TitleFull: Do Test Scores Misrepresent Test Results? An Item-by-Item Analysis. Discussion Paper #2025.13 Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Massachusetts Institute of Technology (MIT), Blueprint Labs – PersonEntity: Name: NameFull: National Bureau of Economic Research (NBER) – PersonEntity: Name: NameFull: Jesse Bruhn – PersonEntity: Name: NameFull: Michael Gilraine – PersonEntity: Name: NameFull: Jens Ludwig – PersonEntity: Name: NameFull: Sendil Mullainathan IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 11 Type: published Y: 2025 Titles: – TitleFull: Blueprint Labs Type: main |
| ResultId | 1 |