Rethinking Data Use in Large Language Models.

Saved in:
Bibliographic Details
Title: Rethinking Data Use in Large Language Models.
Authors: Min, Sewon1, sewon@cs.washington.edu, Hajishirzi, Hannaneh1, hannaneh@cs.washington.edu, Zettlemoyer, Luke1, lsz@cs.washington.edu
Source: Computational Linguistics; Dec2025, Vol. 51 Issue 4, p1033-1118, 86p
Database: Applied Science & Technology Source
Full text is not displayed to guests.
FullText Links:
  – Type: pdflink
Text:
  Availability: 1
Header DbId: aci
DbLabel: Applied Science & Technology Source
An: 190991888
AccessLevel: 2
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Rethinking Data Use in Large Language Models.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AU" term="%22Min%2C+Sewon%22">Min, Sewon</searchLink><relatesTo>1</relatesTo>, <i>sewon@cs.washington.edu</i><br /><searchLink fieldCode="AU" term="%22Hajishirzi%2C+Hannaneh%22">Hajishirzi, Hannaneh</searchLink><relatesTo>1</relatesTo>, <i>hannaneh@cs.washington.edu</i><br /><searchLink fieldCode="AU" term="%22Zettlemoyer%2C+Luke%22">Zettlemoyer, Luke</searchLink><relatesTo>1</relatesTo>, <i>lsz@cs.washington.edu</i>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="JN" term="%22Computational+Linguistics%22">Computational Linguistics</searchLink>; Dec2025, Vol. 51 Issue 4, p1033-1118, 86p
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=aci&AN=190991888
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1162/COLI.a.573
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 86
        StartPage: 1033
    Titles:
      – TitleFull: Rethinking Data Use in Large Language Models.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Min, Sewon
      – PersonEntity:
          Name:
            NameFull: Hajishirzi, Hannaneh
      – PersonEntity:
          Name:
            NameFull: Zettlemoyer, Luke
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 12
              Text: Dec2025
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 08912017
          Numbering:
            – Type: volume
              Value: 51
            – Type: issue
              Value: 4
          Titles:
            – TitleFull: Computational Linguistics
              Type: main
ResultId 1