Compiling the First Spoken Corpus for Turkish Youth Talk: Overview of the Corpus and Methodological Issues

Saved in:
Bibliographic Details
Title: Compiling the First Spoken Corpus for Turkish Youth Talk: Overview of the Corpus and Methodological Issues
Language: English
Authors: Esranur Efeoglu-Özcan (ORCID 0000-0001-9627-4628), Hale Isik-Güler (ORCID 0000-0002-6859-9377)
Source: Australian Review of Applied Linguistics. 2026 49(1):58-86.
Availability: John Benjamins Publishing Company. Klaprozenweg 105 Postbus 36224, NL-1020 ME Amsterdam, Netherlands. Tel: +31-20-6304747; Fax: +31-20-6739773; e-mail: subscription@benjamins.nl; Web site: https://www.benjamins.com
Peer Reviewed: Y
Page Count: 29
Publication Date: 2026
Document Type: Journal Articles
Reports - Research
Education Level: High Schools
Secondary Education
Descriptors: Foreign Countries, Turkish, Speech Communication, Interpersonal Communication, Adolescents, Audio Equipment, Interaction, Friendship, High School Students, Native Speakers, Socioeconomic Status, Individual Characteristics, Transcripts (Written Records)
Geographic Terms: Turkey
DOI: 10.1075/aral.25007.efe
ISSN: 0155-0640
1833-7139
Abstract: This paper addresses issues related to the design and compilation of the first spoken corpus of youth talk in an under-represented language in corpus linguistics, Turkish. Designed to offer a maximally representative sample of Turkish youth talk, the Corpus of Turkish Youth Language (CoTY) is a 168,748-token specialised corpus within the single register of informal, naturally occurring and spontaneous interaction exclusively among friends. The speakers are Turkish-speaking youth aged 14 to 18 from diverse socio-economic backgrounds in Türkiye. In this paper, the issues that surfaced during corpus design and construction are presented, with a discussion and justification of the methodological choices in relation to the long-term project objectives. The corpus contributes to the field as a valuable resource and tool for cross-linguistic youth language research. As an overarching fundamental goal, the project also aims to expand on the cumulative linguistic and methodological knowledge in spoken corpus design and construction.
Abstractor: As Provided
Entry Date: 2026
Accession Number: EJ1497611
Database: ERIC
FullText Text:
  Availability: 0
Header DbId: eric
DbLabel: ERIC
An: EJ1497611
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Compiling the First Spoken Corpus for Turkish Youth Talk: Overview of the Corpus and Methodological Issues
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Esranur+Efeoglu-Özcan%22">Esranur Efeoglu-Özcan</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0001-9627-4628">0000-0001-9627-4628</externalLink>)<br /><searchLink fieldCode="AR" term="%22Hale+Isik-Güler%22">Hale Isik-Güler</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0002-6859-9377">0000-0002-6859-9377</externalLink>)
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="SO" term="%22Australian+Review+of+Applied+Linguistics%22"><i>Australian Review of Applied Linguistics</i></searchLink>. 2026 49(1):58-86.
– Name: Avail
  Label: Availability
  Group: Avail
  Data: John Benjamins Publishing Company. Klaprozenweg 105 Postbus 36224, NL-1020 ME Amsterdam, Netherlands. Tel: +31-20-6304747; Fax: +31-20-6739773; e-mail: subscription@benjamins.nl; Web site: https://www.benjamins.com
– Name: PeerReviewed
  Label: Peer Reviewed
  Group: SrcInfo
  Data: Y
– Name: Pages
  Label: Page Count
  Group: Src
  Data: 29
– Name: DatePubCY
  Label: Publication Date
  Group: Date
  Data: 2026
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Journal Articles<br />Reports - Research
– Name: Audience
  Label: Education Level
  Group: Audnce
  Data: <searchLink fieldCode="EL" term="%22High+Schools%22">High Schools</searchLink><br /><searchLink fieldCode="EL" term="%22Secondary+Education%22">Secondary Education</searchLink>
– Name: Subject
  Label: Descriptors
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Foreign+Countries%22">Foreign Countries</searchLink><br /><searchLink fieldCode="DE" term="%22Turkish%22">Turkish</searchLink><br /><searchLink fieldCode="DE" term="%22Speech+Communication%22">Speech Communication</searchLink><br /><searchLink fieldCode="DE" term="%22Interpersonal+Communication%22">Interpersonal Communication</searchLink><br /><searchLink fieldCode="DE" term="%22Adolescents%22">Adolescents</searchLink><br /><searchLink fieldCode="DE" term="%22Audio+Equipment%22">Audio Equipment</searchLink><br /><searchLink fieldCode="DE" term="%22Interaction%22">Interaction</searchLink><br /><searchLink fieldCode="DE" term="%22Friendship%22">Friendship</searchLink><br /><searchLink fieldCode="DE" term="%22High+School+Students%22">High School Students</searchLink><br /><searchLink fieldCode="DE" term="%22Native+Speakers%22">Native Speakers</searchLink><br /><searchLink fieldCode="DE" term="%22Socioeconomic+Status%22">Socioeconomic Status</searchLink><br /><searchLink fieldCode="DE" term="%22Individual+Characteristics%22">Individual Characteristics</searchLink><br /><searchLink fieldCode="DE" term="%22Transcripts+%28Written+Records%29%22">Transcripts (Written Records)</searchLink>
– Name: Subject
  Label: Geographic Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Turkey%22">Turkey</searchLink>
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1075/aral.25007.efe
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 0155-0640<br />1833-7139
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: This paper addresses issues related to the design and compilation of the first spoken corpus of youth talk in an under-represented language in corpus linguistics, Turkish. Designed to offer a maximally representative sample of Turkish youth talk, the Corpus of Turkish Youth Language (CoTY) is a 168,748-token specialised corpus within the single register of informal, naturally occurring and spontaneous interaction exclusively among friends. The speakers are Turkish-speaking youth aged 14 to 18 from diverse socio-economic backgrounds in Türkiye. In this paper, the issues that surfaced during corpus design and construction are presented, with a discussion and justification of the methodological choices in relation to the long-term project objectives. The corpus contributes to the field as a valuable resource and tool for cross-linguistic youth language research. As an overarching fundamental goal, the project also aims to expand on the cumulative linguistic and methodological knowledge in spoken corpus design and construction.
– Name: AbstractInfo
  Label: Abstractor
  Group: Ab
  Data: As Provided
– Name: DateEntry
  Label: Entry Date
  Group: Date
  Data: 2026
– Name: AN
  Label: Accession Number
  Group: ID
  Data: EJ1497611
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=EJ1497611
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1075/aral.25007.efe
    Languages:
      – Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 29
        StartPage: 58
    Subjects:
      – SubjectFull: Foreign Countries
        Type: general
      – SubjectFull: Turkish
        Type: general
      – SubjectFull: Speech Communication
        Type: general
      – SubjectFull: Interpersonal Communication
        Type: general
      – SubjectFull: Adolescents
        Type: general
      – SubjectFull: Audio Equipment
        Type: general
      – SubjectFull: Interaction
        Type: general
      – SubjectFull: Friendship
        Type: general
      – SubjectFull: High School Students
        Type: general
      – SubjectFull: Native Speakers
        Type: general
      – SubjectFull: Socioeconomic Status
        Type: general
      – SubjectFull: Individual Characteristics
        Type: general
      – SubjectFull: Transcripts (Written Records)
        Type: general
      – SubjectFull: Turkey
        Type: general
    Titles:
      – TitleFull: Compiling the First Spoken Corpus for Turkish Youth Talk: Overview of the Corpus and Methodological Issues
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Esranur Efeoglu-Özcan
      – PersonEntity:
          Name:
            NameFull: Hale Isik-Güler
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2026
          Identifiers:
            – Type: issn-print
              Value: 0155-0640
            – Type: issn-electronic
              Value: 1833-7139
          Numbering:
            – Type: volume
              Value: 49
            – Type: issue
              Value: 1
          Titles:
            – TitleFull: Australian Review of Applied Linguistics
              Type: main
ResultId 1