Compiling the First Spoken Corpus for Turkish Youth Talk: Overview of the Corpus and Methodological Issues

Saved in:
Bibliographic Details
Title: Compiling the First Spoken Corpus for Turkish Youth Talk: Overview of the Corpus and Methodological Issues
Language: English
Authors: Esranur Efeoglu-Özcan (ORCID 0000-0001-9627-4628), Hale Isik-Güler (ORCID 0000-0002-6859-9377)
Source: Australian Review of Applied Linguistics. 2026 49(1):58-86.
Availability: John Benjamins Publishing Company. Klaprozenweg 105 Postbus 36224, NL-1020 ME Amsterdam, Netherlands. Tel: +31-20-6304747; Fax: +31-20-6739773; e-mail: subscription@benjamins.nl; Web site: https://www.benjamins.com
Peer Reviewed: Y
Page Count: 29
Publication Date: 2026
Document Type: Journal Articles
Reports - Research
Education Level: High Schools
Secondary Education
Descriptors: Foreign Countries, Turkish, Speech Communication, Interpersonal Communication, Adolescents, Audio Equipment, Interaction, Friendship, High School Students, Native Speakers, Socioeconomic Status, Individual Characteristics, Transcripts (Written Records)
Geographic Terms: Turkey
DOI: 10.1075/aral.25007.efe
ISSN: 0155-0640
1833-7139
Abstract: This paper addresses issues related to the design and compilation of the first spoken corpus of youth talk in an under-represented language in corpus linguistics, Turkish. Designed to offer a maximally representative sample of Turkish youth talk, the Corpus of Turkish Youth Language (CoTY) is a 168,748-token specialised corpus within the single register of informal, naturally occurring and spontaneous interaction exclusively among friends. The speakers are Turkish-speaking youth aged 14 to 18 from diverse socio-economic backgrounds in Türkiye. In this paper, the issues that surfaced during corpus design and construction are presented, with a discussion and justification of the methodological choices in relation to the long-term project objectives. The corpus contributes to the field as a valuable resource and tool for cross-linguistic youth language research. As an overarching fundamental goal, the project also aims to expand on the cumulative linguistic and methodological knowledge in spoken corpus design and construction.
Abstractor: As Provided
Entry Date: 2026
Accession Number: EJ1497611
Database: ERIC
Description
Abstract:This paper addresses issues related to the design and compilation of the first spoken corpus of youth talk in an under-represented language in corpus linguistics, Turkish. Designed to offer a maximally representative sample of Turkish youth talk, the Corpus of Turkish Youth Language (CoTY) is a 168,748-token specialised corpus within the single register of informal, naturally occurring and spontaneous interaction exclusively among friends. The speakers are Turkish-speaking youth aged 14 to 18 from diverse socio-economic backgrounds in Türkiye. In this paper, the issues that surfaced during corpus design and construction are presented, with a discussion and justification of the methodological choices in relation to the long-term project objectives. The corpus contributes to the field as a valuable resource and tool for cross-linguistic youth language research. As an overarching fundamental goal, the project also aims to expand on the cumulative linguistic and methodological knowledge in spoken corpus design and construction.
ISSN:0155-0640
1833-7139
DOI:10.1075/aral.25007.efe