Evaluating the Language ENvironment Analysis System for Korean

Saved in:
Bibliographic Details
Title: Evaluating the Language ENvironment Analysis System for Korean
Language: English
Authors: McDonald, Margarethe (ORCID 0000-0002-9620-8556), Kwon, Taeahn, Kim, Hyunji, Lee, Youngki, Ko, Eon-Suk (ORCID 0000-0003-3963-4492)
Source: Journal of Speech, Language, and Hearing Research. Mar 2021 64(3):792-808.
Availability: American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org
Peer Reviewed: Y
Page Count: 17
Publication Date: 2021
Document Type: Journal Articles
Reports - Research
Descriptors: Computational Linguistics, Korean, Audio Equipment, Accuracy, Error Patterns, Classification, Measures (Individuals), Infants, Foreign Countries, Interrater Reliability, Recall (Psychology), Speech Evaluation, Databases, Correlation, Language Acquisition
Geographic Terms: South Korea
DOI: 10.1044/2020_JSLHR-20-00489
ISSN: 1092-4388
Abstract: Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key children aged 7-18 months from a larger data set. We establish the identification error rate, precision, and recall of LENA classification compared to human coders. We then examine the correlation between standard LENA measures of adult word count, child vocalization count, and conversational turn count and human counts of the same measures. Results: Our identification error rate (64% or 67%), including false alarm, confusion, and misses, was similar to the rate found in Cristia, Lavechin, et al. (2020). The correlation between LENA and human counts for adult word count (r = 0.78 or 0.79) was similar to that found in the other studies, but the same measure for child vocalization count (r = 0.34-0.47) was lower than the value in Cristia, Lavechin, et al., though it fell within ranges found in other non-European languages. The correlation between LENA and human conversational turn count was not high (r = 0.36-0.47), similar to the findings in other studies. Conclusions: LENA technology is similarly reliable for Korean language environments as it is for other non-English language environments. Factors affecting the accuracy of diarization include speakers' pitch, duration of utterances, age, and the presence of noise and electronic sounds.
Abstractor: As Provided
Entry Date: 2021
Accession Number: EJ1294468
Database: ERIC
Be the first to leave a comment!
You must be logged in first