Evaluating the Language ENvironment Analysis System for Korean
Saved in:
| Title: | Evaluating the Language ENvironment Analysis System for Korean |
|---|---|
| Language: | English |
| Authors: | McDonald, Margarethe (ORCID |
| Source: | Journal of Speech, Language, and Hearing Research. Mar 2021 64(3):792-808. |
| Availability: | American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org |
| Peer Reviewed: | Y |
| Page Count: | 17 |
| Publication Date: | 2021 |
| Document Type: | Journal Articles Reports - Research |
| Descriptors: | Computational Linguistics, Korean, Audio Equipment, Accuracy, Error Patterns, Classification, Measures (Individuals), Infants, Foreign Countries, Interrater Reliability, Recall (Psychology), Speech Evaluation, Databases, Correlation, Language Acquisition |
| Geographic Terms: | South Korea |
| DOI: | 10.1044/2020_JSLHR-20-00489 |
| ISSN: | 1092-4388 |
| Abstract: | Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key children aged 7-18 months from a larger data set. We establish the identification error rate, precision, and recall of LENA classification compared to human coders. We then examine the correlation between standard LENA measures of adult word count, child vocalization count, and conversational turn count and human counts of the same measures. Results: Our identification error rate (64% or 67%), including false alarm, confusion, and misses, was similar to the rate found in Cristia, Lavechin, et al. (2020). The correlation between LENA and human counts for adult word count (r = 0.78 or 0.79) was similar to that found in the other studies, but the same measure for child vocalization count (r = 0.34-0.47) was lower than the value in Cristia, Lavechin, et al., though it fell within ranges found in other non-European languages. The correlation between LENA and human conversational turn count was not high (r = 0.36-0.47), similar to the findings in other studies. Conclusions: LENA technology is similarly reliable for Korean language environments as it is for other non-English language environments. Factors affecting the accuracy of diarization include speakers' pitch, duration of utterances, age, and the presence of noise and electronic sounds. |
| Abstractor: | As Provided |
| Entry Date: | 2021 |
| Accession Number: | EJ1294468 |
| Database: | ERIC |
| FullText | Links: – Type: pdflink Url: https://content.ebscohost.com/cds/retrieve?content=AQICAHj0k_4E0hTGH8RJwT4gCJyBsGNe_WN95AvKlDbXJGqwxwERi7JQP57uy-0Do5axF842AAAA4jCB3wYJKoZIhvcNAQcGoIHRMIHOAgEAMIHIBgkqhkiG9w0BBwEwHgYJYIZIAWUDBAEuMBEEDH3cNgkFdHGw4ReTvAIBEICBmh0aXWdLx1e1ykSFi9WkvwFBarjFRoqZcmaaufDCxDMyN1OQFgWjNyvi06CJ_BCjEHsxNuEiJASTqVAze6F-QcaFCQ-9WG4QNCdDBikqcCUO86-v82cYpOjXr_HX_C1Q3xDymFShYU9lx_QqwhBiYll9uxlQUbGE4pJsPmHIdcI29ePfycroOVhJgDrEJnA5z9lWi3Nr33cmmF0= Text: Availability: 0 |
|---|---|
| Header | DbId: eric DbLabel: ERIC An: EJ1294468 AccessLevel: 3 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Evaluating the Language ENvironment Analysis System for Korean – Name: Language Label: Language Group: Lang Data: English – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22McDonald%2C+Margarethe%22">McDonald, Margarethe</searchLink> (ORCID <externalLink term="http://orcid.org/0000-0002-9620-8556">0000-0002-9620-8556</externalLink>)<br /><searchLink fieldCode="AR" term="%22Kwon%2C+Taeahn%22">Kwon, Taeahn</searchLink><br /><searchLink fieldCode="AR" term="%22Kim%2C+Hyunji%22">Kim, Hyunji</searchLink><br /><searchLink fieldCode="AR" term="%22Lee%2C+Youngki%22">Lee, Youngki</searchLink><br /><searchLink fieldCode="AR" term="%22Ko%2C+Eon-Suk%22">Ko, Eon-Suk</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0003-3963-4492">0000-0003-3963-4492</externalLink>) – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="SO" term="%22Journal+of+Speech%2C+Language%2C+and+Hearing+Research%22"><i>Journal of Speech, Language, and Hearing Research</i></searchLink>. Mar 2021 64(3):792-808. – Name: Avail Label: Availability Group: Avail Data: American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org – Name: PeerReviewed Label: Peer Reviewed Group: SrcInfo Data: Y – Name: Pages Label: Page Count Group: Src Data: 17 – Name: DatePubCY Label: Publication Date Group: Date Data: 2021 – Name: TypeDocument Label: Document Type Group: TypDoc Data: Journal Articles<br />Reports - Research – Name: Subject Label: Descriptors Group: Su Data: <searchLink fieldCode="DE" term="%22Computational+Linguistics%22">Computational Linguistics</searchLink><br /><searchLink fieldCode="DE" term="%22Korean%22">Korean</searchLink><br /><searchLink fieldCode="DE" term="%22Audio+Equipment%22">Audio Equipment</searchLink><br /><searchLink fieldCode="DE" term="%22Accuracy%22">Accuracy</searchLink><br /><searchLink fieldCode="DE" term="%22Error+Patterns%22">Error Patterns</searchLink><br /><searchLink fieldCode="DE" term="%22Classification%22">Classification</searchLink><br /><searchLink fieldCode="DE" term="%22Measures+%28Individuals%29%22">Measures (Individuals)</searchLink><br /><searchLink fieldCode="DE" term="%22Infants%22">Infants</searchLink><br /><searchLink fieldCode="DE" term="%22Foreign+Countries%22">Foreign Countries</searchLink><br /><searchLink fieldCode="DE" term="%22Interrater+Reliability%22">Interrater Reliability</searchLink><br /><searchLink fieldCode="DE" term="%22Recall+%28Psychology%29%22">Recall (Psychology)</searchLink><br /><searchLink fieldCode="DE" term="%22Speech+Evaluation%22">Speech Evaluation</searchLink><br /><searchLink fieldCode="DE" term="%22Databases%22">Databases</searchLink><br /><searchLink fieldCode="DE" term="%22Correlation%22">Correlation</searchLink><br /><searchLink fieldCode="DE" term="%22Language+Acquisition%22">Language Acquisition</searchLink> – Name: Subject Label: Geographic Terms Group: Su Data: <searchLink fieldCode="DE" term="%22South+Korea%22">South Korea</searchLink> – Name: DOI Label: DOI Group: ID Data: 10.1044/2020_JSLHR-20-00489 – Name: ISSN Label: ISSN Group: ISSN Data: 1092-4388 – Name: Abstract Label: Abstract Group: Ab Data: Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key children aged 7-18 months from a larger data set. We establish the identification error rate, precision, and recall of LENA classification compared to human coders. We then examine the correlation between standard LENA measures of adult word count, child vocalization count, and conversational turn count and human counts of the same measures. Results: Our identification error rate (64% or 67%), including false alarm, confusion, and misses, was similar to the rate found in Cristia, Lavechin, et al. (2020). The correlation between LENA and human counts for adult word count (r = 0.78 or 0.79) was similar to that found in the other studies, but the same measure for child vocalization count (r = 0.34-0.47) was lower than the value in Cristia, Lavechin, et al., though it fell within ranges found in other non-European languages. The correlation between LENA and human conversational turn count was not high (r = 0.36-0.47), similar to the findings in other studies. Conclusions: LENA technology is similarly reliable for Korean language environments as it is for other non-English language environments. Factors affecting the accuracy of diarization include speakers' pitch, duration of utterances, age, and the presence of noise and electronic sounds. – Name: AbstractInfo Label: Abstractor Group: Ab Data: As Provided – Name: DateEntry Label: Entry Date Group: Date Data: 2021 – Name: AN Label: Accession Number Group: ID Data: EJ1294468 |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=EJ1294468 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1044/2020_JSLHR-20-00489 Languages: – Text: English PhysicalDescription: Pagination: PageCount: 17 StartPage: 792 Subjects: – SubjectFull: Computational Linguistics Type: general – SubjectFull: Korean Type: general – SubjectFull: Audio Equipment Type: general – SubjectFull: Accuracy Type: general – SubjectFull: Error Patterns Type: general – SubjectFull: Classification Type: general – SubjectFull: Measures (Individuals) Type: general – SubjectFull: Infants Type: general – SubjectFull: Foreign Countries Type: general – SubjectFull: Interrater Reliability Type: general – SubjectFull: Recall (Psychology) Type: general – SubjectFull: Speech Evaluation Type: general – SubjectFull: Databases Type: general – SubjectFull: Correlation Type: general – SubjectFull: Language Acquisition Type: general – SubjectFull: South Korea Type: general Titles: – TitleFull: Evaluating the Language ENvironment Analysis System for Korean Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: McDonald, Margarethe – PersonEntity: Name: NameFull: Kwon, Taeahn – PersonEntity: Name: NameFull: Kim, Hyunji – PersonEntity: Name: NameFull: Lee, Youngki – PersonEntity: Name: NameFull: Ko, Eon-Suk IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 03 Type: published Y: 2021 Identifiers: – Type: issn-print Value: 1092-4388 Numbering: – Type: volume Value: 64 – Type: issue Value: 3 Titles: – TitleFull: Journal of Speech, Language, and Hearing Research Type: main |
| ResultId | 1 |