Evaluating the Language ENvironment Analysis System for Korean

Saved in:
Bibliographic Details
Title: Evaluating the Language ENvironment Analysis System for Korean
Language: English
Authors: McDonald, Margarethe (ORCID 0000-0002-9620-8556), Kwon, Taeahn, Kim, Hyunji, Lee, Youngki, Ko, Eon-Suk (ORCID 0000-0003-3963-4492)
Source: Journal of Speech, Language, and Hearing Research. Mar 2021 64(3):792-808.
Availability: American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org
Peer Reviewed: Y
Page Count: 17
Publication Date: 2021
Document Type: Journal Articles
Reports - Research
Descriptors: Computational Linguistics, Korean, Audio Equipment, Accuracy, Error Patterns, Classification, Measures (Individuals), Infants, Foreign Countries, Interrater Reliability, Recall (Psychology), Speech Evaluation, Databases, Correlation, Language Acquisition
Geographic Terms: South Korea
DOI: 10.1044/2020_JSLHR-20-00489
ISSN: 1092-4388
Abstract: Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key children aged 7-18 months from a larger data set. We establish the identification error rate, precision, and recall of LENA classification compared to human coders. We then examine the correlation between standard LENA measures of adult word count, child vocalization count, and conversational turn count and human counts of the same measures. Results: Our identification error rate (64% or 67%), including false alarm, confusion, and misses, was similar to the rate found in Cristia, Lavechin, et al. (2020). The correlation between LENA and human counts for adult word count (r = 0.78 or 0.79) was similar to that found in the other studies, but the same measure for child vocalization count (r = 0.34-0.47) was lower than the value in Cristia, Lavechin, et al., though it fell within ranges found in other non-European languages. The correlation between LENA and human conversational turn count was not high (r = 0.36-0.47), similar to the findings in other studies. Conclusions: LENA technology is similarly reliable for Korean language environments as it is for other non-English language environments. Factors affecting the accuracy of diarization include speakers' pitch, duration of utterances, age, and the presence of noise and electronic sounds.
Abstractor: As Provided
Entry Date: 2021
Accession Number: EJ1294468
Database: ERIC
FullText Links:
  – Type: pdflink
    Url: https://content.ebscohost.com/cds/retrieve?content=AQICAHj0k_4E0hTGH8RJwT4gCJyBsGNe_WN95AvKlDbXJGqwxwERi7JQP57uy-0Do5axF842AAAA4jCB3wYJKoZIhvcNAQcGoIHRMIHOAgEAMIHIBgkqhkiG9w0BBwEwHgYJYIZIAWUDBAEuMBEEDH3cNgkFdHGw4ReTvAIBEICBmh0aXWdLx1e1ykSFi9WkvwFBarjFRoqZcmaaufDCxDMyN1OQFgWjNyvi06CJ_BCjEHsxNuEiJASTqVAze6F-QcaFCQ-9WG4QNCdDBikqcCUO86-v82cYpOjXr_HX_C1Q3xDymFShYU9lx_QqwhBiYll9uxlQUbGE4pJsPmHIdcI29ePfycroOVhJgDrEJnA5z9lWi3Nr33cmmF0=
Text:
  Availability: 0
Header DbId: eric
DbLabel: ERIC
An: EJ1294468
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 0
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Evaluating the Language ENvironment Analysis System for Korean
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22McDonald%2C+Margarethe%22">McDonald, Margarethe</searchLink> (ORCID <externalLink term="http://orcid.org/0000-0002-9620-8556">0000-0002-9620-8556</externalLink>)<br /><searchLink fieldCode="AR" term="%22Kwon%2C+Taeahn%22">Kwon, Taeahn</searchLink><br /><searchLink fieldCode="AR" term="%22Kim%2C+Hyunji%22">Kim, Hyunji</searchLink><br /><searchLink fieldCode="AR" term="%22Lee%2C+Youngki%22">Lee, Youngki</searchLink><br /><searchLink fieldCode="AR" term="%22Ko%2C+Eon-Suk%22">Ko, Eon-Suk</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0003-3963-4492">0000-0003-3963-4492</externalLink>)
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <searchLink fieldCode="SO" term="%22Journal+of+Speech%2C+Language%2C+and+Hearing+Research%22"><i>Journal of Speech, Language, and Hearing Research</i></searchLink>. Mar 2021 64(3):792-808.
– Name: Avail
  Label: Availability
  Group: Avail
  Data: American Speech-Language-Hearing Association. 2200 Research Blvd #250, Rockville, MD 20850. Tel: 301-296-5700; Fax: 301-296-8580; e-mail: slhr@asha.org; Web site: http://jslhr.pubs.asha.org
– Name: PeerReviewed
  Label: Peer Reviewed
  Group: SrcInfo
  Data: Y
– Name: Pages
  Label: Page Count
  Group: Src
  Data: 17
– Name: DatePubCY
  Label: Publication Date
  Group: Date
  Data: 2021
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Journal Articles<br />Reports - Research
– Name: Subject
  Label: Descriptors
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Computational+Linguistics%22">Computational Linguistics</searchLink><br /><searchLink fieldCode="DE" term="%22Korean%22">Korean</searchLink><br /><searchLink fieldCode="DE" term="%22Audio+Equipment%22">Audio Equipment</searchLink><br /><searchLink fieldCode="DE" term="%22Accuracy%22">Accuracy</searchLink><br /><searchLink fieldCode="DE" term="%22Error+Patterns%22">Error Patterns</searchLink><br /><searchLink fieldCode="DE" term="%22Classification%22">Classification</searchLink><br /><searchLink fieldCode="DE" term="%22Measures+%28Individuals%29%22">Measures (Individuals)</searchLink><br /><searchLink fieldCode="DE" term="%22Infants%22">Infants</searchLink><br /><searchLink fieldCode="DE" term="%22Foreign+Countries%22">Foreign Countries</searchLink><br /><searchLink fieldCode="DE" term="%22Interrater+Reliability%22">Interrater Reliability</searchLink><br /><searchLink fieldCode="DE" term="%22Recall+%28Psychology%29%22">Recall (Psychology)</searchLink><br /><searchLink fieldCode="DE" term="%22Speech+Evaluation%22">Speech Evaluation</searchLink><br /><searchLink fieldCode="DE" term="%22Databases%22">Databases</searchLink><br /><searchLink fieldCode="DE" term="%22Correlation%22">Correlation</searchLink><br /><searchLink fieldCode="DE" term="%22Language+Acquisition%22">Language Acquisition</searchLink>
– Name: Subject
  Label: Geographic Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22South+Korea%22">South Korea</searchLink>
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1044/2020_JSLHR-20-00489
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 1092-4388
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key children aged 7-18 months from a larger data set. We establish the identification error rate, precision, and recall of LENA classification compared to human coders. We then examine the correlation between standard LENA measures of adult word count, child vocalization count, and conversational turn count and human counts of the same measures. Results: Our identification error rate (64% or 67%), including false alarm, confusion, and misses, was similar to the rate found in Cristia, Lavechin, et al. (2020). The correlation between LENA and human counts for adult word count (r = 0.78 or 0.79) was similar to that found in the other studies, but the same measure for child vocalization count (r = 0.34-0.47) was lower than the value in Cristia, Lavechin, et al., though it fell within ranges found in other non-European languages. The correlation between LENA and human conversational turn count was not high (r = 0.36-0.47), similar to the findings in other studies. Conclusions: LENA technology is similarly reliable for Korean language environments as it is for other non-English language environments. Factors affecting the accuracy of diarization include speakers' pitch, duration of utterances, age, and the presence of noise and electronic sounds.
– Name: AbstractInfo
  Label: Abstractor
  Group: Ab
  Data: As Provided
– Name: DateEntry
  Label: Entry Date
  Group: Date
  Data: 2021
– Name: AN
  Label: Accession Number
  Group: ID
  Data: EJ1294468
PLink https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=EJ1294468
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1044/2020_JSLHR-20-00489
    Languages:
      – Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 17
        StartPage: 792
    Subjects:
      – SubjectFull: Computational Linguistics
        Type: general
      – SubjectFull: Korean
        Type: general
      – SubjectFull: Audio Equipment
        Type: general
      – SubjectFull: Accuracy
        Type: general
      – SubjectFull: Error Patterns
        Type: general
      – SubjectFull: Classification
        Type: general
      – SubjectFull: Measures (Individuals)
        Type: general
      – SubjectFull: Infants
        Type: general
      – SubjectFull: Foreign Countries
        Type: general
      – SubjectFull: Interrater Reliability
        Type: general
      – SubjectFull: Recall (Psychology)
        Type: general
      – SubjectFull: Speech Evaluation
        Type: general
      – SubjectFull: Databases
        Type: general
      – SubjectFull: Correlation
        Type: general
      – SubjectFull: Language Acquisition
        Type: general
      – SubjectFull: South Korea
        Type: general
    Titles:
      – TitleFull: Evaluating the Language ENvironment Analysis System for Korean
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: McDonald, Margarethe
      – PersonEntity:
          Name:
            NameFull: Kwon, Taeahn
      – PersonEntity:
          Name:
            NameFull: Kim, Hyunji
      – PersonEntity:
          Name:
            NameFull: Lee, Youngki
      – PersonEntity:
          Name:
            NameFull: Ko, Eon-Suk
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 03
              Type: published
              Y: 2021
          Identifiers:
            – Type: issn-print
              Value: 1092-4388
          Numbering:
            – Type: volume
              Value: 64
            – Type: issue
              Value: 3
          Titles:
            – TitleFull: Journal of Speech, Language, and Hearing Research
              Type: main
ResultId 1