Modeling Lexical and Phraseological Sophistication in Oral Proficiency Interviews: A Conceptual Replication
Saved in:
| Title: | Modeling Lexical and Phraseological Sophistication in Oral Proficiency Interviews: A Conceptual Replication |
|---|---|
| Language: | English |
| Authors: | Masaki Eguchi |
| Source: | Vocabulary Learning and Instruction. 2022 11(2):1-16. |
| Availability: | Castledown Publishers. Ground Level, 470 St Kilda Road, Melbourne, 3004, Australia. Tel: +61-3-7003-8355; e-mail: contact@castledown.com; Web site: https://www.castledown.com/journals/vli |
| Peer Reviewed: | Y |
| Page Count: | 16 |
| Publication Date: | 2022 |
| Document Type: | Journal Articles Reports - Research |
| Descriptors: | Lexicology, Oral Language, Language Proficiency, Vocabulary Development, Natural Language Processing, Factor Analysis, Computational Linguistics, Bayesian Statistics, Prediction, Accuracy, Phrase Structure |
| ISSN: | 2981-9954 |
| Abstract: | Building on previous studies investigating the multidimensional nature of lexical use in task-based L2 performance, this study clarified the roles that the distinct lexical features play in predicting vocabulary proficiency in a corpus of L2 Oral Proficiency Interviews (OPI). A total of 85 OPI samples were rated by three separate raters based on a Common European Frame of Reference (CEFR) based rubric in terms of their linguistic range. The interview transcription was analyzed for 56 lexical and phraseological indices using modern natural language processing tools. The result of an exploratory factor analysis (EFA) revealed that the 56 indices tapped into 10 distinct factors of lexical use in OPI: three factors related to content words, three related to n-grams, three lexical collocation factors, and one function-word factor. A subsequent Bayesian mixed-effect ordinal regression indicated that six out of the 10 factors meaningfully predicted the CEFR levels on Range with reasonable accuracy (quadratic kappa coefficient = 0.81 with the human rating). The result highlights the distinct roles that multiple content-word, collocation, and function-word factors play in characterizing the linguistic range in a CEFR-based assessment of OPI. The implication for the assessment of lexical richness, as well as future directions of this research domain, are discussed. |
| Abstractor: | As Provided |
| Entry Date: | 2024 |
| Accession Number: | EJ1444025 |
| Database: | ERIC |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://eric.ed.gov/contentdelivery/servlet/ERICServlet?accno=EJ1444025 Name: ERIC Full Text Category: fullText Text: Full Text from ERIC |
|---|---|
| Header | DbId: eric DbLabel: ERIC An: EJ1444025 AccessLevel: 3 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Modeling Lexical and Phraseological Sophistication in Oral Proficiency Interviews: A Conceptual Replication – Name: Language Label: Language Group: Lang Data: English – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Masaki+Eguchi%22">Masaki Eguchi</searchLink> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="SO" term="%22Vocabulary+Learning+and+Instruction%22"><i>Vocabulary Learning and Instruction</i></searchLink>. 2022 11(2):1-16. – Name: Avail Label: Availability Group: Avail Data: Castledown Publishers. Ground Level, 470 St Kilda Road, Melbourne, 3004, Australia. Tel: +61-3-7003-8355; e-mail: contact@castledown.com; Web site: https://www.castledown.com/journals/vli – Name: PeerReviewed Label: Peer Reviewed Group: SrcInfo Data: Y – Name: Pages Label: Page Count Group: Src Data: 16 – Name: DatePubCY Label: Publication Date Group: Date Data: 2022 – Name: TypeDocument Label: Document Type Group: TypDoc Data: Journal Articles<br />Reports - Research – Name: Subject Label: Descriptors Group: Su Data: <searchLink fieldCode="DE" term="%22Lexicology%22">Lexicology</searchLink><br /><searchLink fieldCode="DE" term="%22Oral+Language%22">Oral Language</searchLink><br /><searchLink fieldCode="DE" term="%22Language+Proficiency%22">Language Proficiency</searchLink><br /><searchLink fieldCode="DE" term="%22Vocabulary+Development%22">Vocabulary Development</searchLink><br /><searchLink fieldCode="DE" term="%22Natural+Language+Processing%22">Natural Language Processing</searchLink><br /><searchLink fieldCode="DE" term="%22Factor+Analysis%22">Factor Analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Computational+Linguistics%22">Computational Linguistics</searchLink><br /><searchLink fieldCode="DE" term="%22Bayesian+Statistics%22">Bayesian Statistics</searchLink><br /><searchLink fieldCode="DE" term="%22Prediction%22">Prediction</searchLink><br /><searchLink fieldCode="DE" term="%22Accuracy%22">Accuracy</searchLink><br /><searchLink fieldCode="DE" term="%22Phrase+Structure%22">Phrase Structure</searchLink> – Name: ISSN Label: ISSN Group: ISSN Data: 2981-9954 – Name: Abstract Label: Abstract Group: Ab Data: Building on previous studies investigating the multidimensional nature of lexical use in task-based L2 performance, this study clarified the roles that the distinct lexical features play in predicting vocabulary proficiency in a corpus of L2 Oral Proficiency Interviews (OPI). A total of 85 OPI samples were rated by three separate raters based on a Common European Frame of Reference (CEFR) based rubric in terms of their linguistic range. The interview transcription was analyzed for 56 lexical and phraseological indices using modern natural language processing tools. The result of an exploratory factor analysis (EFA) revealed that the 56 indices tapped into 10 distinct factors of lexical use in OPI: three factors related to content words, three related to n-grams, three lexical collocation factors, and one function-word factor. A subsequent Bayesian mixed-effect ordinal regression indicated that six out of the 10 factors meaningfully predicted the CEFR levels on Range with reasonable accuracy (quadratic kappa coefficient = 0.81 with the human rating). The result highlights the distinct roles that multiple content-word, collocation, and function-word factors play in characterizing the linguistic range in a CEFR-based assessment of OPI. The implication for the assessment of lexical richness, as well as future directions of this research domain, are discussed. – Name: AbstractInfo Label: Abstractor Group: Ab Data: As Provided – Name: DateEntry Label: Entry Date Group: Date Data: 2024 – Name: AN Label: Accession Number Group: ID Data: EJ1444025 |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=EJ1444025 |
| RecordInfo | BibRecord: BibEntity: Languages: – Text: English PhysicalDescription: Pagination: PageCount: 16 StartPage: 1 Subjects: – SubjectFull: Lexicology Type: general – SubjectFull: Oral Language Type: general – SubjectFull: Language Proficiency Type: general – SubjectFull: Vocabulary Development Type: general – SubjectFull: Natural Language Processing Type: general – SubjectFull: Factor Analysis Type: general – SubjectFull: Computational Linguistics Type: general – SubjectFull: Bayesian Statistics Type: general – SubjectFull: Prediction Type: general – SubjectFull: Accuracy Type: general – SubjectFull: Phrase Structure Type: general Titles: – TitleFull: Modeling Lexical and Phraseological Sophistication in Oral Proficiency Interviews: A Conceptual Replication Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Masaki Eguchi IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 12 Type: published Y: 2022 Identifiers: – Type: issn-electronic Value: 2981-9954 Numbering: – Type: volume Value: 11 – Type: issue Value: 2 Titles: – TitleFull: Vocabulary Learning and Instruction Type: main |
| ResultId | 1 |