Classification of Open-Ended Responses to a Research-Based Assessment Using Natural Language Processing
Saved in:
| Title: | Classification of Open-Ended Responses to a Research-Based Assessment Using Natural Language Processing |
|---|---|
| Language: | English |
| Authors: | Wilson, Joseph (ORCID |
| Source: | Physical Review Physics Education Research. Jan-Jun 2022 18(1). |
| Availability: | American Physical Society. One Physics Ellipse 4th Floor, College Park, MD 20740-3844. Tel: 301-209-3200; Fax: 301-209-0865; e-mail: assocpub@aps.org; Web site: http://prst-per.aps.org |
| Peer Reviewed: | Y |
| Page Count: | 16 |
| Publication Date: | 2022 |
| Sponsoring Agency: | National Science Foundation (NSF) |
| Contract Number: | PHY1734006 |
| Document Type: | Journal Articles Reports - Research |
| Education Level: | Higher Education Postsecondary Education |
| Descriptors: | Natural Language Processing, Science Education, Physics, Artificial Intelligence, Models, Data Analysis, Classification, Student Reaction, Test Format, College Students |
| Geographic Terms: | Colorado (Boulder) |
| DOI: | 10.1103/PhysRevPhysEducRes.18.010141 |
| ISSN: | 2469-9896 |
| Abstract: | Surveys have long been used in physics education research to understand student reasoning and inform course improvements. However, to make analysis of large sets of responses practical, most surveys use a closed-response format with a small set of potential responses. Open-ended formats, such as written free response, can provide deeper insights into student thinking, but take much longer to analyze, especially with a large number of responses. Here, we explore natural language processing as a computational solution to this problem. We create a machine learning model that can take student responses from the Physics Measurement Questionnaire as input, and output a categorization of student reasoning based on different reasoning paradigms. Our model yields classifications with the same level of agreement as that between two humans categorizing the data, but can be done by a computer, and thus can be scaled for large datasets. In this work, we describe the algorithms and methodologies used to create, train, and test our natural language processing system. We also present the results of the analysis and discuss the utility of these approaches for analyzing open-response data in education research. |
| Abstractor: | As Provided |
| Entry Date: | 2022 |
| Accession Number: | EJ1355094 |
| Database: | ERIC |
| FullText | Text: Availability: 0 |
|---|---|
| Header | DbId: eric DbLabel: ERIC An: EJ1355094 AccessLevel: 3 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 0 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: Classification of Open-Ended Responses to a Research-Based Assessment Using Natural Language Processing – Name: Language Label: Language Group: Lang Data: English – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Wilson%2C+Joseph%22">Wilson, Joseph</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0003-2111-507X">0000-0003-2111-507X</externalLink>)<br /><searchLink fieldCode="AR" term="%22Pollard%2C+Benjamin%22">Pollard, Benjamin</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0002-5109-6415">0000-0002-5109-6415</externalLink>)<br /><searchLink fieldCode="AR" term="%22Aiken%2C+John+M%2E%22">Aiken, John M.</searchLink> (ORCID <externalLink term="https://orcid.org/0000-0003-0717-4583">0000-0003-0717-4583</externalLink>)<br /><searchLink fieldCode="AR" term="%22Lewandowski%2C+H%2E+J%2E%22">Lewandowski, H. J.</searchLink> – Name: TitleSource Label: Source Group: Src Data: <searchLink fieldCode="SO" term="%22Physical+Review+Physics+Education+Research%22"><i>Physical Review Physics Education Research</i></searchLink>. Jan-Jun 2022 18(1). – Name: Avail Label: Availability Group: Avail Data: American Physical Society. One Physics Ellipse 4th Floor, College Park, MD 20740-3844. Tel: 301-209-3200; Fax: 301-209-0865; e-mail: assocpub@aps.org; Web site: http://prst-per.aps.org – Name: PeerReviewed Label: Peer Reviewed Group: SrcInfo Data: Y – Name: Pages Label: Page Count Group: Src Data: 16 – Name: DatePubCY Label: Publication Date Group: Date Data: 2022 – Name: SourceSuprt Label: Sponsoring Agency Group: SrcSuprt Data: National Science Foundation (NSF) – Name: NumberContract Label: Contract Number Group: NumCntrct Data: PHY1734006 – Name: TypeDocument Label: Document Type Group: TypDoc Data: Journal Articles<br />Reports - Research – Name: Audience Label: Education Level Group: Audnce Data: <searchLink fieldCode="EL" term="%22Higher+Education%22">Higher Education</searchLink><br /><searchLink fieldCode="EL" term="%22Postsecondary+Education%22">Postsecondary Education</searchLink> – Name: Subject Label: Descriptors Group: Su Data: <searchLink fieldCode="DE" term="%22Natural+Language+Processing%22">Natural Language Processing</searchLink><br /><searchLink fieldCode="DE" term="%22Science+Education%22">Science Education</searchLink><br /><searchLink fieldCode="DE" term="%22Physics%22">Physics</searchLink><br /><searchLink fieldCode="DE" term="%22Artificial+Intelligence%22">Artificial Intelligence</searchLink><br /><searchLink fieldCode="DE" term="%22Models%22">Models</searchLink><br /><searchLink fieldCode="DE" term="%22Data+Analysis%22">Data Analysis</searchLink><br /><searchLink fieldCode="DE" term="%22Classification%22">Classification</searchLink><br /><searchLink fieldCode="DE" term="%22Student+Reaction%22">Student Reaction</searchLink><br /><searchLink fieldCode="DE" term="%22Test+Format%22">Test Format</searchLink><br /><searchLink fieldCode="DE" term="%22College+Students%22">College Students</searchLink> – Name: Subject Label: Geographic Terms Group: Su Data: <searchLink fieldCode="DE" term="%22Colorado+%28Boulder%29%22">Colorado (Boulder)</searchLink> – Name: DOI Label: DOI Group: ID Data: 10.1103/PhysRevPhysEducRes.18.010141 – Name: ISSN Label: ISSN Group: ISSN Data: 2469-9896 – Name: Abstract Label: Abstract Group: Ab Data: Surveys have long been used in physics education research to understand student reasoning and inform course improvements. However, to make analysis of large sets of responses practical, most surveys use a closed-response format with a small set of potential responses. Open-ended formats, such as written free response, can provide deeper insights into student thinking, but take much longer to analyze, especially with a large number of responses. Here, we explore natural language processing as a computational solution to this problem. We create a machine learning model that can take student responses from the Physics Measurement Questionnaire as input, and output a categorization of student reasoning based on different reasoning paradigms. Our model yields classifications with the same level of agreement as that between two humans categorizing the data, but can be done by a computer, and thus can be scaled for large datasets. In this work, we describe the algorithms and methodologies used to create, train, and test our natural language processing system. We also present the results of the analysis and discuss the utility of these approaches for analyzing open-response data in education research. – Name: AbstractInfo Label: Abstractor Group: Ab Data: As Provided – Name: DateEntry Label: Entry Date Group: Date Data: 2022 – Name: AN Label: Accession Number Group: ID Data: EJ1355094 |
| PLink | https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=eric&AN=EJ1355094 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.1103/PhysRevPhysEducRes.18.010141 Languages: – Text: English PhysicalDescription: Pagination: PageCount: 16 Subjects: – SubjectFull: Natural Language Processing Type: general – SubjectFull: Science Education Type: general – SubjectFull: Physics Type: general – SubjectFull: Artificial Intelligence Type: general – SubjectFull: Models Type: general – SubjectFull: Data Analysis Type: general – SubjectFull: Classification Type: general – SubjectFull: Student Reaction Type: general – SubjectFull: Test Format Type: general – SubjectFull: College Students Type: general – SubjectFull: Colorado (Boulder) Type: general Titles: – TitleFull: Classification of Open-Ended Responses to a Research-Based Assessment Using Natural Language Processing Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Wilson, Joseph – PersonEntity: Name: NameFull: Pollard, Benjamin – PersonEntity: Name: NameFull: Aiken, John M. – PersonEntity: Name: NameFull: Lewandowski, H. J. IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 01 Type: published Y: 2022 Identifiers: – Type: issn-electronic Value: 2469-9896 Numbering: – Type: volume Value: 18 – Type: issue Value: 1 Titles: – TitleFull: Physical Review Physics Education Research Type: main |
| ResultId | 1 |