Assessing Academic Language in Tenth Grade Essays Using Natural Language Processing
Saved in:
| Title: | Assessing Academic Language in Tenth Grade Essays Using Natural Language Processing |
|---|---|
| Language: | English |
| Authors: | Andrew Potter, Mitchell Shortt, Maria Goldshtein, Rod D. Roscoe |
| Source: | Grantee Submission. 2025 64. |
| Peer Reviewed: | Y |
| Page Count: | 18 |
| Publication Date: | 2025 |
| Sponsoring Agency: | Institute of Education Sciences (ED) |
| Contract Number: | R305A180261 |
| Document Type: | Journal Articles Reports - Research |
| Education Level: | Grade 10 High Schools Secondary Education |
| Descriptors: | Academic Language, Natural Language Processing, Grammar, Vocabulary Skills, Writing Instruction, Essays, Syntax, Connected Discourse, Grade 10, Computational Linguistics, High School Students, Writing Evaluation, Computer Software, Evaluators, Plagiarism, Information Sources, Language Usage |
| DOI: | 10.1016/j.asw.2025.100921 |
| ISSN: | 1075-2935 |
| Abstract: | Broadly defined, academic language (AL) is a set of lexical-grammatical norms and registers commonly used in educational and academic discourse. Mastery of academic language in writing is an important aspect of writing instruction and assessment. The purpose of this study was to use Natural Language Processing (NLP) tools to examine the extent to which features related to academic language explained variance in human-assigned scores of writing quality in a large corpus of source-based argumentative essays (n?=?20,820) written by 10th grade students. Using NLP tools, we identified and then calculated linguistic features from essays related to the lexical, syntactic, cohesion, and rhetorical features of academic language. Consistent with prior research findings, results from a hierarchical linear regression revealed that AL features explained 8?% of variance in writing quality when controlling for essay length. The most important AL features included cohesion with the source text, academic wording, and global cohesion. Implications for integrating NLP-produced measures of AL in writing assessment and automated writing evaluation (AWE) systems are discussed. |
| Abstractor: | As Provided |
| IES Funded: | Yes |
| Entry Date: | 2025 |
| Accession Number: | ED671643 |
| Database: | ERIC |
| Abstract: | Broadly defined, academic language (AL) is a set of lexical-grammatical norms and registers commonly used in educational and academic discourse. Mastery of academic language in writing is an important aspect of writing instruction and assessment. The purpose of this study was to use Natural Language Processing (NLP) tools to examine the extent to which features related to academic language explained variance in human-assigned scores of writing quality in a large corpus of source-based argumentative essays (n?=?20,820) written by 10th grade students. Using NLP tools, we identified and then calculated linguistic features from essays related to the lexical, syntactic, cohesion, and rhetorical features of academic language. Consistent with prior research findings, results from a hierarchical linear regression revealed that AL features explained 8?% of variance in writing quality when controlling for essay length. The most important AL features included cohesion with the source text, academic wording, and global cohesion. Implications for integrating NLP-produced measures of AL in writing assessment and automated writing evaluation (AWE) systems are discussed. |
|---|---|
| ISSN: | 1075-2935 |
| DOI: | 10.1016/j.asw.2025.100921 |