Assessing Academic Language in Tenth Grade Essays Using Natural Language Processing

Saved in:
Bibliographic Details
Title: Assessing Academic Language in Tenth Grade Essays Using Natural Language Processing
Language: English
Authors: Andrew Potter, Mitchell Shortt, Maria Goldshtein, Rod D. Roscoe
Source: Grantee Submission. 2025 64.
Peer Reviewed: Y
Page Count: 18
Publication Date: 2025
Sponsoring Agency: Institute of Education Sciences (ED)
Contract Number: R305A180261
Document Type: Journal Articles
Reports - Research
Education Level: Grade 10
High Schools
Secondary Education
Descriptors: Academic Language, Natural Language Processing, Grammar, Vocabulary Skills, Writing Instruction, Essays, Syntax, Connected Discourse, Grade 10, Computational Linguistics, High School Students, Writing Evaluation, Computer Software, Evaluators, Plagiarism, Information Sources, Language Usage
DOI: 10.1016/j.asw.2025.100921
ISSN: 1075-2935
Abstract: Broadly defined, academic language (AL) is a set of lexical-grammatical norms and registers commonly used in educational and academic discourse. Mastery of academic language in writing is an important aspect of writing instruction and assessment. The purpose of this study was to use Natural Language Processing (NLP) tools to examine the extent to which features related to academic language explained variance in human-assigned scores of writing quality in a large corpus of source-based argumentative essays (n?=?20,820) written by 10th grade students. Using NLP tools, we identified and then calculated linguistic features from essays related to the lexical, syntactic, cohesion, and rhetorical features of academic language. Consistent with prior research findings, results from a hierarchical linear regression revealed that AL features explained 8?% of variance in writing quality when controlling for essay length. The most important AL features included cohesion with the source text, academic wording, and global cohesion. Implications for integrating NLP-produced measures of AL in writing assessment and automated writing evaluation (AWE) systems are discussed.
Abstractor: As Provided
IES Funded: Yes
Entry Date: 2025
Accession Number: ED671643
Database: ERIC
Description
Abstract:Broadly defined, academic language (AL) is a set of lexical-grammatical norms and registers commonly used in educational and academic discourse. Mastery of academic language in writing is an important aspect of writing instruction and assessment. The purpose of this study was to use Natural Language Processing (NLP) tools to examine the extent to which features related to academic language explained variance in human-assigned scores of writing quality in a large corpus of source-based argumentative essays (n?=?20,820) written by 10th grade students. Using NLP tools, we identified and then calculated linguistic features from essays related to the lexical, syntactic, cohesion, and rhetorical features of academic language. Consistent with prior research findings, results from a hierarchical linear regression revealed that AL features explained 8?% of variance in writing quality when controlling for essay length. The most important AL features included cohesion with the source text, academic wording, and global cohesion. Implications for integrating NLP-produced measures of AL in writing assessment and automated writing evaluation (AWE) systems are discussed.
ISSN:1075-2935
DOI:10.1016/j.asw.2025.100921