Advancing Sustainable Development Goal 4 -- A Comparative Analysis of Large Language Models

Saved in:
Bibliographic Details
Title: Advancing Sustainable Development Goal 4 -- A Comparative Analysis of Large Language Models
Language: English
Authors: Anshul Saxena (ORCID 0000-0002-5573-2802), Bikramjit Rishi (ORCID 0000-0002-7934-0707)
Source: International Journal of Educational Management. 2025 39(7):1774-1791.
Availability: Emerald Publishing Limited. Howard House, Wagon Lane, Bingley, West Yorkshire, BD16 1WA, UK. Tel: +44-1274-777700; Fax: +44-1274-785201; e-mail: emerald@emeraldinsight.com; Web site: http://www.emerald.com/insight
Peer Reviewed: Y
Page Count: 18
Publication Date: 2025
Document Type: Journal Articles
Reports - Research
Education Level: Higher Education
Postsecondary Education
Descriptors: Artificial Intelligence, Sustainable Development, Masters Programs, Business Education, Curriculum Design, Technology Uses in Education, Readability, Readability Formulas, Computer Assisted Design
Assessment and Survey Identifiers: Flesch Reading Ease Formula, Flesch Kincaid Grade Level Formula
DOI: 10.1108/IJEM-12-2024-0837
ISSN: 0951-354X
1758-6518
Abstract: Purpose: This study investigates how generative AI models (ChatGPT, Claude and Gemini) can be systematically integrated into curriculum design using Hilda Taba's inductive model. Addressing Sustainable Development Goal 4 (SDG 4), it introduces the EduCompass framework to enhance inclusivity and instructional quality. Design/methodology/approach: A structured needs assessment with MBA students identified skill gaps in analytics and AI ethics. Deep research prompts were fed into ChatGPT, Claude and Gemini to generate curriculum components. Outputs were evaluated on accuracy, relevance and clarity by academic experts and analyzed for readability (Flesch-Kincaid), semantic similarity (cosine) and statistical significance (ANOVA). Findings: ChatGPT produced the most readable and pedagogically sound content, followed by Gemini and Claude. ChatGPT also received the highest expert ratings for clarity, accuracy and relevance, with significant differences confirmed via ANOVA. Cosine similarity showed the highest conceptual overlap between ChatGPT and Gemini. Originality/value: This is a study to empirically compare AI-generated curricula across multiple LLMs within a classical curriculum framework. EduCompass offers a replicable model for AI-enhanced, SDG-aligned curriculum design in higher education.
Abstractor: As Provided
Entry Date: 2026
Accession Number: EJ1492267
Database: ERIC
Description
Abstract:Purpose: This study investigates how generative AI models (ChatGPT, Claude and Gemini) can be systematically integrated into curriculum design using Hilda Taba's inductive model. Addressing Sustainable Development Goal 4 (SDG 4), it introduces the EduCompass framework to enhance inclusivity and instructional quality. Design/methodology/approach: A structured needs assessment with MBA students identified skill gaps in analytics and AI ethics. Deep research prompts were fed into ChatGPT, Claude and Gemini to generate curriculum components. Outputs were evaluated on accuracy, relevance and clarity by academic experts and analyzed for readability (Flesch-Kincaid), semantic similarity (cosine) and statistical significance (ANOVA). Findings: ChatGPT produced the most readable and pedagogically sound content, followed by Gemini and Claude. ChatGPT also received the highest expert ratings for clarity, accuracy and relevance, with significant differences confirmed via ANOVA. Cosine similarity showed the highest conceptual overlap between ChatGPT and Gemini. Originality/value: This is a study to empirically compare AI-generated curricula across multiple LLMs within a classical curriculum framework. EduCompass offers a replicable model for AI-enhanced, SDG-aligned curriculum design in higher education.
ISSN:0951-354X
1758-6518
DOI:10.1108/IJEM-12-2024-0837