Advancing Sustainable Development Goal 4 -- A Comparative Analysis of Large Language Models
Saved in:
| Title: | Advancing Sustainable Development Goal 4 -- A Comparative Analysis of Large Language Models |
|---|---|
| Language: | English |
| Authors: | Anshul Saxena (ORCID |
| Source: | International Journal of Educational Management. 2025 39(7):1774-1791. |
| Availability: | Emerald Publishing Limited. Howard House, Wagon Lane, Bingley, West Yorkshire, BD16 1WA, UK. Tel: +44-1274-777700; Fax: +44-1274-785201; e-mail: emerald@emeraldinsight.com; Web site: http://www.emerald.com/insight |
| Peer Reviewed: | Y |
| Page Count: | 18 |
| Publication Date: | 2025 |
| Document Type: | Journal Articles Reports - Research |
| Education Level: | Higher Education Postsecondary Education |
| Descriptors: | Artificial Intelligence, Sustainable Development, Masters Programs, Business Education, Curriculum Design, Technology Uses in Education, Readability, Readability Formulas, Computer Assisted Design |
| Assessment and Survey Identifiers: | Flesch Reading Ease Formula, Flesch Kincaid Grade Level Formula |
| DOI: | 10.1108/IJEM-12-2024-0837 |
| ISSN: | 0951-354X 1758-6518 |
| Abstract: | Purpose: This study investigates how generative AI models (ChatGPT, Claude and Gemini) can be systematically integrated into curriculum design using Hilda Taba's inductive model. Addressing Sustainable Development Goal 4 (SDG 4), it introduces the EduCompass framework to enhance inclusivity and instructional quality. Design/methodology/approach: A structured needs assessment with MBA students identified skill gaps in analytics and AI ethics. Deep research prompts were fed into ChatGPT, Claude and Gemini to generate curriculum components. Outputs were evaluated on accuracy, relevance and clarity by academic experts and analyzed for readability (Flesch-Kincaid), semantic similarity (cosine) and statistical significance (ANOVA). Findings: ChatGPT produced the most readable and pedagogically sound content, followed by Gemini and Claude. ChatGPT also received the highest expert ratings for clarity, accuracy and relevance, with significant differences confirmed via ANOVA. Cosine similarity showed the highest conceptual overlap between ChatGPT and Gemini. Originality/value: This is a study to empirically compare AI-generated curricula across multiple LLMs within a classical curriculum framework. EduCompass offers a replicable model for AI-enhanced, SDG-aligned curriculum design in higher education. |
| Abstractor: | As Provided |
| Entry Date: | 2026 |
| Accession Number: | EJ1492267 |
| Database: | ERIC |
| Abstract: | Purpose: This study investigates how generative AI models (ChatGPT, Claude and Gemini) can be systematically integrated into curriculum design using Hilda Taba's inductive model. Addressing Sustainable Development Goal 4 (SDG 4), it introduces the EduCompass framework to enhance inclusivity and instructional quality. Design/methodology/approach: A structured needs assessment with MBA students identified skill gaps in analytics and AI ethics. Deep research prompts were fed into ChatGPT, Claude and Gemini to generate curriculum components. Outputs were evaluated on accuracy, relevance and clarity by academic experts and analyzed for readability (Flesch-Kincaid), semantic similarity (cosine) and statistical significance (ANOVA). Findings: ChatGPT produced the most readable and pedagogically sound content, followed by Gemini and Claude. ChatGPT also received the highest expert ratings for clarity, accuracy and relevance, with significant differences confirmed via ANOVA. Cosine similarity showed the highest conceptual overlap between ChatGPT and Gemini. Originality/value: This is a study to empirically compare AI-generated curricula across multiple LLMs within a classical curriculum framework. EduCompass offers a replicable model for AI-enhanced, SDG-aligned curriculum design in higher education. |
|---|---|
| ISSN: | 0951-354X 1758-6518 |
| DOI: | 10.1108/IJEM-12-2024-0837 |