View in EDS

Assessing the Promise and Pitfalls of ChatGPT for Automated CS1-Driven Code Generation

Saved in:

Bibliographic Details
Title:	Assessing the Promise and Pitfalls of ChatGPT for Automated CS1-Driven Code Generation
Language:	English
Authors:	Muhammad Fawad Akbar Khan, Max Ramsdell, Erik Falor, Hamid Karimi
Source:	International Educational Data Mining Society. 2024.
Availability:	International Educational Data Mining Society. e-mail: admin@educationaldatamining.org; Web site: https://educationaldatamining.org/conferences/
Peer Reviewed:	Y
Page Count:	13
Publication Date:	2024
Sponsoring Agency:	National Science Foundation (NSF), Division of Equity for Excellence in STEM (EES)
Contract Number:	2321304
Document Type:	Speeches/Meeting Papers Reports - Research
Descriptors:	Artificial Intelligence, Automation, Computer Science Education, Programming, Computer Software, Engineering, Cues
Abstract:	This paper undertakes a thorough evaluation of ChatGPT's code generation capabilities, contrasting them with those of human programmers from both educational and software engineering standpoints. The emphasis is placed on elucidating its importance in these intertwined domains. To facilitate a robust analysis, we curated a novel dataset comprising 131 code-generation prompts spanning five categories. The study encompasses 262 code samples generated by ChatGPT and humans, with a meticulous manual assessment methodology prioritizing correctness, comprehensibility, and security using 14 established code quality metrics. Noteworthy strengths include ChatGPT's proficiency in crafting concise, efficient code, particularly excelling in data analysis tasks (93.1% accuracy). However, limitations are observed in handling visual-graphical challenges. Comparative analysis with human-generated code highlights ChatGPT's inclination towards modular design and superior error handling. Machine learning models effectively distinguish ChatGPT from human code with up to 88% accuracy, indicating detectable coding style disparities. By offering profound insights into ChatGPT's code generation capabilities and limitations through quantitative metrics and qualitative analysis, this study contributes significantly to the advancement of AI-based programming assistants. The curated dataset and methodology establish a robust foundation for future research in this evolving domain, reinforcing its importance in shaping the future landscape of computer science education and software engineering. [For the complete proceedings, see ED675485.]
Abstractor:	As Provided
Notes:	https://github.com/DSAatUSU/ChatGPT-promises-and-pitfalls
Entry Date:	2025
Accession Number:	ED675593
Database:	ERIC

Full Text from ERIC

Be the first to leave a comment!