Evaluating the Accuracy and Explanatory Quality of Large Language Models ChatGPT, Claude, DeepSeek, Gemini, Grok, and Le Chat in Statistical Test Selection for Hypothesis Testing Decisions.

Saved in:
Bibliographic Details
Title: Evaluating the Accuracy and Explanatory Quality of Large Language Models ChatGPT, Claude, DeepSeek, Gemini, Grok, and Le Chat in Statistical Test Selection for Hypothesis Testing Decisions.
Authors: Shukla M; Community Medicine, All India Institute of Medical Sciences, Raebareli, Raebareli, IND., Pandey D; Community Medicine, Amar Shaheed Jodha Singh Atayiya Thakur Dariyao Singh Medical College, Fatehpur, IND., Kaur S; Community Medicine, Ganesh Shankar Vidyarthi Memorial Medical College, Kanpur, IND., Agarwal M; Physiology, All India Institute of Medical Sciences, Raebareli, Raebareli, IND., Goyal A; Community Medicine, All India Institute of Medical Sciences, Raebareli, Raebareli, IND., Sharma H; Community Medicine, All India Institute of Medical Sciences, Raebareli, Raebareli, IND.
Source: Cureus [Cureus] 2025 Oct 19; Vol. 17 (10), pp. e94949. Date of Electronic Publication: 2025 Oct 19 (Print Publication: 2025).
Publication Type: Journal Article
Journal Info: Publisher: Cureus, Inc Country of Publication: United States NLM ID: 101596737 Publication Model: eCollection Cited Medium: Print ISSN: 2168-8184 (Print) Linking ISSN: 21688184 NLM ISO Abbreviation: Cureus Subsets: PubMed not MEDLINE
Database: MEDLINE Ultimate
Full text is not displayed to guests.
Be the first to leave a comment!
You must be logged in first