View in EDS HTML Full Text PDF Full Text

Evaluating the Accuracy and Explanatory Quality of Large Language Models ChatGPT, Claude, DeepSeek, Gemini, Grok, and Le Chat in Statistical Test Selection for Hypothesis Testing Decisions.

Saved in:

Bibliographic Details
Title:	Evaluating the Accuracy and Explanatory Quality of Large Language Models ChatGPT, Claude, DeepSeek, Gemini, Grok, and Le Chat in Statistical Test Selection for Hypothesis Testing Decisions.
Authors:	Shukla M; Community Medicine, All India Institute of Medical Sciences, Raebareli, Raebareli, IND., Pandey D; Community Medicine, Amar Shaheed Jodha Singh Atayiya Thakur Dariyao Singh Medical College, Fatehpur, IND., Kaur S; Community Medicine, Ganesh Shankar Vidyarthi Memorial Medical College, Kanpur, IND., Agarwal M; Physiology, All India Institute of Medical Sciences, Raebareli, Raebareli, IND., Goyal A; Community Medicine, All India Institute of Medical Sciences, Raebareli, Raebareli, IND., Sharma H; Community Medicine, All India Institute of Medical Sciences, Raebareli, Raebareli, IND.
Source:	Cureus [Cureus] 2025 Oct 19; Vol. 17 (10), pp. e94949. Date of Electronic Publication: 2025 Oct 19 (Print Publication: 2025).
Publication Type:	Journal Article
Journal Info:	Publisher: Cureus, Inc Country of Publication: United States NLM ID: 101596737 Publication Model: eCollection Cited Medium: Print ISSN: 2168-8184 (Print) Linking ISSN: 21688184 NLM ISO Abbreviation: Cureus Subsets: PubMed not MEDLINE
Database:	MEDLINE Ultimate
Full text is not displayed to guests. Login for full access.

Be the first to leave a comment!