Comparative Evaluation of ChatGPT, Gemini, and DeepSeek for ADHD-Related Health Information
This study systematically compared how three large language models (ChatGPT GPT-4o, Gemini, and DeepSeek R1) respond to 22 commonly asked ADHD questions across four domains, finding all models achieved high accuracy (87-91%) as independently verified by psychiatry specialists. Reproducibility was strong across models, with ChatGPT showing slightly superior performance in treatment domains while Gemini and DeepSeek excelled in basic knowledge and diagnosis.
Sources
- MED — Fri Jun 12 2026 00:00:00 GMT+0000 (Coordinated Universal Time) · Read full article (translated)
Afișăm titlu + rezumat scurt în limita dreptului de autor; textul integral e la sursă.