Evaluation of ChatGPT, Gemini, and OpenEvidence in Obstetric and Gynecologic Clinical Decision Scenarios.
Background Clinicians frequently face questions that require rapid, evidence-based answers. Artificial intelligence (AI) tools are increasingly used for this purpose, yet their reliability for clinical decision-making remains uncertain. This study compared two generative large language model (LLM) systems (ChatGPT and Gemini) and a retrieval-supported clinical platform (OpenEvidence) to determine which provides the most reliable, clear, and clinically applicable information in obstetrics, gynecology, and urogynecology. Methods A cross-sectional comparative design was [...]
Author(s): Atay, Arif Onur, Atay, Feride, Ozmen, Samican, Balci, Mucahit Furkan
DOI: 10.1055/a-2899-0123
