Zero-shot interpretable biomedical literature appraisal with generative large language models.
This study aims to apply 2 decoder-based Generative Pre-trained Transformer (GPT) models (GPT-4o and GPT-o3-mini) in automating the methodological appraisal of randomized controlled trials (RCTs), under a variety of prompt designs, and to compare their performance to a fine-tuned encoder-only BioLinkBERT model.
Author(s): Zhou, Fangwen, Afzal, Muhammad, Saha, Ashirbani, Parrish, Rick, Haynes, R Brian, Iorio, Alfonso, Lokker, Cynthia
DOI: 10.1093/jamiaopen/ooag043