Uncertainty estimation in diagnosis generation from large language models: next-word probability is not pre-test probability.
To evaluate large language models (LLMs) for pre-test diagnostic probability estimation and compare their uncertainty estimation performance with a traditional machine learning classifier.
Author(s): Gao, Yanjun, Myers, Skatje, Chen, Shan, Dligach, Dmitriy, Miller, Timothy, Bitterman, Danielle S, Chen, Guanhua, Mayampurath, Anoop, Churpek, Matthew M, Afshar, Majid
DOI: 10.1093/jamiaopen/ooae154