AI as an intervention: improving clinical outcomes relies on a causal approach to AI development and validation.
The primary practice of healthcare artificial intelligence (AI) starts with model development, often using state-of-the-art AI, retrospectively evaluated using metrics lifted from the AI literature like AUROC and DICE score. However, good performance on these metrics may not translate to improved clinical outcomes. Instead, we argue for a better development pipeline constructed by working backward from the end goal of positively impacting clinically relevant outcomes using AI, leading to considerations [...]
Author(s): Joshi, Shalmali, Urteaga, Iñigo, van Amsterdam, Wouter A C, Hripcsak, George, Elias, Pierre, Recht, Benjamin, Elhadad, Noémie, Fackler, James, Sendak, Mark P, Wiens, Jenna, Deshpande, Kaivalya, Wald, Yoav, Fiterau, Madalina, Lipton, Zachary, Malinsky, Daniel, Nayan, Madhur, Namkoong, Hongseok, Park, Soojin, Vogt, Julia E, Ranganath, Rajesh
DOI: 10.1093/jamia/ocae301