A scalable framework for benchmark embedding models in semantic health-care tasks.
Text embeddings are promising for semantic tasks, such as retrieval augmented generation (RAG). However, their application in health care is underexplored due to a lack of benchmarking methods. We introduce a scalable benchmarking method to test embeddings for health-care semantic tasks.
Author(s): Soffer, Shelly, Omar, Mahmud, Gendler, Moran, Glicksberg, Benjamin S, Kovatch, Patricia, Efros, Orly, Freeman, Robert, Charney, Alexander W, Nadkarni, Girish N, Klang, Eyal
DOI: 10.1093/jamia/ocaf149