LCD benchmark: long clinical document benchmark on mortality prediction for language models.
The application of natural language processing (NLP) in the clinical domain is important due to the rich unstructured information in clinical documents, which often remains inaccessible in structured data. When applying NLP methods to a certain domain, the role of benchmark datasets is crucial as benchmark datasets not only guide the selection of best-performing models but also enable the assessment of the reliability of the generated outputs. Despite the recent [...]
Author(s): Yoon, WonJin, Chen, Shan, Gao, Yanjun, Zhao, Zhanzhan, Dligach, Dmitriy, Bitterman, Danielle S, Afshar, Majid, Miller, Timothy
DOI: 10.1093/jamia/ocae287