Benchmarking information extraction of physical activity from electronic health record with large language models: an natural language processing pipeline and comparative evaluation.
We aimed to develop a data model and a natural language processing (NLP) pipeline for representing physical activity (PA) in Electronic Health Records (EHRs), and to evaluate transformer- and Large Language Model (LLM)-based classifiers for sentence-level PA attribute classification.
Author(s): Yang, Han, Niu, Zhongran, Li, Mingchen, Zhou, Huixue, Xiao, Yongkang, Zhou, Sicheng, Zhan, Zaifu, Liu, Ying, Liu, Shiqin, Tignanelli, Christopher J, Melton, Genevieve B, Zhang, Rui
DOI: 10.1093/jamia/ocag101