Mayo clinic NLP system for patient smoking status identification.
This article describes our system entry for the 2006 I2B2 contest "Challenges in Natural Language Processing for Clinical Data" for the task of identifying the smoking status of patients. Our system makes the simplifying assumption that patient-level smoking status determination can be achieved by accurately classifying individual sentences from a patient's record. We created our system with reusable text analysis components built on the Unstructured Information Management Architecture and Weka [...]
Author(s): Savova, Guergana K, Ogren, Philip V, Duffy, Patrick H, Buntrock, James D, Chute, Christopher G
DOI: 10.1197/jamia.M2437