Identifying smokers with a medical extraction system.
The Clinical Language Understanding group at Nuance Communications has developed a medical information extraction system that combines a rule-based extraction engine with machine learning algorithms to identify and categorize references to patient smoking in clinical reports. The extraction engine identifies smoking references; documents that contain no smoking references are classified as UNKNOWN. For the remaining documents, the extraction engine uses linguistic analysis to associate features such as status and time [...]
Author(s): Clark, Cheryl, Good, Kathleen, Jezierny, Lesley, Macpherson, Melissa, Wilson, Brian, Chajewska, Urszula
DOI: 10.1197/jamia.M2442