Using implicit information to identify smoking status in smoke-blind medical discharge summaries.
As part of the 2006 i2b2 NLP Shared Task, we explored two methods for determining the smoking status of patients from their hospital discharge summaries when explicit smoking terms were present and when those same terms were removed. We developed a simple keyword-based classifier to determine smoking status from de-identified hospital discharge summaries. We then developed a Naïve Bayes classifier to determine smoking status from the same records after all [...]
Author(s): Wicentowski, Richard, Sydes, Matthew R
DOI: 10.1197/jamia.M2440