Automating the segmentation, date extraction, and classification of multi-report PDFs in outside medical records using optical character recognition and generative artificial intelligence.
Patients referred for specialized care often arrive with outside medical records (OMRs) compiled into multi-report PDFs that include imaging, pathology, and clinical notes in unstructured formats. Reviewing these records is time consuming and mentally taxing, increasing the risk of delayed care, clinician frustration, and missed information affecting quality of care. This study aimed to automate the segmentation, classification, and date extraction of scanned OMRs, with a focus on records relevant [...]
Author(s): Damani, Shivam, Hinton, Benjamin, Hunt, Tanner, Lawrence, Nicholas, Miller, Kurt, Rice, Melinda, Peterson, Kevin, McLaughlin, Sarah, Ryu, Alexander
DOI: 10.1093/jamiaopen/ooag027