Skip to main content

AutoCriteria: a generalizable clinical trial eligibility criteria extraction system powered by large language models

Read the abstract

Presenters

Surabhi Datta, PhD
IMO Health
Xiayang Wang, PhD, NLP
IMO Health

Statement of Purpose

Eligibility criteria in clinical trials play a pivotal role in patient recruitment and in safety and treatment evaluation, leading to advancements in medical knowledge and improved patient care. Traditional approaches to extract eligibility criteria from trial text are often laborious, time-consuming, and prone to human errors. To this end, the development of automated systems for eligibility criteria extraction using natural language processing (NLP) techniques can expedite the process and reduce errors, thereby facilitating comprehensive trial evaluation and improved recruitment processes. Moreover, computable representations that facilitate electronic health record (EHR) interoperable eligibility criteria enable trial coordinators/recruiters to automatically match them against the corresponding clinical information in EHRs and notify providers about potential patients eligible for a specific trial.

Recently, large language models have gained traction in the general as well as biomedical and clinical domains. These models encode general knowledge in their parameters and are designed to work across different NLP tasks. They also encode clinical knowledge and have been employed for various medical applications. Inspired by this, we propose to leverage such a powerful and general-purpose model, OpenAI’s GPT-4, to extract eligibility criteria from clinical trial text.

We developed AutoCriteria to extract granular eligibility criteria including identifying temporal information and criteria conditions from clinical trial documents (collected from ClinicalTrials.gov) belonging to a wide variety of diseases, ranging from oncology (e.g., breast cancer) to neurological progressive diseases (e.g., Alzheimer’s) to rare diseases (e.g., sickle cell disease). AutoCriteria prompts are flexible and generalizable across different disease domains and can be easily extended to new diseases without requiring manual annotation and training. Such a generalizable and scalable criteria extraction system could significantly streamline the patient recruitment process and expedite the construction of criteria knowledge base.

Learning Outcomes

  • Explain the importance of creating a structured representation of free-text clinical trial eligibility criteria for patient recruitment to trials.
  • Demonstrate the construction of generalizable prompts for eligibility criteria extraction from clinical trial documents across different disease domains.
Dates and Times: -
Type: JAMIA Journal Club
Course Format(s): Live Virtual
Credits:
1.00
CME
,
1.00
CNE
Price: Free
Register now
Share