Researchers at Vanderbilt University Medical Center have used natural language processing technology in an electronic medical records system to identify patients with multiple sclerosis and collect data on traits of their disease course.
The work is significant, researchers say, because much remains unknown about the course of the disease, which varies widely among patients. “Most research studies have focused on the origin of the disease, partly because of the difficulty in ascertaining sufficient longitudinal clinical data to study the disease course,” according to the study published in the Journal of the American Medical Informatics Association.
“Electronic medical records may provide such a tool. We have previously shown that genomic signals of MS risk may be replicated using EMR-derived cohorts. In this paper, we evaluated algorithms to extract detailed clinical information for the disease course of MS.”
The study used algorithms based on ICD-9 codes, text keywords and medications to identify 5,789 patients with MS, and collected detailed data on the clinical course of the patients’ disease to measure progression of disability. “For all clinical traits extracted, precision was at least 87 percent and specificity was greater than 80 percent.”