Natural Language Processing in the Electronic Medical RecordAssessing Clinician Adherence to Tobacco Treatment Guidelines

Harvard University, Cambridge, Massachusetts, United States
American Journal of Preventive Medicine (Impact Factor: 4.53). 01/2006; 29(5):434-9. DOI: 10.1016/j.amepre.2005.08.007
Source: PubMed


Comprehensively assessing care quality with electronic medical records (EMRs) is not currently possible because much data reside in clinicians' free-text notes.
We evaluated the accuracy of MediClass, an automated, rule-based classifier of the EMR that incorporates natural language processing, in assessing whether clinicians: (1) asked if the patient smoked; (2) advised them to stop; (3) assessed their readiness to quit; (4) assisted them in quitting by providing information or medications; and (5) arranged for appropriate follow-up care (i.e., the 5A's of smoking-cessation care).
We analyzed 125 medical records of known smokers at each of four HMOs in 2003 and 2004. One trained abstractor at each HMO manually coded all 500 records according to whether or not each of the 5A's of smoking cessation care was addressed during routine outpatient visits.
For each patient's record, we compared the presence or absence of each of the 5A's as assessed by each human coder and by MediClass. We measured the chance-corrected agreement between the human raters and MediClass using the kappa statistic.
For "ask" and "assist," agreement among human coders was indistinguishable from agreement between humans and MediClass (p>0.05). For "assess" and "advise," the human coders agreed more with each other than they did with MediClass (p<0.01); however, MediClass performance was sufficient to assess quality in these areas. The frequency of "arrange" was too low to be analyzed.
MediClass performance appears adequate to replace human coders of the 5A's of smoking-cessation care, allowing for automated assessment of clinician adherence to one of the most important, evidence-based guidelines in preventive health care.

Download full-text


Available from: Dean Forrest Sittig, Jun 26, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: MediClass is a knowledge-based system that processes both free-text and coded data to automatically detect clinical events in electronic medical records (EMRs). This technology aims to optimize both clinical practice and process control by automatically coding EMR contents regardless of data input method (e.g., dictation, structured templates, typed narrative). We report on the design goals, implemented functionality, generalizability, and current status of the system. MediClass could aid both clinical operations and health services research through enhancing care quality assessment, disease surveillance, and adverse event detection.
    Full-text · Article · May 2005 · Journal of the American Medical Informatics Association
  • [Show abstract] [Hide abstract]
    ABSTRACT: Pressure is building for performance measures that can be collected inexpensively and repeatedly for internal and external accountability and quality improvement. The objective of this study was to develop and test measures obtainable from administrative data covering each of the Institute of Medicine's (IOM) 6 aims. Measure definitions were developed for 3 common chronic conditions and were revised after testing the feasibility of collecting them from claims data. The setting was a large, multispecialty medical group in the Midwest and included all adult patients with diabetes, coronary heart disease, or depression. Problems identified in the original 99 measures led to refinements or elimination. The resulting 46 measures ready for use include 11 measures for 5 aims applicable to most common chronic conditions, plus 10 to 14 effectiveness measures for each condition. They have been successfully used to describe care quality changes for these patients over time. This starter set for the 6 IOM aims should be tested and expanded by others.
    No preview · Article · Sep 2006 · American Journal of Medical Quality
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: New clinical information technologies now sporadically available will soon be in routine clinical use, bringing many changes to all phases of the cancer care continuum. For example, new technologies such as: (1) The next generation Internet; (2) Real-time clinical decision support systems; (3) Off-line, population-based systems; (4) Large, integrated, individual patient-level phenotypic and genotypic databases with intelligent data mining capabilities; (5) Wireless, invasive and non-invasive physiologic monitoring devices; (6) Natural Language Processing (NLP) systems; and (7) Mathematical models of complex biological systems all have the potential to impact significantly the provision of cancer care throughout its continuum. While new information management and communication techniques and technologies will reduce many of the inefficiencies and inaccuracies of our present systems, there will be an equal, and potentially far more dangerous, set of unintended consequences. Informatics investigators, cancer specialists, and health system administrators must focus on the study of what is working and what is not, as well as, on development and testing of the new clinical information management and communication technologies, if we are to be ready for the future.
    Full-text · Article · Sep 2006 · Cancer Causes and Control
Show more