Drug safety data mining with a tree-based scan statistic

Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, USA
Pharmacoepidemiology and Drug Safety (Impact Factor: 2.94). 05/2013; DOI: 10.1002/pds.3423
Source: PubMed


In post-marketing drug safety surveillance, data mining can potentially detect rare but serious adverse events. Assessing an entire collection of drug–event pairs is traditionally performed on a predefined level of granularity. It is unknown a priori whether a drug causes a very specific or a set of related adverse events, such as mitral valve disorders, all valve disorders, or different types of heart disease. This methodological paper evaluates the tree-based scan statistic data mining method to enhance drug safety surveillance.
We use a three-million-member electronic health records database from the HMO Research Network. Using the tree-based scan statistic, we assess the safety of selected antifungal and diabetes drugs, simultaneously evaluating overlapping diagnosis groups at different granularity levels, adjusting for multiple testing. Expected and observed adverse event counts were adjusted for age, sex, and health plan, producing a log likelihood ratio test statistic.
Out of 732 evaluated disease groupings, 24 were statistically significant, divided among 10 non-overlapping disease categories. Five of the 10 signals are known adverse effects, four are likely due to confounding by indication, while one may warrant further investigation.
The tree-based scan statistic can be successfully applied as a data mining tool in drug safety surveillance using observational data. The total number of statistical signals was modest and does not imply a causal relationship. Rather, data mining results should be used to generate candidate drug–event pairs for rigorous epidemiological studies to evaluate the individual and comparative safety profiles of drugs. Copyright

Download full-text


Available from: Martin Kulldorff, Mar 27, 2015
1 Follower
38 Reads
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background: Drug adverse event (AE) signal detection using the Gamma Poisson Shrinker (GPS) is commonly applied in spontaneous reporting. AE signal detection using large observational health plan databases can expand medication safety surveillance. Methods: Using data from nine health plans, we conducted a pilot study to evaluate the implementation and findings of the GPS approach for two antifungal drugs, terbinafine and itraconazole, and two diabetes drugs, pioglitazone and rosiglitazone. We evaluated 1676 diagnosis codes grouped into 183 different clinical concepts and four levels of granularity. Several signaling thresholds were assessed. GPS results were compared to findings from a companion study using the identical analytic dataset but an alternative statistical method-the tree-based scan statistic (TreeScan). Results: We identified 71 statistical signals across two signaling thresholds and two methods, including closely-related signals of overlapping diagnosis definitions. Initial review found that most signals represented known adverse drug reactions or confounding. About 31% of signals met the highest signaling threshold. Conclusions: The GPS method was successfully applied to observational health plan data in a distributed data environment as a drug safety data mining method. There was substantial concordance between the GPS and TreeScan approaches. Key method implementation decisions relate to defining exposures and outcomes and informed choice of signaling thresholds.
    Pharmaceutics 03/2013; 5(1):179-200. DOI:10.3390/pharmaceutics5010179
  • [Show abstract] [Hide abstract]
    ABSTRACT: Objective: Several disproportionality analysis methods are widely used for signal detection. The goal of this study was to compare the concordance of the performance characteristics of these methods in spontaneous reporting system of China. Methods: Algorithms including reporting odds ratio (ROR), proportional reporting ratio (PRR) and information component (IC), a composite criterion previously used by Medicines and Healthcare Products Regulatory Agency (MHRA) were compared. Kappa coefficient was used as the gauge to test the concordance. Reports received in the year 2004 and 2005 were extracted for analysis in this study. Results: After data processing, 361,872 reports representing 52,769 combinations were analysed. The analysis generated 24,022, 22,646, 5637 and 5302 signals of disproportionality by PRR, ROR, MHRA and IC, respectively. The kappa coefficient increased with the threshold of number of drug-adverse drug reactions (ADR) combination, and the coefficient exceeded 0.7 when the number of suspected drug-ADR exceeded 2. Conclusion: This study shows that different measures used are broadly comparable in spontaneous reporting system in China when two or more cases per combination have been collected.
    Expert Opinion on Drug Safety 06/2014; 13(7):1-5. DOI:10.1517/14740338.2014.915938 · 2.91 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: A limited amount of data is typically available to support product license applications. That is further complicated by the need to make some medicines available to patients at key times, using expedited drug approval pathways. In addition, increasing immunogenicity concerns have been paralleled by a myriad of biotherapeutics entering development and/or receiving regulatory approval. Postmarketing patient safety is intrinsically dependent on the correct balance of economics, regulatory oversight and legal and enforcement issues. Here, we discuss the changing landscape of pharmacovigilance, with special emphasis on postmarketing commitments and requirements, megadata analysis, regulatory responsibilities and research opportunities. Challenges and possibilities are illustrated with therapeutic drugs approved for treatment of autoimmune diseases, diabetes, cancer, rare diseases and the resurgence of gene therapy.
    Drug Discovery Today 07/2014; 19(12). DOI:10.1016/j.drudis.2014.07.011 · 6.69 Impact Factor
Show more