ArticlePDF Available

Abstract

We analyzed 45 original articles from Indian Pediatrics for appropriateness of the statistical methods. Appropriate statistical tests (93%), no use of obscure test and use of exact P value were the positive findings observed. Sample size was calculated in 24% and confidence interval in 13%. There is a need to generate awareness regarding confidence interval and sample size calculations.
INDIAN PEDIATRICS 811 VOLUME 46__SEPTEMBER 17, 2009
RESEARCH LETTERS
from immunoglobulin administration. In Group I,
only one patient (8%) expired as compared to 6/13
(46%) in Group II (P=0.04). Recovery of left
ventricular function was not significantly different
between two groups (49% vs. 46%) (P=0.13).
Our reports showed that IVIG group had
significant higher survival rate (92%) than other
group who did not receive IVIG (54%). The
therapeutic efficacy of high-dose IVIG in Kawasaki
disease has been already established(2). Other
experimental animal and human studies in acute
myocarditis have also reported better outcome with
IVIG(3-6).
Our study had few limitations. The diagnosis was
based on clinical features, CXR, ECG, and
echocardiography. Small sample size and
retrospective nature of the study were the other
hinderances. However, this study provides support
for aggressive supportive care and early use of IVIG
in acute myocarditis in children.
Anwarul Haque, Samreen Bhatti
and Fahad J Siddiqui
Department of Pediatrics and Child Health,
Aga Khan University Hospital,
Stadium Road, P O Box 3500,
Karachi 74800, Pakistan.
anwar.haq@aku.edu
REFERENCES
1. Levi D, Alejos J. Diagnosis and treatment of
pediatric viral myocarditis. Curr Opin Cardiol
2001; 4: 171-181.
2. Newburger JW, Takahashi M, Burns JC, Beiser
AS, Chung KJ, Duffy CE, et al. The treatment of
Kawasaki syndrome with intravenous gamma
globulin. N Engl J Med 1986; 315: 341-347.
3. Drucker NA, Colan SD, Lewis AB, Beiser AS,
Wessel DL, Takahashi M, et al. Gamma-globulin
treatment of acute myocarditis in the pediatric
population. 6. Circulation 1994; 89: 252-257.
4. Goland S, Czer LS, Seigel RJ, Tabak S. JordanS,
Luthringer D, et al. Intravenous immunoglobulin
treatment for acute fulminant inflammatory
cardiomyopathy: series of six patients and review
of literature. Can J Cardiol 2008; 24: 571-574.
5. McNamara DM, Rosenblum WD, Janosko KM,
Trost MK, Villaneuva FS, Demetris AJ, et al.
Intravenous immune globulin in the therapy of
myocarditis and acute cardiomyopathy. Circulation
1997; 95: 2476-2478.
6. Kishimoto C, Shioji K, Kinoshita M, Iwase T,
Tamaki S, Fujii M, et al. Treatment of acute
inflammatory cardiomyopathy with intravenous
immunoglobulin ameliorates left ventricular
function associated with suppression of
inflammatory cytokines and decreased oxidative
stress. Int J Cardiol 2003; 91:173-178.
TABLE I PATIENT CHARACTERISTICS OF TWO GROUPS
Variables IVIG- IVIG+ P
(controls) (cases) value
n = 13 n =12
Age (mo), mean(SD) 12.0 (4.9) 7.3 (5.8) 0.04
Gender (M/F) 6/7 6/6 0.6
Hepatomegaly 12 9 0.3
Cardiomegaly 12 12 0.5
ECG: Low-voltage 12 9 0.3
Initial EF (%), mean(SD) 22.5 (11.1) 17.5 (5.0) 0.17
Inotropes 1.5 (0.9) 3.0 (1.1) 0.001
EF: ejection fraction
Statistical Reporting in Indian Pediatrics
We analyzed 45 original articles from Indian Pediatrics for
appropriateness of the statistical methods. Appropriate
statistical tests (93%), no use of obscure test and use of
exact P value were the positive findings observed. Sample
size was calculated in 24% and confidence interval in
13%. There is a need to generate awareness regarding
confidence interval and sample size calculations.
Key words: India,Journal, Statistics.
Statistical errors are common in scientific
literature(1-3). We conducted this study to evaluate
the appropriateness of statistical techniques used and
types of statistical errors present in original papers
INDIAN PEDIATRICS 812 VOLUME 46__SEPTEMBER 17, 2009
RESEARCH LETTERS
published in Indian Pediatrics. The journal has
separate advisors for biostatistics(4).
All the authors independently surveyed 45 Lead
Articles and Research Papers published in Indian
Pediatrics in 2007 and 2008 on the basis of a
modified checklist(5). The results are summarized in
Table I.
Appropriate statistical tests were used in 42
(93%) articles. In few studies parametric test were
used in place of nonparametric tests. Mainly these
data were dealing with rank and scores. The
incidence of inappropriate tests is less as compared
to other studies published in this area(2,3,6).
Sample size was calculated in only 11 studies;
among the clinical trials it was calculated in 46% of
studies. Data were checked for normal distribution
in only 21% studies. Multiple hypothesis testing was
done in most of the articles. Separate test was used
for each variable. Another neglected aspect was
confidence interval, calculated in only 13% articles.
Confidence interval gives us range of the value
around the effect size of sample where population
mean actually lies. Other encouraging point is
mentioning of exact P value in most of the articles.
We observed that as compared to previous
studies in other journals, the statistical reporting of
Indian Pediatrics is better(6,7). Editors should
generate more awareness regarding confidence
interval, distribution of data, nonparametric statistics
and calculation of sample size.
Jay Karan, JP Goyal, *P Bhardwaj and
Preeti Yadav
Government Medical College, Surat; and
*Era’s Medical College, Lucknow, India.
drjayakaran@yahoo.co.in
REFERENCES
1. Curran-Everett D, Taylor S, Kafadar K. Funda-
mental concepts in statistics: elucidation and
illustration. J Appl Physiol 1998; 85: 775-786.
2. Altman DG. Statistics in medical journals: some
recent trends. Stat Med 2000; 19: 3275-3289.
3. Altman DG, Bland JM. Improving doctors’
understanding of statistics. J R Stat Soc Ser 1991;
154: 223-267.
4. Gupta P, Kaur G, Sharma B, Shah D, Choudhury P.
What is submitted and what gets accepted in Indian
Pediatrics: Analysis of submissions, review
process, decision making, and criteria for rejection.
Indian Pediatr 2006; 43: 479-489.
5. Greenhalgh T. How to read a paper? The basics of
evidence based medicine. 3rd ed. London: BMJ
Publishing House; 1997.
6. Mantha S. Statistical reporting of study results: is
there a scope for improvement. J Anaesth Clin
Pharmacol 1994; 10: 81-84.
7. Pocock SJ, Hughes MD, Lee RJ. Statistical
problems in the reporting of clinical trials. A survey
of three medical journals. N Eng J Med 1987; 317:
426-432.
TABLE I REPORTING OF STATISTICAL METHODS IN INDIAN
PEDIATRICS
Checklist points Frequency(%)n=45
Sample size calculation 11 (24.4)
Baseline comparison 43 (95.5)
Ratio + nominal data 22 (48)
Nominal data 11 (24)
Ratio data 9 (20)
Ordinal 2 (4)
Ratio + ordinal 1 (2)
Appropriate statistical test 42 (93)
Parametric test 38 (84)
Nonparametric test 7 (15)
Normal distribution analyzed 7 (15)
Obscure test 0
Post hoc subgroup analysis 0
Two tailed test 45 (100)
Outliers 0
Correlation 9 (20)
r value 7 (15)
P value <0.05 45 (100)
Confidence interval 6 (13)
Relative risk reduction 0
Absolute risk reduction 0
Number needed to treat 0
Use of software 31 (29)
... [4] Many of the clinical trials published in Indian and western medical journals are deficient in reporting of sample size calculation. [4,7,8,1,2] Help of statistician or readymade computer software can be taken for the calculation of appropriate sample size. Power calculation should also be done during designing phase. ...
... One of the fallacies during clinical trials is use of inappropriate or wrong statistical tests. [1][2][3][4][5] Number of reasons cited for considering a test to be inappropriate. As mentioned earlier before using the particular statistical tests, all the assumptions for that statistical test should be fulfilled. ...
... In many of the clinical trials published in Indian medical journals it is observed that distribution of data was not checked which is an important prerequisite before applying parametric statistical tests. [1,2] Biological variables are usually more prone to fall in skewed distribution, hence checking of distribution become very important. Variable measured from small samples are also more prone to fall under skewed distribution. ...
Article
Full-text available
Statistical methods are widely accepted tools for analyzing data and drawing appropriate conclusions. However, inappropriate statistical methods may lead to unsuitable or false conclusions. Statistical quality of articles published in Indian medical journals presents with statistical fallacies and very few attempts have been done to document it till date. Issues related to inappropriate use of statistic in interpreting clinical trial raises serious doubt about the validity of the results published and there generalization to patient population. Clinical trials are considered to have one of the highest evidences in terms of efficacy of any intervention and hence results reported in these clinical trials help in appropriate decision making in clinical practices. Present manuscript is an attempt to have an insight into issues related to inappropriate use of statistic during clinical trials and suggests possible methods by which such fallacies can be averted.
... We believe that this work is important as very few studies are done regarding reporting of statistics in Indian medical journals. [18,19] Each author surveyed these negative clinical trials for reporting of sample size calculation and power on predesigned proforma. We looked for reporting of the sample size and various components of sample size calculation (reporting of power, comment on power, reporting of delta, reason for delta, assumption of the control group, reason for the control group, confidence interval, and reporting of one tailed or two tailed). ...
... [21,22] In a survey done by us on articles published in Indian medical journals, Indian pediatrics, and Indian Journal of Pharmacology, we found that sample size calculation was not mentioned in any article of Indian Journal of Pharmacology and in Indian pediatrics it was mentioned in 24% of articles. [18,19] There is also ethical importance of calculation of the sample size and power. It is observed that ethics committees did not permit the conduction of clinical trial having an unnecessary large sample size or small sample ...
Article
Background and Aim: It is observed that negative clinical trials published in medical journals are poor in reporting of sample size calculation, various components of calculation of sample size. It is also observed that they are underpowered to detect the actual difference between the treatment outcomes. Because of scarcity of these data for Indian medical journals we designed this study to critically analyze the Indian medical journals for reporting of sample size, components of sample size and power. We calculated post hoc power for 30% and 50% difference between the treatment outcomes. Materials and Methods: All the negative clinical trials published in five Indian medical journals (Indian Journal of Pharmacology (IJP), Indian Pediatrics (IP), Indian Journal of Dermatology (IJD), Indian Journal of Dermatology, Vanereology and Leprology (IJDVL), and Journal of Postgraduate Medicine (JPGM)), between 2001 and 2008, were analyzed by each author for reporting of the sample size and components of the sample size. Post hoc power for 30% and 50% differences between the outcome was calculated by G Power software. All data were expressed as frequency, percentages, and 95% confidence interval around the percentages with the help of SPSS ver. 17. Results: The median sample size was observed to be 33 (range 9-85). Power was calculated in 28 (41.1%, 95% CI 30.2% to 53%) trials. The sample size was not calculated in 34 (50%, 95% CI 38.4% to 61.5%) trials. The full sample size was calculated in only 2 (2.9%, 95% CI 0.8% to 10.1%) trials. Post hoc power above 80% for 30% of difference was reported in 3 (4.4%, 95% CI 1.5% to 12%) trials and for 50% difference in outcome it was reported in 42 (61.7%, 95% CI 49.8% to 72.3) trials. Conclusion: Negative clinical trials published in five Indian journals are poor in reporting of sample size calculation. It is also observed that most of trials are underpowered to see the 30% and 50% difference between the outcomes. There is a need to generate more awareness regarding the sample size and power calculation.
... Use of wrong or inappropriate statistical test is a common phenomenon observed in articles published in biomedical journals. [1][2][3][4] Wrong statistical tests can be seen in many conditions like use of paired test for unpaired data or use of parametric statistical tests for the data which does not follow the normal distribution or incompatibility of statistical tests with the type of data, etc. [5] Because of the availability of different types of statistical software, performing the statistical tests become easy, but selection of appropriate statistical test is still a problem. ...
... This type of data is observed to be used most frequently. [4] Example of ration data is height, weight, length, etc. In this type of data, it can be said meaningfully that 10 m of length is double than 5 m. ...
... ommonly used multivariable methods in medical literature are multiple linear regression, logistic regression and Cox-proportional hazards model. [1] Multivariable logistic regression (MLR) is used when response is binary and explanatories are two or more, which could be continuous, categorical, or ranked. MLR is commonly applied for two purposes: (i) to determine the effect of a specific covariate after adjustment for the potential confounders; and (ii) to estimate the likelihood of ...
... thods. A recent study conducted on 196 original articles published in two Indian pharmacology journals between 2002 and 2010 found that only 22% reported appropriate descriptive statistics. Only two out of 196 articles mentioned about the assumptions related to the statistical method applied, and 69.3% articles used appropriate statistical methods. [1] ...
Article
Full-text available
Use of multivariable logistic regression (MLR) modeling has steeply increased in the medical literature over the past few years. Testing of model assumptions and adequate reporting of MLR allow the reader to interpret results more accurately. To review the fulfillment of assumptions and reporting quality of MLR in selected Indian medical journals using established criteria. Analysis of published literature. Medknow.com publishes 68 Indian medical journals with open access. Eight of these journals had at least five articles using MLR between the years 1994 to 2008. Articles from each of these journals were evaluated according to the previously established 10-point quality criteria for reporting and to test the MLR model assumptions. SPSS 17 software and non-parametric test (Kruskal-Wallis H, Mann Whitney U, Spearman Correlation). One hundred and nine articles were finally found using MLR for analyzing the data in the selected eight journals. The number of such articles gradually increased after year 2003, but quality score remained almost similar over time. P value, odds ratio, and 95% confidence interval for coefficients in MLR was reported in 75.2% and sufficient cases (>10) per covariate of limiting sample size were reported in the 58.7% of the articles. No article reported the test for conformity of linear gradient for continuous covariates. Total score was not significantly different across the journals. However, involvement of statistician or epidemiologist as a co-author improved the average quality score significantly (P=0.014). Reporting of MLR in many Indian journals is incomplete. Only one article managed to score 8 out of 10 among 109 articles under review. All others scored less. Appropriate guidelines in instructions to authors, and pre-publication review of articles using MLR by a qualified statistician may improve quality of reporting.
... Hanif and Ajmal [16] also reported a comparable rate of inappropriate statistical tests (28.8%), Allam et al. [4] and Šimundić and Nikolac [17] reported higher rates (53.2% and 61.8%, respectively), while Karan et al. [18] reported a lower rate (7.0%). ...
Article
Full-text available
Background To identify and report flaws of Internet-published articles in the Journal of the Egyptian National Cancer Institute (JENCI), Cairo University, through a retrospective documentary study on articles published during the period from 2011 to 2016. All sections were reviewed against a collective checklist. Articles were grouped by publication year into 2 intervals: early (from 2011 to 2013) and recent (from 2014 to 2016) to identify changes in study characteristics over time. Results The study included 139 original articles. Half of the titles represented aim and 9.4% represented study design. Abstracts were concise, clear, with structured writing format in 98.6%, 93.5%, and 35.3%, respectively. Most introductions included the study aim, while 41% had a rationale. Study timing was reported in 59.0%, while the study design was reported in 25.9%. Inclusion and exclusion criteria were clearly reported in 43.1% and 40.1%, respectively. Statistical methods were mentioned in 80.6%, complete in 30.4%, and appropriate in 85.7%. Four studies reported sample size estimation. Only 52.5% and 58.3% of results were exhaustive and answer the research question, respectively. Incorrect statistical calculations occurred in 41.0%, inappropriate statistical tests or descriptive parameter selection in 26.6%, while inappropriate test application occurred in 49.1%. About 60% of discussions did not completely cover results, 31.7% fully justified the findings, 56.1% followed a logical flow, and 36.7% had contradiction within the text. Conclusions were mostly linked to aim, imprecise, and extrapolating beyond results. On comparing both periods, only a significant less misuse of statistical terms, more reporting conflict of interest, more missing references for cited texts in the recent period, and more participation of NCI over other institutes in the early period were found. Conclusion Articles published in JENCI (from 2011 to 2016) had many methodological and reporting defects and some points of strength. Using the collective checklist developed by this study, continuous training of researchers, involving epidemiologists throughout the whole research process, and applying strict journal reporting and publication rules should be encouraged.
... In most of the Intuition only few of the researchers follow the rule during the data analysis. In various research works published in biomedical journals journal we can see and observed the use of wrong or inappropriate statistical test used in the data analysis [1,2]. In order to do the statistical calculation and analysis there are lot of software. ...
Article
Full-text available
The main objective of writing this paper is to provide the algorithm of choosing the statistical tools in the data analysis. Choosing an appropriate test one of the most important task in research. So, that right test will give the valid conclusion and wrong test may give the misleading inference. In order to choose the right statistical test we should be familiar with the different variable and their nature (for parametric, test of normality)
... The most widely used approaches to multivariable models in clinical studies are multivariable linear regression, multivariable logistic regression (MLR), and proportional hazards regression [1] . The independent variables in MLR model can be continuous, categorical, or ordinal. ...
Article
Full-text available
Multivariable logistic regression (MLR) has been increasingly used in Chinese clinical medical research during the past few years. However, few evaluations of the quality of the reporting strategies in these studies are available. To evaluate the reporting quality and model accuracy of MLR used in published work, and related advice for authors, readers, reviewers, and editors. A total of 316 articles published in 5 leading Chinese clinical medical journals with high impact factor from January 2010 to July 2015 were selected for evaluation. Articles were evaluated according 12 established criteria for proper use and reporting of MLR models. Among the articles, the highest quality score was 9, the lowest 1, and the median 5 (4–5). A total of 85.1% of the articles scored below 6. No significant differences were found among these journals with respect to quality score (χ² = 6.706, P = .15). More than 50% of the articles met the following 5 criteria: complete identification of the statistical software application that was used (97.2%), calculation of the odds ratio and its confidence interval (86.4%), description of sufficient events (>10) per variable, selection of variables, and fitting procedure (78.2%, 69.3%, and 58.5%, respectively). Less than 35% of the articles reported the coding of variables (18.7%). The remaining 5 criteria were not satisfied by a sufficient number of articles: goodness-of-fit (10.1%), interactions (3.8%), checking for outliers (3.2%), collinearity (1.9%), and participation of statisticians and epidemiologists (0.3%). The criterion of conformity with linear gradients was applicable to 186 articles; however, only 7 (3.8%) mentioned or tested it. The reporting quality and model accuracy of MLR in selected articles were not satisfactory. In fact, severe deficiencies were noted. Only 1 article scored 9. We recommend authors, readers, reviewers, and editors to consider MLR models more carefully and cooperate more closely with statisticians and epidemiologists. Journals should develop statistical reporting guidelines concerning MLR.
... Use of wrong or inappropriate statistical test is thus very commonly observed in various research works published in biomedical journals. 2,3 Unlike in the past, the researchers of today neither have to perform statistical calculations manually nor do they have to be familiar with the mathematical principles of a statistical test. There are excellent softwares which can perform any statistical test in a very short time. ...
Article
Full-text available
The young researcher today is confronted with a choice of hundreds of statistical tests, both while reading research works or while planning own research. The principles guiding the choice of statistical tests are simple and this article, aimed at the young researcher, aims to demystify the same.
... [7][8][9] Except for very attempts that too with smaller samples for statistical reporting and methodological aspects of clinical trials, this issue is not adequately addressed for articles published in various Indian Medical Journals. [10][11][12] Present study aims to document use of various study designs and statistical methods used by authors in articles published in Indian Medical Journals. from these journals were analysed independently by fi rst and second author for study design and statistical methods (k = 0.85 for study design and 0.89 for statistical analysis). ...
Article
Aims and Objectives: Types of study designs and statistical method utilized by authors for articles published in foreign journals has been widely documented. Few such attempts are also done for articles published in Chinese and Pakistani Journals but no such attempt has been documented for articles published in Indian Medical Journals. This manuscript is an attempt to appraising articles published in Indian Medical Journals in terms of study designs and statistical methods used. Materials and Methods: All the original articles published in 2009 from 14 PubMed indexed Indian Medical Journals were analyzed for study designs and statistical methods. These journals were Annals of Indian Academy of Neurology (AIAN), Indian Journal of Orthopedics (IJOrtho), Indian Journal of Critical Care Medicine (IJCCM), Indian Journal of Dermatology, Venereology and Leprology (IJDVL), Indian Journal of Nephrology (IJN), Indian Journal of Dermatology (IJD), Indian Journal of Ophthalmology (IJO), Indian Journal of Urology (IJU), Indian Journal of Anesthesia (IJA), Indian Journal of Psychiatry (IJPsy), Indian Pediatrics (IP), Indian Journal of Medical Research (IJMR), Indian Journal of Medical Science (IJMS) and Indian Journal of Community Medicine (IJCM). Values are shown as frequency and percentages. Results: Majority of the published articles were based on prospective study designs (60.8%). Clinical trials were as method of study was documented in 12.5% articles. Three most common tests were test (23.2%), Chi-square test (21.9%) and analysis of variance (ANOVA) (10.5%). However, statistical methods were not at all used by 26.2% articles. Conclusions: Knowledge regarding prospective studies and statistical methods related to contingency table, test and ANOVA needs to be updated to the Undergraduate and Postgraduate students.
Article
Full-text available
Fundamental concepts in statistics form the cornerstone of scientific inquiry. If we fail to understand fully these fundamental concepts, then the scientific conclusions we reach are more likely to be wrong. This is more than supposition: for 60 years, statisticians have warned that the scientific literature harbors misunderstandings about basic statistical concepts. Original articles published in 1996 by the American Physiological Society's journals fared no better in their handling of basic statistical concepts. In this review, we summarize the two main scientific uses of statistics: hypothesis testing and estimation. Most scientists use statistics solely for hypothesis testing; often, however, estimation is more useful. We also illustrate the concepts of variability and uncertainty, and we demonstrate the essential distinction between statistical significance and scientific importance. An understanding of concepts such as variability, uncertainty, and significance is necessary, but it is not sufficient; we show also that the numerical results of statistical analyses have limitations.
Article
Full-text available
Objectives To identify the characteristics of the manuscripts submitted to the Indian Pediatrics; attributes of the peer-review process and decision-making; and factors associated with their acceptance or rejection. Methods All submissions to Indian Pediatrics during 2002 were analyzed by a retrospective review of records. Manuscripts were categorized by their place of origin (Indian vs. foreign), geographic region of India (north, south, east, west, central), submitting institution (teaching vs. non-teaching), subject (general pediatrics, systemic pediatrics, neonatology, genetic syndrome, allied sub-specialities, etc.), and type of article (research paper, case report, images, letter to editor, review, etc.). Manuscript details were recorded in a database that also included information on peer reviewer assignment, editorial and reviewer comments, and final disposition of the manuscript. Characteristics of accepted and rejected manuscripts were compared. Results Indian Pediatrics received 687 manuscripts for consideration in the year 2002; mostly from Indian authors (89%). Maximum contributions were received from North India (236, 39%) followed by 165 (27%) from South, 95 (16%) from West, 90 (15%) from Central and 26 (4%) from Eastern part of India. Of 687 papers, 457 (66%) articles qualified for peer review. Agreement between the reviewers was not significantly greater than that expected by chance; kappa for inter-rater agreement was 0.35, 0.17 and 0.21 between any two sets of reviewers for 431, 228 and 203 articles, respectively (P <0.005). Of 687 submitted manuscripts, 294(43%) were accepted, 347(50%) were rejected and no decision was possible on 46(7%) manuscripts. The top reasons for rejection were ‘absence of a message’, ‘lack of originality’, ‘inadequate methods’, ‘not relevant to journal’, ‘overinterpretation of results’, ‘unsatisfactory writing style’, ‘inaccurate/inconsistent/insufficient data’, and ‘inappropriate statistical analysis’, in that order. Median number of days (IQR) needed to reach the final decision was 81 (25–210) d; ranging from 8 (3–29.5) d for Images to180 (90–341) d for Research papers. No preference for acceptance was noted for foreign articles, geographic region of India, type of institution, or a particular topic, on both univariate and multivariate analysis. Conclusion Indian Pediatrics is receiving contributions from all over India. Majority of the manuscripts are peer-reviewed. Of every 10 articles submitted, almost 4 are accepted. Median time interval from submission to final decision is less than 3 months. The decision-making is not influenced by the place of origin of manuscript.
Article
Statistics is of ever increasing importance in medicine. We review the need for statistical knowledge in medicine, what doctors need to know about statistics and whether the state of statistical knowledge among doctors is adequate. We then consider the ways in which doctors acquire statistical knowledge. We discuss undergraduate and post-graduate teaching, the quality of many text-books, the example given by papers in medical journals, the use of statistical computer programs and statistical consultancy. Lastly we discuss ways in which both doctors and statisticians can act to improve matters.
Article
We compared the efficacy of intravenous gamma globulin plus aspirin with that of aspirin alone in reducing the frequency of coronary-artery abnormalities in children with acute Kawasaki syndrome in a multicenter, randomized trial. Children randomly assigned to the gamma globulin group received intravenous gamma globulin, 400 mg per kilogram of body weight per day, for four consecutive days; both treatment groups received aspirin, 100 mg per kilogram per day, through the 14th day of illness, then 3 to 5 mg per kilogram per day. Two-dimensional echocardiograms were interpreted blindly and independently by two or more readers. Two weeks after enrollment, coronary-artery abnormalities were present in 18 of 78 children (23 percent) in the aspirin group, as compared with 6 of 75 (8 percent) in the gamma globulin group (P = 0.01). Seven weeks after enrollment, abnormalities were present in 14 of 79 children (18 percent) in the aspirin group and in 3 of 79 (4 percent) in the gamma globulin group (P = 0.005). No child had serious adverse effects from receiving gamma globulin. We conclude that high-dose intravenous gamma globulin is safe and effective in reducing the prevalence of coronary-artery abnormalities when administered early in the course of Kawasaki syndrome.
Article
Reports of clinical trials often contain a wealth of data comparing treatments. This can lead to problems in interpretation, particularly when significance testing is used extensively. We examined 45 reports of comparative trials published in the British Medical Journal, the Lancet, or the New England Journal of Medicine to illustrate these statistical problems. The issues we considered included the analysis of multiple end points, the analysis of repeated measurements over time, subgroup analyses, trials of multiple treatments, and the overall number of significance tests in a trial report. Interpretation of large amounts of data is complicated by the common failure to specify in advance the intended size of a trial or statistical stopping rules for interim analyses. In addition, summaries or abstracts of trials tend to emphasize the more statistically significant end points. Overall, the reporting of clinical trials appears to be biased toward an exaggeration of treatment differences. Trials should have a clearer predefined policy for data analysis and reporting. In particular, a limited number of primary treatment comparisons should be specified in advance. The overuse of arbitrary significance levels (for example, P less than 0.05) is detrimental to good scientific reporting, and more emphasis should be given to the magnitude of treatment differences and to estimation methods such as confidence intervals.
Article
Myocardial damage in myocarditis is mediated, in part, by immunological mechanisms. High-dose intravenous gamma-globulin (IVIG) is an immunomodulatory agent that is beneficial in myocarditis secondary to Kawasaki disease, as well as in murine myocarditis. Since 1990, the routine management of presumed acute myocarditis at Children's Hospital, Boston, and Children's Hospital, Los Angeles, has included administration of high-dose IVIG. We treated 21 consecutive children presenting with presumed acute myocarditis with IVIG, 2 g/kg, over 24 hours, in addition to anticongestive therapies. A comparison group comprised 25 recent historical control patients meeting identical eligibility criteria but not receiving IVIG therapy. Left ventricular function was assessed during five time intervals: 0 to 7 days, 1 to 3 weeks, 3 weeks to 3 months, 3 to 6 months, and 6 to 12 months. At presentation, the IVIG and non-IVIG groups had comparable left ventricular enlargement and poor fractional shortening. Compared with the non-IVIG group, those treated with IVIG had a smaller mean adjusted left ventricular end-diastolic dimension and higher fractional shortening in the periods from 3 to 6 months (P = .008 and P = .033, respectively) and 6 to 12 months (P = .072 and P = .029, respectively). When adjusting for age, biopsy status, intravenous inotropic agents, and angiotensin-converting enzyme inhibitors, patients treated with IVIG were more likely to achieve normal left ventricular function during the first year after presentation (P = .03). By 1 year after presentation, the probability of survival tended to be higher among IVIG-treated patients (.84 versus .60, P = .069). We observed no adverse effects of IVIG administration. These data suggest that use of high-dose IVIG for treatment of acute myocarditis is associated with improved recovery of left ventricular function and with a tendency to better survival during the first year after presentation.
Article
Although an autoimmune pathogenesis has been postulated for dilated cardiomyopathy, immunosuppressive therapy has not been shown to be effective in clinical trials. Immune modulatory therapy with immune globulin is an effective therapy for Kawasaki disease in children, and recent data suggest that it improves ventricular function in children with new-onset dilated cardiomyopathies. The role of immune globulin therapy in adults with this disorder has not previously been evaluated. Ten patients were treated with high-dose intravenous immune globulin infusions (2 g/kg). All were hospitalized with NYHA class III to IV heart failure, left ventricular ejection fraction (LVEF) < 0.40, and symptoms for < 6 months at the time of presentation. One patient died before the completion of therapy. The remaining 9 were discharged, and LVEF was reassessed 12 months after therapy. LVEF improved from 0.24 +/- 0.02 (mean +/- SEM) at baseline to 0.41 +/- 0.04 at follow-up (P = .003). All 9 patients improved functionally to NYHA class I to II, and there have been no subsequent hospitalizations for heart failure during the course of follow-up. In this series of patients with new-onset dilated cardiomyopathy treated with high-dose immune globulin, LVEF improved 17 EF units. The effectiveness of intravenous immune globulin therapy in this disorder should be evaluated in a randomized, multicenter trial.
Article
I review some areas of medical statistics that have gained prominence over the last 5-10 years: meta-analysis, evidence-based medicine, and cluster randomized trials. I then consider several issues relating to data analysis and interpretation, many relating to the use and misuse of hypothesis testing, drawing on recent reviews of the use of statistics in medical journals. I also consider developments in the reporting of research in medical journals.
Article
Acute myocarditis is characterized by the rapid development of life-threatening congestive heart failure and arrhythmias. Although the initial stages of this disorder apparently result from direct cytopathic effects on the atrial and ventricular myocardium, later stages of progressive decompensation may result from immune-mediated myocyte destruction. There has been recent improvement in understanding the role of this immunologic cascade. As a result, treatment now begins earlier in the course of the disease and can target both the virus and the immune response. Our ability to implement mechanical support in children as a bridge to transplant or recovery, even in children presenting in the final stages of their disease, has led to an improved outcome regarding morbidity and mortality.