ArticlePDF Available

Statistical Algorithms in Review Manager 5

Authors:

Abstract and Figures

i b i n1 Control i c i d i n2
Content may be subject to copyright.
Statistical algorithms in Review Manager 5
Jonathan J Deeks and Julian PT Higgins
on behalf of the Statistical Methods Group
of The Cochrane Collaboration
August 2010
Data structure
Consider a meta-analysis of k studies. When the studies have a dichotomous (binary) outcome the
results of each study can be presented in a 2×2 table (Table 1) giving the numbers of participant
who do or do not experience the event in each of the two groups (here called experimental (or 1)
and control (or 2)).
Table 1: Binary data
Study i
Event No event Total
Experimental
i
a
i
b
i
n
1
Control
i
c
i
d
i
n
2
If the outcome is a continuous measure, the number of participants in each of the two groups, their
mean response and the standard deviation of their responses are required to perform meta-analysis
(Table 2).
Table 2: Continuous data
Study
i
Group
size
Mean
response
Standard
deviation
Experimental
i
n
1
i
m
1
i
sd
1
Control
i
n
2
i
m
2
i
sd
2
If the outcome is analysed by comparing observed with expected values (for example using the
Peto method or a log-rank approach for time-to-event data), then ‘O – E’ statistics and their
variances are required to perform the meta-analysis. Group sizes may also be entered by the review
author, but are not involved in the analysis.
Table 3: O minus E and variance
Study
i
O minus E
Variance of
(O minus E)
Group size
(experimental)
Group size
(control)
i
Z
i
V
i
n
1
i
n
2
For other outcomes a generic approach can be used, the user directly specifying the values of the
intervention effect estimate and its standard error for each study (the standard error may be
calculable from a confidence interval). ‘Ratio’ measures of effect effects (e.g. odds ratio, risk ratio,
hazard ratio, ratio of means) will normally be expressed on a log-scale, ‘difference’ measures of
1
effect (e.g. risk difference, differences in means) will normally be expressed on their natural scale.
Group sizes can optionally be entered by the review author, but are not involved in the analysis.
Table 4: Generic data
Study
i
Estimate of
effect
Standard error of
estimate
Group size
(experimental)
Group size
(control)
ˆ
i
θ
{
}
ˆ
SE
i
θ
i
n
1
i
n
2
Formulae for individual studies
Individual study estimates: dichotomous outcomes
Peto odds ratio
For study idenote the cell counts as in Table n1, with
iii
ba
+
=
1
e given by
, , and let
ar
i
d
ii
cn +=
2
iii
nnN
21
+= . For the Peto method, the individual odds ratios
,
exp
i
Peto i
i
Z
OR
V
=
⎩⎭
.
The logarithm of the odds ratio has standard error
()
{}
,
1
SE ln
Peto i
i
OR
V
=
,
where
i
Z
is the ‘O – E’ statistic:
]
E
ii i
Z
aa=− ,
with
[]
(
)
1
E
ii i
i
i
nac
a
N
+
=
(the expected number of events in the experimental intervention group), and
(
)
(
)
()
12
2
1
iiiii i
i
ii
nn a c b d
V
NN
++
=
(the hypergeometric variance of ).
i
a
Odds ratio
For methods other than the Peto method, the odds ratio for each study is given by
ii
i
ii
ad
OR
bc
=
,
the standard error of the log odds ratio being
()
{}
1111
SE ln
i
iii i
OR
abcd
=
+++
.
Risk ratio
The risk ratio for each study is given by
1
2
/
/
ii
i
ii
an
RR
cn
=
,
2
the standard error of the log risk ratio being
()
{}
12
111 1
SE ln
i
ii i
RR
acn n
=+
i
.
Risk difference
The risk difference for each study is given by
12
ii
i
ii
ac
RD
nn
=−,
with standard error
{}
33
12
SE
ii i i
i
ii
ab cd
RD
nn
=+
.
Empty cells
Where zeros cause problems with computation of effects or standard errors, 0.5 is added to all cells
( , , , ) for that study, except when
i
a
i
b
i
c
i
d 0
=
=
ii
ca or 0
=
=
ii
db , when the relative effect
measures and
i
OR
i
R
R are undefined.
Individual study estimates: continuous outcomes
Denote the number of participants, mean and standard deviation as in Table 2, and let
iii
nnN
21
+
=
and
()()
22
112
11
2
iii
i
i
nsdnsd
s
N
−+
=
2i
i
be the pooled standard deviation across the two groups.
Difference in means (mean difference)
The difference in means (referred to as mean difference) is given by
12ii
M
Dmm
=
,
with standard error
{}
22
12
12
SE
ii
i
ii
sd sd
MD
nn
=+.
Standardized difference in means (standardized mean difference)
There are several popular formulations of the standardized mean difference. The one implemented
in RevMan is Hedges’ adjusted g, which is very similar to Cohen's d, but includes an adjustment
for small sample bias
12
3
1
49
ii
i
ii
mm
SMD
sN
⎛⎞
=−
⎜⎟
⎝⎠
,
with standard error
{}
()
2
12
SE
23.9
ii
i
ii i
NSMD
SMD
nn N
=+
4
.
3
Individual study estimates: O – E and variance
For study ithe effect estimate is given by
ˆ
i
i
i
Z
V
θ= ,
with standard error
{}
1
ˆ
SE θ=
i
i
V
.
The effect estimate is either of a log odds ratio or a log hazard ratio, depending on how the
observed and expected values were derived.
Individual study estimates: Generic method
As the user directly enters the intervention effect estimates and their standard errors no further
processing is needed. All types of intervention effects are eligible for this method, but it might be
most useful when intervention effects have been calculated in a way which makes special
consideration of design (e.g. cluster randomized and cross-over trials), are adjusted for other effects
(adjusted effects from non-randomized studies) or are not covered by existing methods (e.g. ratios
of means, relative event rates).
Meta-analysis methods
All summations are over i, from 1 to the number of studies, unless otherwise specified.
Mantel-Haenszel methods for combining results across studies
Odds ratio
The Mantel-Haenszel summary log odds ratio is given by
()
,
,
ln ln
⎛⎞
=
⎝⎠
M
Hi i
MH
MH i
wOR
OR
w
, (1)
and the Mantel-Haenszel summary odds ratio by
,
,
M
Hi i
MH
MH i
wOR
OR
w
=
,
where each study’s odds ratio is given weight
,
ii
MH i
i
bc
w
N
=
.
The summary log odds ratio has standard error given by
()
{}
2
1
SE ln
2
MH
EFGH
OR
2
R
RS S
+
=++
⎝⎠
, (2)
where
ii
i
ad
R
N
=
;
ii
i
bc
S
N
=
;
4
()
2
iiii
i
adad
E
N
+
=
;
(
)
2
iii
i
adbc
F
N
+
=
i
;
()
2
iiii
i
bcad
G
N
+
=
;
(
)
2
iiii
i
bcbc
H
N
+
=
.
Risk ratio
The Mantel-Haenszel summary log risk ratio is given by
()
,
,
ln ln
⎛⎞
=
⎝⎠
M
Hi i
MH
MH i
wRR
RR
w
, (3)
and the Mantel-Haenszel summary risk ratio by
,
,
M
Hi i
MH
MH i
wRR
RR
w
=
,
where each study’s risk ratio is given weight
(
)
,
ii i
MH i
i
ca b
w
N
+
= .
The summary log risk ratio has standard error given by
()
{}
SE ln
MH
P
RR
R
S
=
, (4)
where
()
12
2
ii i i iii
i
nn a c acN
P
N
+−
=
;
2ii
i
an
R
N
=
;
1ii
i
cn
S
N
=
.
Risk difference
The Mantel-Haenszel summary risk difference is given by
,
,
M
Hi i
MH
MH i
wRD
RD
w
=
, (5)
where each study’s risk difference is given weight
12
,
ii
MH i
i
nn
w
N
= .
The summary risk difference has standard error given by
{}
2
SE
MH
J
RD
K
=
, (6)
where
33
21
2
12
ii i i i i
iii
abn cdn
J
nnN
+
=
;
12ii
i
nn
K
N
=
.
Test for heterogeneity
The heterogeneity test statistic is given by
(
)
2
ˆˆ
MH i i MH
Qwθ
,
5
where represents the log odds ratio, log risk ratio or risk difference and the are the weights
calculated as
ˆ
θ
i
w
{
}
2
ˆ
1SE
i
θ rather than the weights used for the Mantel-Haenszel meta-analyses.
Under the null hypothesis that there are no differences in intervention effect among studies this
follows a chi-squared distribution with
1
k degrees of freedom (where is the number of studies
contributing to the meta-analysis).
k
The statistic I
2
is calculated as
(
)
2
1
max 100% ,0
MH
MH
Qk
I
Q
−−
⎩⎭
This measures the extent of inconsistency among the studies’ results, and is interpreted as
approximately the proportion of total variation in study estimates that is due to heterogeneity rather
than sampling error.
Inverse-variance methods for combining results across studies
Inverse-variance methods are used to pool log odds ratios, log risk ratios and risk differences as one
of the analysis options for binary data, to pool all mean differences and standardized mean
differences for continuous data, and also for combining intervention effect estimates in the generic
method. In the general formula the intervention effect estimate is denoted by , which is the
study’s log odds ratio, log risk ratio, risk difference, mean difference or standardized mean
difference, or the estimate of intervention effect in the generic method. The individual effect sizes
are weighted according to the reciprocal of their variance (calculated as the square of the standard
error given in the individual study section above) giving
ˆ
i
θ
{}
()
2
1
ˆ
SE
i
i
w =
θ
.
These are combined to give a summary estimate
ˆ
ˆ
ii
IV
i
w
w
θ
θ=
. (7)
with
{
}
1
ˆ
SE
IV
i
w
θ=
. (8)
The heterogeneity statistic is given by a similar formula as for the Mantel-Haenszel method:
(
)
2
ˆˆ
IV i i IV
Qwθ
.
Under the null hypothesis that there are no differences in intervention effect among studies this
follows a chi-squared distribution with
1
k degrees of freedom (where is the number of studies
contributing to the meta-analysis). I
2
is calculated as
k
(
)
2
1
max 100% ,0
IV
IV
Qk
I
Q
−−
⎩⎭
.
Peto's method for combining results across studies
The Peto summary log odds ratio is given by
6
()
(
)
,
ln
ln =
iPeto
Peto
i
VOR
OR
V
i
. (9)
and the summary odds ratio by
(
)
,
ln
exp
iPetoi
Peto
i
VOR
OR
V
=
⎩⎭
,
where the odds ratio
,
P
eto i
OR is calculated using the approximate method described in the individual
study section, and are the hypergeometric variances.
i
V
The log odds ratio has standard error
()
{}
1
SE ln
Peto
i
OR
V
=
. (10)
The heterogeneity statistic is given by
()
()
{
}
2
2
,
ln ln
Peto i Peto i Peto
Q V OR OR=−
.
Under the null hypothesis that there are no differences in intervention effect among studies this
follows a chi-squared distribution with
1
k degrees of freedom (where is the number of studies
contributing to the meta-analysis). I
2
is calculated as
k
(
)
2
1
max 100% ,0
Peto
Peto
Qk
I
Q
−−
⎩⎭
.
O – E and variance method for combining studies
This is an implementation of the Peto method, which allows its application to time-to-event data as
well as binary data. The summary effect estimate is given by
ˆ
ˆ
θ
θ=
ii
i
V
V
, (11)
where the estimate, , from study i is calculated from
ˆ
i
θ
i
Z
and as for individual studies. The
summary effect is either a log odds ratio or a log hazard ratio (the user should specify which). The
effect estimate (on a non-log scale) is given by
i
V
ˆ
effect estimate exp
θ
=
⎩⎭
ii
i
V
V
,
and is either an odds ratio or a hazard ratio.
The effect estimate (on the log scale) has standard error
{
}
1
ˆ
SE
i
V
θ=
. (12)
The heterogeneity statistic is given by
(
)
22
ˆˆ
Peto i i
QV
=
θ−θ
.
Under the null hypothesis that there are no differences in intervention effect among studies this
follows a chi-squared distribution with
1
k degrees of freedom (where is the number of studies
contributing to the meta-analysis). I
2
is calculated as
k
7
(
)
2
1
max 100% ,0
Peto
Peto
Qk
I
Q
−−
⎩⎭
.
DerSimonian and Laird random-effects models
Under the random-effects model, the assumption of a common intervention effect is relaxed, and
the effect sizes are assumed to have a distribution
(
)
2
,
i
N
θ
∼θτ.
The estimate of is given by
2
τ
()
()
2
2
1
ˆ
max , 0
iii
Qk
www
−−
τ=
⎩⎭
∑∑
,
where the are the inverse-variance weights, calculated as
i
w
{}
2
1
ˆ
SE
i
i
w =
θ
,
for log odds ratio, log risk ratio, risk difference, mean difference, standardized mean difference, or
for the intervention effect in the generic method, as appropriate.
For continuous data and for the generic method, Q is . For binary data, either or
IV
Q
IV
Q
M
H
Q may
be taken. Both are implemented in RevMan 5 (and this is the only difference between random-
effects methods under ‘Mantel-Haenszel’ and ‘inverse-variance’ options). Again, for odds ratios,
risk ratios and other ratio effects, the effect size is taken on the natural logarithmic scale.
Each study’s effect size is given weight
{}
2
2
1
ˆ
ˆ
SE
i
i
w
=
θ
.
The summary effect size is given by
ˆ
ˆ
ii
DL
i
w
w
θ
θ=
, (13)
and
{
}
1
ˆ
SE
DL
i
w
θ=
. (14)
Note that in the case where the heterogeneity statistic is less than or equal to its degrees of
freedom , the estimate of the between study variation,
, is zero, and the weights coincide
with those given by the inverse-variance method.
Q
)1( k
2
ˆ
τ
Confidence intervals
The )%1(100
α
confidence interval for
ˆ
θ
is given by
)
to
{
}
()
ˆˆ
SE 1 2θ+ θ Φ −α ,
{
}
(
ˆˆ
SE 1 2θ− θ Φ −α
where is the log odds ratio, log risk ratio, risk difference, mean difference, standardized mean
difference or generic intervention effect estimate, and
ˆ
θ
Φ
is the standard normal deviate. For log
odds ratios, log risk ratios and generic intervention effects entered on the log scale (and identified
8
as such by the review author), the point estimate and confidence interval limits are exponentiated
for presentation.
Test statistics
Test for presence of an overall intervention effect
In all cases, the test statistic is given by
()
ˆ
ˆ
SE
Z
θ
=
θ
,
where the odds ratio, risk ratio and other ratio measures are again considered on the log scale.
Under the null hypothesis that there is no overall effect of intervention effect this follows a standard
normal distribution.
Test for comparison of subgroups
The test is valid for all methods. It is based on the notion of performing a test for heterogeneity
across
subgroups rather than across studies. Let
ˆ
θ
j
be the summary effect size for subgroup j, with
standard error
{
}
ˆ
θ
j
SE . The summary effect size may be based on either a fixed-effect or a
random-effects meta-analysis. For fixed-effect meta-analyses, these numbers correspond to above
equations (1) and (2); (3) and (4); (5) and (6); (7) and (8); (9) and (10); or (11) and (12), each
applied within each subgroup. For random-effects meta-analyses, these numbers correspond to
equations (13) and (14), each applied within each subgroup. Note that for ratio measures, all
computations here are performed on the log scale.
First we compute a weight for each subgroup:
{}
2
1
ˆ
SE
=
θ
j
j
w ,
then we perform a (fixed-effect) meta-analysis of the summary effect sizes across subgroups:
ˆ
ˆ
θ
θ=
jj
tot
j
w
w
.
The test statistic for differences across subgroups is given by
(
)
2
ˆˆ
θ
int j j tot
Qw .
Under the null hypothesis that there are no differences in intervention effect across subgroups this
follows a chi-squared distribution with
1
S degrees of freedom (where S is the number of
subgroups with summary effect sizes).
I
2
for differences across subgroups is calculated as
(
)
2
1
max 100% ,0
−−
⎩⎭
int
int
QS
I
Q
.
This measures the extent of inconsistency across the subgroups’ results, and is interpreted as
approximately the proportion of total variation in subgroup estimates that is due to genuine
variation across subgroups rather than sampling error.
9
Note. An alternative formulation for fixed-effect meta-analyses (inverse variance and Peto methods
only) is as follows. The
Q statistic defined by either or
IV
Q
P
eto
Q is calculated separately for each of
the
S subgroups and for the totality of studies, yielding statistics , …, and . The test
statistic is given by
1
Q
S
Q
tot
Q
1=
=−
S
int tot j
j
QQ Q
.
This is identical to the test statistic given above, in these specific situations.
10
11
Bibliography
Borenstein M, Hedges LV, Higgins JPT, Rothstein HR. Introduction to Meta-analysis. John Wiley
& Sons, 2009.
Breslow NE, Day NE. Combination of results from a series of 2x2 tables; control of confounding.
In: Statistical Methods in Cancer Research, Volume 1: The analysis of case-control data. IARC
Scientific Publications No.32. Lyon: International Agency for Health Research on Cancer, 1980.
Deeks JJ, Altman DG, Bradburn MJ. Statistical methods for examining heterogeneity and
combining results from several studies in a meta-analysis. In: Egger M, Davey Smith G, Altman
DG. Systematic Reviewes and Healthcare: meta-analysis in context. BMJ Publications (in press).
DerSimonian R, Laird N. Meta-analysis in clinical trials. Controlled Clinical Trials 1986; 7: 177-
188.
Greenland S, Robins J. Estimation of a common effect parameter from sparse follow-up data.
Biometrics 1985;41: 55-68.
Greenland S, Salvan A. Bias in the one-step method for pooling study results. Statistics in Medicine
1990; 9:247-252.
Hedges LV, Olkin I. Statistical Methods for Meta-analysis. San Diego: Academic Press 1985.
Chapter 5.
Higgins JPT, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analysis.
BMJ 2003; 327: 557-560.
Mantel N, Haenszel W. Statistical aspects of the analysis of data from retrospective studies of
disease. Journal of the National Cancer Institute 1959;22: 719-748.
Robins J, Greenland S, Breslow NE. A general estimator for the variance of the Mantel-Haenszel
odds ratio. American Journal of Epidemiolgy 1986; 124:719-723.
Rosenthal R. Parametric measures of effect size. In: Cooper H, Hedges LV (eds.). The Handbook
of Research Synthesis. New York: Russell Sage Foundation, 1994.
Sinclair JC, Bracken MB. Effective Care of the Newborn infant.Oxford: Oxford University Press
1992.Chapter 2.
Yusuf S, Peto R, Lewis J, Collins R, Sleight P. Beta blockade during and after myocardial
infarction: an overview of the randomized trials. Progress in Cardiovascular Diseases 1985;27:335-
371.
... Paired analyses were applied to all crossover trials with the use of a within-individual correlation coefficient between the treatments of 0.5, as described by Elbourne et al., to calculate the SEs [29][30][31]. To mitigate a unit-of-analysis error, when arms of trials with multiple intervention or control arms were used more than once, the corresponding sample size was divided by the number of times it was used for the calculation of the standard error of the pooled effect [32]. ...
... a Because all included trials were randomized controlled trials, the certainty of the evidence was graded as high for all outcomes by default and then downgraded or upgraded based on prespecified criteria. Criteria for downgrades included risk of bias (ROB) (downgraded if most trials were considered to be at high ROB); inconsistency (downgraded if there was substantial unexplained heterogeneity: I 2 ≥ 50%; PQ < 0.10); indirectness (downgraded if there were factors absent or present relating to the participants, interventions, or outcomes that limited the generalizability of the results); imprecision (downgraded if the 95% confidence intervals crossed the minimally important difference (MID) for harm or benefit set at 0.1 mmol/L (5%) for LDL-C, non-HDL-C, HDL-C, and TG and ± 0.04 g/L for apoB [32][33][34][35], or there was a concern with the robustness of the estimate resulting from sensitivity analyses); and publication bias (downgraded if there was evidence of publication bias based on the funnel plot asymmetry and/or significant Egger's or Begg's test (p < 0.10) with the confirmation of evidence of small study effects by adjustment using the trim-and-fill analysis of Duval and Tweedie [42]). The criteria for upgrades included a significant dose-response gradient that supports the direction of the pooled effect estimate. ...
... a Because all included trials were randomized controlled trials, the certainty of the evidence was graded as high for all outcomes by default and then downgraded or upgraded based on prespecified criteria. Criteria for downgrades included risk of bias (ROB) (downgraded if most trials were considered to be at high ROB); inconsistency (downgraded if there was substantial unexplained heterogeneity: I 2 ≥ 50%; P Q < 0.10); indirectness (downgraded if there were factors absent or present relating to the participants, interventions, or outcomes that limited the generalizability of the results); imprecision (downgraded if the 95% confidence intervals crossed the minimally important difference (MID) for harm or benefit set at 0.1 mmol/L (5%) for LDL-C, non-HDL-C, HDL-C, and TG and ± 0.04 g/L for apoB [32][33][34][35], or there was a concern with the robustness of the estimate resulting from sensitivity analyses); and publication bias (downgraded if there was evidence of publication bias based on the funnel plot asymmetry and/or significant Egger's or Begg's test (p < 0.10) with the confirmation of evidence of small study effects by adjustment using the trim-and-fill analysis of Duval and Tweedie [42]). The criteria for upgrades included a significant dose-response gradient that supports the direction of the pooled effect estimate. ...
Article
Full-text available
Background: Many clinical practice guidelines recommend dietary pulses for the prevention and management of cardiovascular disease and diabetes. The impact of extracted pulse proteins remains unclear. We therefore conducted a systematic review and meta-analysis of randomized controlled trials of the effect of extracted pulse proteins on therapeutic lipid targets. Methods and Findings: MEDLINE, Embase, and the Cochrane Library were searched through April 2024 for trials of ≥3-weeks. The primary outcome was low-density lipoprotein-cholesterol (LDL-C). The secondary outcomes were other lipid targets. Independent reviewers extracted data and assessed the risk of bias. Subgroup analyses included by pulse type and the certainty of evidence was assessed using GRADE. Results: Seven included trials (14 trial comparisons, n = 453) with a median of 4-weeks duration and dose of 35 g/day showed that extracted pulse proteins decreased LDL-C by −0.23 mmol/L (95% confidence interval: −0.36 to −0.10 mmol/L, p < 0.001). Similar effects were observed for non-high-density lipoprotein-cholesterol and apolipoprotein B. No interactions were found by pulse type. Subgroup analyses revealed effect modification by sex, with greater proportions of females seeing greater reductions. GRADE was generally moderate. Conclusions: Extracted pulse proteins likely result in moderate reductions in LDL-C and other lipid targets. Future studies on various types of extracted pulse proteins including assessments by sex are warranted.
... Se calcularon los OR con sus respectivos IC del 95% para cuantificar el riesgo asociado a cada factor (Deeks y Higgins, 2007). Esto proporciona una medida clara de la relación entre los factores de riesgo y la actividad motriz. ...
Article
Full-text available
Introducción: Los niveles de resolución de problemas permiten caracterizar dos grupos uno denominado “apropiado” y otro “no apropiado” que pueden servir para mejorar el nivel de creatividad y el rendimiento académico de los estudiantes. Por tanto, las dimensiones de la creatividad motriz y las habilidades de los dos grupos para resolver problemas deben ser investigadas de manera longitudinal y espacial en estudiantes. Metodología: Se identificaron dos grupos de resolución de problemas y se establecerán diferencias estadísticamente significativas de las medias entre los grupos identificados a través de un ANOVA y/o MANOVA en las puntuaciones de creatividad motriz. Aplicamos estadísticas correlacionales de Spearman. La muestra fue no aleatoria y basada en conveniencia. Se segmentaron datos por género y edad usando SPSS, y se realizaron análisis estadísticos como regresión múltiple, Odds Ratio (OR), intervalos de confianza (IC) para los riesgos. Se utilizaron redes neuronales para evaluar la importancia de las variables. Resultados: Se obtuvo diferencias entre las medias de nivel de creatividad motriz en los dos grupos de resolución de problemas. La comprobación de las diferencias de medias está sustentada en el estadístico de Fisher. las pruebas de correlación de Pearson y Spearman indicaron una relación significativa entre la creatividad motriz y la resolución de problemas con diversos objetos. El test de Kolmogorov-Smirnov confirmó la distribución normal de las variables. Discusión: Los hallazgos sugieren que la creatividad motriz está significativamente influenciada por las habilidades para resolver problemas, subrayando la necesidad de programas educativos que incorporen actividades físicas creativas. Similares resultados son establecidos con pruebas de comparación de medias de pruebas paramétricas (t- Student) y no paramétricas (U de Mann-Whitney). Conclusiones: Es importante intervenir el grupo denominado “no apropiado” para la mejora de su nivel de creatividad motriz. Asimismo, la creatividad motriz está estrechamente vinculada a la resolución de problemas, y las estrategias educativas que promueven el movimiento creativo pueden impactar significativamente en el desarrollo integral de los estudiantes.
... Shapiro-Wilk test), the correlation between the study groups was analyzed using a two-tailed, paired t-test, where P<0.05 meant statistically significant. who met Petersen's criteria (17,18). GraphPad Prism, version 9 (San Diego, CA, USA) was used for the analyses and derivation of figures. ...
Article
Full-text available
Background Left ventricular non-compaction (LVNC) is still a pathology around which there are numerous controversies regarding the criteria for its diagnosis, presentation, prognosis, and even classification into the appropriate group of diseases. So far, about 190 genes in which mutations may be associated with LVNC have been described, and in each of them, several to several dozen different loci have been discovered. We decided to analyze the frequency of single nucleotide variants (SNVs) in correlation to Petersen’s criteria. Methods We retrospectively analyzed the results of cardiac magnetic resonance (CMR) studies. Twenty-three patients who met Petersen’s criteria agreed to participate in the research and take blood samples for genetic testing. Next, we prospectively included 24 volunteers who did not meet Petersen’s criteria. Petersen’s criteria were complied with ratio of non-compacted to compacted myocardium (NC/C) ≥2.3. A total of 47 DNA samples were analyzed based on the selected regions of the following genes: β-myosin heavy chain (MYH7), α-cardiac actin (ACTC1), cardiac troponin T (TNNT2), myosin binding protein-C (MYBPC3), LIM-domain binding protein 3 (LBD3), and taffazin (TAZ). Results In total, 248 substitutions in exons and introns were obtained for all analyzed samples. No statistically significant differences were detected between the mentioned groups. No significant difference in either downward or upward trends in the number of substitutions in relation to the increasing trabeculation is observed. We indicated differences in the occurrence of the studied SNVs between groups, especially for rs8037241 (3’UTR region of ACTC1) and rs2675686 (LDB3), but they also did not show statistical significance. Although we did not find a significant correlation between the co-occurrence of individual mutations with LVNC, it is worth noting that the presence of one of the four mutations in the range rs8037241 (ACTC1 3’UTR), rs3729998 (TNNT2e. 12), and rs727503240 (MYH7e. 39) increases the risk of LVNC more than 4 times. An inverse association between the number of SNVs and the meeting the Petersen’s criteria was demonstrated for studied LDB3 region and rs397516254 in exon 39 of the MYH7 gene. Conclusions To our knowledge, no studies have been published comparing the prevalence of selected SNVs in a group of healthy subjects and in a group meeting the Petersen criteria for LVNC. Among both completely healthy individuals who did not meet the Petersen criteria for LVNC as well as those with symptoms who met these criteria we found a similar incidence of SNVs in the ACTC1, TNNT2, LDB3 and MYH7 genes segments analyzed. Further studies are required to confirm or exclude “potentially protective” SNV in the 39th exon of MYH7 (rs397516254) and the role of co-occurrence of individual SNVs in rs8037241 (ACTC1 3’UTR), rs3729998 (TNNT2), and rs727503240 (MYH7) for the increase of the risk of LVNC.
... Where relative risk, number needed to treat and confidence intervals could be calculated but were not provided by the authors, these statistics were calculated by the research team using the methods of Altman [84,85]. Where zeros caused problems with computation of the relative risk or its standard error, 0.5 was added to all cells as per the methods of Pagano and Gauvreau, Deeks and Higgins, and Altman [86,87]. Relative risk was preferred to odds ratios as per the guidance of Sackett et al. [88]. ...
Article
Full-text available
Globally, emergency medical services (EMSs) report that their demand is dominated by non-emergency (such as urgent and primary care) requests. Appropriately managing these is a major challenge for EMSs, with one mechanism employed being specialist community paramedics. This review guides policy by evaluating the economic impact of specialist community paramedic models from a healthcare system perspective. A multidisciplinary team (health economics, emergency care, paramedicine, nursing) was formed, and a protocol registered on PROSPERO (CRD42023397840) and published open access. Eligible studies included experimental and analytical observational study designs of economic evaluation outcomes of patients requesting EMSs via an emergency telephone line (‘000’, ‘111’, ‘999’, ‘911’ or equivalent) responded to by specialist community paramedics, compared to patients attended by usual care (i.e. standard paramedics). A three-stage systematic search was performed, including Peer Review of Electronic Search Strategies (PRESS) and Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA). Two independent reviewers extracted and verified 51 unique characteristics from 11 studies, costs were inflated and converted, and outcomes were synthesised with comparisons by model, population, education and reliability of findings. Eleven studies (n = 7136 intervention group) met the criteria. These included one cost-utility analysis (measuring both costs and consequences), four costing studies (measuring cost only) and six cohort studies (measuring consequences only). Quality was measured using Joanna Briggs Institute tools, and was moderate for ten studies, and low for one. Models included autonomous paramedics (six studies, n = 4132 intervention), physician oversight (three studies, n = 932 intervention) and/or special populations (five studies, n = 3004 intervention). Twenty-one outcomes were reported. Models unanimously reduced emergency department (ED) transportation by 14–78% (higher quality studies reduced emergency department transportation by 50–54%, n = 2639 intervention, p < 0.001), and costs were reduced by AU3381227perattendanceinfourstudies(n=2962).Onestudyperformedaneconomicevaluation(n=1549),findingboththatthecostswerereducedbyAU338–1227 per attendance in four studies (n = 2962). One study performed an economic evaluation (n = 1549), finding both that the costs were reduced by AU454 per attendance (although not statistically significant), and consequently that the intervention dominated with a > 95% chance of the model being cost effective at the UK incremental cost-effectiveness ratio threshold. Community paramedic roles within EMSs reduced ED transportation by approximately half. However, the rate was highly variable owing to structural (such as local policies) and stochastic (such as the patient’s medical condition) factors. As models unanimously reduced ED transportation—a major contributor to costs—they in turn lead to net healthcare system savings, provided there is sufficient demand to outweigh model costs and generate net savings. However, all models shift costs from EDs to EMSs, and therefore appropriate redistribution of benefits may be necessary to incentivise EMS investment. Policymakers for EMSs could consider negotiating with their health department, local ED or insurers to introduce a rebate for successful community paramedic non-ED-transportations. Following this, geographical areas with suitable non-emergency demand could be identified, and community paramedic models introduced and tested with a prospective economic evaluation or, where this is not feasible, with sufficient data collection to enable a post hoc analysis.
... 19 To pool data across studies using inverse variance methods, the data are transformed on to an additive scale using the natural logarithm. 20 Publication bias was assessed using funnel plots and explored using trim and fill methods where applicable. 17 Due to a lack of studies included in the meta-analysis, subgroup analyses were not explored. ...
Article
Full-text available
Background Intravenous fluid therapy is a ubiquitous intervention for the management of patients with sepsis, however excessive cumulative fluid balance has been shown to result in worse outcomes. Hyperoncotic albumin is presented in low volumes, is an effective resuscitation fluid and may have effects beyond plasma volume expansion alone. This systematic review aimed to assess the efficacy, safety and effectiveness of hyperoncotic albumin solutions in the management of sepsis. Methods We searched four databases and two trial registries for controlled clinical trials of hyperoncotic albumin for management of sepsis. Review outcomes were mortality, need for renal replacement therapy, cumulative-fluid balance, and need for organ support. We used methods guided by the Cochrane Handbook for reviews of clinical interventions. Studies were assessed using Cochrane’s Risk of Bias 2 tool. We performed pairwise meta-analysis where possible. Certainty of evidence was assessed using GRADE. Results We included six trials; four (2772 patients) were meta-analysed. Most studies had moderate or high risk of bias. There was no significant difference in 28-day mortality for septic patients receiving hyperoncotic albumin compared to other intravenous fluids (OR 0.95, [95% CI: 0.8–1.12]); in patients with septic shock (2013 patients) there was a significant reduction (OR 0.82 [95% CI: 0.68–0.98]). There was no significant difference in safety outcomes. Hyperoncotic albumin was associated with variable reduction in early cumulative fluid balance and faster resolution of shock. Conclusions There is no good-quality evidence to support the use of hyperoncotic albumin in patients with sepsis, but it may reduce short-term mortality in the sub-groups with septic shock. It appears safe in terms of need for renal replacement therapy and is associated with reduced early cumulative fluid balance and faster resolution of shock. Larger, better quality randomised controlled trials in patients with septic shock may enhance the certainty of these findings. Review registration PROSPERO ref: CRD42021150674
... Since the baselines can only make text extractions, we manually convert them into numbers suitable for meta-analysis. 44 This made very strong baselines since they combined LLM extraction with human post-processing. We assessed the performance using the Accuracy metric. ...
Preprint
Full-text available
Automatic medical discovery by AI is a dream of many. One step toward that goal is to create an AI model to understand clinical studies and synthesize clinical evidence from the literature. Clinical evidence synthesis currently relies on systematic reviews of clinical trials and retrospective analyses from medical literature. However, the rapid expansion of publications presents challenges in efficiently identifying, summarizing, and updating evidence. We introduce TrialMind, a generative AI-based pipeline for conducting medical systematic reviews, encompassing study search, screening, and data extraction phases. We utilize large language models (LLMs) to drive each pipeline component while incorporating human expert oversight to minimize errors. To facilitate evaluation, we also create a benchmark dataset TrialReviewBench, a custom dataset with 870 annotated clinical studies from 25 meta-analysis papers across various medical treatments. Our results demonstrate that TrialMind significantly improves the literature review process, achieving high recall rates (0.897-1.000) in study searching from over 20 million PubMed studies and outperforming traditional language model embeddings-based methods in screening (Recall@20 of 0.227-0.246 vs. 0.000-0.102). Furthermore, our approach surpasses direct GPT-4 performance in result extraction, with accuracy ranging from 0.65 to 0.84. We also support clinical evidence synthesis in forest plots, as validated by eight human annotators who preferred TrialMind over the GPT-4 baseline with a winning rate of 62.5%-100% across the involved reviews. Our findings suggest that an LLM-based clinical evidence synthesis approach, such as TrialMind, can enable reliable and high-quality clinical evidence synthesis to improve clinical research efficiency.
Article
Background: Personal listening devices (PLDs) include all electronic gadgets that allow users to listen to audio uninterrupted for prolonged periods without disturbing the people around. PLDs are used by youngsters not only to listen to recreational music, but also to hear audio books, online videos and for online educational programs specifically in the current COVID 19 pandemic scenario. Objectives: To assess the pattern of usage of personal listening devices and the knowledge of its harmful effects among students of professional colleges in central Kerala. Materials & Method: A cross sectional study was conducted on 363 students of a private medical and engineering college in central Kerala using a semi-structured questionnaire. Result: Out of total 363 students who gave consent for study, 77% of medical students and 56% of engineering students were PLD users. The students used PLDs for both educational and recreational purposes (mainly gaming). Earphones/in ear devices were used by 52% of the students. There is predominance in the number of students who have been using PLDs for 2-5 years (54.3% medical and 37.2% engineering) and currently using it 3-5 days per week for more than 4 hours per day. 63% of students listen to more than 50% volume of the device and 80% of the users increase the volume in a noisy environment. 35% of the medical students and 37% of engineering students clean their PLDs frequently. Only a few students (13%) share their earphones, but most students (60%) do not clean it before sharing. Although medical students were able to correctly answer most of the knowledge questions better than the engineering students, it was found that only 24.4% of the medical and 10.1% of the engineering students knew that the highest safe exposure level is 85dB of sound up to a maximum of 8 hours.
Article
Full-text available
The effect of practice schedule on retention and transfer has been studied since the first publication on contextual interference (CI) in 1966. However, strongly advocated by scientists and practitioners, the CI effect also aroused some doubts. Therefore, our objective was to review the existing literature on CI and to determine how it affects retention in motor learning. We found 1255 articles in the following databases: Scopus, EBSCO, Web of Science, PsycINFO, ScienceDirect, supplemented by the Google Scholar search engine. We screened full texts of 294 studies, of which 54 were included in the meta-analysis. In the meta-analyses, two different models were applied, i.e., a three-level mixed model and random-effects model with averaged effect sizes from single studies. According to both analyses, high CI has a medium beneficial effect on the whole population. These effects were statistically significant. We found that the random practice schedule in laboratory settings effectively improved motor skills retention. On the contrary, in the applied setting, the beneficial effect of random practice on the retention was almost negligible. The random schedule was more beneficial for retention in older adults (large effect size) and in adults (medium effect size). In young participants, the pooled effect size was negligible and statically insignificant.
Chapter
Full-text available
Article
The one-step (Peto) method for obtaining pooled effect estimates can yield extremely biased results when applied to unbalanced data. Even for balanced studies, the one-step estimate may incorporate an unacceptable degree of bias. In place of the one-step estimate, we recommend use of ordinary Mantel-Haenszel, weighted least squares, or maximum likelihood estimates whenever the total number of events is adequate for such methods. If the total number of events is small, we recommend exact methods.
Article
Long-term beta blockade for perhaps a year or so following discharge after an MI is now of proven value, and for many such patients mortality reductions of about 25% can be achieved. No important differences are clearly apparent among the benefits of different beta blockers, although some are more convenient than others (or have slightly fewer side effects), and it appears that those with appreciable intrinsic sympathomimetic activity may confer less benefit. If monitored, the side effects of long-term therapy are not a major problem, as when they occur they are easily reversible by changing the beta blocker or by discontinuation of treatment. By contrast, although very early IV short-term beta blockade can definitely limit infarct size, more reliable information about the effects of such treatment on mortality will not be available until a large trial (ISIS) reports later this year, with data on some thousands of patients entered within less than 4 hours of the onset of pain. Our aim has been not only to review the 65-odd randomized beta blocker trials but also to demonstrate that when many randomized trials have all applied one general approach to treatment, it is often not appropriate to base inference on individual trial results. Although there will usually be important differences from one trial to another (in eligibility, treatment, end-point assessment, and so on), physicians who wish to decide whether to adopt a particular treatment policy should try to make their decision in the light of an overview of all these related randomized trials and not just a few particular trial results. Although most trials are too small to be individually reliable, this defect of size may be rectified by an overview of many trials, as long as appropriate statistical methods are used. Fortunately, robust statistical methods exist--based on direct, unweighted summation of one O-E value from each trial--that are simple for physicians to use and understand yet provide full statistical sensitivity. These methods allow combination of information from different trials while avoiding the unjustified direct comparison of patients in one trial with patients in another. (Moreover, they can be extended of such data that there is no real need for the introduction of any more complex statistical methods that might be more difficult for physicians to trust.) Their robustness, sensitivity, and avoidance of unnecessary complexity make these particular methods an important tool in trial overviews.
Article
This commentary presents a new esti- of matching variables, unmatched varimator of the variance of the natural log of ables, or both. This notation encompasses the Mantel-Haenszel odds ratio and illus- the full spectrum of matched and untrates its application in typical case-control matched designs. For example, in the pairdesigns. In contrast with any single previ- matched study, MI and Mo are always one; ously proposed estimator (1-7), this esti- in a design with J controls uniquely mator is easily computed and can be used matched to each case (1: J matching), MI in the analysis of data from any sampling is always one and Mo is always J; in an design (e.g., individually matched, category unmatched design, MI and Mo will usually matched, or unmatched). Furthermore, in vary in an arbitrary fashion across the a series of Monte Carlo simulations, this strata. estimator yielded confidence limits as good Suppose now that the parameter of inas or better than those obtained using pre- terest is the odds ratio and the odds ratio vious estimators (8). has a common (constant) value across strata. If R = AD/T and S = BC/T, the BACKGROUND Mantel-Haenszel estirnator2f the common We will use the following notation to odds ratio is defined as ORMH = R+/S+, represent data values in a basic stratum