Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM.

Applied Physiology Laboratory, Division of Physical Therapy, Des Moines University-Osteopathic Medical Center, Des Moines, Iowa 50312, USA.
The Journal of Strength and Conditioning Research (Impact Factor: 1.86). 03/2005; 19(1):231-40. DOI: 10.1519/15184.1
Source: PubMed

ABSTRACT Reliability, the consistency of a test or measurement, is frequently quantified in the movement sciences literature. A common metric is the intraclass correlation coefficient (ICC). In addition, the SEM, which can be calculated from the ICC, is also frequently reported in reliability studies. However, there are several versions of the ICC, and confusion exists in the movement sciences regarding which ICC to use. Further, the utility of the SEM is not fully appreciated. In this review, the basics of classic reliability theory are addressed in the context of choosing and interpreting an ICC. The primary distinction between ICC equations is argued to be one concerning the inclusion (equations 2,1 and 2,k) or exclusion (equations 3,1 and 3,k) of systematic error in the denominator of the ICC equation. Inferential tests of mean differences, which are performed in the process of deriving the necessary variance components for the calculation of ICC values, are useful to determine if systematic error is present. If so, the measurement schedule should be modified (removing trials where learning and/or fatigue effects are present) to remove systematic error, and ICC equations that only consider random error may be safely used. The use of ICC values is discussed in the context of estimating the effects of measurement error on sample size, statistical power, and correlation attenuation. Finally, calculation and application of the SEM are discussed. It is shown how the SEM and its variants can be used to construct confidence intervals for individual scores and to determine the minimal difference needed to be exhibited for one to be confident that a true change in performance of an individual has occurred.

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Objective: The purpose of this study was to assess the effects of high-voltage electrical stimulation (HVES), continuous short wave diathermy, and physical exercise on arterial blood flow in the lower limbs of diabetic women with peripheral arterial disease. Methods: A crossover study was carried out involving 15 diabetic women (mean age of 77.87 ± 6.20 years) with a diagnosis of peripheral arterial disease. One session of each therapeutic resource was held, with a 7-day washout period between protocols. Blood flow velocity was evaluated before each session and 0, 20, 40 and 60 minutes after the administration of each protocol. Two-way repeated-measures analysis of variance with Bonferroni post hoc test was used for the intragroup and intergroup comparisons. Results: In the intragroup analysis, a significant reduction (P < .05) was found in blood flow velocity in the femoral and popliteal arteries over time with HVES and physical exercise and in the posterior tibial artery with the physical exercise protocol. However, no significant differences were found in the intergroup analysis (P > .05). Conclusion: Proximal blood circulation in the lower limb of diabetic women with peripheral arterial disease was increased by a single session of HVES and physical exercise, whereas distal circulation was only increased with physical exercise.
    Journal of Manipulative and Physiological Therapeutics 01/2015; · 1.25 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Assessment of control of posture using a task battery that represents work-related postural conditions is highly recommended for providing a comprehensive understanding of collective postural demands. However, dearth of evidence exists on the reliability of a task battery, thus precluding its use as an outcome measure in field research. This study investigated the intrasession reliability and systematic variation of force plate derived centre of pressure (COP) measures obtained during repeated performance of a task battery (lifting task, limits of stability and bipedal and unipedal stance). COP signals obtained during each task performance were processed to derive various time-domain COP measures. Statistical analyses revealed that 13 of the 19 COP measures displayed excellent relative (ICC(2,3) ≥ 0.75) and acceptable absolute reliability (SEM%: ≤ 10). Although COP measures displayed systematic variation, the differences were less or equal to the measurement error, except COP measures of unipedal stance and limits of stability. The chosen task battery is reliable and can be used for comprehensive evaluation of control of posture, in both field and laboratory research. Practitioner Summary: Repeated evaluation of multiple tasks together sequentially could introduce measurement variability. This study investigated intrasession reliability of a task battery representing common work-related postures. The chosen task battery was found to be reliable with acceptable measurement error and can be used in field research settings for evaluation of control of posture.
    Ergonomics 01/2015; · 1.61 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background Limited research exists examining the impact of nutrition on golfing performance. This study’s purpose was to determine the impact of daily supplementation with an over-the-counter dietary supplement on golf performance. Methods Healthy men (30.3 ± 6.9 y, 183.1 ± 5.6 cm, 86.7 ± 11.9 kg), with a 5–15 handicap were assigned in a double-blind, placebo-controlled manner to ingest for 30 days either a placebo (PLA, n = 13) or a dietary supplement containing creatine monohydrate, coffea arabica fruit extract, calcium fructoborate and vitamin D (Strong Drive™, SD, n = 14). Subjects ingested two daily doses for the first two weeks and one daily dose for the remaining two weeks. Participants followed their normal dietary habits and did not change their physical activity patterns. Two identical testing sessions in a pre/post fashion were completed consisting of a fasting blood sample, anthropometric measurements, 1-RM bench press, upper body power and golf swing performance using their driver and 7-iron. Data were analyzed using two-way mixed factorial ANOVAs and ANCOVA when baseline differences were present. Statistical significance was established a priori at p ≤ 0.05. Results ANCOVA revealed significantly greater (post-test) best drive distance (p = 0.04) for SD (+5.0% [+13.6 yards], ES = 0.75) as well as a tendency (p = 0.07) for average drive distance to increase (+8.4% [+19.6 yards], ES = 0.65), while no such changes were found with PLA (−0.5% [−1.2 yards], ES = 0.04 and +1.3% [+2.8 yards], ES = 0.08, respectively). Both groups experienced significant increases in body mass and 1-RM bench press (p < 0.001). No other significant group × time interactions were found. For the SD group only, within-group analysis confirmed significant improvements in set 1 average (+8.9%, p = 0.001) and peak velocity (+6.8%, p < =0.01). No changes were noted for reported adverse events, pain inventories, quality of life or any measured blood parameter. Conclusions SD supplementation for 30 days significantly improved best drive distance more than placebo. Supplementation was well tolerated and did not result in any clinically significant changes in markers of health or adverse events/side effect profiles.
    Journal of the International Society of Sports Nutrition 01/2015; 12(1). · 1.50 Impact Factor

Full-text (2 Sources)

Available from
May 20, 2014