Clinical Rehabilitation 2007; 21: 640–647
Sensitivity, specificity and predictive value
of the clinical trunk muscle endurance tests
in low back pain
Amir Massoud Arab, Mahyar Salavati Department of Physical Therapy, University of Social Welfare and Rehabilitation
Sciences, Evin, Ismaeil Ebrahimi Faculty of Rehabilitation, Iran University of Medical Sciences and Mohammad Ebrahim
Mousavi Orthopaedics, University of Social Welfare and Rehabilitation Sciences, Evin, Tehran, Iran
Received 1st August 2006; returned for revisions 28th November 2006; revised manuscript accepted 10th December 2006.
Objective: To describe the sensitivity, specificity, positive predictive value,
negative predictive value and diagnostic accuracy of five clinical tests used to
measure trunk muscle endurance in low back pain.
Design: A cross-sectional non-experimental design.
Setting: Orthopaedic and physical therapy departments of four hospitals and
outpatient physical therapy clinics, Tehran, Iran.
Subjects: Convenience sample of 200 subjects participated in this study.
Subjects were categorized into four groups: men without low back pain (N ⫽50,
mean (SD) age ⫽38 (12) years), women without low back pain (N ⫽50, mean
(SD) age ⫽43 (11) years), men with low back pain (N ⫽ 50, mean (SD) age ⫽ 39
(12) years) and women with low back pain (N ⫽50, mean (SD) age ⫽43 (12)
Main measures: Five clinical static endurance tests of trunk muscles such as:
Sorensen test, prone isometric chest raise test, prone double straight-leg raise
test, supine isometric chest raise test and supine double straight-leg raise test
were measured in each group.
Results: The result of receiver operating characteristics (ROC) curve analysis
revealed that in a separate analysis of data for men and women, among all tests,
the prone double straight-leg raise test had the highest sensitivity, specificity and
predictive value in low back pain compared with other performed tests.
Conclusions: It seems that the prone double straight-leg raise test has more
sensitivity, specificity and predictive value in low back pain than other tests and
could be used as a useful clinical method for testing the spinal muscle
endurance to predict the probability of the occurrence of low back pain.
Address for correspondence: Amir Massoud Arab, Department of
Physical Therapy, University of Social Welfare and Rehabilitation
Sciences, Evin, Koodakyar Ave., PO Box 19834, Tehran, Iran.
Low back pain is one of the most common and costly
musculoskeletal complaints in today’s societies,
affecting up to 70–80% of the population with at least
one episode during their lifetime.
Despite its high
incidence and detrimental effects on individuals’
activities the exact causes of mechanical low back
pain have not yet been fully understood as no
approach to diagnosis or treatment has been shown to
be clearly effective.
In recent decades the main
focus has been placed on trunk muscle endurance and
its association with low back pain. The back extensor
muscles are considered to be postural muscles that aid
in maintaining upright standing posture and control-
ling lumbar forward bending.
Numerous studies have
shown a significant decrease in back extensor muscle
endurance in patients with low back pain.
It has
been reported that evaluation of the endurance of
trunk extensor muscles has greater discriminative
validity than evaluation of muscle strength low back
and could be a very good predictor of back
Some electromyographic studies indicate
that the paraspinal muscles in patients with low back
pain have a faster fatigue rate compared with those in
asymptomatic subjects.
Moreover, some investi-
gators have focused on the endurance of the trunk
flexors in low back pain because of their significant
role in normal function of the lumbo-pelvic area.
It has also been reported that abdominal muscular
endurance in patients with low back pain is less than
that in the normal health population
and apparent
loss of muscle control following trunk muscle fatigue
could be considered to be one of the important causes
of low back pain.
Thus testing trunk muscle
endurance would seem to be very important in the pre-
diction, prevention and rehabilitation of low back pain.
Several types of testing methods, such as static
endurance test, active measures of endurance, isokinet-
ic and electromyographic testing, have been studied in
the literature.
Of the different available assessment
strategies, isometric endurance testing seems to be
cost-effective, easy and quick to perform and requires
no special equipment in the clinics, so clinicians would
choose it to use for measuring trunk muscle
Different static endurance testing meth-
ods and evidence regarding their utilization have been
reported in the literature. Most commonly, they are:
prone isometric chest raise test as described by Ito
et al.,
McIntosh et al.
and others,
double straight-leg raise test as described by McIntosh
et al.
and Moreau et al.,
supine isometric chest raise
test as described by Ito et al.,
McIntosh et al.,
supine double straight-leg raise test
as described by McIntosh et al.
and Sorensen
The diagnostic accuracy and suitability
of a clinical test can be measured by comparing the test
results to the true condition of the patient. The most
widely used measurements used to evaluate the accura-
cy and suitability of clinical tests in binary data are sen-
sitivity and specificity and predictive values of the test.
Several studies have shown a significant difference
between normal subjects and those with low back pain
in these tests, but more in-depth review of the literature
reveals that most previous studies have considered only
one of these tests in a relatively small population and
although there are several measures of endurance of
trunk muscle, they have not been compared. The cur-
rent study collectively examined five clinical isometric
trunk muscle endurance tests in subjects with and with-
out low back pain in a relatively large population and
identified the sensitivity, specificity, predictive values
of each test to effectively describe how well low back
pain and no-low back pain people can be classified on
the basis of their clinical endurance test values.
Two hundred subjects between the ages of 20 and
65 were selected from four hospitals.
All the individuals who were participated in the study
filled out a simple health questionnaire. Those who met
the selection criteria were included in the study. All the
subjects signed an informed consent form approved by
the human subjects committee at the University of
Social Welfare and Rehabilitation Sciences before par-
ticipating in the study. Subjects were categorized into
four groups of men and women with and without low
back pain: men without low back pain (N⫽50, mean
(SD) age 38 (12) years), women without low back pain
(N⫽50, mean (SD) age 43(11) years), men with low
back pain (N⫽50, mean (SD) age 39 (12) years), and
women with low back pain (N⫽50, mean (SD) age 43
(12) years). The mean age, height and weight of the sub-
jects in each group are shown in Table 1.
Selection criteria
Subjects were included if they had no history of
spinal surgery, no spinal or pelvic fracture, no his-
tory of hospitalization for severe trauma or injuries
from a car accident, no history of osteoarthritis or
fracture of the lower extremities and had no history
of any systemic disease, such as arthritis or tuber-
culosis. Control subjects were evaluated and found
to have no complaint of any pain or dysfunction
in their low back, thoracic and neck area, lower
extremities and any neuromuscular disorders.
Patients were included if they had a history of low
back pain for more than six weeks before the study
or had on and off back pain and had experienced at
least three episodes of low back pain, each lasting
more than one week, during the year before the
None of the patients or control subjects had
referred leg pain.
Reliability assessment
Using 30 asymptomatic subjects (15 male and 15
female volunteers), we assessed intratester and
intertester reliability of the measurements. The first
examiner completed the tests in a subject and then
after 15 minutes repeated the tests in a random order
on the same subject. The second examiner then tested
the subject, following the same procedure.
The description of the measurement procedure for
each test was as follows:
● Sorensen test: This is the most widely used test in
published studies evaluating the isometric
endurance of trunk extensor muscles. During the
test, the patient was on the examining table in the
prone position with the upper edge of the iliac crests
aligned with the edge of the table. The lower body
was fixed to the table by three straps, located around
the pelvis, knees and ankles. With the arms folded
across the chest, the patient was asked to maintain
the unsupported upper body in horizontal position
until he or she could no longer control the posture or
had no more tolerance for the procedure.
● Prone isometric chest raise test: This was done
with the subject lying prone on a treatment table
with a pad under the abdomen and the arms
along the sides. The subject was instructed to lift
upper trunk about 30 degrees from the table
while flexing the neck and to hold the sternum
off the floor as much as possible. The test con-
sisted in holding this position as long as possible
while breathing normally. The detailed proce-
dure for this test is described by Ito et al.
● Prone double straight-leg raise test: The subject’s
position was prone with hips extended, the hands
underneath the forehead and the arms perpendicu-
lar to the body. The subject was then instructed to
raise both legs until knee clearance was achieved.
The examiner monitored knee clearance by sliding
one hand under the thighs. The time was recorded
in seconds, and the test was terminated when the
subject was no longer able to maintain knee clear-
ance. The detailed procedure for this test is
described by McIntosh et al.
Table 1 Descriptive statistics for the age, height, weight and the clinical endurance tests scores in subjects with and with-
out low back pain
Variables Men Women
Without LBP (N550) With LBP (N550) Without LBP (N550) With LBP (N550)
Age (years) 38 (12) 39 (12) 43 (11) 43 (12)
Height (cm) 170 (6) 172 (7) 166 (7) 160 (6)
Weight (kg) 70 (12) 69 (11) 68 (13) 67 (10)
Sorensen test (s) 35 (7) 27 (8) 36 (7) 25 (6)
Prone isometric
chest raise test (s) 40 (9) 33 (15) 52 (18) 30 (7)
Supine isometric
chest raise test (s) 43 (9) 33 (5) 32 (5) 28 (6)
Prone double
straight-leg raise test (s) 38 (6) 26 (4) 35 (5) 26 (3)
Supine double
straight-leg raise test (s) 28 (4) 24 (5) 28 (4) 23 (3)
Values are mean (SD).
LBP, low back pain.
● Supine isometric chest raise test: This was done
with the subject lying supine on a treatment table
with the hands crossed on his or her chest. The
knees and hips were in 90 degree flexion. The sub-
ject was instructed to lift neck and upper trunk
from the table and hold this position as long as
● Supine double straight-leg raise test: To do this
test we followed the method described by
McIntosh et al.
to assess the endurance of the
lower abdominal muscles. The subject began in
the supine-lying position, hips extended, with the
hands laying beside the trunk. The subject was
then instructed to raise both legs from the floor
about 20 degrees and hold this position as long as
possible without any tilting in the pelvis. The
examiner monitored pelvic tilt during test. The
time was recorded in seconds and the test was ter-
minated when the subject was no longer able to
maintain knee clearance.
The examiner undertook the clinical tests in ran-
dom order and not in specified subjects.
The research was reviewed and was approved by
the Human Subject Committee at University of Social
Welfare and Rehabilitation Sciences.
Data analysis
The intraclass correlation coefficient (ICC), two-
way random effect model was used to assess intrat-
ester and intertester reliability of the measurement as
described by Shrout and Fleiss.
The receiver oper-
ating characteristic (ROC) curve analysis
MedCalc statistical software (MedCalc, Mariakerke,
Belgium) was used to determine a cut-off value for
each test and the sensitivity, specificity, predictive
value and area under the curve of tests was calculat-
ed. The ROC curve is a plot of sensitivity versus
1–specificity of a variable assessed against an exter-
nal criterion.
Equivalently, the ROC curve is the
representation of the trade-offs between sensitivity
and specificity. Having or not having low back pain
was used as the external criterion for constructing
the ROC curves in this study. MedCalc statistical
software provides a value of the independent vari-
able with the highest sensitivity and specificity as a
cut-off score which best can discriminate between
subjects with and without the condition using the
tested variable as a diagnostic tool. Separate cut-off
values and ROC curve were obtained for men and
Descriptive statistics for the subjects and test scores in
each group are presented in Table 1. Table 2 presents
the ICC for each test taken in the pilot study. Except
for the Sorensen test, all other ICC values were
greater than 0.80 (Table 2).
The cut-off value, sensitivity, specificity, positive
predictive values, negative predictive value and area
under the ROC curve for the tests in men and women
are presented in Table 3. The result of ROC curve
revealed that in separate analyses of data for men and
women, although all tests had somewhat good sensi-
tivity and specificity in low back pain, among them,
the prone double straight-leg raise test had the high-
est sensitivity, specificity and predictive value (Table
3, Figures 1 and 2). It also had the highest area under
the ROC curve in comparison with other tests both in
men and women. Other tests had high sensitivity with
relatively low specificity or vice versa (Table 3).
Our data indicate a relatively good sensitivity and
specificity and predictive value in all performed tests
Table 2 Intraclass correlation coefficient values for intra-
tester and intertester reliability for the measurements per-
formed in the study (N ⫽30 subjects)
Measurements Tester 1 Tester 2 Intertester
ICC(3,1) ICC(3,1) ICC(2,1)
Sorensen test 0.80 0.79 0.78
Prone isometric 0.90 0.89 0.90
chest raise test
Supine isometric 0.92 0.90 0.89
chest raise test
Prone double s 0.87 0.85 0.83
traight-leg raise
Supine double 0.84 0.85 0.79
raise test
ICC, intraclass correlation coefficient.
in low back pain. This finding is in accordance with
other studies showing a significant decrease in trunk
muscle endurance in patients with chronic low back
Because these muscles are rich in larger
diameter type I muscle fibres,
they are suited to
support low levels of activity for long periods of
time. Investigators have attributed the decreased
muscle endurance found in patients with low back
pain to higher muscle metabolite level resulting from
prolonged muscle tension and spasm, muscle decon-
ditioning and inhibition of the paraspinal muscles
in response to pain and decreased activity.
However, the significance of this study was in
assessing several clinical isometric tests that have
been used to measure trunk muscle endurance togeth-
er to compare the relative significance of each test
Figure 1 ROC curve for the performed tests in men. Soren,
Sorensen test; SICR, supine isometric chest raise test;
SDSLR, supine double straight-leg raise test; PICR, prone
isometric chest raise test; PDSLR, prone double straight-leg
raise test.
0 20 40 60 80 100
100-S pec if icity
Figure 2 ROC curve for the performed tests in women.
Soren, Sorensen test; SICR, supine isometric chest raise
test; SDSLR, supine double straight-leg raise test; PICR,
prone isometric chest raise test; PDSLR, prone double
straight-leg raise test.
0 20 40 60 80 100
100-S pec if icity
Table 3 The cut-off score, sensitivity, specificity, predictive value and area under the ROC curve for the performed tests
Tests Cut-off score Sen. Spec. ⫹PV ⫺PV Area
Sorensen test
Men ⬎28 92.3 76.0 80.8 90.0 0.85
Women ⬎29 84.3 84.6 84.3 84.6 0.90
Prone isometric chest raise test
Men ⬎31 80.8 80.0 80.08 80.0 0.79
Women ⬎33 98.0 84.6 86.2 97.8 0.93
Supine isometric chest raise test
Men ⬎34 96.2 72.0 78.1 94.7 0.88
Women ⬎24 99.4 32.7 59.3 99.4 0.63
Prone double straight-leg raise test
Men ⬎30 96.2 100 100 96.2 0.99
Women ⬎29 100 92.3 92.7 100 0.97
Supine double straight-leg raise test
Men ⬎25 92.3 80.0 82.8 90.9 0.83
Women ⬎25 98.0 84.6 86.2 97.8 0.95
Sen., sensitivity; Spec., specificity; ⫹PV, positive predictive value; ⫺PV, negative predictive value; Area, area under the ROC
curve (maximum⫽1.0).
Equal number of men (100) and women (100) were used for all conditions.
or abdomen.
Latikka et al. also reported a 50% fail-
ure rate in doing Sorensen test.
