ArticlePDF Available

Automated Collection of Fatigue Ratings at the Top of Descent: A Practical Commercial Airline Tool

Authors:

Abstract and Figures

There is a need to develop an efficient and accurate way of assessing pilot fatigue in commercial airline operations. We investigated the validity of an automated system to collect pilot ratings of alertness at the top of descent, comparing the data obtained with existing results from previous studies and those predicted by the validated SAFE fatigue model. Boeing 777 pilots were prompted to enter a Samn-Perelli fatigue scale rating directly into the flight management system of the aircraft shortly prior to descent on a variety of short- and long-haul commercial flights. These data were examined to evaluate whether the patterns were in line with predicted effects of duty length, crew number, and circadian factors. We also compared the results with data from previous studies as well as SAFE model predictions for equivalent routes. The effects of duty length, time of day, and crew complement were in line with expected trends and with data from previous studies; the correlation with predictions from the SAFE model was high (r = 0.88). Fatigue ratings were greater on longer trips (except where mitigated by adding an extra pilot) and on overnight sectors (4.68 vs 3.77). The results suggest that the automated collection of subjective ratings is a valid way to collect data on fatigue in an airline setting. This method has potential benefits for the crew in assessing fatigue risk prior to approach, as part of a fatigue risk management system, with the possibility of wider safety benefits.
Content may be subject to copyright.
Aviation, Space, and Environmental Medicine x Vol. 82, No. 11 x November 2011 1037
RESEARCH ARTICLE
P OWELL DMC, S PENCER MB, P ETRIE KJ. Automated collection of
fatigue ratings at the top of descent: a practical commercial airline
tool. Aviat Space Environ Med 2011; 82:1037 41.
Introduction: There is a need to develop an effi cient and accurate way
of assessing pilot fatigue in commercial airline operations. We investi-
gated the validity of an automated system to collect pilot ratings of alert-
ness at the top of descent, comparing the data obtained with existing
results from previous studies and those predicted by the validated SAFE
fatigue model. Methods: Boeing 777 pilots were prompted to enter a
Samn-Perelli fatigue scale rating directly into the fl ight management sys-
tem of the aircraft shortly prior to descent on a variety of short- and long-
haul commercial fl ights. These data were examined to evaluate whether
the patterns were in line with predicted effects of duty length, crew num-
ber, and circadian factors. We also compared the results with data from
previous studies as well as SAFE model predictions for equivalent routes.
Results: The effects of duty length, time of day, and crew complement
were in line with expected trends and with data from previous studies;
the correlation with predictions from the SAFE model was high (r 5
0.88). Fatigue ratings were greater on longer trips (except where miti-
gated by adding an extra pilot) and on overnight sectors (4.68 vs 3.77).
Discussion: The results suggest that the automated collection of subjec-
tive ratings is a valid way to collect data on fatigue in an airline setting.
This method has potential benefi ts for the crew in assessing fatigue risk
prior to approach, as part of a fatigue risk management system, with the
possibility of wider safety benefi ts.
Keywords: fatigue , intervention , work hours , circadian rhythm , duty
time limitations .
A N IMPORTANT ISSUE in commercial airline oper-
ations is the evaluation of the effect of different sec-
tors and work patterns on pilot fatigue. This information
is useful for identifying problem sectors and duties that
may compromise the safety of the operation. Such data
would also be useful for monitoring changes in fatigue
which may indicate the need for an intervention, such as
adjusting departure times, re-specifying prefl ight rest
provisions, or perhaps adding an additional pilot ( 4 , 5 ).
One of the diffi culties of collecting ongoing fatigue
measures from pilots is that the methodologies currently
available are labor-intensive and costly. Typically, these
have involved researchers either accompanying crew
on a duty and collecting fatigue ratings over the course
of each sector, or briefi ng crew prior to departure to fi ll
out various fatigue ratings and reaction time tasks dur-
ing each fl ight ( 8 ). While these approaches produce valid
estimates of fatigue levels in specifi c duties, they are
impractical to use across the whole of a large airline’s
operation. What is needed to assess fatigue across the
whole of a commercial airline’s operation is a valid mea-
sure of fatigue at critical phases of fl ight that can be
completed by crew quickly and easily without the use of
other personnel or equipment.
In this study we evaluated the validity of pilots enter-
ing their fatigue rating directly into the fl ight manage-
ment system of the aircraft just prior to the top of descent.
To do this we designed a system that prompts pilots to
enter a Samn-Perelli fatigue rating scale ( 10 ) for each
crewmember into a special screen. This allowed for the
routine collection of fatigue data from the crews of those
aircraft fi tted with this modifi cation.
We evaluated the validity of this methodology in a
mixture of short- and long-haul operations. Firstly, we
predicted that fatigue levels in the automated top-
of-descent ratings would be greater in long-haul fl ights
(operating across time zones and usually at night) than
in short-haul duties (generally daylight hours, returning
to home base). Further, we predicted that in short-haul
ights, fatigue levels would be greater at the end of duty,
on the return sector, compared to the end of the outward
sector. We also predicted that in long-haul fl ights, fa-
tigue would be less for the daylight fl ights than for the
otherwise similar overnight fl ights and, fi nally, that the
rating method would refl ect the predicted benefi cial
effect of a fourth pilot on longer duties. We compared
the results collected by the automated top-of-descent
method with data gathered in previous studies using
questionnaires and with predictions made by the widely
used fatigue predictive model, SAFE ( 1 ).
METHODS
Subjects
Pilots ying Air New Zealand 777-200 aircraft were
contacted in advance of the study and invited to partici-
pate. All participants were pilots already scheduled to
y the studied patterns on the days available for testing;
From Air New Zealand, Auckland, New Zealand; the University of
Otago, Wellington, New Zealand; and the University of Auckland,
Auckland, New Zealand.
This manuscript was received for review in May 2011 . It was
accepted for publication in August 2011 .
Address correspondence and reprint requests to: David M. C. Powell,
Aviation Medicine Specialist, Air New Zealand, Private Bag 92007,
Auckland 1142, New Zealand; david.powell@airnz.co.nz .
Reprint & Copyright © by the Aerospace Medical Association,
Alexandria, VA.
DOI: 10.3357/ASEM.3115.2011
Automated Collection of Fatigue Ratings at the Top
of Descent: A Practical Commercial Airline Tool
David M. C. Powell , Mick B. Spencer ,
and Keith J. Petrie
1038 Aviation, Space, and Environmental Medicine x Vol. 82, No. 11 x November 2011
TOP-OF-DESCENT ALERTNESS — POWELL ET AL.
no pilots were excluded from testing. Data were col-
lected noninvasively and anonymously during the nor-
mal work of the pilots, and participation was voluntary.
No demographic data were collected from the subjects
and, as the collection of data was anonymous, it is not
known which pilots chose to participate.
Procedure
The modifi cation to enable the top-of-descent fatigue
ratings was made on the fl ight management system
software of Boeing 777-200 aircraft. This aircraft was op-
erating a mixture of long-haul (10 13 h fl ight time) inter-
national sectors and out-and-back duties from home
base in New Zealand to destinations in Australia and
the Pacifi c Islands, returning to home base within the
same duty period. The long-haul international sectors,
with only one exception, operated through the night,
with three or four pilot crews to allow each pilot the op-
portunity to sleep in the aircraft crew rest compartment.
The exception was a daylight fl ight from New Zealand
to Japan, with three pilots. The short-haul duties were
out-and-back duties, conducted between the hours of
0750 (earliest scheduled departure) and 2000 (latest
scheduled arrival). Some of these were fl own by two pi-
lots, without crew rest; however, the fl ights between
Auckland and the Cook Islands, and on occasions those
between Auckland and Melbourne, being slightly lon-
ger, had three pilots, allowing each pilot a rest period
away from the fl ight deck.
Measures: An input page was programmed into the
aircraft fl ight management system of the aircraft, asking
pilots to input their alertness levels according to the
7-point Samn-Perelli fatigue scale ( 10 ). The input screen
is shown in Fig. 1 . The pilots were prompted, at 20 min
prior to descent, to conduct the procedure. This involved
Fig. 1. Input screen.
the pilots discussing their alertness ratings and one of
them entering the scores on the appropriate screen.
For each pilot a score was entered anonymously into
one of the available boxes. To avoid the effects of sleep
inertia, pilots were asked not to provide a rating within
15 min after waking from bunk rest. Once entered, the
data were transmitted to a ground-based station, then
de-identifi ed by removal of the fi elds containing date
(day of the month) and aircraft registration; the informa-
tion added to the database consisted of the airports of
departure and destination, airline fl ight number, time of
day, and month/year, along with the alertness scores.
Statistical Analysis
The returns from individual fl ights were combined
into groups corresponding to the different routes. Then
an analysis of variance procedure was used to split the
sum of squares due to the means over individual fl ights
(weighted by the number of scores on each fl ight), into
two separate sums of squares, one between routes, the
other within routes. Mean values over routes were com-
pared using an error term obtained from the fl ights-
within-routes sum of squares.
Planned comparisons were made as follows: between
all long-haul and short-haul routes; between mean
scores for the four short-haul routes (eight fl ights, four
out and four back); between the outward long-haul
ight to Tokyo and the outward long-haul fl ight to Hong
Kong, which were of similar duration but at different
times of day; and, fi nally, between the long-haul four-
pilot fl ights to Vancouver and Beijing, which departed
in the evening with a four-pilot crew, and the (slightly
shorter) fl ight to Hong Kong, which departed in the late
evening with a three-pilot crew.
A comparison with existing data was based on the re-
sults from two-pilot operations. In a previous study ( 7 ),
information on fatigue, collected from pilots at top of
descent using standard fatigue questionnaires, was
summarized in the form of a series of trend curves which
related fatigue to the timing and duration of the duty
period. The output from this representation, extrapo-
lated where required for longer fl ights, was compared
with the fatigue ratings for all the outbound fl ights in
this study. Inbound fl ights were excluded due to poten-
tial confounding effects of time-zone change and/or
multiple sectors ( 6 ).
Finally, a comparison was made between the auto-
mated top-of-descent ratings for the 23 routes and the
predictions of the SAFE fatigue model ( 2 ), which has
been validated with respect to air transport operations.
As some of the parameters used by the model varied
within many of the routes (e.g., layover duration, crew
size, relief or main crew), the predictions were based on
average values using estimates for the distribution of
the parameters within the sample.
RESULTS
Over a period of 1 yr, 4629 ratings were obtained;
this represents well over 50% of available fl ights and a
Aviation, Space, and Environmental Medicine x Vol. 82, No. 11 x November 2011 1039
TOP-OF-DESCENT ALERTNESS — POWELL ET AL.
response rate of approximately 38% of the pilots who
could have participated. A summary of the fatigue rat-
ings for the individual routes collected with the auto-
matic process is presented in Table I . The means and
variances for the 23 routes are given in Fig. 2A , where
the routes have been ordered with respect to their mean
rating, from the least fatiguing (Auckland Fiji) to the
most fatiguing (Hong Kong – Auckland).
There were clear differences relating to the type of
ight undertaken. The highest set of scores was obtained
at the end of long-haul nighttime sectors. The lowest
scores were the fi rst (outbound) sector of short-haul
daylight trips, followed by the second (return) sector.
An intermediate rating was obtained from the sole long-
haul daytime sector of Auckland-Tokyo. Overall, fatigue
levels on the long-haul routes were signifi cantly higher
than on the short-haul routes [F(1,1491) 5 1850.7, P ,
0.001].
We analyzed the short-haul fl ights more closely ( Fig.
2B ) to determine whether the onboard fatigue assess-
ment refl ected the expected variations with duty pat-
tern. The scores on return were considerably higher than
at the end of the outward fl ight [F(1,1491) 5 209.8, P ,
0.001]. There were also signifi cant differences between
the four individual routes [F(3,1491) 5 11.9, P , 0.001]
on both the outward and return fl ights: the Brisbane
ights were less fatiguing than those to and from
Melbourne ( P , 0.01), which departed the earliest, and
the Fiji fl ights, which departed latest (at noon) were less
fa tiguing than those involving the other three short-haul
destinations (Brisbane P , 0.05; Cook Islands and
Melbourne P , 0.01). The Cook Islands fl ights and, on
some occasions the Melbourne fl ights, had three pilots,
whereas the other short-haul fl ights had two; however,
since the fl ights were during daylight, the presence of
the third pilot was unlikely to have resulted in bunk
sleep.
To examine whether the automated alertness assess-
ment was sensitive to the effects of time of day, we com-
pared an overnight fl ight with a daytime fl ight of similar
duration. The average fatigue score at the end of the
overnight Auckland to Hong Kong fl ight was signifi -
cantly higher than at the end of the daytime Auckland to
Tokyo fl ight, which was of similar duration [4.68 vs.
3.77; F(1,1491) 5 96.3, P , 0.001].
We also tested whether the automated alertness as-
sessment showed the effect of an additional pilot by
comparing evening fl ights from Auckland-Vancouver
and Auckland-Beijing with four-pilot crews with an
evening three-pilot Auckland-Hong Kong fl ight. Fatigue
using this assessment method was signifi cantly higher
at the end of the fl ight to Hong Kong with a three-pilot
crew [4.21 vs. 4.68; F(1,1491) 5 16.4, P , 0.001] than at
the end of the four-pilot fl ight to Vancouver; however,
this fl ight departed earlier in the evening. The Beijing
ight was a later departure, like Hong Kong and, al-
though it carried an extra pilot, there was no signifi cant
difference between these two fl ights.
We previously examined Samn-Perelli ratings com-
pleted by pilots on paper at the top of descent from re-
gional two-pilot operations. From these results we
derived a set of trend curves based on start time and ap-
proximate duty duration ( 7 ). In this study we compared
those trend curves to the fatigue scores obtained by the
automated collection method. The comparison between
the fatigue scores on the outward fl ights with those from
the predictions derived from previous data is illustrated
in Fig. 3 . The two-crew fl ights were generally in very
close agreement. However, the fatigue scores on the
three and four pilot overnight fl ights, which allow for
TABLE I. RESULTS OF FATIGUE RATINGS FOR INDIVIDUAL ROUTES.
From To
Takeoff
(Approx. Local Time)
Flight
Duration (h) No. of Pilots Type of Flight
Prior Nights
Layover No. of Flights Mean Variance
Auckland Fiji 1200 3.0 2 S/H out 0 36 1.88 0.69
Auckland Brisbane 1000 3.5 2 S/H out 0 69 2.18 0.94
Auckland Cook Islands 1100 3.8 3 S/H out 0 34 2.31 0.82
Auckland Melbourne 0800 3.8 2/3 S/H out 0 75 2.56 1.12
Fiji Auckland 1600 3.0 2 S/H back 0 37 2.89 0.61
Brisbane Auckland 1300 3.0 2 S/H back 0 75 3.20 0.99
Melbourne Auckland 1200 3.4 2/3 S/H back 0 94 3.43 0.69
Cook Islands Auckland 1700 3.5 3 S/H back 0 37 3.45 0.68
Auckland Tokyo 1000 11.2 3 L/H out 0 104 3.77 0.69
Auckland Vancouver 2000 13.2 4 L/H out 0 26 4.21 0.49
Auckland Shanghai 0000 12.5 3/4 L/H out 0 63 4.36 0.61
Auckland San Francisco 2000 12.2 3/4 L/H out 0 92 4.44 0.62
Auckland Beijing 2300 13.5 4 L/H out 0 34 4.55 0.52
Auckland Hong Kong 0000 11.5 3 L/H out 0 129 4.72 0.64
Hong Kong London 0800 13.2 3 L/H out 1,2 1 99 4.40 0.47
Vancouver Auckland 2000 14.0 4 L/H back 2 1 21 4.44 0.74
Tokyo Christchurch 1800 12.0 3 L/H back 1,2 1 33 4.52 0.83
Beijing Auckland 1200 13.2 4 L/H back 2 1 40 4.62 0.71
Tokyo Auckland 1800 11.0 3 L/H back 1,2 1 83 4.62 0.45
Shanghai Auckland 1400 11.5 3/4 L/H back 2 1 57 4.65 0.40
San Francisco Auckland 2000 13.2 3/4 L/H back 2 1 106 4.68 0.50
London Hong Kong 2100 12.2 3 L/H back 2 1 91 4.73 0.60
Hong Kong Auckland 1800 10.8 3 L/H back 1,2 1 148 4.81 0.56
1040 Aviation, Space, and Environmental Medicine x Vol. 82, No. 11 x November 2011
TOP-OF-DESCENT ALERTNESS — POWELL ET AL.
bunk rest, were consistently lower than predictions
which were based on a two-crew operation.
We also compared the average scores for the 23 indi-
vidual routes with the predictions of the SAFE model
for the same routes ( Fig. 4 ). The overall correlation was
strong (r 5 0.88, P , 0.001).
DISCUSSION
The automated method of collecting subjective fa-
tigue ratings was relatively simple to implement and
yielded large quantities of data in a nonintrusive way.
Furthermore, we found that the automated fatigue rat-
ings at the top of descent responded as expected to fac-
tors incorporating crew size, time of day, length of duty,
and circadian changes. Of note was that no average
scores on any route were above 5.0, which is often taken
as a critical value ( 2 ), in keeping with the results of pre-
vious studies on the same routes. Scores on the daylight
“ out-and-back ” ights were also all lower than those on
the long-haul (mostly nighttime) duties. On these out-
and-back duties, the mean scores were all lower prior to
the fi rst approach than prior to the second approach at
the end of duty, so that fatigue was increasing with duty
length. The comparison between the different out-and-
back duties showed the expected differences based on
duty start time, but there was no reduction in fatigue
associated with the presence of a third pilot on these
Fig. 2. Samn-Perelli fatigue scores by sector. A) All duties (mean 6 SD); B) out-and-back daylight duties (mean 6 SE).
Fig. 3. Mean Samn-Perelli scores vs. predictions extrapolated from
previous two-pilot results.
Fig. 4. Mean scores vs. predictions of the bio-mathematical SAFE
model.
duties. This apparent lack of benefi t from the additional
pilot may be explained by the fact that these duties
occurred at times of day when it was unlikely that the
pilots would sleep during their rest breaks. This would
be expected to reduce the benefi t of the extra pilot ( 3 ).
The pattern of results on the long haul duties also
supported the validity of the collection procedure.
Among these duties, the sole daylight sector (Auckland-
Tokyo) scored signifi cantly lower than the other duties,
which were all operated through the hours of darkness,
as would be expected ( 9 ). For example, the Auckland-
Hong Kong sector was of the same duration and crew
complement as Auckland-Tokyo, but departed in the
late evening rather than in the morning; the results from
the top-of-descent automated ratings showed a signifi -
cantly higher mean level of fatigue for the Hong Kong
night sector than Auckland-Tokyo. We also examined
the effect of an additional pilot by comparing the late
evening Auckland-Hong Kong three-pilot sector with
the late evening four-pilot sector Auckland-Beijing
and the early evening four-pilot sector from Auckland-
Vancouver. A fourth pilot is only added to mitigate against
longer sectors by allowing additional in-fl ight rest. It is,
therefore, not unexpected that there was no difference
between the similarly timed three-pilot (Hong Kong)
and four-pilot (Beijing) fl ights; it is likely that the ob-
served difference between the Hong Kong and the
Aviation, Space, and Environmental Medicine x Vol. 82, No. 11 x November 2011 1041
TOP-OF-DESCENT ALERTNESS — POWELL ET AL.
earlier Vancouver fl ights was related to the different de-
parture times.
We evaluated the validity of the automated collection
of pilot fatigue ratings at the top of descent by compar-
ing the results obtained by this method with data from
previous studies and to the results predicted by a vali-
dated fatigue model. We fi rst compared the results from
this study with trend curves derived from a previous
top of descent study using standard questionnaires.
When comparing the data from the current study, it was
seen that the effect of an augmented (three or more pi-
lots) crew was to decrease the fatigue level from that ex-
pected from the previously published trend curves
which were based on two-pilot crews. This is as ex-
pected, since the augmented crew arrangement provides
opportunities for in-fl ight rest which are not possible in
a standard two-pilot crew ( 3 ).
There is increasing use of bio-mathematical models in
predicting fatigue in fl ight operations ( 4 ). Although some
of these models have been well validated in studies of
aircrew, there is a need for continual updating and valida-
tion of the models. Our analysis showed close agreement
between the outputs of one such model, SAFE, and the
top-of-descent data, suggesting that the model and top-
of-descent alertness ratings may complement each other.
A strength of this study is that it addresses a practical
problem faced by commercial airlines: it introduces a
methodology which provides a method of gathering sub-
jective data at a critical phase of fl ight without the need
for specialized testing. This makes it possible to evaluate
fatigue in a large sample of pilots engaged in an actual
airline roster, thereby enabling a more reliable and repre-
sentative measure of day-to-day operations.
There are some possible limitations to this study. We
did not collect information on the work patterns of the
pilots prior to the schedules under study. In addition,
we have not studied their sleep patterns prefl ight or in
ight. These limitations were an inevitable consequence
of the anonymous and brief nature of this method for
collecting data. The choice of the time just prior to com-
mencing descent does have a potential drawback relat-
ing to in-fl ight rest: one (or in a four-pilot crew, possibly
two) of the pilots may have just returned from bunk rest
and it is possible that, despite being asked not to, some
pilots undertook the rating when still suffering from the
effects of sleep inertia. This could lead to an overesti-
mate of the levels of underlying fatigue at the top of de-
scent in some pilots on long-haul fl ights. Finally, there
was no performance testing to accompany the subjec-
tive ratings and the potential, therefore, exists for distor-
tion of the results by some pilots.
However, the large numbers of ratings obtained and
the relatively small variability across the data set tend to
suggest that the potential for distortion by a few indi-
viduals was minimal. A further important benefi t of this
approach is that it encourages a discussion by the crew of
their fatigue and alertness levels just prior to commenc-
ing the approach. This enables them to integrate fatigue
into the threat assessment when briefi ng that approach
and, thus, offers a direct safety benefi t for the operation.
There is potential for further work in this area: in par-
ticular, data could be collected at other phases of fl ight,
such as pre-departure. The analysis would be enhanced
by amending the software to input automatically the
number of pilots present on each fl ight and potentially
by collecting information on the in-fl ight and prefl ight
sleep history of each pilot. We also believe that these
ndings have implications for fl ight-deck design, in
which there is a search for better methods of managing
fatigue and alertness. Many airlines have introduced
ight data analysis programs such as Flight Operations
Quality Assurance, which take information from the air-
craft data frame on a range of fl ight path parameters and
control inputs; these data are de-identifi ed and analyzed
in detail as part of the airline safety management sys-
tems. If in-fl ight alertness data could be integrated into
such programs, signifi cant headway could be made into
determining the safety consequences of different levels
of crew fatigue.
We have demonstrated that onboard ratings at top of
descent are a useful method for identifying problem
ights and for examining trends across the operation.
The data are collected easily, in large numbers, in a non-
intrusive fashion. As predicted, the fatigue scores re-
sponded as expected to duty length, time zone shifts,
and night fl ying, and correlated well both with previous
questionnaire data and with predictions from an aircrew
fatigue model. There is potential for the further devel-
opment and application of this methodology.
ACKNOWLEDGMENTS
Authors and affi liations: David M. C. Powell, M.B.Ch.B., FAFOEM,
Air New Zealand and University of Otago, Auckland, New Zealand;
Mick B. Spencer, B.A., M.Sc., MB Spencer Ltd., Sandhurst, Berks, UK;
and Keith J. Petrie, M.A., Ph.D., University of Auckland, Auckland,
New Zealand.
REFERENCES
1. Belyavin AJ, Spencer MB . Modelling performance and alertness:
the QinetiQ approach . Aviat Space Environ Med 2004 ; 75 ( 3,
Suppl. ) A93 – 103 .
2. Civil Aviation Authority . Aircrew fatigue: a review of research
undertaken on behalf of the UK Civil Aviation Authority .
Norwich, UK : The Stationery Offi ce ; 2005 . CAA Paper
2005/04 .
3. Eriksen CA, Akerstedt T, Nilsson JP . Fatigue in trans-Atlantic
airline operations: diaries and actigraphy for two- vs. three-
pilot crews . Aviat Space Environ Med 2006 ; 77 : 605 – 12 .
4. Gander P, Hartley L, Powell D, Cabon P, Hitchcock E, et al. Fatigue
risk management: organizational factors at the regulatory and
industry/company level . Accid Anal Prev 2011 ; 43 : 573 – 90 .
5. Goode JH . Are pilots at risk of accidents due to fatigue? J Safety
Res 2003 ; 34 : 309 – 13 .
6. Nicholson AN, Pascoe PA, Spencer MB, Stone BM, Roehrs T, Roth
T . Sleep after transmeridian ights . Lancet 1986 ; 328 : 1205 – 8 .
7. Powell D, Spencer MB, Holland D, Petrie KJ . Fatigue in two-pilot
operations: implications for fl ight and duty time limitations .
Aviat Space Environ Med 2008 ; 79 : 1047 – 50 .
8. Powell DM, Spencer MB, Petrie KJ . Fatigue in airline pilots after
an additional day’s layover period . Aviat Space Environ Med
2010 ; 81 : 1013 – 7 .
9. Samel A, Wegmann H-M, Vejvoda M, Drescher J, Gundel A,
et al. Two-crew operations: stress and fatigue during long-haul
night fl ights . Aviat Space Environ Med 1997 ; 68 : 679 – 87 .
10. Samn SW, Perelli LP . Estimating aircrew fatigue: A technique with
implications to airlift operations . Brooks AFB, TX : USAF School
of Aerospace Medicine ; 1982 : Technical Report No. SAM-TR-
82-21
... For example, shift work sleep-wake disorder can cause workers to fall asleep while working, and this is out of their control [13]. Pilots, air-traffic controllers and general maintenance technicians have been frequently the subject of studies of fatigue and workload [14][15][16][17]. This is not the case for de-icing ground crews, despite the fact that this job is critical to ensuring safe takeoff in northern regions. ...
... According to Lerman et al. [11], sleepiness is the propensity to fall asleep, while fatigue is the response of the body to lack of sleep or in some cases to extended physical or mental effort. Due to its non-specific origin and to the lack of consensus on its measurement, there is no short, universally accepted definition of fatigue [12][13][14][15][16][17][18][19]. The International Civil Aviation Organization (ICAO) defines fatigue as "A physiological state of reduced mental or physical performance capability resulting from sleep loss or extended wakefulness, circadian phase, or workload (mental and/or physical activity) that can impair a crew member's alertness and ability to safely operate an aircraft or perform safety related duties." ...
... This scale is one of the most used in the subjective measurement of fatigue [27]. It has been used in the study of railway workers [28] and also for measuring fatigue among [15]. ...
Article
Full-text available
BACKGROUND: Fatigue and workload experienced by aircraft de-icing personnel have been largely neglected in occupational health and safety research. OBJECTIVE: To provide an initial assessment of fatigue and workload among de-icing ground crews. METHODS: Company records were used to reveal possible relationships between different variables (age, seniority, truck type, and work shift). A group of 20 volunteer participants (17 men and 3 women) rated their level of fatigue before and after one shift using the Samn-Perelli fatigue scale. Workload was evaluated using the NASA-TLX method at the end of the shift. RESULTS: The average fatigue experienced by de-icing worker was significantly greater (P = 0.043) for the technicians in open-basket trucks than for the ones in trucks with a cabin (4.43 vs 3.37). Furthermore, there was a significant age difference (P= 0.048) in the perceived level of fatigue (4.1 vs 3.1), with younger workers (< 30 years) reporting a higher level than older workers (≥ 30 years). Overall NASA-TLX score were not significant (P> 0.05) for any of the factors tested: type of truck, shift and age. CONCLUSIONS: The results suggest that particular attention should be paid to young technicians and technicians working in open-basket trucks, since the fatigue levels reported in association with these factors were higher.
... The SP is a 7-point fatigue scale (1 = fully alert, wide awake; 7 = completely exhausted, unable to function effectively) on which participants are asked for the level of fatigue they are currently experiencing. A cut-off of 6 and above has previously been used to indicate critical fatigue (Powell et al., 2011). Finally, two scales were selected to measure workload. ...
... 2019). The Samn-Perelli Fatigue Scale has been incorporated directly into the flight management system of a commercial Air New Zealand 777-200 aircraft for inflight recording of pilot fatigue(Powell DMC, Spencer MB, and Petrie KJ, 2011). ...
... Regulators also usually require the operator to have an approved Fatigue Risk Management System (FRMS) in place before a ULR operation can commence. In New Zealand there is no regulatory option for an operator to obtain an approved FRMS, but Air New Zealand (Air NZ) has been an industry leader in the development of these processes (7) and has undertaken numerous previous data collection studies to inform the management of fatigue among flight crew (8)(9)(10)(11) and cabin crew (not published) within their organization. Because FRMS is a data-driven means of managing fatigue-related safety risk and is "based on scientific principles, knowledge, and operational experience", (12) sharing information on the utilization and effectiveness of fatigue risk mitigations on existing ULR routes will inform the planning of new ULR operations. ...
Article
Full-text available
Introduction Ultra-long range (ULR) flights have the potential to increase fatigue-related risk for cabin crew, if the extended flight times are associated with extended wakefulness, sleep loss and higher levels of crew fatigue. ULR flights may also require longer opportunities for recovery sleep. This study evaluates the utilization of fatigue risk mitigations for cabin crew operating the Auckland – Chicago ULR route with a two-day layover. Methods 65 cabin crew (45 women; aged 20–59 years) wore an actigraph and completed a sleep/duty diary for 3 local nights prior to, throughout, and for 3 local nights after a ULR trip. Crewmembers rated their fatigue (Samn-Perelli Crew Status Check), sleepiness (Karolinska Sleepiness Scale), and workload (OW; NASA-TLX) at key times during each flight. Jet lag was rated each day at home and during layover. Results Fatigue and sleepiness were highest at top-of-descent and after landing and were higher on the inbound flight than on the outbound flight. For every hour of additional sleep in-flight, top-of-descent fatigue ratings decreased by 0.24 points and top-of-descent sleepiness ratings decreased by 0.38, whereas top-of-descent fatigue and sleepiness ratings increased by 0.24 points with every 10-point increase in OW ratings. Crew slept more in the 24-hours prior to the outbound ( M = 8.5 h) and inbound flights ( M = 9.1 h) compared to pre-trip baseline days ( M = 8.2 h). Post-trip, crew slept more during the first day ( M = 9.9 h) compared to baseline, with 95% taking a daytime nap. Jet lag ratings decreased daily on return home but were still higher on the fourth day than on the day of the outbound flight. Discussion Cabin crew prepare for ULR flights by obtaining more sleep prior to departure. However, large individual differences in sleep and declining jet lag ratings across pre-trip days suggest that some crewmembers may still be recovering from a previous trip. Further refinement of in-flight sleep strategies and workload mitigations could be considered for managing fatigue risk at top-of-descent. Findings also highlight the importance of a protected period of post-trip rest to facilitate cabin crews' recovery from the effects of sleep restriction and circadian disruption associated with this ULR trip.
... May and Baldwin (2009) proposed that fatigue can be classified as either task-related or sleep-related. Since fatigue can be viewed as a feeling of 'tiredness' experienced by an individual, subjective methods have been employed in measuring the manifestations of fatigue (Ahsberg et al., 2000;Kar et al., 2010;Powell et al., 2011). Fatigue can also be described by looking at performance decrements during a task, although Phillips (2015) stated concerns about the consistency of such measures across different task contexts. ...
Article
Full-text available
Fatigue resulting from driving has been the subject of interest in many studies, particularly due to its pertinent role in road traffic crashes. Fatigue can be evaluated by certain indicators, such as changes in neural activity. The objective of this study was to characterize fatigue associated with prolonged simulated driving by employing electroencephalography. Fourteen male participants were recruited and asked to drive a simulator for five hours in the morning. All participants had two resting conditions the night prior to the experiment (sufficient sleep or partial sleep deprivation). Subjective responses clearly demonstrated an increase in fatigue as a function of driving duration. Data from brain wave activities, however, did not present clear, consistent changes as fatigue progressed. These findings suggest that theta waves can be used as manifestations of fatigue and temporal waves as the selected cortical area of concern.
... The self-reported data were collected by the 7-point Samn-Perelli Fatigue Index (Samn & Perelli, 1982). This fatigue index is known to have high test-retest reliability (Miller & Narvaez, 1986) and has been extensively used in Aviation contexts (Gander et al., 2013(Gander et al., , 2015Honn et al., 2016;Millar, 2012;Powell et al., 2007Powell et al., , 2011Roach et al., 2012). ...
Article
Full-text available
Pilots in long-duration flight missions in single-seat aircraft may be affected by fatigue. This study determined associations between cognitive performance, emotions and physiological activation and deactivation – measured by heart rate variability (HRV) – in a simulated 11-h flight mission in the 39 Gripen aircraft. Twelve participants volunteered for the study. Perceived fatigue was measured by the Samn-Perelli Fatigue Index (SPFI). Cognitive performance was measured by non-executive and executive tasks. Emotions were assessed by the Circumplex Affect Space instrument. HRV was considered in relation to the cognitive tasks in four time points – Hours 3, 5, 7, 9 – and their associations with emotional ratings. Results indicated a decrease in performance in the non-executive task after approximately 7 h. This result was correlated with self-reported measures of fatigue. HRV, assessed by indices of parasympathetic modulation, remained unchanged for both non-executive and executive tasks over time (p > .05 for all). Significant correlations were observed between emotions and HRV; with increased boredom, increased passiveness, decreased stimulation, and decreased activeness, HRV indicators increased (p < .05). This suggests that a low self-regulatory effort for maintaining performance in these conditions was prevalent and that pilots could adapt to some degree to the demands and fatigue of long-duration missions.
... For example, the aviation industry has trialled the use of the Samn-Perelli scale at key points in the flight, presented and recorded via the flight management system. 43 Similarly, presentation of a PVT on a mobile device can be used at appropriate times while on duty. However, care must be taken that using the techniques in this way does not interfere with the primary task. ...
Article
On a 24/7 railway, managing the risks of operator fatigue is crucial for maintaining the reliability and safety of the system. Whilst there is a substantial body of scientific literature covering the causes and effects of fatigue, a recent accident investigation report exposed gaps in our applied knowledge regarding barriers to reporting fatigue, and the problems of monitoring and mitigating the risk arising from fatigue. This paper responds to that call by reviewing literature and practices regarding fatigue management in rail and other safety-critical industries, before focusing on potential solutions for assessing, monitoring and counteracting fatigue both before and during a shift. The authors conclude by identifying areas for future work in this field.
Article
OCCUPATIONAL APPLICATIONS We conducted a study to evaluate fatigue and workload among workers performing complex assembly tasks. We investigate several predictors of fatigue, including subjective workload estimates, sleep duration, the shift being worked, and production levels. High levels of fatigue were reported in one-third of the shifts evaluated. The main predictors of high fatigue were workload estimates, working evening shifts, and baseline fatigue. Among the six dimensions of workload, only mental demand and frustration were predictors of high fatigue. Mental demand was also rated highest. Participants reported less than seven hours of sleep in 60% of the nights evaluated. These results suggest that managers and supervisors should consider cognitive workload as a key contributing factor to fatigue in complex manual assembly. Similarly, work schedule planning should consider shift duration, start times, and end times, because of the negative influence on fatigue and the potential disruptions on sleep among workers.
Article
Objective: To assess the fatigue risk is an important challenge in improving flight safety in aviation industry. The aim of this study was to develop a comprehensive fatigue risk management indicators system and a fatigue questionnaire for Chinese civil aviation pilots. Methods: Participants included 74 (all males) civil aviation pilots. They finished the questionnaire in 20 minutes before a flight mission. The estimation of internal consistency with Cronbach's α and Student's t test as well as Pearson's correlation analysis were the main statistical methods. Results: The results revealed that the fatigue questionnaire had acceptable internal consistency reliability and construct validity; there were significant differences on fatigue scores between international and domestic flight pilots. And some international flight pilots, who had taken medications as a sleep aid, had worse sleep quality than those had not. The long-endurance flight across time zones caused significant differences in circadian rhythm. Conclusions: The fatigue questionnaire can be used to measure Chinese civil aviation pilots' fatigue, which provided a reference for fatigue risk management system to civil aviation pilots.
Article
Background: This study examined whether subjective measurements of in-flight sleep could be a reliable alternative to actigraphic measurements for monitoring pilot fatigue in a large-scale survey. Methods: Pilots (3-pilot crews) completed a 1-page survey on outbound and inbound long-haul flights crossing 1-7 time zones (N = 586 surveys) between 53 city pairs with 1-d layovers. Across each flight, pilots documented flight start and end times, break times, and in-flight sleep duration and quality if they attempted sleep. They also rated their fatigue (Samn-Perelli Crew Status Check) and sleepiness (Karolinska Sleepiness Scale) at top of descent (TOD). Mixed model ANCOVA was used to identify independent factors associated with sleep duration, quality, and TOD measures. Domicile time was used as a surrogate measure of circadian phase. Results: Sleep duration increased by 10.2 min for every 1-h increase in flight duration. Sleep duration and quality varied by break start time, with significantly more sleep obtained during breaks starting between (domicile) 22:00-01:59 and 02:00-05:59 compared to earlier breaks. Pilots were more fatigued and sleepy at TOD on flights arriving between 02:00-05:59 and 06:00-09:59 domicile time compared to other flights. With every 1-h increase in sleep duration, sleepiness ratings at TOD decreased by 0.6 points and fatigue ratings decreased by 0.4 points. Discussion: The present findings are consistent with previous actigraphic studies, suggesting that self-reported sleep duration is a reliable alternative to actigraphic sleep in this type of study, with use of validated measures, sufficiently large sample sizes, and where fatigue risk is expected to be low. van den Berg MJ, Wu LJ, Gander PH. Subjective measurements of in-flight sleep, circadian variation, and their relationship with fatigue. Aerosp Med Hum Perform. 2016; 87(10):869-875.
Article
Full-text available
This report proposes a method of assessing aircrew fatigue based on work/rest profiles. Possible circadian desynchronization and cumulative fatigue an aircrew may have experienced are considered. The method was used to assess aircrew fatigue during computer-simulated airlift operations. It shows quantitatively how flying-hour limitations can affect average aircrew fatigue and system performance.
Article
Full-text available
We investigated the effect of an additional day's layover on reducing fatigue in two different duties: a two-pilot crew flying between Auckland and Brisbane, and a three-pilot crew flying between Auckland and Los Angeles. Pilots completed a reaction time task, the Samn-Perelli fatigue scale, and the Karolinska Sleepiness Scale on both outward and return flights. The flights were conducted with and without a 1-d layover (Brisbane) and with a 1- or 2-d layover (Los Angeles). On the Brisbane route, the addition of a layover resulted in a significant reduction of fatigue, sleepiness, and reaction time. At top of descent, Samn-Perelli fatigue was reduced from over 5.0 to under 4.5. In contrast, the addition of an extra day layover in Los Angeles had no significant effect on the same measures during the return flight; on both flights Samn-Perelli fatigue was over 5.0 at top of descent. The results suggest that the addition of an extra night's layover has different effects depending on the type of operation. Layover periods need to ensure adequate opportunity to recover from any sleep deficit arising from the outbound journey, but the benefit of increased layover time may be limited if time-zone shifts cause a mismatch between local time and the circadian rhythm of sleep.
Article
Full-text available
Two-pilot operations make up the majority of commercial flights. Fatigue is an important consideration in these operations as there is little opportunity for in-flight rest. We investigated the role of duty length, time of day, and whether one or two sectors were flown on reported fatigue at the top of descent in two-pilot regional operations. Pilots flying two-pilot operations ranging from 3-12 h completed Samn-Perelli fatigue ratings prior to descent at the end of each rostered duty over a 12-wk period. We collected 3023 usable ratings (72% of rostered duties) comprising 26% single and 74% double sector duties. We found that time of day has a marked effect on the pattern of fatigue at the start of the duty and on the rate at which fatigue levels increased, with the highest levels in the window of circadian low (0200-0600). Fatigue also increased with the length of duty and was 0.56 higher at the end of a two-sector compared with a single-sector duty. The results imply authorities should consider increasing existing limits for daytime duties and reducing those for nighttime two-pilot operations.
Conference Paper
The basis of the QinetiQ alertness model used at the heart of the System for Aircrew Fatigue Evaluation (SAFE) free-standing software tool is described. A number of extensions to the basic model that are applicable to the civil aviation environment are outlined, based on the analysis of eight studies involving sleep diaries gathered from pilots undertaking a range of duty schedules. The relationship between subjective alertness and performance of laboratory tasks is described and it is concluded that different tasks are affected differentially by fatigue. In general, there is a larger impact on the incidence of errors than on response time. The implementation of the full model in the integrated Performance Modeling Environment is described and the application of both the alertness and performance models to two scenarios provided by the workshop organizers is outlined. it is concluded that further work is needed in three areas: cumulative fatigue, the impact of sleep architecture, and the prediction of performance for complex tasks in systems.
Article
Nocturnal sleep and daytime sleep latencies, recorded electroencephalographically after westward and eastward flights across the North Atlantic involving time zone shifts of 5 h, were influenced by the time of the flight and by subsequent displacement of the rest period. After the westward flight there was sleep disturbance during the latter part of the first night. However, there was persistent disturbance of sleep after the eastward flight. A rapidly eliminated hypnotic may be useful for the first night or two after a westward flight and for a few nights after an overnight eastward flight.
Article
As part of a research program concerning legal aspects of two-pilot operations on long-haul routes, the purpose of the study was to investigate two-crew extended range operations during a flight roster with two consecutive night flights and a short layover. Present flight time regulations may not be adequate for two-crew minimum operations. The study was conducted in cooperation with a German airline company on the route Frankfurt (FRA)-Mahe (SEZ). There were 11 rotations (22 flights) that were investigated by pre-, in- and post-flight data collection each time from the two pilots. Recordings included sleep, taskload, fatigue and stress by measurement of EEG, ECG, motor activity, and subjective ratings. The average actual flight times were 9:15 h (FRA-SEZ) and 9:53 h (SEZ-FRA). All flights took place at night. The layover duration in Mahe was 13:30 h during day-time. During layover, sleep was shortened by 2 h on average compared with 8-h baseline sleep. The two consecutive night duties resulted in a sleep loss of 9.3 h upon return to home base. Inflight ratings of taskload showed moderate grades, but for fatigue ratings an increasing level was observed. Fatigue was more pronounced during the return flight and several pilots scored their fatigue at a critical level. Motor activity, brainwave activity (occurrences of micro-events) and heart rate indicated drowsiness and a low state of vigilance and alertness during both night flights, but these effects were more pronounced during the second flight. From the findings it is concluded that a duty roster, as conducted in this study, may impose excessive demands on mental and physiological capacity.
Article
There is concern in the aviation community that pilot schedules can lead to fatigue and increased chance of an aviation accident. Yet despite this concern, there is little empirical analysis showing the relationship between pilot schedules and commercial aviation accidents. This study attempts to demonstrate an empirical relationship between pilot schedules and aviation accidents. Data for human factors-related accidents and pilot work patterns were identified. The distribution of pilot work schedule parameters for the accidents was compared to that for all pilots using a chi-square test to determine if the proportions of accidents and length of duty exposure were the same. If the distributions are the same, then one could infer that pilot human factor accidents are not affected by work schedule parameters. The proportion of accidents associated with pilots having longer duty periods is higher than the proportion of longer duty periods for all pilots. There is a discernible pattern of increased probability of an accident as duty time increases for commercial aircraft pilots in the United States. The analysis suggests that establishing limits on duty time for commercial pilots would reduce risk. Such a rule is likely to be expensive and could substantially impact the commercial airlines. In return, there is likely to be a reduction in the risk of commercial aviation accidents due to pilot fatigue.
Article
The aim was to compare intercontinental flights with two-pilot and three-pilot crews with respect to fatigue/sleepiness and sleep, as there is considerable economic pressure on the airlines to use two-pilot crews. Twenty pilots participated. Data were collected before, during, and after outbound and homebound flights using a sleep/wake diary (sleepiness ratings every 2-3 h) and wrist actigraphy. The duration of flights was approximately 8 h, and six time zones were crossed. The same pilots participated in both conditions. Napping during the outbound flight was 26 min for the two-pilot crew, and 48 min for the three-pilot crew. Napping during the homebound flight was 54 min and 1 h 6 min, respectively, and the difference was directly related to the time allotted for sleep. Subjective sleepiness was significantly higher for the two-pilot condition in both directions, peaking a few hours into the flight. Performance at top of descent for the two-pilot condition was rated as lower than the three-pilot condition. In the overall evaluation questionnaire there was a significant negative attitude toward two-crew operations. Sleep, sleepiness, subjective performance, boredom, mood, and layover sleep were assessed as having deteriorated in the two-pilot condition. The homebound flight was associated with considerably higher levels of sleepiness than the outbound flight. The study indicates that the reduction of crew size by one pilot is associated with moderately increased levels of sleepiness. It is also suggested that time allotted to sleep in the two-pilot condition might be somewhat extended to improve alertness.
Article
The basis of the QinetiQ alertness model used at the heart of the System for Aircrew Fatigue Evaluation (SAFE) free-standing software tool is described. A number of extensions to the basic model that are applicable to the civil aviation environment are outlined, based on the analysis of eight studies involving sleep diaries gathered from pilots undertaking a range of duty schedules. The relationship between subjective alertness and performance of laboratory tasks is described and it is concluded that different tasks are affected differentially by fatigue. In general, there is a larger impact on the incidence of errors than on response time. The implementation of the full model in the Integrated Performance Modeling Environment is described and the application of both the alertness and performance models to two scenarios provided by the workshop organizers is outlined. It is concluded that further work is needed in three areas: cumulative fatigue, the impact of sleep architecture, and the prediction of performance for complex tasks in systems.