While it is generally accepted that holistic processing facilitates face recognition, recent studies suggest that poor recognition might also arise from imprecise perception of local features in the face. This study aimed to examine to what extent holistic and featural processing relates to individual differences in face recognition ability (FRA), during face learning (Experiment 1) and face recognition (Experiment 2). Participants performed two tasks: (1) The “Cambridge Face Memory Test-Chinese” which measured participants’ FRAs, and (2) an “old/new recognition memory test” encompassing whole faces (preserving holistic and featural processing) and faces revealed through a dynamic aperture (impairing holistic processing but preserving featural processing). Our results showed that participants recognised faces more accurately in conditions when holistic information was preserved, than when it is impaired. We also show that the better use of holistic processing during face learning and face recognition was associated with better FRAs. However, enhanced featural processing during recognition, but not during learning, was related to better FRAs. Together, our findings demonstrate that good face recognition depends on distinct roles played by holistic and featural processing at different stages of face recognition.
Holistic and featural processing’s
link to face recognition varies
by individual and task
Bryan Qi Zheng Leong
1,2*, Alejandro J. Estudillo
1,2* & Ahamed Miah Hussain Ismail
While it is generally accepted that holistic processing facilitates face recognition, recent studies
suggest that poor recognition might also arise from imprecise perception of local features in the face.
This study aimed to examine to what extent holistic and featural processing relates to individual
dierences in face recognition ability (FRA), during face learning (Experiment 1) and face recognition
(Experiment 2). Participants performed two tasks: (1) The “Cambridge Face Memory Test-Chinese”
which measured participants’ FRAs, and (2) an “old/new recognition memory test” encompassing
whole faces (preserving holistic and featural processing) and faces revealed through a dynamic
aperture (impairing holistic processing but preserving featural processing). Our results showed that
participants recognised faces more accurately in conditions when holistic information was preserved,
than when it is impaired. We also show that the better use of holistic processing during face learning
and face recognition was associated with better FRAs. However, enhanced featural processing during
recognition, but not during learning, was related to better FRAs. Together, our ndings demonstrate
that good face recognition depends on distinct roles played by holistic and featural processing at
dierent stages of face recognition.
Recognising the identity of an individual by perceiving their face is a fundamental social skill. Most human faces
adhere to a standard template and conguration of facial features such as the eyes, nose, and mouth. While the
isolated processing of dierent facial features is known as “featural processing, the combination of these facial
features and their conguration into a whole is referred to as “holistic processing”1. Although both processes are
believed to contribute to face recognition, the popular view is that holistic processing is relatively more crucial24.
However, the contribution of holistic and featural processing to dierent stages of the face recognition process
(i.e., learning vs. recognition) and their relationship with individual dierences in face recognition are largely
unknown. is study aims to shed light on these questions.
In typical adults, the face inversion, composite face and part-whole tasks are conventionally used to demonstrate
the dominance of holistic processing in face recognition5,6. In the inversion eect, recognition is more accurate for
upright (experimental condition) faces than for inverted faces (control condition), since the latter impairs holistic
processing3,7,8. In the composite eect4,9,10, when the top half of one identity’s face is spatially aligned with the
bottom half of another identity (experimental condition), the two halves are fused to create an illusory identity,
and this impairs recognising the source identity of each half. However, this impairment disappears when the
two halves are misaligned (control condition) and holistic processing is disrupted. In the part-whole eect1113,
recognising an individual part (e.g., a nose) of a previously learnt face is more accurate when it is presented in
the context of a whole face (experimental condition) rather than an isolated part (control condition). Face parts
are believed to be encoded by engaging holistic processes that integrate them into a whole, and therefore part
recognition is best when the same processes can be engaged during recognition (i.e., whole condition). Interest-
ingly, some studies have reported positive correlations between these indexes and face identication1416, pointing
to holistic processing as the underlying mechanism explaining individual dierences in face recognition (but
see Konar etal.17; Verhallen etal.18).
However, there is also emerging evidence suggesting that featural processing is important for face identica-
tion too. For instance, Cabeza and Kato19 found that participants were equally prone to falsely recognise novel
faces (what they called “prototype faces”) that only had either holistic information or featural information
preserved from previously learnt faces. is reects that both holistic and featural information were encoded
and stored, and that they may be equally important in face recognition. More recently, DeGutis etal.14 used
the part-whole and composite tasks to demonstrate that both holistic and featural processing contributes inde-
pendently and signicantly to face recognition abilities (FRA). First, they obtained an independent measure of
recognition based on featural processing by calculating the accuracy for the control conditions (e.g., part condi-
tion in the part-whole task) where holistic information is disrupted. Second, they regressed the variance of the
control conditions from the experimental conditions (e.g., whole condition in the part-whole task) to obtain an
independent score of holistic processing. ey found signicant positive correlations between these independ-
ent estimates of holistic as well as featural processing and their measures of FRA (scores in the Cambridge Face
Memory Test; CFMT20). Furthermore, it has also been suggested that featural processing is more important for
the recognition of unfamiliar faces than familiar faces21. For example, Lobmaier and Mast22 found that match-
ing two sequentially presented faces is relatively more impaired when the two faces are blurred (i.e., to disrupt
featural processing) than when they are scrambled (i.e., to disrupt holistic processing), but this disadvantage for
blurred faces was more pronounced for novel faces than previously learnt faces.
With conventional measures of holistic processing (i.e., composite, part-whole and inversion eects), the
assumption is that their experimental manipulations (e.g., misaligning faces in the composite task) are meant to
disrupt holistic processing. However, these measures are not free of criticism as there are secondary factors that
could drive the same eects too23. For example, in the part-whole task, faces are always encoded in their whole,
arguing that the part-whole eect could be driven by encoding specicity24. Further, the experimental condition
generally contains more facial information than the control condition. Here, the so-called holistic advantage
measured by the part-whole eect could reect dierences in the amount of featural information contained
between the two conditions. Recent studies have also criticised the functional signicance of the composite face
task6,25. For instance, Fitousi25 showed that aligned composite faces (that are oen used to demonstrate interfer-
ence from holistic processing) were not aected by the Garner interference paradigm. In other words, participants
were perfectly capable of selectively attending to target facial features even when other irrelevant features were
manipulated, casting doubt on the fact that holistic processing may be interfering with perception in aligned
composites. To control for secondary cognitive factors, studies have oen adopted these two holistic measures
with the inversion eect. Following this argument, the pure contribution of holistic processing would be observed
when the part-whole and composite eects are only present with upright faces and disappear for inverted faces23.
With regard to the inversion task, the most common interpretation is that the upright condition facilitates
holistic processing3,8. If that is the case, when observers are forced to view both upright and inverted faces in
a featural manner, the inversion eect should be reduced, or disappear. Murphy and Cook26 used the xed-
trajectory aperture paradigm (FTAP) to examine this hypothesis. is paradigm has two conditions: (1) the
“whole” condition in which the entire face is visible to the observer, and (2) the “aperture” condition in which a
transparent, rectangular window smoothly moves from the top of the face to the bottom, revealing parts of the
face in a sequential order. Murphy and Cook26 found that faces are recognised better in the whole conditions
compared to the aperture conditions (i.e., the “aperture eect”), suggesting that the dynamic aperture successfully
disrupts holistic processing. Interestingly, the magnitude of the inversion eect (i.e., the dierence between the
upright condition and the inverted condition) was comparable in both the whole and aperture conditions (see
also Murphy and Cook27). is is in stark contrast with the holistic accounts of the face inversion eect, which
predicts that an inversion eect should only be observed when the entire face is fully visible.
erefore, Murphy and Cook’s ndings challenge the view that the inversion eect disrupts only holistic
processing, at the same time providing a paradigm that systematically disrupts or facilitates holistic process-
ing. Interestingly, the FTAP is also a good paradigm to measure individual dierences in holistic and featural
processing. For example, Tsantani etal.28 showed that Developmental Prosopagnosics (DPs) are less accurate in
recognising upright faces in both the whole and the aperture conditions, compared to typical adults without face
recognition decits. However, the magnitude of the holistic advantage (i.e., higher accuracy in the whole com-
pared to the aperture condition) was similar between DPs and typical adults. is shows that DPs are impaired
in processing faces featurally but not holistically.
Learning and recognition
Recognising the identity of an unfamiliar face is a product of at least two exposures to the same face. In its
simplest order, the rst exposure results in the observer learning the identity of the face and during the second
exposure, the observer recognises a face they have learnt before. e distinction between these two stages is
supported by neuroimaging evidence that showed dierent brain regions were involved during the learning
and recognition of faces29. Interestingly, most studies attempting to examine the contribution of holistic and
featural processing to face identication do not specically address the role of these processes in the learning
and recognition of faces.
Some studies have used oculomotor behaviour to index the processes involved during visual sampling30.
Measuring xations, Henderson etal.31 found that face recognition is better if observers were allowed to freely
xate on the face during learning, rather than being forced to learn faces with just a single xation. Further, eye
movement patterns during recognition were comparable between conditions in which participants learnt faces
by freely xating them and by means of a single xation. ese ndings suggest that, although recognition abil-
ity depends on how observers sampled facial information during learning, the information sampling strategy
employed by observers during recognition is independent of how faces were learnt. Henderson etal.31 also
reported that when observers freely xated on faces during learning and recognition, they were largely directed
at internal facial features. Although these xations were attributed to processing holistic information, we could
also assume that they served a simpler purpose of separately encoding individual features at high resolution, in
other words, featural processing32,33. Lastly, Henderson etal. also reported that when observers were allowed to
freely explore faces, xations during recognition were much more restricted than those during learning. is
could suggest greater reliance on featural processing during learning and/or greater reliance on holistic process-
ing during recognition. While both interpretations are possible, there is no way to be certain of the purpose of
xations, as they can be used, at the best, as indirect measures of these processes33.
A recent study by Dunn etal.34, using a gaze-contingent paradigm, further examined the contributions of
both holistic and featural processing in face recognition at the learning and recognition stages. Faces were viewed
either in full-view or through circular apertures varying in sizes. When observers were allowed to sample faces
freely during face learning and face recognition, super-recognizers (SRs) had a broader gaze distribution and
more exploratory xations than control participants. Most importantly, SRs were consistently better than control
participants regardless of the aperture size. is indicates that the underlying perceptual processes contributing
to superior face recognition can be explained by featural processing. Interestingly, these dierences were more
evident during face learning than during face recognition. In line with Henderson etal.31, these ndings suggest
that broader exploration of the face during face learning facilitates face recognition and could quantitatively
explain individual dierences in face recognition.
The present study
To explore the contribution of holistic and featural processing at learning and recognition and their relationship
with individual dierences in FRA, the present study uses the FTAP in each stage separately. In Experiment 1, to
isolate the contribution of holistic and featural processing during learning, faces were learned either through an
aperture or in full-view. However, during the recognition stage, all faces were viewed in full-view. In Experiment
2, all faces were viewed in their entirety during learning. However, during the recognition stage, some faces were
viewed through an aperture while others were viewed in their entirety. is allowed us to isolate the contribu-
tion of holistic and featural processing during the recognition stage to FRA. In addition, to measure individual
dierences in face recognition, observers performed the Cambridge Face Memory Test-Chinese (CFMT-Chi)35,
a highly reliable and valid measure of individual dierences in face recognition skills36.
Holistic and featural processing abilities were assessed with the FTAP in an old/new recognition memory task
(RMT). e task involved two stages as shown in Fig.1: a “learning” stage where participants learn a series of
faces, and a “recognition” stage, where participants attempt to recognise the learnt face among a set of faces that
contains new faces too. In Experiment 1, faces in the learning stage were presented in their entirety (“whole
condition”) or through the xed-trajectory aperture (“aperture condition”). Faces in the recognition stage were
always presented in their entirety for both conditions, thus, scores here were always computed from the recogni-
tion of full faces. is manipulation was reversed in Experiment 2. Learning stage faces were always presented
in their entirety, whereas recognition stage faces were shown in their entirety or through the aperture. Hence,
scores here were computed from the recognition of either full or aperture faces. Briey, recognition performance
in the aperture condition of the RMT informs us how good our participants are with featural processing. e
Figure1. Chronological procedure and examples of stimuli in the old/new recognition memory task used
in Experiment 1. In the aperture condition (centre right), a dynamic window moves smoothly across the face
image from top to bottom (images from le to right).
improvement in performance in the whole condition compared to the aperture condition of the RMT is a measure
of the magnitude of the holistic advantage experienced by participants, i.e., how good they were with holistic
processing. To obtain a standardised measure of FRA, we used the CFMT-Chi35. Correlating the aperture condi-
tions accuracy and the holistic advantage calculated from the old/new recognition task to the CFMT-Chi would
tell us to what extent featural and holistic processing relates to FRA, respectively.
e maximum achievable score (e.g., sum of correct responses) for the CFMT-Chi is 72, in which our current
sample had a mean score of 57.98 (SD = 8.93) in Experiment 1 and 58.28 (SD = 8.53) in Experiment 2. As revealed
by a two-tailed independent-samples t-test, the mean CFMT-Chi scores for both experiments were not signi-
cantly dierent from each other, t(171) = − 0.23, p = 0.820, ηp2 = − 0.035. is shows that our participants’ FRA are
largely similar between the two experiments, as well as with those of previous studies3539. Mean accuracy scores
of the RMT were calculated separately for each of the two viewing conditions: “whole” and “aperture” (Fig.2).
Two-tailed paired samples t-tests were conducted to compare accuracy scores between the two conditions of the
RMT. In Experiment 1, there was a signicant dierence in the mean scores between the conditions, t(86) = 5.67,
p < 0.001, ηp2 = 0.607, in which mean accuracy in the whole condition (M = 0.672, SD = 0.117) was sig nicantly
higher than that of the aperture condition (M = 0.590, SD = 0.104). Similarly, in Experiment 2, we found that there
was a signicant dierence in accuracy between the two conditions, t(85) = 11.21, p < 0.001, ηp2 = 1.209, in which
mean accuracy for the whole condition (M = 0.759, SD = 0.116) was higher than that of the aperture condition
(M = 0.586, SD = 0.120). In both experiments, one-sample t-tests revealed that the accuracy in the aperture condi-
tions were signicantly better than chance (accuracy more than 0.5) at the group level: t(86) = 7.978, p < 0.001 (for
Experiment 1) and t(85) = 6.638, p < 0.001 (for Experiment 2). A further independent-samples t-test conrmed
that these mean accuracies are comparable between experiments, t(172) = 0.222, p = 0.824.
Traditionally, the holistic advantage has been calculated using subtraction methods6. In the case of the FTAP,
this method would involve subtracting the mean accuracy in the aperture condition from the mean accuracy
in the whole condition. However, subtraction methods can be dicult to interpret14, as a lower value for the
aperture eect can indicate close to ceiling performance in the aperture condition, close to oor performance
in the whole condition, or both. us, in the present study, we used the “regression” method6,14 to calculate the
holistic advantage experienced by participants in the whole condition, aer accounting for the variation in per-
formance that the whole condition shares with the aperture condition. Using the equation of the line of best t of
the overall scores, each participant’s expected score on the whole condition (i.e., residual scores) was calculated
based on their performance in the aperture condition. Here, accuracy in the aperture condition is regressed from
the whole condition to compute residuals, which we termed “residuals of aperture eect” (RAE). A higher RAE
score indicates stronger holistic processing.
Next, we ran a number of Pearsons product-moment correlation tests for data obtained from Experiment
1. First, to explore if both tasks are measuring similar constructs, we correlated the accuracies of the whole
condition in the RMT with the CFMT-Chi scores. e test showed a signicant positive correlation between the
two tasks, r(85) = 0.334, p = 0.002. Second, to explore the relationship between featural processing ability and
FRA, we correlated the accuracies of the aperture condition with the CFMT-Chi scores, and the test showed
no signicant correlation between the two, r(85) = − 0.002, p = 0.986 (Fig.3a). ird, to explore the relationship
between holistic processing ability and FRA, we correlated measures of holistic advantage with CFMT-Chi
scores. ere was a signicant positive correlation between the RAE scores and CFMT-Chi scores (Fig.3b),
r(85) = 0.347, p < 0.001. For Experiment 2, we found a positive correlation between the accuracy in the whole
Figure2. Mean accuracies for the whole (blue) and aperture (green) conditions from (a) Experiment 1 and (b)
Experiment 2. Black-lled circles represent accuracy scores from individual participants.
condition and their respective scores on the CFMT-Chi, (r(84) = 0.489, p < 0.001). ere was also a strong posi-
tive correlation between the accuracy in the aperture condition and their respective scores on the CFMT-Chi
(Fig.3c), (r(84) = 0.570, p < 0.001). Particularly, the higher the participants’ FRA, the more accurate they were in
the “aperture” condition. Additionally, there was a signicant positive correlation between the RAE and CFMT-
Chi scores (Fig.3d), r(84) = 0.354, p < 0.001. Since some participants were performing at (or close to) chance
level in our experimental conditions (especially for the aperture conditions of both experiments), it is possible
that oor eects can account for some correlations (or the lack of it). To address this, we also correlated whole
accuracy, aperture accuracy and RAE scores with CFMT-Chi scores aer excluding participants who did not
score above chance, as identied by binomial probability tests. Importantly, the pattern of results remained the
same (see Online Supplementary Material).
To compare the strengths of correlations between the two experiments and between the whole and aperture
conditions within each experiment, we transformed the Pearsons correlation coecient values into z scores (i.e.,
Fisher’s r to z transformation)40. We found a signicant dierence in coecients between Experiment 1 and 2 for
the correlations between aperture accuracy and CFMT-Chi (z = − 4.197, p < 0.001), but not for the correlations
between RAE and CFMT-Chi (z = − 0.052, p = 0.479). Specically, the correlation coecient between aperture
accuracy and CFMT-Chi was larger in Experiment 2 than in Experiment 1. Additionally, the correlation coef-
cients between aperture accuracy with CFMT-Chi, and RAE with CFMT-Chi, were signicantly dierent in
Experiment 1 (z = − 2.359, p = 0.009) and Experiment 2 (z = 1.788, p = 0.037). Particularly, the correlation with
CFMT-Chi was stronger for RAE (i.e., holistic processing) in Experiment 1, but the correlation with CFMT-Chi
was stronger for aperture accuracy (i.e., featural processing) in Experiment 2. Lastly, for the correlations between
whole conditions accuracy and CFMT-Chi scores, the coecients were comparable between Experiments 1 and
2 (z = − 1.211, p = 0.113).
e purpose of the present study was to examine the role of holistic and featural processing in face recognition
ability (FRA). Both experiments showed that forcing observers to rely on featural processing with a small aperture
reduced recognition accuracy signicantly. is impairment was observed irrespective of whether the aperture
Figure3. Correlation analyses from Experiments 1 (black) and 2 (grey). Black circles and grey annulus
represent scores from individual participants in Experiment 1 and Experiment 2, respectively. Black solid lines
and grey dashed lines are least-squares regression ts to individual data from Experiments 1 and 2, respectively.
was applied during face learning or recognition. One unique characteristic of our study is that we measured to
what extent featural and holistic processing can explain FRA at dierent stages of face recognition, separately.
In Experiment 1, we found that accuracy for recognising faces learnt through featural processing was uniform,
albeit poor, across the whole spectrum of FRA. To our knowledge, no past study had systematically restricted
participants along the FRA spectrum to featural processing during face learning. Accordingly, our ndings are
novel in isolating the contribution of featural processing during face learning to face recognition ability. Based
on our ndings, featural processing during face learning does not account for individual dierences in face
recognition abilities.
In Experiment 2, we found that individuals with better FRA were also better at using featural processing
during recognition than individuals with poor FRA. is suggests that featural processing during face recogni-
tion contributes to identifying learnt faces, and it is in support of past ndings showing that good recognisers
make good use of featural processing when attempting to recognise a learnt face. ese past studies have used
various tasks to assess the contribution of featural processing (e.g., part-whole task, familiar face recognition
test) in recognising famous faces as well as recently learnt unfamiliar faces28,41,42. Nonetheless, there are also
some exceptions34,43. One could argue that the lack of correlation between featural processing ability and FRA
in Experiment 1 is a result of oor eects. However, accuracies were comparable across both experiments and
above chance. In addition, individual dierences in the aperture condition were related to FRA in Experiment
2, but not in Experiment 1. erefore, oor eects are unlikely to explain a lack of correlation in Experiment 1.
In line with Dunn etal.34, we found that featural processing is positively associated with FRA. However, we
only found this correlation during face recognition and not face learning (i.e., Experiment 1). ese disparities
could be the result of our viewing manipulations. Dunn etal. allowed observers to actively explore the faces,
whereas the FTAP constraints all observers to learn faces in a similar fashion, which could interfere with unique
perceptual encoding strategies used by good recognizers. For instance, Dunn etal. found that super recognis-
ers (SRs) had broader gaze distributions and more xations than typical observers, but these dierences were
more apparent during face learning. In contrast, Abudarham etal.43 showed that Developmental Prosopagnosics
(DPs) and SRs are similarly good at featural processing. However, DPs tend to be heterogenous in decits, with
some cases having featural processing decits and some not44,45, and decits can be qualitatively dierent from
neurotypicals with poor FRA (e.g., atypical sampling of faces)4648.
Additionally, our RAE scores showed that people’s ability to process faces holistically (but not featurally) dur-
ing face learning could be a strong determinant of their FRA. e relationship found in Experiment 2 further
supports previous ndings showing that higher face recognition abilities are associated with stronger holistic
processing1416. Nonetheless, as we found, why would good recognisers rely more on processing holistic but not
featural representations of a face during face learning? We encounter a large number of faces in everyday life.
Obviously, the more faces we can store in our memory, the better our social interactions would be. However,
storing individual features of every single face we encounter would be very taxing for human memory. Holistic
representations provide a way of reducing this memory load, by allowing us to store more identities in the form of
a simplied gist (see Curby and Gauthier49; Pertzov etal.50). Moreover, holistic information of faces is more stable
in memory than featural information4,7,51. For example, Peters and Kemner52 showed that long-term memory for
faces is better when face identities were learnt from their low spatial frequencies conveying holistic information
than from their high spatial frequencies conveying ne details of features. Given that holistic representations
allow us to eciently utilise memory and form stable traces over time, it would be expected that good recognisers
make better use of holistic processing than featural processing during face learning.
Why would good recognisers rely on both holistic and featural processing during recognition, but not face
learning? Some studies have demonstrated that when we attempt to recognise a face, we follow a course-to-ne
strategy5355. Here, a holistic representation of the to-be-recognised face is initially matched to face representa-
tions in our memory to narrow down the most likely candidate representations53. Next, in an empirical sense,
features of the to-be-recognised face are compared with those selected representations in memory, whereby
identity-specic, distinct features could help to distinguish a learnt identity from other similar-resembling faces.
Extending this explanation to our case, it appears important that we compare a to-be-recognised face to memory
representations both at the holistic and featural levels, and good recognisers might be adept at doing both.
We would also like to emphasise on an interesting nding of our study. In the aperture condition of Experi-
ment 1, when participants’ face learning was restricted to featural processing, even good recognisers failed to use
this information. However, in Experiment 2, when we allowed participants to learn faces freely (i.e., not restrict-
ing the processing), good recognisers were able to recognise these faces better even when holistic processing was
largely interrupted during recognition due to the aperture. As the FRA of participants decreased, this advantage
with featural information diminished. Based on this, we can claim that forming a holistic representation when
learning a face is also important for good recognisers to eectively use featural information during recognition.
If that’s the case, a weak holistic representation formed by poor recognisers during learning may have led to poor
use of both holistic and featural information during recognition (as shown in Experiment 2; Fig.3).
However, our study is not without limitations. First, we did not account for congruency eects between
face learning and face recognition. Previous research has shown the importance of congruency in face
identication5658. For example, faces learned with a ski mask are better recognized when they are also pre-
sented with a ski mask compared to full-view faces57. In our study, there is an incongruence between learning
and recognition, as the aperture was only applied during learning (Experiment 1) or recognition (Experiment
2) stages. However, as all our participants were given the same tasks, it is unlikely that incongruence between
learning and recognition explains any observed relationships between face recognition skills and the dierent
conditions of the FTAP.
Second, it could be argued that the FTAP also disrupts featural processing. For example, the FTAP might
impair the encoding of featural information at the learning stage, which would explain why aperture accuracy
was not associated with FRA in Experiment 1. However, this also seems unlikely. Research has shown that holis-
tic processing is mostly engaged by the presence of a whole and intact face12,59,60. Importantly, this whole and
intact face processing is indeed avoided by the aperture. To ensure the serial processing of each facial feature,
the aperture used in this study was created to be large enough to reveal the entire eye and mouth regions, and
approximately 75% of the nose. erefore, although the serial presentation of the features through the aperture
might also impair some featural processing, it seems unplausible that this disruption is comparable to that of
holistic processing. In fact, if such disruptions were comparable, as observers would not be able to use either fea-
tural or holistic processing, performance in the aperture conditions should be at chance levels61. However, as our
results showed, participants’ performance in the aperture condition was above chance levels in both experiments.
ird, we applied the regression method to compute the holistic advantage of participants. While this
approach does control for variance in the aperture condition14, one important limitation of the regression method
is its assumption of a linear relationship between the whole and aperture conditions. In fact, as shown by the
weak correlations, it is possible that a non-linear model could better explain the relationship between the whole
and aperture conditions.
In conclusion, we show that poor FRA arises from the poor encoding of holistic and featural information
during face recognition. We also show that enhanced holistic (but not featural) processing during face learning
contributes to better FRA. In addition, our ndings raise the intriguing possibility that good recognisers’ ability
to eectively utilise featural information during recognition may depend on the extent to which faces are pro-
cessed holistically during learning. We demonstrate these using the FTAP that deals with several limitations of
other paradigms (i.e., inversion, composite and part-whole tasks). Moreover, the FRA of our sample is broad, to
the extent of capturing individuals with FRAs (according to CFMT scores) similar to DPs and SRs identied in
past studies, as well as those in between. erefore, we provide reliable insight into the contribution of holistic
and featural processing during face learning and face recognition.
An a-priori power analysis using G*Power62 estimated that a sample size of 82 is required to obtain a moderate
eect size of 0.3 with a statistical power of 80% (α = 0.05), for a Pearsons test of correlation between FRA and
the conditions of the RMT. In Experiment 1, we recruited 87 Malaysian Chinese (44 females) participants with
no known clinical diagnosis of a mental health disorder, with age ranging from 18 to 54 years (M = 25.00 years,
SD = 5.29). For Experiment 2, we recruited 86 healthy typical Malaysian Chinese participants (70 females), with
age ranging from 18 to 47 years (M = 22.34 years, SD = 5.10). Participants were paid 5 Malaysian Ringgits as
compensation for their time. All participants reported normal or corrected-to-normal vision. A digital informed
consent was obtained prior to participation. All experimental procedures were approved by the Science and Engi-
neering Research Ethics Committee of the University of Nottingham Malaysia (approval code: BLQZ210421).
We conrm that all experiments were performed in accordance with relevant guidelines and regulations.
is study was conducted using the online experimental platform Testable (www. testa ble. org) 63. e study com-
prised two tasks: the CFMT-Chi35 and an old/new recognition memory task (RMT) with two viewing conditions
(whole or aperture viewing). Participants used their own computers (laptops or desktops) to complete the two
tasks online in a web browser. To minimise dierences in the visible size of stimuli across dierent computer
screens, participants were required to adjust the length of a horizontal yellow line that appeared on the screen
to match the size of a debit/credit card they possessed. Based on this, the testing platform calculates how many
pixels correspond to one centimetre, and all stimuli within the study were rescaled using this mapping to the
required dimensions in centimetres. All face stimuli were edited and cropped using Abobe Photoshop CS6, while
the dynamic aperture was created in Matlab R2019b (Mathworks).
Stimuli and procedure
Cambridge face memory test-Chinese (CFMT-Chi)
We used the validated Chinese version of the Cambridge Face Memory Test (i.e., CFMT-Chi), and all faces and
procedures were the same as those used in the original paper by McKone etal.35 Face images were those of men in
their 20s and early 30s in neutral expressions, and each individual was photographed in the same range of poses
and lighting conditions. For this task, six unique target identities and 46 unique distractor identities were used.
For each identity, three face images from three dierent viewpoints (one le 1/3 prole, one full-frontal and one
right 1/3 prole) were used. Similar to the original version, only male faces were used because sex dierences in
observers have been reported for recognising female but not male faces64. ese faces did not contain external
features, such as hair and no facial blemishes were visible. ey were greyscale faces (approximately 160 pixels
(px) in width and 195 px in height; assuming participants had a seating distance of 57 cm, the faces subtended
approximately 3.2° and 4° in width and height, respectively) embedded in the centre of a uniformly grey back-
ground that is 200 px wide and 240 px tall (4 × 4.8 cm; see McKone etal.35 for further details).
e CFMT-Chi was presented using the standard procedure which consists of a total of 72 trials presented
over three dierent stages (18 in the Learning, 30 in the Novel and 24 in the Noise stages). In all trials that test
face memory, there were three simultaneously presented faces (one learned target and two distractors) and
participants were required to select which of them was the learnt face, by pressing the keys “1” for the le, “2”
for the middle, “3” for the right image.
Old/new recognition memory task (RMT)
Face images were those of Malaysian Chinese males in their early or mid-20s in neutral expressions. All indi-
viduals were photographed in the same range of poses and lighting conditions in the Face Laboratory at the
University of Nottingham Malaysia, wherein the informed consents to publish identifying information/images
were obtained. For each identity, only frontal view face images were used. All external features in the faces were
removed. e faces were then resized to approximately 160 px in width and exactly 195 px in height (subtending
approximately 3.2° × 4° at a viewing distance of 57 cm), converted into greyscale and embedded in the centre of
a uniformly black background of 200 × 250 px (4 × 5 cm).
In Experiment 1, the RMT consisted of four blocks (two whole and two aperture conditions). e four blocks
were randomized across participants. Each block started with an initial “learning” stage, followed by a ller task
and nally a “recognition” stage. In any given block, the learning stage showed the faces of six unique identities
to participants. e recognition stage sequentially presented the same six identities (“old”) randomly intermixed
with 6 new and unique identities that the participants had not seen before (“new”), leading to a total of 12 test
faces. is led to a total of 48 unique faces (e.g., 24 old and 24 new unique identities) that were used throughout
the entire experiment. In the learning stage of the “whole” condition, each trial started with a white central xa-
tion cross (22 × 22 px; 0.4 × 0.4 cm) shown for 500 ms, followed by a fully visible unique face stimulus presented
in the centre of the screen for 1000 ms (Fig.1). Old faces presented in the “whole” condition during the recogni-
tion stage are exactly the same as those in the learning stage. In contrast, in the learning stage of the “aperture
condition, the face image was shown through a dynamic window that moved smoothly from the top of the face
to the bottom, revealing features of the face in a sequential order (Fig.1). e dynamic window started and ended
with a fully black display. e height of the aperture that moved from top to bottom was 12% (i.e., 30 px) of the
overall height of the face and took approximately 6200 ms to move across the entire face (i.e., black-to-black
display). e sequential display and frame rate generated a smooth aperture motion (~ 11 frames per second).
All sequences were constructed from a series of bitmap images and saved as .GIF les. For both conditions, six
of such trials were presented in the learning stage, and participants were asked to learn and memorize all six
faces for a subsequent recognition stage.
Following the learning stage, in both conditions, participants were given a short ller task that involved
mathematical calculations (e.g., “5 6/2 + 10 = ?”), which took less than a minute to complete. is was followed
by the recognition stage. During this stage, the 12 test faces were sequentially presented over 12 trials. Each trial
began with a 500 ms presentation of a white central xation cross. is was followed by the presentation of a
fully visible face that remained on the centre of the screen until a response was recorded. e participants were
required to indicate whether they had previously seen this face in the learning stage, by pressing the key “Q” on
the keyboard if they have seen it and the key “P” if they have not seen it before. Participants were instructed to
respond as quickly and accurately as possible. In both stages, the presentation timing was adopted from previous
studies using the FTAP26,27. In the whole condition, old faces presented during recognition and learning were
both fully visible. However, in the aperture condition, old faces shown during learning were viewed through an
aperture and when the same identities were shown during recognition, they were fully visible.
In Experiment 2, experimental procedures and stimuli used were similar to Experiment 1, except for the
following changes in the old/new RMT. Irrespective of the experimental condition (whole or aperture), partici-
pants were always shown a white central xation cross, followed by fully visible faces for 1000ms in the learning
stage. A total of six unique faces (i.e., old faces) were shown in each block. During the “recognition stage”, they
were shown with the 12 test faces that were either in full-view (for the “whole” condition) or viewed through an
aperture (for the “aperture” condition). Faces to be recognised stayed on screen for the same duration of 6200
ms in both conditions, and this was followed by a black screen that remained until a response was recorded.
Responses could also be provided while the faces were shown or aer the faces were removed from the screen,
either of which terminated the trial. Similar to Experiment 1, participants pressed the key “Q” or “P” to indicate
whether they have seen each test face in the learning stage or not, respectively.
