Fig 5 - uploaded by Maciej Henneberg
Content may be subject to copyright.
Recognition responses to face arrays when the assessors ( n = 15) had no knowledge of the facial approximation. Dashed line indicates the recognition rate expected by chance for each of the faces in the face array (10%). ‘‘NI’’ indicates instances of ‘‘no identification’’. The target individual or face number 4, is indicated by the black arrow.
Context in source publication
Context 1
... approximation (the face array may be biased and hence some distractor faces may not be seen to be plausible alternatives to the target individual and/or the facial approximation). To determine if any type I bias was present in the face array, sequential and simultaneous presentation trials were conducted without the facial approximation. Fifteen adult assessors (11 females, 4 males: mean age 23 years, standard deviation 12 years) who did not recognize, either personally or as having been seen in the media, any of the faces in the face array were asked to determine, without the facial approximation, who the murder victim (target individual) was in the face array. Assessors had no information apart from the photographs themselves to base this decision on. Assessors first participated in the sequential face array presentation and then the simultaneous line-up. In the sequential face array assessors where shown one face image at a time in a random order. Assessors were forced to decide for each face ‘‘yes’’ or ‘‘no’’ if the face was that of the murder victim. Assessors were aware that if they made an identification of ‘‘yes’’ the trial was completed (they would not see the other faces in the sequence during this test) and if they answered ‘‘no’’ that they would not be able to change their mind to this face at a later point in the trial. The investigator (CNS) held more cards than those included in the face array test so that assessors could not anticipate the end of the sequence. In the sequential trial cards were held about 1 m in front of the assessor and at arm’s length from the investigator at 90% to his line of sight so investigator cues, if there were any, were not obvious to assessors (who were hopefully concentrating on the cards and hence not attending to any cues, particularly facial ones, if expressed by the investigator). After assessors had made an identification, or if they proceeded through all the faces in the sequence without making an identification, all the faces in the array were presented simultaneously, but in a random order, to the assessor for him/her to change their identification decision from the sequential trial if they so wished. Twenty new assessors (14 females, 6 males: mean age 20 years, standard deviation 4 years) who did not recognize, either personally or as having been seen in the media, any of the faces in the face array, were recruited for the main project: to attempt to correctly identify who the target individual was from the facial approximation. Identical procedures were followed as indicated above for face pool testing without the facial approximation, except of course that this time the assessors had access to a facial approximation. Although assessors had the option of not choosing any face in the simultaneous line-up, a chance rate of 10% was used as it was found that almost all assessors selected a face (see Section 3), and hence appeared biased in this respect (according to chance one would expect a 50% response of ‘‘not there’’, instead it seemed all assessors were choosing a face and hence each face in the array had a 10% chance of being selected). Responses were recorded categorically as correct (target face identification) or incorrect (distractor identification or ‘‘no identification’’ response) for statistical analysis. Observed data were compared to expected frequen- 1 cies by Fisher’s Exact tests in the JMP 4.0 statistical package. As we were only interested in responses larger than chance (and because chance rates were so small that lesser differences would be difficult to detect) confidence intervals for one tailed tests were used. Assessor’s resemblance ratings of both facial approximations were in accordance with previous indications from other individuals, including other forensic facial approximation experts, that resemblance was high. It is worth noting that Betty Pat Gatliff, a high profile forensic artist and facial approximation practitioner, indicated to the first author in 2000 that the face appeared similar enough that she would have expected a positive result had it been advertised. Both facial approximations received high, but similar, resemblance rating results (around 7 out of 10), although the facial approximation without hair tended to be rated higher than the facial approximation with hair (Fig. 4). Data distributions for the no hair facial approximation displayed slightly more left skew than those for the facial approximation with hair (Fig. 4). Hence in recognition tests reported here we used the facial approximation without hair as this is expected to favour positive recognition responses according to traditional facial approximation theory (greater resemblance, greater accuracy and possibly more frequent correct recognitions). Sequential and simultaneous face array tests, done without assessors seeing a facial approximation, showed that the target individual (face number 4) was identified at rates well above other individuals (Fig. 5) and that responses were highly similar between the sequential and simultaneous presentation methods. For sequential line-ups, the identification of the target individual (face number 4) in comparison to other faces was statistically significant ( p < 0.08) in six out of nine cases (i.e., for face numbers 1, 2, 5, 6, 7, and 8), although the other three cases (face numbers 3, 9, and 10) followed similar trends. In no case was a ‘‘not there’’ response made for either the sequential or the simultaneous face array. During post-experiment feed back assessors expressed their thoughts why they thought they could tell who the murder victim was without the facial approximation. Frequently these explanations included: ‘‘the target photograph . . . would not be flattering’’, ‘‘ . . . would not be clear’’, and/or ‘‘ . . . would present the person smiling’’. When the facial approximation without hair was presented to assessors who then attempted to identify the target individual from the sequential face array, identification of the target individual (face number 4) did not increase (Fig. 6). Face number 4 was not identified above chance rates at statistically significant levels. Face number 4 was also identified some- what less, but not at rates statistically different from that observed when the facial approximation was not presented to assessors (in both sequential and simultaneous procedures). During sequential trials the majority of responses were the default response of ‘‘no identification’’, as most individuals (60%) completed the sequential presentation without identifying any face. The increased number of ‘‘not there’’ responses for the sequential face array, facial approximation present scenario, was highly statistically significant ( p < 0.001) in comparison to: (i) the sequential face array, facial approximation not present scenario; (ii) the simultaneous face array, facial approximation not present scenario; and (iii) the simultaneous face array, facial approximation present scenario. Thus, when facial approximations were presented to assessors in the facial approximation present scenario, the number of false identifications decreased in comparison to all other testing conditions. The similarity ratings of the facial approximation to the target individual clearly indicate high resemblance, as the modes observed for the facial approximations were high (7 out of 10 for the facial approximation with hair; and 6/10 and 8/10 for facial approximations without hair; Fig. 4). Such resemblance ratings seem comparable with reports of resemblance ratings for other ‘‘successful’’ facial approximations [14–17]. However, despite attempts to encourage ‘‘favourable’’ recognition results here (by using the facial approximation that achieved the highest resemblance, and not selecting a face pool that included distractors who looked like the target individual) recognition frequencies of the target individual were poor, and in fact tended to be less than those when assessors made an identification without the use of the facial approximation . Furthermore, when assessors used the facial approximation in sequential trials the ‘‘no identification responses’’ increased dramatically in contrast to responses when facial approximations were not used. These results offer strong support for claims that resemblance ratings do not indicate the recognizability, and hence accuracy, of facial approximations [2], and that facial approximations are not recognized frequently or reliably above rates expected by chance [1]. Results were also consistent with eyewitness research that indicates sequential face arrays are preferable to simultaneous presentations [9–13]. Sequential face arrays were found to dramatically reduce the number of false identifications whilst not greatly affecting the rate of correct identifications [9–13]. This suggests that the high numbers of false hits in other studies (see e.g. [1]) using simultaneous testing procedures may be a result of the protocols employed not due to recognitions of the facial approximations alone. Therefore, the primary weakness of the facial approximation method seems to be a lack of frequent correct recognitions, rather than a vast number of false recognitions. These findings suggest that sequential face array procedures should be employed in future facial approximation research. The lack of identification of distractor face numbers 5, 8 and 10 in recognition tests that included the facial approximation indicates that these distractor faces were not functional. That is, these faces were so dissimilar to the facial approximation that they were essentially ignored by assessors. This could be interpreted as meaning that facial approximation methods used here successfully eliminated 30% of the sample, which may perhaps be useful in forensic casework. However, we suspect that such elimination of people to whom the skull does not belong does not require the specific construction of a ...