Figure - available from: PLOS One
This content is subject to copyright.
T test and Wilcoxon signed-rank tests examine different null hypotheses about the data distribution
The graph depicts four different sets of simulated EDIs (A to D) sampled from Gaussian distributions with different means and variances. Gray points indicate the sample data. The superimposed black point depicts the mean of each sample. The sample variance around its mean is depicted by black lines emanating from the mean (i.e., the black dot). Each set consists of 12 values, representing 12 subjects, which are submitted to both a t test and a Wilcoxon signed-ranked test. Both tests are right-tailed (meaning the tail extending into the positive direction) and test the null hypothesis that the data come from a distribution with zero mean (meaning no exemplar information across participants). p-values were pronounced significant (Y) if they passed the conventional threshold of p<0.05. Otherwise, they were unsignificant (N). (A) The sample mean is positive and the sample variance is small, hence both tests pass the significance threshold. (B) The sample mean is positive. However, an extreme value in the sample inflates the variance. Therefore the t test does not return a significant p-value anymore. (C) The mean is positive, but the sample contains a considerable number of negative EDIs. Accounting for the weight of the ranks of the data points, the Wilcoxon-sigend rank test penalizes this presence of negative data points, therefore its p-value does not pass the significance threshold. (D) The sample mean is close to zero, hence neither test indicates significant exemplar information.

T test and Wilcoxon signed-rank tests examine different null hypotheses about the data distribution The graph depicts four different sets of simulated EDIs (A to D) sampled from Gaussian distributions with different means and variances. Gray points indicate the sample data. The superimposed black point depicts the mean of each sample. The sample variance around its mean is depicted by black lines emanating from the mean (i.e., the black dot). Each set consists of 12 values, representing 12 subjects, which are submitted to both a t test and a Wilcoxon signed-ranked test. Both tests are right-tailed (meaning the tail extending into the positive direction) and test the null hypothesis that the data come from a distribution with zero mean (meaning no exemplar information across participants). p-values were pronounced significant (Y) if they passed the conventional threshold of p<0.05. Otherwise, they were unsignificant (N). (A) The sample mean is positive and the sample variance is small, hence both tests pass the significance threshold. (B) The sample mean is positive. However, an extreme value in the sample inflates the variance. Therefore the t test does not return a significant p-value anymore. (C) The mean is positive, but the sample contains a considerable number of negative EDIs. Accounting for the weight of the ranks of the data points, the Wilcoxon-sigend rank test penalizes this presence of negative data points, therefore its p-value does not pass the significance threshold. (D) The sample mean is close to zero, hence neither test indicates significant exemplar information.

Source publication

Similar publications

Article
Full-text available
Vehicle re-identification (re-id) technology refers to a vehicle matching under a non-overlapping domain, that is, to confirm whether the vehicle target taken by cameras in different positions at different times is the same vehicle. Different identities of the same type of vehicles are one of the most challenging factors in the field of vehicle re-...
Preprint
Full-text available
Passage retrieval aims to retrieve relevant passages from large collections of the open-domain corpus. Contextual Masked Auto-Encoding has been proven effective in representation bottleneck pre-training of a monolithic dual-encoder for passage retrieval. Siamese or fully separated dual-encoders are often adopted as basic retrieval architecture in t...
Article
Full-text available
La recherche empirique sur la problématique de la discrimination génétique (DG) au Québec est généralement limitée. À ce titre, cette étude vise, au moyen d’une méthodologie qualitative, à explorer et à recueillir les opinions, les expériences et les connaissances des Québécois à l’égard de la DG. En concordance avec les objectifs exploratoires de...
Article
Full-text available
Benchmark tests are employed when testing for racial discrimination by police. Neil and Winship (2019) emphasize that such tests are threatened by Simpson’s paradox, but avoid analyzing the paradox causally. They consequently cannot elucidate the link between statistical quantities and discrimination hypotheses. Simpson’s paradox reveals that the s...
Article
Full-text available
Specifying surface reflectances in a simple and perceptually informative way would be beneficial for many areas of research and application. We assessed whether a ${3} \times {3}$ matrix may be used to approximate how a surface reflectance modulates the sensory color signal across illuminants. We tested whether observers could discriminate between...