Performance of five two-sample location tests for skewed distributions with unequal variances.

Ullevål Department of Research Administration, Oslo University Hospital, N-0407 Oslo, Norway.
Contemporary clinical trials (Impact Factor: 1.99). 08/2009; 30(5):490-6. DOI: 10.1016/j.cct.2009.06.007
Source: PubMed

ABSTRACT Tests for comparing the locations of two independent populations are associated with different null hypotheses, but results are often interpreted as evidence for or against equality of means or medians. We examine the appropriateness of this practice by investigating the performance of five frequently used tests: the two-sample T test, the Welch U test, the Yuen-Welch test, the Wilcoxon-Mann-Whitney test, and the Brunner-Munzel test. Under combined violations of normality and variance homogeneity, the true significance level and power of the tests depend on a complex interplay of several factors. In a wide ranging simulation study, we consider scenarios differing in skewness, skewness heterogeneity, variance heterogeneity, sample size, and sample size ratio. We find that small differences in distribution properties can alter test performance markedly, thus confounding the effort to present simple test recommendations. Instead, we provide detailed recommendations in Appendix A. The Welch U test is recommended most frequently, but cannot be considered an omnibus test for this problem.

Download full-text


Available from: Morten Wang Fagerland, Jul 04, 2015
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: A novel coherence estimation method for small data sets is presented for interferometric synthetic aperture radar (SAR) (InSAR) data processing and geoscience applications. The method selects homogeneous pixels in both the spatial and temporal spaces by means of local and nonlocal adaptive techniques. Reliable coherence estimation is carried out by using such pixels and by correcting the bias in the estimated coherence caused by the non-Gaussianity in high-resolution SAR scenes. As an example, the proposed method together with coherence decomposition is applied to extract the temporal decorrelation component over an area in Macao. The results show that the proposed algorithms work well over various types of land cover. Moreover, the coherence change with time can be more accurately detected compared to other conventional methods.
    IEEE Transactions on Geoscience and Remote Sensing 10/2014; 52(10):6584-6596. DOI:10.1109/TGRS.2014.2298408 · 2.93 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The objective of this study was to evaluate the effect of bilateral versus unilateral cochlear implants and the importance of the inter-implant interval. Seventy-three prelingually deaf children received sequential bilateral cochlear implants. Speech recognition in quiet with the first, second and with both implants simultaneously was evaluated at the time of the second implantation and after 12 and 24 months. Mean bilateral speech recognition 12 and 24 months after the second implantation was significantly higher than that obtained with either the first or the second implant. The addition of a second implant was demonstrated to have a beneficial effect after both 12 and 24 months. Speech recognition with the second implant increased significantly during the first year. A small, non-significant improvement was observed during the second year. The inter-implant interval significantly influenced speech recognition with the second cochlear implant both at 12 and 24 months, and bilateral speech recognition at 12 months, but not at 24 months. A small, but statistically significant improvement in speech recognition was found with bilateral cochlear implants compared with a unilateral implant. A major increase in speech recognition occurred with the second cochlear implant during the first year. A shorter time interval between the two implantations resulted in better speech recognition with the second implant. However, no definitive time-point was found for when the second implant could no longer add a positive effect.
    International journal of pediatric otorhinolaryngology 11/2011; 76(1):95-9. DOI:10.1016/j.ijporl.2011.10.009 · 1.32 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Security inspection and testing require experts in security who think like an attacker. Security experts need to know code locations on which to focus their testing and inspection efforts. Since vulnerabilities are rare occurrences, locating vulnerable code locations can be a challenging task. We investigated whether software metrics obtained from source code and development history are discriminative and predictive of vulnerable code locations. If so, security experts can use this prediction to prioritize security inspection and testing efforts. The metrics we investigated fall into three categories: complexity, code churn, and developer activity metrics. We performed two empirical case studies on large, widely used open-source projects: the Mozilla Firefox web browser and the Red Hat Enterprise Linux kernel. The results indicate that 24 of the 28 metrics collected are discriminative of vulnerabilities for both projects. The models using all three types of metrics together predicted over 80 percent of the known vulnerable files with less than 25 percent false positives for both projects. Compared to a random selection of files for inspection and testing, these models would have reduced the number of files and the number of lines of code to inspect or test by over 71 and 28 percent, respectively, for both projects.
    IEEE Transactions on Software Engineering 11/2011; 37:772-787. DOI:10.1109/TSE.2010.81 · 2.29 Impact Factor