Article

Reconstructing sibling relationships in wild populations.

Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA.
Bioinformatics (impact factor: 5.47). 08/2007; 23(13):i49-56. DOI:10.1093/bioinformatics/btm219 pp.i49-56
Source: PubMed

ABSTRACT Reconstruction of sibling relationships from genetic data is an important component of many biological applications. In particular, the growing application of molecular markers (microsatellites) to study wild populations of plant and animals has created the need for new computational methods of establishing pedigree relationships, such as sibgroups, among individuals in these populations. Most current methods for sibship reconstruction from microsatellite data use statistical and heuristic techniques that rely on a priori knowledge about various parameter distributions. Moreover, these methods are designed for data with large number of sampled loci and small family groups, both of which typically do not hold for wild populations. We present a deterministic technique that parsimoniously reconstructs sibling groups using only Mendelian laws of inheritance. We validate our approach using both simulated and real biological data and compare it to other methods. Our method is highly accurate on real data and compares favorably with other methods on simulated data with few loci and large family groups. It is the only method that does not rely on a priori knowledge about the population under study. Thus, our method is particularly appropriate for reconstructing sibling groups in wild populations.

0 0
 · 
0 Bookmarks
 · 
46 Views

Keywords

biological applications
 
genetic data
 
growing application
 
heuristic techniques
 
large family groups
 
Mendelian laws
 
microsatellite data use statistical
 
molecular markers
 
new computational methods
 
parsimoniously reconstructs sibling groups
 
real biological data
 
real data
 
reconstructing sibling groups
 
sibgroups
 
sibship reconstruction
 
simulated data
 
small family groups
 
study wild populations
 
various parameter distributions
 
wild populations