Predicting the proportion of essential genes in mouse duplicates based on biased mouse knockout genes.
ABSTRACT In the yeast or nematode, the proportion of essential genes in duplicates is lower than in singletons (single-copy genes), due to the functional redundancy. One may expect that it should be the same in the mouse genome. However, based on the publicly available mouse knockout data, it was observed that the proportion of essential genes in duplicates is similar to that in singletons. The most straightforward interpretation, as claimed in a recent study, is that duplicate genes may have a negligible role in the mouse genetic robustness. Here we show that in the current mouse knockout dataset, recently duplicated genes have been highly underrepresented, leading to an overestimation of the proportion of essential genes in duplicates. After estimating the duplication time of mouse duplication events, we have developed a simple bias-correcting procedure and shown that the bias-corrected proportion of essential genes in mouse duplicates is significantly lower than that in singletons.
- SourceAvailable from: PubMed CentralPLoS Computational Biology 07/2014; 10(7):e1003754. · 4.83 Impact Factor
- PLoS Computational Biology 07/2014; 10(7):e1003758. · 4.83 Impact Factor
- [Show abstract] [Hide abstract]
ABSTRACT: In contrast to S. cerevisiae and C. elegans, analyses based on the current knockout (KO) mouse phenotypes led to the conclusion that duplicate genes had almost no role in mouse genetic robustness. It has been suggested that the bias of mouse KO database toward ancient duplicates may possibly cause this knockout duplicate puzzle, that is, a very similar proportion of essential genes (P E ) between duplicate genes and singletons. In this paper, we conducted an extensive and careful analysis for the mouse KO phenotype data and corroborated a strong effect of duplicate genes on mouse genetics robustness. Moreover, the effect of duplicate genes on mouse genetic robustness is duplication-age dependent, which holds after ruling out the potential confounding effect from coding-sequence conservation, protein-protein connectivity, functional bias, or the bias of duplicates generated by whole genome duplication (WGD). Our findings suggest that two factors, the sampling bias toward ancient duplicates and very ancient duplicates with a proportion of essential genes higher than that of singletons, have caused the mouse knockout duplicate puzzle; meanwhile, the effect of genetic buffering may be correlated with sequence conservation as well as protein-protein interactivity.BioMed Research International 01/2014; 2014:758672. · 2.71 Impact Factor