Conference Paper

Protein interaction prediction for mouse PDZ domains using dipeptide composition features

Key Lab. of Syst. Biol., Chinese Acad. of Sci., Shanghai, China
DOI: 10.1109/ISB.2011.6033143 Conference: Systems Biology (ISB), 2011 IEEE International Conference on
Source: IEEE Xplore

ABSTRACT

The PDZ domain is one of the largest families of protein domains that are involved in targeting and routing specific proteins in signaling pathways. PDZ domains mediate protein-protein interactions by binding the C-terminal peptides of their target proteins. Using the dipeptide feature encoding, we develop a PDZ domain interaction predictor using a support vector machine that achieves a high accuracy rate of 82.49%. Since most of the dipeptide compositions are redundant and irrelevant, we propose a new hybrid feature selection technique to select only a subset of these compositions that are useful for interaction prediction. Our experimental results show that only approximately 25% of dipeptide features are needed and that our method increases the accuracy by 3%. The selected dipeptide features are analyzed and shown to have important roles on specificity pattern of PDZ domains.

Download full-text

Full-text

Available from: Songyot Nakariyakul

Similar Publications