Article

The InterPro database, an integrated documentation resource for protein families, domains and functional sites

01/2002;
Source: CiteSeer

ABSTRACT Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, literature references and links back to the relevant member database(s). Release 2.0 of InterPro (October 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification encoded by a total of 6804 different regular expressions, profiles, fingerprints and Hidden Markov Models. Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (more than 1 000 000 hits from 462 500 proteins in SWISS-PROT and TrEMBL). The database is accessible for text- and sequence-based searches at http:// www.ebi.ac.uk/interpro/. Questions can be emailed to interhelp@ebi.ac.uk.

0 0
 · 
0 Bookmarks
 · 
28 Views
  • Source
    Article: RNA-Seq analysis of splicing in Plasmodium falciparum uncovers new splice junctions, alternative splicing and splicing of antisense transcripts.
    [show abstract] [hide abstract]
    ABSTRACT: Over 50% of genes in Plasmodium falciparum, the deadliest human malaria parasite, contain predicted introns, yet experimental characterization of splicing in this organism remains incomplete. We present here a transcriptome-wide characterization of intraerythrocytic splicing events, as captured by RNA-Seq data from four timepoints of a single highly synchronous culture. Gene model-independent analysis of these data in conjunction with publically available RNA-Seq data with HMMSplicer, an in-house developed splice site detection algorithm, revealed a total of 977 new 5' GU-AG 3' and 5 new 5' GC-AG 3' junctions absent from gene models and ESTs (11% increase to the current annotation). In addition, 310 alternative splicing events were detected in 254 (4.5%) genes, most of which truncate open reading frames. Splicing events antisense to gene models were also detected, revealing complex transcriptional arrangements within the parasite's transcriptome. Interestingly, antisense introns overlap sense introns more than would be expected by chance, perhaps indicating a functional relationship between overlapping transcripts or an inherent organizational property of the transcriptome. Independent experimental validation confirmed over 30 new antisense and alternative junctions. Thus, this largest assemblage of new and alternative splicing events to date in Plasmodium falciparum provides a more precise, dynamic view of the parasite's transcriptome.
    Nucleic Acids Research 01/2011; 39(9):3820-35. · 8.03 Impact Factor
  • Source
    Article: PredUs: a web server for predicting protein interfaces using structural neighbors.
    [show abstract] [hide abstract]
    ABSTRACT: We describe PredUs, an interactive web server for the prediction of protein-protein interfaces. Potential interfacial residues for a query protein are identified by 'mapping' contacts from known interfaces of the query protein's structural neighbors to surface residues of the query. We calculate a score for each residue to be interfacial with a support vector machine. Results can be visualized in a molecular viewer and a number of interactive features allow users to tailor a prediction to a particular hypothesis. The PredUs server is available at: http://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:PredUs.
    Nucleic Acids Research 05/2011; 39(Web Server issue):W283-7. · 8.03 Impact Factor
  • Source
    Article: Role of plant-specific N-terminal domain of maize CK2β1 subunit in CK2β functions and holoenzyme regulation.
    [show abstract] [hide abstract]
    ABSTRACT: Protein kinase CK2 is a highly pleiotropic Ser/Thr kinase ubiquituous in eukaryotic organisms. CK2 is organized as a heterotetrameric enzyme composed of two types of subunits: the catalytic (CK2α) and the regulatory (CK2β). The CK2β subunits enhance the stability, activity and specificity of the holoenzyme, but they can also perform functions independently of the CK2 tetramer. CK2β regulatory subunits in plants differ from their animal or yeast counterparts, since they present an additional specific N-terminal extension of about 90 aminoacids that shares no homology with any previously characterized functional domain. Sequence analysis of the N-terminal domain of land plant CK2β subunit sequences reveals its arrangement through short, conserved motifs, some of them including CK2 autophosphorylation sites. By using maize CK2β1 and a deleted version (ΔNCK2β1) lacking the N-terminal domain, we have demonstrated that CK2β1 is autophosphorylated within the N-terminal domain. Moreover, the holoenzyme composed with CK2α1/ΔNCK2β1 is able to phosphorylate different substrates more efficiently than CK2α1/CK2β1 or CK2α alone. Transient overexpression of CK2β1 and ΔNCK2β1 fused to GFP in different plant systems show that the presence of N-terminal domain enhances aggregation in nuclear speckles and stabilizes the protein against proteasome degradation. Finally, bimolecular fluorescence complementation (BiFC) assays show the nuclear and cytoplasmic location of the plant CK2 holoenzyme, in contrast to the individual CK2α/β subunits mainly observed in the nucleus. All together, our results support the hypothesis that the plant-specific N-terminal domain of CK2β subunits is involved in the down-regulation of the CK2 holoenzyme activity and in the stabilization of CK2β1 protein. In summary, the whole amount of data shown in this work suggests that this domain was acquired by plants for regulatory purposes.
    PLoS ONE 01/2011; 6(7):e21909. · 4.09 Impact Factor

Keywords

6804 different regular expressions
 
annotation
 
distant relationships
 
efforts
 
families
 
fingerprints
 
functional sites
 
Hidden Markov Models
 
inferring protein function
 
integrated documentation resource
 
post-translational modification encoded
 
ProDom database projects
 
protein families
 
relevant member database(s)
 
sequence-based searches
 
Signature databases