An improved scoring scheme for predicting glycan structures from gene expression data.

Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan.
Genome informatics. International Conference on Genome Informatics 02/2007; 18:237-46. DOI: 10.1142/9781860949920_0023
Source: PubMed

ABSTRACT The prediction of glycan structures from gene expression of glycosyltransferases (GTs) is a challenging new area in computational biology because the biosynthesis of glycan chains is under the control of GT expression. In this paper we developed a new method for predicting glycan structures from gene expression data. There are two main original aspects of the proposed method. First, we proposed to increase the number of predictable glycan structure candidates by estimating missing glycans from a global glycan structure map, which enables us to predict new glycan structures that are not stored in the database. Second, we proposed a more general scoring scheme based on real-valued gene expression intensity rather than converting it into binary information. In the result we applied the proposed method to predicting cancer-specific glycan structures from gene expression profiles for patients of acute lymphocytic leukemia (ALL) and acute myelocytic leukemia (AML). We confirmed that several of the predicted glycan structures successfully correspond to known cancer-specific glycan structures according to the literature, and our method outperforms the previous methods at a statistically significant level.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Glycosylation serves essential functions on many proteins produced in biopharmaceutical manufacturing, making it mandatory to thoroughly consider its biogenesis during the production process. Glycoengineering efforts involve the rational design of glycosylation through adjustments in culturing conditions or genetic modifications. Computational models have been developed to aid this process, aiming to offer cheaper and faster alternatives to costly screening strategies. Recently, these models have been successfully utilized to predict glycosylation of products of industrial relevance. Furthermore, systems-level analyses of glycan diversity are elucidating deeper insights into the mechanisms underlying glycosylation. As computational models of glycosylation continue to be expanded, refined, and leveraged for detailed analysis of glycomics data, they will become invaluable resources for cell line development and glycoengineering.
    Current Opinion in Biotechnology 12/2014; 30:218–224. DOI:10.1016/j.copbio.2014.08.004 · 8.04 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Searchable mass spectral libraries for glycans may be enhanced using a B2 ion library. Using a quadrupole ion-trap mass spectrometer, successive fragmentations of sodiated oligosaccharides were carried out in the positive ion mode. In B,Y-type fragmentation, disaccharide B2 ions are generated which correspond to specific glycosidic linkages using progressive MS stages. Fragmentation of "B2 ions" corresponding to glycosidic linkages such as Hex-Fuc, Hex-Hex, Hex-HexNAc, HexNAc-Hex and HexNAc-HexNAc, were systematically studied in low energy CID and collected to form a "B2 library". Linkages produce characteristic fragmentation patterns in the absence of cross-ring fragmentation. Patterns of "B2 ions" rely on relative stability of glycosidic bonds and carbohydrate-metal complexes in the gas phase. MS(n) studies of linear, branched trisaccharides and tetrasaccharides show that isomers for which B2 ion information is not available are rarely a problem in practice by their absence in an isomeric sequence or by their scarcity in nature. This MS strategy for linkage determination of carbohydrates aided by a "B2 library" was developed with a scope for expansion, providing an improved tool for glycomics. We validated this method examining levels of expressed activities of two glycosyl transferases in cancer cell lines: β3(B3GALNT2) and β4GalNAcT(B4GALNT3&4) that generate GalNAcβ3GlcNAcβ and GalNAcβ4GlcNAcβ.
    Journal of Proteomics 08/2014; DOI:10.1016/j.jprot.2014.07.013 · 3.93 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Abstract Glycoproteins for treating human diseases have revolutionized the health care industry. However, controlling glycosylation has been a challenge as small variations in glycan structure can be responsible for significant changes in key therapeutic properties. Manipulation of glycan biosynthesis can be particularly complex since the process is not directly encoded on the genome but depends on multiple variables such as enzymes’ activity, selectivity, localization, expression host, and process parameters and conditions. Furthermore, a particular glycoprotein may include many different glycan structures due to differences in processing that occur for each individual molecule. The present chapter focuses on experimental and computational approaches to direct N-glycosylation in expression systems for generation of biotherapeutics of superior value. Glycoengineering-based manipulations of glycan structures using glycosyltransferases, modification of precursor biosynthetic pathways, and predictions of glycosylation patterns using mathematical models are described including examples from the literature as a means of optimizing glycoform distributions in cells.


1 Download
Available from