Antal Novak

Cornell University, Ithaca, NY, USA

Are you Antal Novak?

Claim your profile

Publications (4)24.98 Total impact

  • Source
    Article: Automatic parameter learning for multiple local network alignment.
    [show abstract] [hide abstract]
    ABSTRACT: We developed Graemlin 2.0, a new multiple network aligner with (1) a new multi-stage approach to local network alignment; (2) a novel scoring function that can use arbitrary features of a multiple network alignment, such as protein deletions, protein duplications, protein mutations, and interaction losses; (3) a parameter learning algorithm that uses a training set of known network alignments to learn parameters for our scoring function and thereby adapt it to any set of networks; and (4) an algorithm that uses our scoring function to find approximate multiple network alignments in linear time. We tested Graemlin 2.0's accuracy on protein interaction networks from IntAct, DIP, and the Stanford Network Database. We show that, on each of these datasets, Graemlin 2.0 has higher sensitivity and specificity than existing network aligners. Graemlin 2.0 is available under the GNU public license at http://graemlin.stanford.edu .
    Journal of computational biology: a journal of computational molecular cell biology 09/2009; 16(8):1001-22. · 1.69 Impact Factor
  • Source
    Article: Graemlin: general and robust alignment of multiple large interaction networks.
    [show abstract] [hide abstract]
    ABSTRACT: The recent proliferation of protein interaction networks has motivated research into network alignment: the cross-species comparison of conserved functional modules. Previous studies have laid the foundations for such comparisons and demonstrated their power on a select set of sparse interaction networks. Recently, however, new computational techniques have produced hundreds of predicted interaction networks with interconnection densities that push existing alignment algorithms to their limits. To find conserved functional modules in these new networks, we have developed Graemlin, the first algorithm capable of scalable multiple network alignment. Graemlin's explicit model of functional evolution allows both the generalization of existing alignment scoring schemes and the location of conserved network topologies other than protein complexes and metabolic pathways. To assess Graemlin's performance, we have developed the first quantitative benchmarks for network alignment, which allow comparisons of algorithms in terms of their ability to recapitulate the KEGG database of conserved functional modules. We find that Graemlin achieves substantial scalability gains over previous methods while improving sensitivity.
    Genome Research 10/2006; 16(9):1169-81. · 13.61 Impact Factor
  • Source
    Article: Traffic-based feedback on the web.
    [show abstract] [hide abstract]
    ABSTRACT: Usage data at a high-traffic web site can expose information about external events and surges in popularity that may not be accessible solely from analyses of content and link structure. We consider sites that are organized around a set of items available for purchase or download, consider, for example, an e-commerce site or collection of online research papers, and we study a simple indicator of collective user interest in an item, the batting average, defined as the fraction of visits to an item's description that result in an acquisition of that item. We develop a stochastic model for identifying points in time at which an item's batting average experiences significant change. In experiments with usage data from the Internet Archive, we find that such changes often occur in an abrupt, discrete fashion, and that these changes can be closely aligned with events such as the highlighting of an item on the site or the appearance of a link from an active external referrer. In this way, analyzing the dynamics of item popularity at an active web site can help characterize the impact of a range of events taking place both on and off the site.
    Proceedings of the National Academy of Sciences 05/2004; 101 Suppl 1:5254-60. · 9.68 Impact Factor
  • Article: Traffic-Based Feedback on the Web
    [show abstract] [hide abstract]
    ABSTRACT: Usage data at a high-traffic Web site can expose information about external events and surges in popularity that may not be accessible solely from analyses of content and link structure.
    03/2004;