Deepak Lachwani

University of Florida, Gainesville, Florida, United States

Are you Deepak Lachwani?

Claim your profile

Publications (2)1.4 Total impact

  • [Show abstract] [Hide abstract]
    ABSTRACT: We consider the problem of similarity search in databases with costly metric distance measures. Given limited main memory, our goal is to develop a reference-based index that reduces the number of comparisons in order to answer a query. The idea in reference-based indexing is to select a small set of reference objects that serve as a surrogate for the other objects in the database. We consider novel strategies for selection of references and assigning references to database objects. For dynamic databases with frequent updates, we propose two incremental versions of the selection algorithm. Our experimental results show that our selection and assignment methods far outperform competing methods.
    The VLDB Journal 07/2008; 17(5):1231-1251. · 1.40 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: We consider the problem of similarity search in a very large sequence database with edit distance as the similarity mea- sure. Given limited main memory, our goal is to develop a reference-based index that reduces the number of costly edit distance computations in order to answer a query. The idea in reference-based indexing is to select a small set of reference sequences that serve as a surrogate for the other sequences in the database. We consider two novel strategies for selecting references as well as a new strategy for assigning references to database sequences. Our experimental results show that our selection and assignment methods far outper- form competitive methods. For example, our methods prune up to 20 times as many sequences as the Omni method, and as many as 30 times as many sequences as frequency vec- tors. Our methods also scale nicely for databases containing many and/or very long sequences.
    Proceedings of the 32nd International Conference on Very Large Data Bases, Seoul, Korea, September 12-15, 2006; 01/2006

Publication Stats

31 Citations
1.40 Total Impact Points

Top Journals

Institutions

  • 2006
    • University of Florida
      • Department of Computer and Information Science and Engineering
      Gainesville, Florida, United States