An Efficient Semantic Similarity Search on XML Documents
ABSTRACT In this paper, we study the use of XML path query to search an XML fragment in a collection of XML documents. We present efficient techniques that are able to employ bit-slice bloom-filtered signature file as filter and investigate an approach that are able to measure semantic similarity between the query and the XML document in the collection by considering their structures and contents. A ranked list of XML fragments is generated as the search results. By using a set of path queries on a variety of XML documents, the experiments show that the precision and efficiency of the results perform better than ever.