Publications (2)0 Total impact

Conference Paper: An approximation algorithm for finding skeletal points for density based clustering approaches
[Show abstract] [Hide abstract]
ABSTRACT: Clustering is the problem of finding relations in a data set in an supervised manner. These relations can be extracted using the density of a data set, where density of a data point is defined as the number of data points around it. To find the number of data points around another point, region queries are adopted. Region queries are the most expensive construct in density based algorithm, so it should be optimized to enhance the performance of density based clustering algorithms specially on large data sets. Finding the optimum set of region queries to cover all the data points has been proven to be NPcomplete. This optimum set is called the skeletal points of a data set. In this paper, we proposed a generic algorithms which fires region queries at most 6 times the optimum number of region queries (has 6 as approximation factor). Also, we have extend this generic algorithm to create a DBSCAN (the most wellknown density based algorithm) derivative, named ADBSCAN. Presented experimental results show that ADBSCAN has a better approximation to DBSCAN than the DBRSH (the most wellknown randomized density based algorithm) in terms of performance and quality of clustering, specially for large data sets.Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on; 05/2009 
Conference Paper: An approximation algorithm for finding skeletal points for density based clustering approaches.
[Show abstract] [Hide abstract]
ABSTRACT: Clustering is the problem of finding relations in a data set in an supervised manner. These relations can be extracted using the density of a data set, where density of a data point is defined as the number of data points around it. To find the number of data points around another point, region queries are adopted. Region queries are the most expensive construct in density based algorithm, so it should be optimized to enhance the performance of density based clustering algorithms specially on large data sets. Finding the optimum set of region queries to cover all the data points has been proven to be NPcomplete. This optimum set is called the skeletal points of a data set. In this paper, we proposed a generic algorithms which fires region queries at most 6 times the optimum number of region queries (has 6 as approximation factor). Also, we have extend this generic algorithm to create a DBSCAN (the most wellknown density based algorithm) derivative, named ADBSCAN. Presented experimental results show that ADBSCAN has a better approximation to DBSCAN than the DBRSH (the most wellknown randomized density based algorithm) in terms of performance and quality of clustering, specially for large data sets.Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2009, part of the IEEE Symposium Series on Computational Intelligence 2009, Nashville, TN, USA, March 30, 2009  April 2, 2009; 01/2009
Publication Stats
2  Citations  
Top Journals
Institutions

2009

Sharif University of Technology
 Department of Computer Engineering
Teheran, Tehrān, Iran
