Content uploaded by Charu C. Aggarwal
Author content
All content in this area was uploaded by Charu C. Aggarwal on Oct 14, 2013
Content may be subject to copyright.
Outlier Detection for High Dimensional Data
Charu C. Aggarwal
IBM T. J. Watson Research Center
Yorktown Heights, NY 10598
charu@us.ibm.com
Philip S. Yu
IBM T. J. Watson Research Center
Yorktown Heights, NY 10598
psyu@us.ibm.com
ABSTRACT
1. INTRODUCTION
Permission to make digital or hard copies of part or all of this work or
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. To copy otherwise, to
republish, to post on servers, or to redistribute to lists, requires prior
specific permission and/or a fee.
ACM SIGMOD 2001 May 21-24, Santa Barbara, California USA
Copyright 2001 ACM 1-58113-332-4/01/05…$5.00
37
x
x
x
x
x
xx
x
x
x
x
x
x
*
o
B
A
View 1
x
x
x
x
x
x
x
x
xx
A
View 2
*
oB
View 3
x
xx
x
xx
x
x
x
x
x
oB
*A
View 4
x
x
x
x
x
x
x
x
x
x
*
A
o B
38
1.1 Desiderata for High Dimensional Outlier
Detection Algorithms
1.2 Defining Outliers in Lower Dimensional
Projections
1.3 DefiningAbnormalLowerDimensionalPro-
jections
1.4 A Note on the Nature of the Problem
39
2. EVOLUTIONARY ALGORITHMS FOR
OUTLIER DETECTION
40
2.1 An Overview of Evolutionary Search
41
2.2 The Evolutionary Outlier Detection Algo-
rithm
42
2.3 Postprocessing Phase
2.4 Choice of Projection Parameters
43
3. EMPIRICAL RESULTS
3.1 An Intuitive Evaluation of Results
44
4. CONCLUSIONS
45
5. REFERENCES
46