Please use this identifier to cite or link to this item:
https://open.uns.ac.rs/handle/123456789/31176
Title: | Reverse nearest neighbors in unsupervised distance-based outlier detection | Authors: | Radovanović Miloš Nanopoulos A. Ivanović Mirjana |
Issue Date: | 2015 | Journal: | IEEE Transactions on Knowledge and Data Engineering | Abstract: | © 2014 IEEE. Outlier detection in high-dimensional data presents various challenges resulting from the 'curse of dimensionality.' A prevailing view is that distance concentration, i.e., the tendency of distances in high-dimensional data to become indiscernible, hinders the detection of outliers by making distance-based methods label all points as almost equally good outliers. In this paper, we provide evidence supporting the opinion that such a view is too simple, by demonstrating that distance-based methods can produce more contrasting outlier scores in high-dimensional settings. Furthermore, we show that high dimensionality can have a different impact, by reexamining the notion of reverse nearest neighbors in the unsupervised outlier-detection context. Namely, it was recently observed that the distribution of points' reverse-neighbor counts becomes skewed in high dimensions, resulting in the phenomenon known as hubness. We provide insight into how some points (antihubs) appear very infrequently in k-NN lists of other points, and explain the connection between antihubs, outliers, and existing unsupervised outlier-detection methods. By evaluating the classic k-NN method, the angle-based technique designed for high-dimensional data, the density-based local outlier factor and influenced outlierness methods, and antihub-based methods on various synthetic and real-world data sets, we offer novel insight into the usefulness of reverse neighbor counts in unsupervised outlier detection. | URI: | https://open.uns.ac.rs/handle/123456789/31176 | ISSN: | 1041-4347 | DOI: | 10.1109/TKDE.2014.2365790 |
Appears in Collections: | PMF Publikacije/Publications |
Show full item record
SCOPUSTM
Citations
168
checked on Apr 29, 2023
Page view(s)
27
Last Week
8
8
Last month
0
0
checked on May 10, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.