Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/15097
Title: Nearest neighbors in high-dimensional data : The emergence and influence of hubs
Authors: Radovanović M.
Nanopoulos A.
Ivanović, Mirjana 
Issue Date: 15-Sep-2009
Journal: ACM International Conference Proceeding Series
Abstract: High dimensionality can pose severe difficulties, widely recognized as different aspects of the curse of dimensionality. In this paper we study a new aspect of the curse pertaining to the distribution of k-occurrences, i.e., the number of times a point appears among the k nearest neighbors of other points in a data set. We show that, as dimensionality increases, this distribution becomes considerably skewed and hub points emerge (points with very high k-occurrences). We examine the origin of this phenomenon, showing that it is an inherent property of highdimensional vector space, and explore its influence on applications based on measuring distances in vector spaces, notably classification, clustering, and information retrieval. Copyright 2009.
URI: https://open.uns.ac.rs/handle/123456789/15097
ISBN: 9781605585161
DOI: 10.1145/1553374.1553485
Appears in Collections:PMF Publikacije/Publications

Show full item record

SCOPUSTM   
Citations

12
checked on Nov 20, 2023

Page view(s)

23
Last Week
8
Last month
0
checked on May 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.