LoCH: A neighborhood-based multidimensional projection technique for high-dimensional sparse spaces
Research output: Journal contributions › Journal articles › Research › peer-review
Standard
In: Neurocomputing, Vol. 150, No. Part B, 20.02.2015, p. 546-556.
Research output: Journal contributions › Journal articles › Research › peer-review
Harvard
APA
Vancouver
Bibtex
}
RIS
TY - JOUR
T1 - LoCH
T2 - A neighborhood-based multidimensional projection technique for high-dimensional sparse spaces
AU - Fadel, Samuel G.
AU - Fatore, Francisco M.
AU - Duarte, Felipe S.L.G.
AU - Paulovich, Fernando V.
PY - 2015/2/20
Y1 - 2015/2/20
N2 - On the last few years multidimensional projection techniques have advanced towards defining faster and user-centered approaches. However, most of existing methods are designed as generic tools without considering particular features of the data under processing, such as the distance distribution when the data is embedded into a certain metric space. In this paper we split the projection techniques into two groups, global and local techniques, conduct an analysis of them, and present a novel local technique specially designed for projecting heavy tail distance distributions, such as the one produced by high-dimensional sparse spaces. This novel approach, called Local Convex Hull (LoCH), relies on an iterative process that seeks to place each point close to the convex hull of its nearest neighbors. The accuracy, in terms of neighborhood preservation, is confirmed by a set of comparisons and tests, showing that LoCH is capable of successfully segregating groups of similar instances embedded in high-dimensional sparse spaces and of defining the borders between them, significantly better than most projection techniques.
AB - On the last few years multidimensional projection techniques have advanced towards defining faster and user-centered approaches. However, most of existing methods are designed as generic tools without considering particular features of the data under processing, such as the distance distribution when the data is embedded into a certain metric space. In this paper we split the projection techniques into two groups, global and local techniques, conduct an analysis of them, and present a novel local technique specially designed for projecting heavy tail distance distributions, such as the one produced by high-dimensional sparse spaces. This novel approach, called Local Convex Hull (LoCH), relies on an iterative process that seeks to place each point close to the convex hull of its nearest neighbors. The accuracy, in terms of neighborhood preservation, is confirmed by a set of comparisons and tests, showing that LoCH is capable of successfully segregating groups of similar instances embedded in high-dimensional sparse spaces and of defining the borders between them, significantly better than most projection techniques.
KW - High-dimensional sparse space
KW - Local multidimensional projection
KW - Visual data mining
KW - Business informatics
UR - http://www.scopus.com/inward/record.url?scp=84922676246&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2014.07.071
DO - 10.1016/j.neucom.2014.07.071
M3 - Journal articles
AN - SCOPUS:84922676246
VL - 150
SP - 546
EP - 556
JO - Neurocomputing
JF - Neurocomputing
SN - 0925-2312
IS - Part B
ER -