A general framework for subspace detection in unordered multidimensional data

作者:

Highlights:

摘要

The analysis of large volumes of unordered multidimensional data is a problem confronted by scientists and data analysts every day. Often, it involves searching for data alignments that emerge as well-defined structures or geometric patterns in datasets. For example, straight lines, circles, and ellipses represent meaningful structures in data collected from electron backscatter diffraction, particle accelerators, and clonogenic assays. Also, customers with similar behavior describe linear correlations in e-commerce databases. We describe a general approach for detecting data alignments in large unordered noisy multidimensional datasets. In contrast to classical techniques such as the Hough transforms, which are designed for detecting a specific type of alignment on a given type of input, our approach is independent of the geometric properties of the alignments to be detected, as well as independent of the type of input data. Thus, it allows concurrent detection of multiple kinds of data alignments, in datasets containing multiple types of data. Given its general nature, optimizations developed for our technique immediately benefit all its applications, regardless the type of input data.

论文关键词:Hough transform,Geometric algebra,Parameter space,Subspace detection,Shape detection,Blade,Grassmannian,Coordinate chart,Line,Circle,Plane,Sphere,Conic section,Flat,Round,Quadric

论文评审过程:Received 14 August 2011, Revised 30 December 2011, Accepted 19 February 2012, Available online 5 March 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.02.033