cPCA++: An efficient method for contrastive feature learning

作者:

Highlights:

• In this work, we propose a new data visualization and clustering technique for discovering discriminative structures in high-dimensional data.

• This technique, referred to as cPCA++, is motivated by the fact that the interesting features of a “target” dataset may be obscured by high variance components during traditional PCA.

• By analyzing what is referred to as a “background” dataset (i.e., one that exhibits the high variance principal components but not the interesting structures), our technique is capable of efficiently highlighting the structures that are unique to the “target” dataset.

• Similar to another recently proposed algorithm called “contrastive PCA” (cPCA), the proposed cPCA++ method identifies important dataset-specific patterns that are not detected by traditional PCA in a wide variety of settings.

• However, unlike cPCA, the proposed cPCA++ method does not require a parameter sweep, and as a result, it is significantly more efficient.

摘要

•In this work, we propose a new data visualization and clustering technique for discovering discriminative structures in high-dimensional data.•This technique, referred to as cPCA++, is motivated by the fact that the interesting features of a “target” dataset may be obscured by high variance components during traditional PCA.•By analyzing what is referred to as a “background” dataset (i.e., one that exhibits the high variance principal components but not the interesting structures), our technique is capable of efficiently highlighting the structures that are unique to the “target” dataset.•Similar to another recently proposed algorithm called “contrastive PCA” (cPCA), the proposed cPCA++ method identifies important dataset-specific patterns that are not detected by traditional PCA in a wide variety of settings.•However, unlike cPCA, the proposed cPCA++ method does not require a parameter sweep, and as a result, it is significantly more efficient.

论文关键词:PCA,Contrastive PCA,Feature learning,Dimensionality reduction

论文评审过程:Received 18 May 2021, Revised 13 September 2021, Accepted 18 October 2021, Available online 26 October 2021, Version of Record 17 December 2021.

论文官网地址:https://doi.org/10.1016/j.patcog.2021.108378