Penalized PCA approaches for B-spline expansions of smooth functional data

作者:

Highlights:

摘要

Functional principal component analysis (FPCA) is a dimension reduction technique that explains the dependence structure of a functional data set in terms of uncorrelated variables. In many applications the data are a set of smooth functions observed with error. In these cases the principal components are difficult to interpret because the estimated weight functions have a lot of variability and lack of smoothness. The most common way to solve this problem is based on penalizing the roughness of a function by its integrated squared d-order derivative. Two alternative forms of penalized FPCA based on B-spline basis expansions of sample curves and a simpler discrete penalty that measures the roughness of a function by summing squared d-order differences between adjacent B-spline coefficients (P-spline penalty) are proposed in this paper. The main difference between both smoothed FPCA approaches is that the first uses the P-spline penalty in the least squares approximation of the sample curves in terms of a B-spline basis meanwhile the second introduces the P-spline penalty in the orthonormality constraint of the algorithm that computes the principal components. Leave-one-out cross-validation is adapted to select the smoothing parameter for these two smoothed FPCA approaches. A simulation study and an application with chemometric functional data are developed to test the performance of the proposed smoothed approaches and to compare the results with non penalized FPCA and regularized FPCA.

论文关键词:Functional data,Principal component analysis,B-spline expansion,Roughness penalty,P-splines

论文评审过程:Available online 5 March 2013.

论文官网地址:https://doi.org/10.1016/j.amc.2013.02.009