Seminar
Notice
Statistical
Laboratory
DATE
AND TIME:
PLACE: 319
Snedecor
SPEAKER: Debashis
Paul
Department
of Statistics
TITLE: Principal components analysis for high
dimensional
data
Suppose
we have i.i.d. observations from a multivariate Gaussian distribution with mean
and covariance matrix
. We consider the problem of estimating the leading
eigenvectors of
when the dimension
of the observation
vectors increases with the sample size
. We work under the setup where the covariance matrix is a
finite rank perturbation of identity. We show that even though the ordinary
principal components analysis may fail to yield consistent estimators of the
eigenvectors, if the data can be sparsely represented in some known basis, then
a scheme based on first selecting a set of significant coordinates and then
applying PCA to the submatrix of sample covariance matrix corresponding to the
selected coordinates, gives better estimates. Under suitable sparsity
restrictions, we show that the risk of the proposed estimator has the optimal
rate of convergence when measured in a squared-error type loss. We demonstrate
the performance of our method through simulation studies and discuss some
potential applications. We also state some new results about the behavior of
the eigenvalues and eigenvectors of sample covariance matrix when
converges to a
positive constant.
COFFEE:
Seminar schedules and abstracts
are available via WWW: http://www.stat.iastate.edu/