James-Stein for the leading eigenvector

Lisa R Goldberg; Alec N Kercheval

doi:10.1073/pnas.2207046120

James-Stein for the leading eigenvector

Proc Natl Acad Sci U S A. 2023 Jan 10;120(2):e2207046120. doi: 10.1073/pnas.2207046120. Epub 2023 Jan 5.

Authors

Lisa R Goldberg^{1

2}, Alec N Kercheval³

Affiliations

¹ Department of Economics and Consortium for Data Analytics in Risk, University of California, Berkeley, CA 94720.
² Aperio by BlackRock, Sausalito, CA 94965.
³ Department of Mathematics, Florida State University, Tallahassee, FL 32306.

Abstract

Recent research identifies and corrects bias, such as excess dispersion, in the leading sample eigenvector of a factor-based covariance matrix estimated from a high-dimension low sample size (HL) data set. We show that eigenvector bias can have a substantial impact on variance-minimizing optimization in the HL regime, while bias in estimated eigenvalues may have little effect. We describe a data-driven eigenvector shrinkage estimator in the HL regime called "James-Stein for eigenvectors" (JSE) and its close relationship with the James-Stein (JS) estimator for a collection of averages. We show, both theoretically and with numerical experiments, that, for certain variance-minimizing problems of practical importance, efforts to correct eigenvalues have little value in comparison to the JSE correction of the leading eigenvector. When certain extra information is present, JSE is a consistent estimator of the leading eigenvector.

Keywords: asymptotic regime; covariance matrix; factor model; optimization; shrinkage.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Bias*
Sample Size