Matrix Concentration Inequalities and Free Probability II. Two-sided Bounds and Applications

Afonso Bandeira, Giorgio Cipolloni, Ramon van Handel, Dominik Schröder

preprint(2024)

Summary

We determine the approximate location of the extreme eigenvalues for a large class of random matrix models. These two-sided bounds are fundamentally beyond the reach of classical matrix concentration inequalities.

Example: Sample covariance matrix

Consider a rank-one perturbation of the identity matrix $I\in \mathbb R^{p\times p}$ as a population covariance matrix

\Sigma = I + \lambda vv^\top,

where $v\in\mathbb{R}^p$ is a unit vector and $\lambda>0$ is a parameter. Then draw $n$ random vectors $x_1, \ldots, x_n$ from the distribution $\mathcal N(0, \Sigma)$ and consider the sample covariance matrix

\hat \Sigma = \frac{1}{n}\sum_{i=1}^n x_i x_i^\top.

We are able to show that this model exhibits two phase transitions. Denoting the the ratio of dimensions $p,n$ by $\delta:=p/n$ the largest eigenvalue $\lambda_{\max}(\hat\Sigma-\Sigma)$ satisfies

\lambda_{\max} \approx \begin{cases} (1 + \sqrt\delta)^2 - 1, & \text{ if } \lambda < 1 + \sqrt\delta,\\ \frac{1 + \lambda}{2\lambda} (\sqrt\delta + \sqrt{\delta + 4 \lambda}) \sqrt\delta, & \text{ if } \lambda \geq 1 + \sqrt\delta, \end{cases}

while the smallest eigenvalue $\lambda_{\min}(\hat\Sigma-\Sigma)$ satisfies

\lambda_{\min} \approx \begin{cases} (1 - \sqrt\delta)^2 - 1, & \text{ if } \lambda < 1 - \sqrt\delta,\\ \frac{1 + \lambda}{2\lambda} (\sqrt\delta - \sqrt{\delta + 4 \lambda}) \sqrt\delta, & \text{ if } \lambda \geq 1 - \sqrt\delta \end{cases}

exactly as the corresponding free model suggests.

Numerical illustration

MaxMin

The grey histogram represents the empirical distribution of sample covariance eigenvalues, while the solid curve is the spectral density of the corresponding free model. The coloured histograms represent the empirical distribution of the largest and smallest eigenvalues of the sample covariance matrix. Here √δ ≈ 0.45 so that the two phase transitions occur at λ ≈ 0.55 and λ ≈ 1.45.

Abstract

The first paper in this series introduced a new family of nonasymptotic matrix concentration inequalities that sharply capture the spectral properties of very general Gaussian (as well as non-Gaussian) random matrices in terms of an associated noncommutative model. These methods achieved matching upper and lower bounds for smooth spectral statistics, but only provided upper bounds for the spectral edges. Here we obtain matching lower bounds for the spectral edges, completing the theory initiated in the first paper. The resulting two-sided bounds enable the study of applications that require an exact determination of the spectral edges to leading order, which is fundamentally beyond the reach of classical matrix concentration inequalities. To illustrate their utility, we undertake a detailed study of phase transition phenomena for spectral outliers of nonhomogeneous random matrices.

Matrix Concentration Inequalities and Free Probability II. Two-sided Bounds and Applications

Summary

Example: Sample covariance matrix

Numerical illustration

Abstract

Paper