Notebook 16 – Math 2121, Fall 2020

In this notebook we explore more random matrices and their eigenvalues.

Last time we saw that the eigenvalues of a large orthogonal matrix chosen uniformly at random are, with high probability, evenly spaced complex numbers in the unit circle {zC:|z|=1}.

Thus, the process of choosing a random eigenvalue of a very large, uniformly random orthogonal matrix is indistiguishable from choosing an element of {zC:|z|=1} uniformly at random.

Today we will examine a similar phenomenon for random symmetric matrices.

795 μs
1.6 ms
10.5 s
2.4 s

First let's discuss a few more details about the standard normal distribution.

Here is an intuitive definition of this probability distribution:

  • Suppose some students are taking an exam with q different True/False questions.

  • On each question, a student has equal chances of answering correctly or incorrectly.

  • Students receive +1 points for a correct answer, and 1 points otherwise.

  • The final score on the exam is the sum of these ±1 points divided by q.

Since we divide by q instead of q, it is possible to receive a score as low as q or as high as +q.

Definition: The standard normal distribution is the probability distribution that describes the random scores on this exam when the number of questions is very large.

1.9 ms
questions
10000
1.1 μs
students
50000
2.4 μs

To observe this empirically, we can simulate the scores for 50000 students on a 10000 question exam.

13.5 μs
50000×1 Array{Float64,2}:
 -1.98
  0.98
  0.08
 -0.1
 -0.36
 -0.34
  0.26
  ⋮
 -0.06
 -2.22
  0.1
 -1.08
  0.96
 -0.24
7.9 s

Now we compare the histogram of scores to a histogram of 50000 samples of the standard normal distribution (computed using a Julia library function).

These plots should look almost the same if our parameters are large enough.

13.0 μs
8.1 s

All matrices today have real entries.

A square matrix S=ST is symmetric if it is equal to its transpose. For example:

10.0 μs
3×3 Array{Int64,2}:
 1  3  5
 3  0  6
 5  6  2
37.5 ms

All eigenvalues of a symmetric matrix are real numbers. For example:

9.3 μs
869 ms

Compare with the eigenvalues of a random (non-symmetric) square matrix:

6.7 μs
171 ms

Here is a proof of this fact in general.

Assume S=ST is an n×n symmetric matrix. This means S has all real entries.

Suppose 0vCn is an eigenvector for S with eigenvalue λC, so that

Sv=λvandSv¯=λ¯v¯.

The product v¯Tv=v2 is a positive real number since v0.

On the other hand we have both

v¯TSv=v¯T(Sv)=v¯T(λv)=λv¯Tv=λv2

and, because S=ST, also

v¯TSv=(STv¯)Tv=(Sv¯)Tv=(λ¯v¯)Tv=λ¯v¯Tv=λ¯v2.

These expressions must be equal, and the number v20 is nonzero, so λ=λ¯R.

12.4 μs
273 ns

The set of symmetric n×n matrices has infinite volume, so it does not make sense to take about a symmetric matrices chosen uniformly at random.

Here is a different way of sampling a random symmetric matrix.

6.7 μs

First, we generate a random matrix with entries from the standard normal distribution.

4.4 μs
n
5000
2.2 μs
367 ms

Now to get a random symmetric matrix, we can just form the sum A+AT.

To make later plots nicer, we also divide by the normalizing constant 8n.

4.4 μs
S
5000×5000 Array{Float64,2}:
 -0.00284525   0.0031909    -0.0115228   …   0.00735712  -0.00644351   -0.0120708
  0.0031909   -0.0210216     0.00159086     -0.0109171   -0.00906114    0.0124589
 -0.0115228    0.00159086    0.0177357      -0.00719807  -0.00122543   -0.00144259
 -0.0088009    0.00934093   -0.00193897      0.017209    -0.00196763    0.0053541
  0.010613    -0.000949646  -0.00718409      0.0034996    0.00706539   -0.0124317
 -0.00219485  -0.00013567   -0.0111961   …   0.00365358  -0.000357063  -0.00754908
 -0.00102919   0.00232089   -0.00990527     -0.0034211   -0.00695158    0.00374101
  ⋮                                      ⋱                             
  0.010426     0.0027998     0.00216689     -0.0119353   -0.00485233    0.00267533
  0.00492574  -0.00829167    0.0041719   …   0.0131883   -0.00866356   -0.00655834
 -0.0016651   -0.000921308   0.00991947     -0.0120893    0.00111046    0.0184311
  0.00735712  -0.0109171    -0.00719807     -0.00223987   0.00224638   -0.0108681
 -0.00644351  -0.00906114   -0.00122543      0.00224638  -0.00971652    0.0104294
 -0.0120708    0.0124589    -0.00144259     -0.0108681    0.0104294    -0.00556288
669 ms

Here are the (real) eigenvalues of this matrix, in sorted order:

10.0 μs
eig
22.3 s

The histogram of these eigenvalues, when n is large, has a surprising shape.

This histogram is the unit semicircle with height rescaled by factor 2/π so that its area is exactly 1.

7.6 μs
0.6366197723675814
2.8 ms
916 ms

Thus, the eigenvalues of a large random symmetric matrix with entries from the standard normal distribution obey yet another probability law that is neither the uniform distribution nor the normal distribution.

Instead, these random eigenvalues follow what is called the Wigner semicircle distribution.

9.8 μs

We can again observe this empirically using Julia's built-in functions for the semicircle distribution:

2.9 μs
396 ms