The Rademacher distribution on ℝ: uniform on {-1, +1}. This is the probability measure assigning mass 1/2 to each of +1 and -1.
Instances For
Predicate: the entries of a random matrix X are i.i.d. Rademacher random variables. This means the n×d entries, viewed as random variables indexed by (Fin n × Fin d), are mutually independent and each has the Rademacher distribution (uniform on {-1, +1}).
Instances For
Probabilistic Statement of Proposition 2.16 #
The proof of Proposition 2.16 combines three steps:
- Per-pair Hoeffding bound (Theorem 1.9): for each off-diagonal pair (j,l), P(|(XᵀX/n)_{jl}| > t) ≤ 2exp(-nt²/2), since the products εᵢⱼεᵢₗ are iid Rademacher random variables.
- Union bound: summing over at most (d choose 2) unordered pairs, P(|XᵀX/n - I|∞ > t) ≤ d²exp(-nt²/2) (the factor 2 from Hoeffding is absorbed: (d choose 2)·2 = d(d-1) ≤ d²).
- Analytical bound: setting t = 1/(14k) and using the sample size condition n ≥ 392k²(log(1/δ) + 2log(d)) yields d²exp(-n/(392k²)) ≤ δ.
Step 1 requires connecting the Rademacher iid structure to Hoeffding's inequality through product measure and coordinate independence — probability infrastructure that the book invokes by reference ("Hoeffding: Theorem 1.9"). Steps 1-2 are therefore axiomatized as a single result. Step 3 is proved below.
Helper lemmas for the Hoeffding + union bound proof #
The complement of AssumptionINC for Rademacher entries is contained in the union of off-diagonal bad events, since diagonal entries are exactly 1.
Independence of row-wise product RVs (infrastructure lemma).
For an iid Rademacher matrix X, the products ξᵢ = X_{i,j₁}·X_{i,j₂} across different rows i are independent. This follows from the mutual independence of the matrix entries: ξᵢ depends only on entries (i,j₁) and (i,j₂), and different rows involve disjoint sets of entries.
The book invokes this implicitly when applying Hoeffding to the products. This is standard measure theory (products of independent RVs with disjoint coordinate dependence are independent) but requires substantial infrastructure to formalize from first principles.
Measurability of entry functions from an iid Rademacher matrix.
Each entry X_{i,j} is a measurable function. This follows directly from the measurability condition in IsIIDRademacherMatrix.
The integral of the identity function with respect to the Rademacher measure is zero. This is because E[Rademacher] = (1/2)·1 + (1/2)·(-1) = 0 by symmetry.
For a measurable function with Rademacher distribution, its integral (mean) is zero.
Zero mean of Rademacher products (infrastructure lemma).
For an iid Rademacher matrix X with j₁ ≠ j₂, the product X_{i,j₁}·X_{i,j₂} has mean zero: E[X_{i,j₁}·X_{i,j₂}] = E[X_{i,j₁}]·E[X_{i,j₂}] = 0·0 = 0. The first equality uses independence of entries in the same row at different columns; the second uses E[Rademacher] = 0 by symmetry.
Per-pair Hoeffding bound (Theorem 1.9 applied to products of Rademacher entries).
For fixed j₁ ≠ j₂ and a random Rademacher matrix X, the products ξᵢ = X_{i,j₁}·X_{i,j₂} are independent RVs bounded in [-1,1] with mean 0. By Hoeffding's inequality (Theorem 1.9 with aᵢ=-1, bᵢ=1), the sample mean satisfies: P(|(1/n)∑ξᵢ| > t) ≤ 2·exp(-nt²/2).
The bound 2·exp(-nt²/2) comes from Hoeffding with ∑(bᵢ-aᵢ)² = 4n: exp(-2(nt)²/(4n)) = exp(-nt²/2), applied to both tails.
The complement of AssumptionINC is contained in the union over upper-triangular off-diagonal pairs, using the symmetry of XᵀX.
Hoeffding + union bound for iid Rademacher matrices (Theorem 1.9 + union bound).
For an n×d random matrix X with i.i.d. Rademacher (±1) entries, P(¬ INC(k)) ≤ d²·exp(-n·(1/(14k))²/2) = d²·exp(-n/(392k²)).
This combines Hoeffding's inequality (Theorem 1.9) on each off-diagonal pair (j,l) — noting that the products εᵢⱼ·εᵢₗ are iid Rademacher, so the mean (1/n)∑εᵢⱼεᵢₗ has two-sided tail ≤ 2exp(-nt²/2) — with the union bound over (d choose 2) ≤ d²/2 unordered pairs, absorbing the factor of 2 to get d²·exp(-nt²/2).
Tail bound for the incoherence condition (book proof, lines 1715-1718).
For an n×d random matrix X with i.i.d. Rademacher (±1) entries, P(¬ INC(k)) ≤ d²·exp(-n·(1/(14k))²/2) = d²·exp(-n/(392k²)).
This combines Hoeffding's inequality (Theorem 1.9) on each off-diagonal pair with the union bound over (d choose 2) ≤ d² pairs, as stated in the book's proof. The diagonal entries equal 1 exactly since Rademacher squares are 1.
Proved from the hoeffding_union_bound_for_rademacher axiom.
Helper: if the complement of a measurable set has small probability, then the set itself has large probability.
Analytical bound (book proof, lines 1720-1726). Setting t = 1/(14k), the tail bound becomes d²·exp(-n/(392k²)). This is at most δ when n ≥ 392k²(log(1/δ) + 2log(d)), since exp(-n/(392k²)) ≤ exp(-(log(1/δ) + 2log(d))) = δ/d².
Proposition 2.16
Part 2: Existence of INC(k) matrices #
The book states: "It implies that there exist matrices that satisfy Assumption INC(k) for n ≳ k²log(d)."
This is a direct consequence of the probabilistic statement: since for any δ ∈ (0,1), the probability of INC(k) is positive when n ≥ 392k²(log(1/δ) + 2log(d)), such matrices must exist. In particular, taking δ = 1/2, we need n ≥ 392k²(log 2 + 2log(d)) ≲ k²log(d) (for d large).
The fair coin measure on Fin 2: assigns mass 1/2 to each of 0 and 1.
Instances For
Existence of an i.i.d. Rademacher probability space.
For any dimensions n, d, there exists a probability space (Ω, μ) equipped with an n×d random matrix X whose entries are i.i.d. Rademacher (±1) random variables, and such that the INC(k) event is measurable for all k.
This is the product measure construction ∏ᵢⱼ Rademacher on {-1,+1}^{n×d}, which the book assumes implicitly when discussing random Rademacher matrices. Constructed using Ω = (Fin n × Fin d) → Fin 2 with the product of fair coin measures, mapping each coordinate to ±1 via coinToSign.
Proposition 2.16, Part 2 (Existence of INC(k) matrices).
There exist n×d matrices satisfying Assumption INC(k) whenever n is sufficiently large relative to k²log(d). Specifically, for any n ≥ 392k²(log 2 + 2log(d)), there exists a matrix X ∈ {-1,+1}^{n×d} satisfying INC(k).
This follows from Proposition 2.16 (Part 1): taking δ = 1/2, the probability that a random Rademacher matrix satisfies INC(k) is at least 1/2 > 0, so such a matrix must exist (probabilistic method).