Lemma 1.12: Square of sub-Gaussian is sub-exponential #
Let X subG(σ²). Then Z = X² - E[X²] is sub-exponential: Z subE(16σ²).
The proof in the book uses:
- Power series expansion of E[exp(sZ)]
- Jensen's inequality (twice) to bound centered moments
- Lemma 1.4 for moment bounds E[X^{2k}] ≤ (2σ²)^k · k!
- Geometric series summation for |8sσ²| < 1
- The inequality 1 + x ≤ exp(x)
This yields: E[exp(sZ)] ≤ 1 + 128s²σ⁴ ≤ exp(s²(16σ²)²/2) for |s| ≤ 1/(16σ²).
Auxiliary lemmas #
For all x ∈ ℝ, x² ≤ exp(x) + exp(-x).
Proof: By the Taylor expansion exp(y) ≥ 1 + y + y²/2 + y³/6 for y ≥ 0
(from Real.sum_le_exp_of_nonneg), applied to y = |x|, we get
exp(|x|) ≥ 1 + |x| + x²/2 + |x|³/6 ≥ x²
(using the algebraic fact |x|³ - 3|x|² + 6|x| + 6 ≥ 0 for |x| ≥ 0).
Since exp(-|x|) > 0 and exp(x) + exp(-x) = exp(|x|) + exp(-|x|),
we conclude x² ≤ exp(|x|) ≤ exp(x) + exp(-x).
If X has integrable exponential moments for all s, then X² is integrable. This uses the pointwise bound x² ≤ exp(x) + exp(-x).
Helper lemmas for the analytical MGF bound #
Helper: The DCT interchange for the tail of the exp series applied to a centered
random variable. The textbook cites this step by reference ("by the Dominated Convergence
Theorem"). The interchange is fully proved using integral_tsum.
Sub-Gaussian even moment bound (Rigollet, line 722, via Lemma 1.4). For X ~ subG(σ²): E[X^{2k}] ≤ 2 · (2σ²)^k · k!. Derived from the sub-Gaussian tail bound and the layer cake formula. Referenced in the textbook proof of Lemma 1.12.
Integrability of exp(|s*(X²-c)|) when X is sub-Gaussian and |s| is small.
Similar to exp_sq_centered_integrable, this is a technical condition for DCT.
Since |s*(X²-c)| ≤ |s|*X² + |s|*c, the integrability reduces to that of
exp(|s|*X²), which follows from the same moment bound argument.
Cited by reference in the textbook (Rigollet, Lemma 1.12).
Integrability of exp(s*(X²-c)) when X is sub-Gaussian and |s| is small.
This follows from exp_abs_sq_centered_integrable since
exp(x) ≤ exp(|x|) for all x ∈ ℝ.
DCT MGF expansion (Rigollet, line 714).
For X ~ subG(σ²) and Z = X² - E[X²], the MGF can be bounded by its Taylor series with absolute values on the moments:
E[exp(sZ)] ≤ 1 + Σ_{k≥2} |s|^k · E[|Z|^k] / k!
The proof uses:
- Taylor expansion: exp(x) = 1 + x + Σ_{k≥2} x^k/k!
- E[Z] = 0, so the k=1 term vanishes
- Dominated convergence theorem to interchange E and Σ
- |E[Z^k]| ≤ E[|Z|^k] (Jensen on absolute value)
Reference: Rigollet, High-Dimensional Statistics, Lemma 1.12, line 714. The dominated convergence interchange is cited by reference.
Jensen/centering bound (Rigollet, lines 718–720). For Z = X² - E[X²]: E[|Z|^k] ≤ 2^k · E[X^{2k}]. Uses |a-b|^k ≤ 2^{k-1}(|a|^k + |b|^k) and Jensen's inequality (E[X²])^k ≤ E[X^{2k}]. Referenced but not explicitly proved in the textbook.
Per-term MGF bound (Rigollet, lines 718–722).
For X ~ subG(σ²) and Z = X² - E[X²], for each k ≥ 2: E[|Z|^k] ≤ (2k)! · σ^{2k} · 2^k
More precisely, the terms in the series satisfy: |s|^k · E[|Z|^k] / k! ≤ (8|s|σ²)^k
The proof combines:
- Jensen/centering (line 718–720): |a-b|^k ≤ 2^{k-1}(|a|^k + |b|^k) and (E[X²])^k ≤ E[X^{2k}] (Jensen for convex x^k, k ≥ 2) giving E[|Z|^k] ≤ 2^k · E[X^{2k}]
- Lemma 1.4 (line 722): E[X^{2k}] ≤ 2(2σ²)^k · k! → E[|Z|^k] ≤ 2^{k+1} · (2σ²)^k · k! → |s|^k · E[|Z|^k] / k! ≤ 2 · (4|s|σ²)^k ≤ (8|s|σ²)^k
Reference: Rigollet, High-Dimensional Statistics, Lemma 1.12, lines 718–722. The centering bound and Lemma 1.4 moment bound are now fully proved.
Key theorem and main result #
Analytical MGF bound from Lemma 1.12 proof.
For X ~ subG(σ²) and Z = X² - E[X²], for |s| ≤ 1/(16σ²): E[exp(sZ)] ≤ 1 + 128s²σ⁴
This combines:
dct_mgf_expansion: E[exp(sZ)] ≤ 1 + Σ_{k≥2} |s|^k E[|Z|^k]/k! (DCT interchange)per_term_mgf_bound: |s|^k E[|Z|^k]/k! ≤ (8|s|σ²)^k (Jensen + Lemma 1.4)geometric_series_bound: Σ_{k≥2} (8|s|σ²)^k ≤ 128s²σ⁴ (geometric series)
Reference: Rigollet, High-Dimensional Statistics, Lemma 1.12, lines 714–726.
Lemma 1.12 (Square of sub-Gaussian is sub-exponential).
Let X subG(σ²). Then the random variable Z = X² - E[X²] is sub-exponential
with parameter λ = 16σ², i.e., Z subE(16σ²).
The proof establishes:
- Positivity: 16σ² > 0 (from σ² > 0)
- Integrability: Z = X² - E[X²] is integrable (derived from sub-Gaussian via the bound x² ≤ exp(x) + exp(-x))
- Mean zero: E[Z] = E[X² - E[X²]] = 0 (by linearity of expectation)
- MGF bound: E[exp(sZ)] ≤ exp(s²(16σ²)²/2) for |s| ≤ 1/(16σ²)
This follows from the analytical bound E[exp(sZ)] ≤ 1 + 128s²σ⁴
(from
analytical_mgf_bound_for_centered_square) combined with the standard inequality 1 + x ≤ exp(x).