Inline Definitions #
A random variable X : Ω → ℝ is sub-Gaussian with variance proxy σ² w.r.t. measure μ
if its moment generating function satisfies 𝔼[exp(tX)] ≤ exp(t²σ²/2) for all t ∈ ℝ.
Instances For
A random matrix E : Ω → Matrix (Fin n) (Fin T) ℝ is sub-Gaussian with variance proxy σ²
(denoted E ~ subG_{n×T}(σ²)) if for all unit vectors u ∈ S^{n-1} and v ∈ S^{T-1},
the random variable ω ↦ uᵀ E(ω) v is sub-Gaussian with variance proxy σ².
Instances For
Problem 4.1, Part 1: Rank-based bound #
Problem 4.1, Part 1 (Rank bound). Book statement (Rigollet, Problem 4.1.1): "Using the results of Chapter 2, show that there exists an estimator Θ̂ ∈ ℝ^{d×T} such that (1/n) ‖XΘ̂ − XΘ*‖_F² ≲ σ² · rT / n with probability .99, where r denotes the rank of X."
Proof idea: Decompose Y = XΘ* + E column-wise into T independent univariate regression problems Y^(j) = Xθ*^(j) + ε^(j). Apply the Chapter 2 projection estimator to each column (giving MSE ≲ σ²r/n per column), then sum over T columns. The union bound is not needed since each column's bound holds in expectation.
The estimator takes Y as input (not ω), preventing the trivial witness Θ̂ = Θ*.
Problem 4.1, Part 2: Sparsity-based bound #
Problem 4.1, Part 2 (Sparsity bound). Book statement (Rigollet, Problem 4.1.2): "There exists an estimator Θ̂ ∈ ℝ^{d×T} such that (1/n) ‖XΘ̂ − XΘ*‖_F² ≲ σ² · |Θ*|₀ · log(ed) / n with probability .99."
Proof idea: Decompose column-wise into T independent regression problems. Apply the Chapter 2 LASSO or best-subset estimator to each column j, giving MSE ≲ σ² · |θ*^(j)|₀ · log(ed) / n. Sum over j = 1,…,T to get total MSE ≲ σ² · |Θ*|₀ · log(ed) / n, since Σⱼ |θ*^(j)|₀ = |Θ*|₀.
The estimator takes Y as input (not ω), preventing the trivial witness Θ̂ = Θ*.