Documentation

Atlas.HighDimensionalStatistics.code.Chapter4.Problem_4_2

Local Definitions #

def Chapter4.Problem42.frobSq {m p : } (A : Matrix (Fin m) (Fin p) ) :

Squared Frobenius norm: ‖M‖_F² = Σᵢⱼ Mᵢⱼ².

Instances For
    noncomputable def Chapter4.Problem42.opNorm {m p : } (A : Matrix (Fin m) (Fin p) ) :

    Operator norm of a matrix (abstract placeholder). In the book this is the largest singular value ‖A‖_op = σ₁(A).

    Instances For

      Problem 4.2, Part 1: SVD thresholding bound #

      theorem Chapter4.Problem42.problem_4_2_part1 {Ω : Type u_1} [MeasurableSpace Ω] {μ : MeasureTheory.Measure Ω} [MeasureTheory.IsProbabilityMeasure μ] {n d T : } (hn : 0 < n) (X : Matrix (Fin n) (Fin d) ) (Θstar : Matrix (Fin d) (Fin T) ) (σ : ) ( : 0 < σ) (E Y : ΩMatrix (Fin n) (Fin T) ) (hY : ∀ (ω : Ω), Y ω = X * Θstar + E ω) (Mhat : ΩMatrix (Fin n) (Fin T) ) (τ₀ : ) (hτ₀_pos : 0 < τ₀) (hτ₀_bound : τ₀ ^ 2 σ ^ 2 * (max d T)) (hLemma42 : μ {ω : Ω | opNorm (E ω) τ₀} ENNReal.ofReal 0.99) (C₀ : ) (hC₀ : 0 < C₀) (hThm43 : ∀ (ω : Ω), opNorm (E ω) τ₀frobSq (Mhat τ₀ ω - X * Θstar) C₀ * Θstar.rank * τ₀ ^ 2) (hMeas : MeasurableSet {ω : Ω | opNorm (E ω) τ₀}) :
      ∃ (C : ) (τ : ), 0 < C 0 < τ μ {ω : Ω | 1 / n * frobSq (Mhat τ ω - X * Θstar) C * σ ^ 2 * Θstar.rank * (max d T) / n} ENNReal.ofReal 0.99

      Problem 4.2, Part 1 — SVD thresholding bound with probability ≥ 0.99.

      The proof combines Lemma 4.2 (operator norm bound on noise) with Theorem 4.3 (deterministic SVT approximation bound). Given:

      • τ₀ is a threshold with τ₀² ≤ σ²(d ∨ T)
      • Lemma 4.2: μ {ω | ‖E(ω)‖_op ≤ τ₀} ≥ 0.99
      • Theorem 4.3 (deterministic): when ‖E(ω)‖_op ≤ τ₀, ‖M̂(τ₀,ω) − XΘ*‖_F² ≤ C₀ · rank(Θ*) · τ₀²

      We combine these to show the probabilistic MSE bound.

      Problem 4.2, Part 2: Existence of projection P with PM̂ = XΘ̂ #

      theorem Chapter4.Problem42.problem_4_2_part2 {Ω : Type u_1} [MeasurableSpace Ω] {μ : MeasureTheory.Measure Ω} [MeasureTheory.IsProbabilityMeasure μ] {n d T : } (hn : 0 < n) (X : Matrix (Fin n) (Fin d) ) (Θstar : Matrix (Fin d) (Fin T) ) (σ : ) ( : 0 < σ) (E Y : ΩMatrix (Fin n) (Fin T) ) (hY : ∀ (ω : Ω), Y ω = X * Θstar + E ω) (Mhat : ΩMatrix (Fin n) (Fin T) ) (P : Matrix (Fin n) (Fin n) ) (hP_proj : P * P = P) (hP_col : ∀ (Θ : Matrix (Fin d) (Fin T) ), P * (X * Θ) = X * Θ) (hP_contract : ∀ (A : Matrix (Fin n) (Fin T) ), frobSq (P * A - X * Θstar) frobSq (A - X * Θstar)) (ThetaHat : ΩMatrix (Fin d) (Fin T) ) (hPM : ∀ (τ : ) (ω : Ω), P * Mhat τ ω = X * ThetaHat τ ω) (τ₀ : ) (hτ₀_pos : 0 < τ₀) (hτ₀_bound : τ₀ ^ 2 σ ^ 2 * (max d T)) (hLemma42 : μ {ω : Ω | opNorm (E ω) τ₀} ENNReal.ofReal 0.99) (C₀ : ) (hC₀ : 0 < C₀) (hThm43 : ∀ (ω : Ω), opNorm (E ω) τ₀frobSq (Mhat τ₀ ω - X * Θstar) C₀ * Θstar.rank * τ₀ ^ 2) (hMeas : MeasurableSet {ω : Ω | opNorm (E ω) τ₀}) :
      ∃ (Q : Matrix (Fin n) (Fin n) ) (Th : ΩMatrix (Fin d) (Fin T) ), (∀ (τ : ) (ω : Ω), Q * Mhat τ ω = X * Th τ ω) ∃ (C : ) (τ : ), 0 < C 0 < τ μ {ω : Ω | 1 / n * frobSq (X * Th τ ω - X * Θstar) C * σ ^ 2 * Θstar.rank * (max d T) / n} ENNReal.ofReal 0.99

      Problem 4.2, Part 2 — There exists an n×n projection matrix P onto col(X) such that PM̂ = XΘ̂ for some Θ̂, and the same MSE rate holds.

      The key ideas:

      • P is the orthogonal projector onto the column space of X (so P² = P, P(XΘ) = XΘ)
      • P is a contraction in Frobenius norm: ‖PA − PB‖_F ≤ ‖A − B‖_F
      • PM̂ lies in col(X), so PM̂ = XΘ̂ for some Θ̂
      • The MSE of XΘ̂ = PM̂ is no worse than the MSE of M̂ by the contraction property