Documentation

Atlas.HighDimensionalStatistics.code.Chapter4.Problem_4_1

Inline Definitions #

def Chapter4.Problem41.frobeniusNormSq {m : Type u_1} {p : Type u_2} [Fintype m] [Fintype p] (A : Matrix m p ) :

Squared Frobenius norm: ‖M‖_F² = Σᵢⱼ Mᵢⱼ².

Instances For
    def Chapter4.Problem41.matrixL0Norm {m : Type u_1} {p : Type u_2} [Fintype m] [Fintype p] [DecidableEq ] (M : Matrix m p ) :

    Number of nonzero entries in a matrix (the ℓ₀ "norm").

    Instances For
      def Chapter4.Problem41.IsSubGaussian {Ω : Type u_1} [MeasurableSpace Ω] (X : Ω) (σsq : ) (μ : MeasureTheory.Measure Ω) :

      A random variable X : Ω → ℝ is sub-Gaussian with variance proxy σ² w.r.t. measure μ if its moment generating function satisfies 𝔼[exp(tX)] ≤ exp(t²σ²/2) for all t ∈ ℝ.

      Instances For
        def Chapter4.Problem41.IsSubGaussianMatrix {Ω : Type u_1} [MeasurableSpace Ω] {n T : } (E : ΩMatrix (Fin n) (Fin T) ) (σsq : ) (μ : MeasureTheory.Measure Ω) :

        A random matrix E : Ω → Matrix (Fin n) (Fin T) ℝ is sub-Gaussian with variance proxy σ² (denoted E ~ subG_{n×T}(σ²)) if for all unit vectors u ∈ S^{n-1} and v ∈ S^{T-1}, the random variable ω ↦ uᵀ E(ω) v is sub-Gaussian with variance proxy σ².

        Instances For

          Problem 4.1, Part 1: Rank-based bound #

          theorem Chapter4.Problem41.problem_4_1_part1 {Ω : Type u_1} [MeasurableSpace Ω] (μ : MeasureTheory.Measure Ω) [MeasureTheory.IsProbabilityMeasure μ] {n d T : } (hn : 0 < n) (X : Matrix (Fin n) (Fin d) ) (Θstar : Matrix (Fin d) (Fin T) ) (E Y : ΩMatrix (Fin n) (Fin T) ) (hY : ∀ (ω : Ω), Y ω = X * Θstar + E ω) (σ : ) ( : 0 < σ) (hE : IsSubGaussianMatrix E (σ ^ 2) μ) :
          ∃ (Θhat : Matrix (Fin n) (Fin T) Matrix (Fin d) (Fin T) ) (C : ), 0 < C μ {ω : Ω | frobeniusNormSq (X * Θhat (Y ω) - X * Θstar) / n C * σ ^ 2 * X.rank * T / n} ENNReal.ofReal 0.99

          Problem 4.1, Part 1 (Rank bound). Book statement (Rigollet, Problem 4.1.1): "Using the results of Chapter 2, show that there exists an estimator Θ̂ ∈ ℝ^{d×T} such that (1/n) ‖XΘ̂ − XΘ*‖_F² ≲ σ² · rT / n with probability .99, where r denotes the rank of X."

          Proof idea: Decompose Y = XΘ* + E column-wise into T independent univariate regression problems Y^(j) = Xθ*^(j) + ε^(j). Apply the Chapter 2 projection estimator to each column (giving MSE ≲ σ²r/n per column), then sum over T columns. The union bound is not needed since each column's bound holds in expectation.

          The estimator takes Y as input (not ω), preventing the trivial witness Θ̂ = Θ*.

          Problem 4.1, Part 2: Sparsity-based bound #

          theorem Chapter4.Problem41.problem_4_1_part2 {Ω : Type u_1} [MeasurableSpace Ω] (μ : MeasureTheory.Measure Ω) [MeasureTheory.IsProbabilityMeasure μ] {n d T : } (hn : 0 < n) (hd : 0 < d) (X : Matrix (Fin n) (Fin d) ) (Θstar : Matrix (Fin d) (Fin T) ) (E Y : ΩMatrix (Fin n) (Fin T) ) (hY : ∀ (ω : Ω), Y ω = X * Θstar + E ω) (σ : ) ( : 0 < σ) (hE : IsSubGaussianMatrix E (σ ^ 2) μ) :
          ∃ (Θhat : Matrix (Fin n) (Fin T) Matrix (Fin d) (Fin T) ) (C : ), 0 < C μ {ω : Ω | frobeniusNormSq (X * Θhat (Y ω) - X * Θstar) / n C * σ ^ 2 * (matrixL0Norm Θstar) * Real.log (Real.exp 1 * d) / n} ENNReal.ofReal 0.99

          Problem 4.1, Part 2 (Sparsity bound). Book statement (Rigollet, Problem 4.1.2): "There exists an estimator Θ̂ ∈ ℝ^{d×T} such that (1/n) ‖XΘ̂ − XΘ*‖_F² ≲ σ² · |Θ*|₀ · log(ed) / n with probability .99."

          Proof idea: Decompose column-wise into T independent regression problems. Apply the Chapter 2 LASSO or best-subset estimator to each column j, giving MSE ≲ σ² · |θ*^(j)|₀ · log(ed) / n. Sum over j = 1,…,T to get total MSE ≲ σ² · |Θ*|₀ · log(ed) / n, since Σⱼ |θ*^(j)|₀ = |Θ*|₀.

          The estimator takes Y as input (not ω), preventing the trivial witness Θ̂ = Θ*.