Documentation

Atlas.HighDimensionalStatistics.code.Chapter3.Problem_3_1

theorem Chapter3.problem_3_1_ls_oracle_inequality {n M : } (hn : 0 < n) (Φ : Matrix (Fin n) (Fin M) ) (f : Fin n) (σ : ) ( : 0 < σ) (μ : MeasureTheory.Measure (Fin n)) [MeasureTheory.IsProbabilityMeasure μ] (θhat : (Fin n)Fin M) (hLS : ∀ (ε : Fin n) (θ : Fin M), (f + ε - Φ.mulVec (θhat ε)) ⬝ᵥ (f + ε - Φ.mulVec (θhat ε)) (f + ε - Φ.mulVec θ) ⬝ᵥ (f + ε - Φ.mulVec θ)) (hVar : ∀ (i : Fin n), (ε : Fin n), ε i ^ 2 μ σ ^ 2) (hMean : ∀ (i : Fin n), (ε : Fin n), ε i μ = 0) :
∃ (C : ), 0 < C (ε : Fin n), MSE (Φ.mulVec (θhat ε)) f μ (⨅ (θ : Fin M), MSE (Φ.mulVec θ) f) + C * σ ^ 2 * M / n

Problem 3.1 — Exact oracle inequality for the LS estimator. Given the regression model Y = f + ε with noise variance proxy σ², the LS estimator θ̂^LS satisfies: E[MSE(φ_{θ̂^LS})] ≤ inf_θ MSE(φ_θ) + C σ² M / n for some universal constant C > 0.

Here the LS estimator is modeled as a map θ̂ : (Fin n → ℝ) → Fin M → ℝ from noise realizations ε to parameter vectors, where θ̂(ε) minimizes |f + ε − Φθ|² over θ. The expectation is over ε ∼ μ.