Documentation

Atlas.HighDimensionalStatistics.code.Chapter2.Problem_2_3

Problem 2.3: Improved Constant for Hard Thresholding Bound #

From Rigollet Chapter 2, Problem 2.3.

In the proof of Theorem 2.11, the bound |θ̂ᴴᴿᴰ_j - θ*_j| ≤ 4·min(|θ*_j|, τ) can be improved to |θ̂ᴴᴿᴰ_j - θ*_j| ≤ 3·min(|θ*_j|, τ).

That is, on the event A = {max_j |ξ_j| ≤ τ} (where y_j = θ*_j + ξ_j), we have: |θ̂ᴴᴿᴰ_j - θ*_j| ≤ 3 · min(|θ*_j|, τ)

Key idea (tighter case analysis) #

noncomputable def Rigollet.hardThresh {d : } (τ : ) (y : Fin d) :
Fin d

Hard thresholding estimator (coordinate-wise): keeps y_j if |y_j| > 2τ, else sets to 0.

Instances For
    theorem Rigollet.problem_2_3_improved_hard_threshold_bound (d : ) (τ : ) ( : 0 < τ) (θstar ξ y : Fin d) (hy : ∀ (j : Fin d), y j = θstar j + ξ j) (hA : ∀ (j : Fin d), |ξ j| τ) (j : Fin d) :
    |hardThresh τ y j - θstar j| 3 * min |θstar j| τ

    Problem 2.3: On the event A = {∀ j, |ξ_j| ≤ τ}, the hard thresholding estimator satisfies the improved bound (with constant 3 instead of 4):

    |θ̂ᴴᴿᴰ_j - θ*_j| ≤ 3 · min(|θ*_j|, τ)

    Here y_j = θ*_j + ξ_j is the sub-Gaussian sequence model, and θ̂ᴴᴿᴰ_j = y_j · 𝟙(|y_j| > 2τ) is the hard thresholding estimator.