Documentation

Atlas.ProbabilisticMethodsInCombinatorics.code.RandomProjection

Numerical fact $\log 2 < 1$, used as a constant bound in the recentering arguments.

For $r \le \log 2$, we have $1 \le 2 \exp(-r)$. Used to handle the trivial part of sub-Gaussian tail bounds when the threshold is small.

theorem RandomProjection.tail_recenter_toReal {Ω : Type u_1} [MeasurableSpace Ω] {μ : MeasureTheory.Measure Ω} [MeasureTheory.IsProbabilityMeasure μ] {m : } (hm : 0 < m) {Y : Ω} {med center : } (hmed : s0, (μ {ω : Ω | |Y ω - med| s}).toReal 2 * Real.exp (-(m * s ^ 2) / 4)) :
c > 0, t0, (μ {ω : Ω | |Y ω - center| t}).toReal 2 * Real.exp (-c * m * t ^ 2)

Recentering lemma in the toReal formulation: given a sub-Gaussian tail bound around the median, produce a sub-Gaussian tail bound around an arbitrary center center, with some positive constant $c$ depending on the gap and the dimension.

noncomputable def RandomProjection.normProj (m d : ) (hd : d m) (z : EuclideanSpace (Fin m)) :

Norm of the projection of $z \in \mathbb{R}^m$ onto the first $d$ coordinates, $\| (z_1, \dots, z_d) \|_2$.

Instances For
    theorem RandomProjection.normProj_lipschitz {m d : } (hd : d m) (x y : EuclideanSpace (Fin m)) :
    |normProj m d hd x - normProj m d hd y| x - y

    The projection norm is $1$-Lipschitz: $|\,\|Px\| - \|Py\|\,| \le \|x - y\|$.

    theorem RandomProjection.random_projection_concentration {m d : } (hd : d m) (hm : 0 < m) {Ω : Type u_1} [MeasurableSpace Ω] {μ : MeasureTheory.Measure Ω} [MeasureTheory.IsProbabilityMeasure μ] {Z : ΩEuclideanSpace (Fin m)} (hLevy : ∀ (f : EuclideanSpace (Fin m)), (∀ (x y : EuclideanSpace (Fin m)), |f x - f y| x - y)∃ (med : ), s0, (μ {ω : Ω | |f (Z ω) - med| s}).toReal 2 * Real.exp (-(m * s ^ 2) / 4)) :
    c > 0, t0, (μ {ω : Ω | |normProj m d hd (Z ω) - (d / m)| t}).toReal 2 * Real.exp (-c * m * t ^ 2)

    Lemma 9.4.24 (random projection / Johnson-Lindenstrauss concentration). Under a Lévy sub-Gaussian concentration assumption for $1$-Lipschitz functions of a random $Z \in \mathbb{R}^m$, the projection norm $\|P_d Z\|$ concentrates around $\sqrt{d/m}$ with sub-Gaussian tails: $\mu(|\,\|P_d Z\| - \sqrt{d/m}\,| \ge t) \le 2 \exp(-c m t^2)$ for some $c > 0$.

    Identity $\exp(2 \log 2) = 4$.

    For $e \ge 2 \log 2$, we have $\exp(e)/2 \ge \exp(e/2)$.

    If $e \ge \log 2$, then $\lfloor \exp(e) \rfloor \ge 2$.

    theorem NearlyEquidistantPoints.nearly_equidistant_points (hJL : C > 0, ∀ (ε : ), 0 < ε∀ (m N d : ) (X : Fin NEuclideanSpace (Fin m)), d > C * ε⁻¹ ^ 2 * Real.log N∃ (f : Fin NEuclideanSpace (Fin d)), ∀ (i j : Fin N), i j → (1 - ε) * dist (X i) (X j) dist (f i) (f j) dist (f i) (f j) (1 + ε) * dist (X i) (X j)) (hSimplex : ∀ (N : ), 2 N∃ (X : Fin NEuclideanSpace (Fin (N - 1))), ∀ (i j : Fin N), i jdist (X i) (X j) = 1) :
    c > 0, ∀ (ε : ), 0 < ε∀ (d : ), 0 < d∃ (N : ) (S : Fin NEuclideanSpace (Fin d)), N Real.exp (c * ε ^ 2 * d) ∀ (i j : Fin N), i j1 - ε dist (S i) (S j) dist (S i) (S j) 1 + ε

    Nearly-equidistant points in $\mathbb{R}^d$. Combining Johnson-Lindenstrauss with the standard simplex construction, one can embed exponentially many points $S_1, \dots, S_N$ in $\mathbb{R}^d$ with all pairwise distances in $[1 - \varepsilon, 1 + \varepsilon]$, where $N \ge \exp(c \varepsilon^2 d)$.