Chernoff Analytic Helpers for Lemma 5.14 #
Key combinatorial and analytic inequalities used in the proof of the Sparse Varshamov-Gilbert bound (Lemma 5.14) from Chapter 5.4 of Rigollet's "High Dimensional Statistics."
Main results #
Combinatorial (binomial coefficient bounds) #
binom_ratio_ineq: For j < k ≤ n: (n-k+1)·(j+1) ≤ (n-j)·kchoose_step_bound: For j < k ≤ n: (n-k+1)·C(n,j) ≤ k·C(n,j+1)choose_iterated_bound: (n-k+1)^m · C(n,t) ≤ k^m · C(n,t+m)choose_ratio_bound: (n-k+1)^{k-2t} · C(n,t) ≤ k^{k-2t} · C(n,k-t) for 2t ≤ kvandermonde_decomposition: C(d,k) = Σ C(k,i)·C(d-k,j) over antidiagonal
Analytic (power and logarithmic bounds) #
power_ineq_for_chernoff: For R ≥ 4: (2(R-1))⁴ ≥ 1+Rchernoff_power_bound: For d ≥ 8k: ((d-2k)/k)⁴ ≥ 1+d/(2k)log_ratio_bound: (k/2)·log((d-2k+1)/k) ≥ (k/8)·log(1+d/(2k))exp_le_ratio_pow: exp(k/8·log(1+d/(2k))) ≤ ((d-2k+1)/k)^{k/2}
Overview of the proof strategy #
The textbook proof of Lemma 5.14 bounds the size of Hamming balls in the space of k-sparse binary vectors. The key inequality is:
|ball(x)| / |C₀(k)| = P(ρ(ω, x₀) < k/2) ≤ 2^k · (1+d/(2k))^{-k/2}
This file provides the combinatorial underpinning: the ratio C(n,k-t)/C(n,t) (where n = d-k) is bounded below by ((d-2k+1)/k)^{k-2t}. This ratio arises from the consecutive binomial coefficient ratio (n-j)/(j+1) ≥ (n-k+1)/k for j < k ≤ n. Combined with the power inequality (2(R-1))⁴ ≥ 1+R for R ≥ 4, we obtain the exponential bound needed for the Chernoff counting bound.
Fundamental arithmetic inequality for binomial coefficient ratios. For j < k ≤ n: (n-k+1)·(j+1) ≤ (n-j)·k.
In ℤ, this is equivalent to (n+1)·(k-j-1) ≥ 0, which is always true. This formalizes the fact that each consecutive ratio C(n,j+1)/C(n,j) = (n-j)/(j+1) is at least (n-k+1)/k.
For j < k ≤ n: (n-k+1) · C(n, j) ≤ k · C(n, j+1).
This formalizes the bound C(n,j+1)/C(n,j) = (n-j)/(j+1) ≥ (n-k+1)/k,
using Nat.choose_succ_right_eq which gives C(n,j+1)·(j+1) = C(n,j)·(n-j).
Iterated binomial coefficient bound: (n-k+1)^m · C(n,t) ≤ k^m · C(n, t+m).
Equivalently: C(n, t+m) / C(n, t) ≥ ((n-k+1)/k)^m, using the product telescope
of consecutive ratios. Each step uses choose_step_bound.
Choose ratio bound: (n-k+1)^{k-2t} · C(n,t) ≤ k^{k-2t} · C(n, k-t) for 2t ≤ k ≤ n.
This is the key combinatorial inequality for the Chernoff counting bound. It bounds the ratio C(n,k-t)/C(n,t) ≥ ((n-k+1)/k)^{k-2t}.
In the Lemma 5.14 context with n = d-k and d ≥ 8k: C(d-k, k-t) / C(d-k, t) ≥ ((d-2k+1)/k)^{k-2t} ≥ 6^{k-2t} (since (d-2k+1)/k ≥ (6k+1)/k > 6).
Vandermonde convolution specialized to our setting: C(d, k) = Σ_{(i,j) ∈ antidiagonal k} C(k, i) · C(d-k, j).
This decomposition counts k-element subsets of {1,...,d} by partitioning them according to how many elements fall in the first k positions vs the remaining d-k positions. It is the key identity connecting the total count C(d,k) = |C₀(k)| to the Hamming ball size Σ_{t<k/2} C(k,t)·C(d-k,t).
For R ≥ 4: (2(R-1))⁴ ≥ 1 + R.
This is the key algebraic inequality connecting binomial coefficient ratios to the Chernoff bound exponent. When R = d/(2k) with d ≥ 8k, this gives the fourth-power bound needed to convert the log ratio bound (k/2)·log(base) into (k/8)·log(1+R).
For d ≥ 8k, k ≥ 1: (k/2) · log((d-2k+1)/k) ≥ (k/8) · log(1 + d/(2k)).
This is the key inequality connecting binomial coefficient ratios to the exponential bound in the Chernoff method of Lemma 5.14.
Proof strategy: Setting R = d/(2k) ≥ 4:
- (d-2k+1)/k ≥ 2(R-1) > 0
- (2(R-1))⁴ ≥ 1+R (by
power_ineq_for_chernoff) - Taking logs: 4·log(2(R-1)) ≥ log(1+R)
- Multiplying by k/8: (k/2)·log(2(R-1)) ≥ (k/8)·log(1+R)
- Since log((d-2k+1)/k) ≥ log(2(R-1)), the result follows.
Main exponential bound for the Chernoff counting bound.
For d ≥ 8k, k ≥ 1: exp(k/8 · log(1 + d/(2k))) ≤ ((d-2k+1)/k)^{k/2}
This connects the Chernoff bound exponent to the binomial coefficient ratio. The left side is the packing number lower bound (1+d/(2k))^{k/8}, and the right side bounds the product of consecutive binomial coefficient ratios ∏_{j=0}^{k/2-1} (n-j)/(j+1) ≥ ((n-k+1)/k)^{k/2} where n = d-k.
Combined with choose_iterated_bound, this gives:
C(d-k, k) ≥ ((d-2k+1)/k)^k · C(d-k, 0) ≥ (1+d/(2k))^{k/8}.
For t < n/2: C(n, t) ≤ C(n, t+1). This is Nat.choose_le_succ_of_lt_half_left
from Mathlib, stated here for reference.