Berry–Esseen theorem

In probability theory, the central limit theorem states that, under certain circumstances, the probability distribution of the scaled mean of a random sample converges to a normal distribution as the sample size increases to infinity. Under stronger assumptions, the Berry–Esseen theorem, or Berry–Esseen inequality, gives a more quantitative result, because it also specifies the rate at which this convergence takes place by giving a bound on the maximal error of approximation between the normal distribution and the true distribution of the scaled sample mean. The approximation is measured by the Kolmogorov–Smirnov distance. In the case of independent samples, the convergence rate is $n -1/2$ , where $n$ is the sample size, and the constant is estimated in terms of the third absolute normalized moment.

Statement of the theorem

Statements of the theorem vary, as it was independently discovered by two mathematicians, Andrew C. Berry (in 1941) and Carl-Gustav Esseen (1942), who then, along with other authors, refined it repeatedly over subsequent decades.

Identically distributed summands

One version, sacrificing generality somewhat for the sake of clarity, is the following:

There exists a positive constant C such that if X₁, X₂, ..., are i.i.d. random variables with E(X₁) = 0, E(X₁²) = σ² > 0, and E(|X₁|³) = ρ < ∞,^{[note 1]} and if we define

[math]\displaystyle{ Y_n = {X_1 + X_2 + \cdots + X_n \over n} }[/math]

the sample mean, with F_n the cumulative distribution function of

[math]\displaystyle{ {Y_n \sqrt{n} \over {\sigma}}, }[/math]

and Φ the cumulative distribution function of the standard normal distribution, then for all x and n,

[math]\displaystyle{ \left|F_n(x) - \Phi(x)\right| \le {C \rho \over \sigma^3\sqrt{n}}.\ \ \ \ (1) }[/math]

Illustration of the difference in cumulative distribution functions alluded to in the theorem.

That is: given a sequence of independent and identically distributed random variables, each having mean zero and positive variance, if additionally the third absolute moment is finite, then the cumulative distribution functions of the standardized sample mean and the standard normal distribution differ (vertically, on a graph) by no more than the specified amount. Note that the approximation error for all n (and hence the limiting rate of convergence for indefinite n sufficiently large) is bounded by the order of n^−1/2.

Calculated values of the constant C have decreased markedly over the years, from the original value of 7.59 by (Esseen 1942), to 0.7882 by (van Beek 1972), then 0.7655 by (Shiganov 1986), then 0.7056 by (Shevtsova 2007), then 0.7005 by (Shevtsova 2008), then 0.5894 by (Tyurin 2009), then 0.5129 by (Korolev Shevtsova), then 0.4785 by (Tyurin 2010). The detailed review can be found in the papers (Korolev Shevtsova) and (Korolev Shevtsova). The best estimate (As of 2012), C < 0.4748, follows from the inequality

[math]\displaystyle{ \sup_{x\in\mathbb R}\left|F_n(x) - \Phi(x)\right| \le {0.33554 (\rho+0.415\sigma^3)\over \sigma^3\sqrt{n}}, }[/math]

due to (Shevtsova 2011), since σ³ ≤ ρ and 0.33554 · 1.415 < 0.4748. However, if ρ ≥ 1.286σ³, then the estimate

[math]\displaystyle{ \sup_{x\in\mathbb R}\left|F_n(x) - \Phi(x)\right| \le {0.3328 (\rho+0.429\sigma^3)\over \sigma^3\sqrt{n}}, }[/math]

which is also proved in (Shevtsova 2011), gives an even tighter upper estimate.

(Esseen 1956) proved that the constant also satisfies the lower bound

[math]\displaystyle{ C\geq\frac{\sqrt{10}+3}{6\sqrt{2\pi}} \approx 0.40973 \approx \frac{1}{\sqrt{2\pi}} + 0.01079 . }[/math]

Non-identically distributed summands

Let X₁, X₂, ..., be independent random variables with E(X_i) = 0, E(X_i²) = σ_i² > 0, and E(|X_i|³) = ρ_i < ∞. Also, let

[math]\displaystyle{ S_n = {X_1 + X_2 + \cdots + X_n \over \sqrt{\sigma_1^2+\sigma_2^2+\cdots+\sigma_n^2} } }[/math]

be the normalized n-th partial sum. Denote F_n the cdf of S_n, and Φ the cdf of the standard normal distribution. For the sake of convenience denote

[math]\displaystyle{ \vec{\sigma}=(\sigma_1,\ldots,\sigma_n),\ \vec{\rho}=(\rho_1,\ldots,\rho_n). }[/math]

In 1941, Andrew C. Berry proved that for all n there exists an absolute constant C₁ such that

[math]\displaystyle{ \sup_{x\in\mathbb R}\left|F_n(x) - \Phi(x)\right| \le C_1\cdot\psi_1,\ \ \ \ (2) }[/math]

where

[math]\displaystyle{ \psi_1=\psi_1\big(\vec{\sigma},\vec{\rho}\big)=\Big({\textstyle\sum\limits_{i=1}^n\sigma_i^2}\Big)^{-1/2}\cdot\max_{1\le i\le n}\frac{\rho_i}{\sigma_i^2}. }[/math]

Independently, in 1942, Carl-Gustav Esseen proved that for all n there exists an absolute constant C₀ such that

[math]\displaystyle{ \sup_{x\in\mathbb R}\left|F_n(x) - \Phi(x)\right| \le C_0\cdot\psi_0, \ \ \ \ (3) }[/math]

where

[math]\displaystyle{ \psi_0=\psi_0\big(\vec{\sigma},\vec{\rho}\big)=\Big({\textstyle\sum\limits_{i=1}^n\sigma_i^2}\Big)^{-3/2}\cdot\sum\limits_{i=1}^n\rho_i. }[/math]

It is easy to make sure that ψ₀≤ψ₁. Due to this circumstance inequality (3) is conventionally called the Berry–Esseen inequality, and the quantity ψ₀ is called the Lyapunov fraction of the third order. Moreover, in the case where the summands X₁, ..., X_n have identical distributions

[math]\displaystyle{ \psi_0=\psi_1=\frac{\rho_1}{\sigma_1^3\sqrt{n}}, }[/math]

and thus the bounds stated by inequalities (1), (2) and (3) coincide apart from the constant.

Regarding C₀, obviously, the lower bound established by (Esseen 1956) remains valid:

[math]\displaystyle{ C_0\geq\frac{\sqrt{10}+3}{6\sqrt{2\pi}} = 0.4097\ldots. }[/math]

The upper bounds for C₀ were subsequently lowered from the original estimate 7.59 due to (Esseen 1942) to (considering recent results only) 0.9051 due to (Zolotarev 1967), 0.7975 due to (van Beek 1972), 0.7915 due to (Shiganov 1986), 0.6379 and 0.5606 due to (Tyurin 2009) and (Tyurin 2010). (As of 2011) the best estimate is 0.5600 obtained by (Shevtsova 2010).

Multidimensional version

As with the multidimensional central limit theorem, there is a multidimensional version of the Berry–Esseen theorem.^[1]^[2]

Let [math]\displaystyle{ X_1,\dots,X_n }[/math] be independent [math]\displaystyle{ \mathbb R^d }[/math]-valued random vectors each having mean zero. Write [math]\displaystyle{ S_n = \sum_{i=1}^n X_i }[/math] and assume [math]\displaystyle{ \Sigma_n = \operatorname{Cov}[S_n] }[/math] is invertible. Let [math]\displaystyle{ Z_n\sim\operatorname{N}(0,{\Sigma_n}) }[/math] be a [math]\displaystyle{ d }[/math]-dimensional Gaussian with the same mean and covariance matrix as [math]\displaystyle{ S_n }[/math]. Then for all convex sets [math]\displaystyle{ U\subseteq\mathbb R^d }[/math],

[math]\displaystyle{ \big|\Pr[S_n\in U]-\Pr[Z_n\in U]\,\big| \le C d^{1/4} \gamma_n }[/math],

where [math]\displaystyle{ C }[/math] is a universal constant and [math]\displaystyle{ \gamma_n=\sum_{i=1}^n \operatorname{E}\big[\|\Sigma_n^{-1/2}X_i\|_2^3\big] }[/math] (the third power of the L² norm).

The dependency on [math]\displaystyle{ d^{1/4} }[/math] is conjectured to be optimal, but might not be.^[2]

Notes

↑ Since the random variables are identically distributed, X₂, X₃, ... all have the same moments as X₁.

References

↑ Bentkus, Vidmantas. "A Lyapunov-type bound in R^d." Theory of Probability & Its Applications 49.2 (2005): 311–323.
↑ ^2.0 ^2.1 Raič, Martin (2019). "A multivariate Berry--Esseen theorem with explicit constants". Bernoulli 25 (4A): 2824–2853. doi:10.3150/18-BEJ1072. ISSN 1350-7265.

Berry, Andrew C. (1941). "The Accuracy of the Gaussian Approximation to the Sum of Independent Variates". Transactions of the American Mathematical Society 49 (1): 122–136. doi:10.1090/S0002-9947-1941-0003498-3.
Durrett, Richard (1991). Probability: Theory and Examples. Pacific Grove, CA: Wadsworth & Brooks/Cole. ISBN:0-534-13206-5.
Esseen, Carl-Gustav (1942). "On the Liapunoff limit of error in the theory of probability". Arkiv för Matematik, Astronomi och Fysik A28: 1–19. ISSN 0365-4133.
Esseen, Carl-Gustav (1956). "A moment inequality with an application to the central limit theorem". Skand. Aktuarietidskr. 39: 160–170.
Feller, William (1972). An Introduction to Probability Theory and Its Applications, Volume II (2nd ed.). New York: John Wiley & Sons. ISBN:0-471-25709-5.
Korolev, V. Yu.; Shevtsova, I. G. (2010a). "On the upper bound for the absolute constant in the Berry–Esseen inequality". Theory of Probability and Its Applications 54 (4): 638–658. doi:10.1137/S0040585X97984449.
Korolev, Victor; Shevtsova, Irina (2010b). "An improvement of the Berry–Esseen inequality with applications to Poisson and mixed Poisson random sums". Scandinavian Actuarial Journal 2012 (2): 1–25. doi:10.1080/03461238.2010.485370.
Manoukian, Edward B. (1986). Modern Concepts and Theorems of Mathematical Statistics. New York: Springer-Verlag. ISBN:0-387-96186-0.
Serfling, Robert J. (1980). Approximation Theorems of Mathematical Statistics. New York: John Wiley & Sons. ISBN:0-471-02403-1.
Shevtsova, I. G. (2008). "On the absolute constant in the Berry–Esseen inequality". The Collection of Papers of Young Scientists of the Faculty of Computational Mathematics and Cybernetics (5): 101–110.
Shevtsova, Irina (2007). "Sharpening of the upper bound of the absolute constant in the Berry–Esseen inequality". Theory of Probability and Its Applications 51 (3): 549–553. doi:10.1137/S0040585X97982591.
Shevtsova, Irina (2010). "An Improvement of Convergence Rate Estimates in the Lyapunov Theorem". Doklady Mathematics 82 (3): 862–864. doi:10.1134/S1064562410060062.
Shevtsova, Irina (2011). "On the absolute constants in the Berry Esseen type inequalities for identically distributed summands". arXiv:1111.6554 [math.PR].
Shiganov, I.S. (1986). "Refinement of the upper bound of a constant in the remainder term of the central limit theorem". Journal of Soviet Mathematics 35 (3): 109–115. doi:10.1007/BF01121471.
Tyurin, I.S. (2009). "On the accuracy of the Gaussian approximation". Doklady Mathematics 80 (3): 840–843. doi:10.1134/S1064562409060155.
Tyurin, I.S. (2010). "An improvement of upper estimates of the constants in the Lyapunov theorem". Russian Mathematical Surveys 65 (3(393)): 201–202. doi:10.1070/RM2010v065n03ABEH004688.
van Beek, P. (1972). "An application of Fourier methods to the problem of sharpening the Berry–Esseen inequality". Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 23 (3): 187–196. doi:10.1007/BF00536558.
Zolotarev, V. M. (1967). "A sharpening of the inequality of Berry–Esseen". Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 8 (4): 332–342. doi:10.1007/BF00531598.

External links

Gut, Allan & Holst Lars. Carl-Gustav Esseen, retrieved Mar. 15, 2004.
Hazewinkel, Michiel, ed. (2001), "Berry–Esseen inequality", Encyclopedia of Mathematics, Springer Science+Business Media B.V. / Kluwer Academic Publishers, ISBN 978-1-55608-010-4, https://www.encyclopediaofmath.org/index.php?title=p/b015760

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Berry–Esseen theorem. Read more

[1] Since the random variables are identically distributed, X₂, X₃, ... all have the same moments as X₁.

[2] Bentkus, Vidmantas. "A Lyapunov-type bound in R^d." Theory of Probability & Its Applications 49.2 (2005): 311–323.

[:0-3] 2.0 ^2.1 Raič, Martin (2019). "A multivariate Berry--Esseen theorem with explicit constants". Bernoulli 25 (4A): 2824–2853. doi:10.3150/18-BEJ1072. ISSN 1350-7265.

[note 1]

[1]

[2]

Anonymous

Search

Berry–Esseen theorem

Namespaces

More

Page actions

Contents

Statement of the theorem

Identically distributed summands

Non-identically distributed summands

Multidimensional version

See also

Notes

References

External links

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Berry–Esseen theorem

Statement of the theorem

Identically distributed summands

Non-identically distributed summands

Multidimensional version

See also

Notes

References

External links

Navigation

Wiki tools

Page tools

Other projects

Categories