Riesz–Fischer theorem

From HandWiki

In mathematics, the Riesz–Fischer theorem in real analysis is any of a number of closely related results concerning the properties of the space L2 of square integrable functions. The theorem was proven independently in 1907 by Frigyes Riesz and Ernst Sigismund Fischer.

For many authors, the Riesz–Fischer theorem refers to the fact that the Lp spaces [math]\displaystyle{ L^p }[/math] from Lebesgue integration theory are complete.

Modern forms of the theorem

The most common form of the theorem states that a measurable function on [math]\displaystyle{ [-\pi, \pi] }[/math] is square integrable if and only if the corresponding Fourier series converges in the Lp space [math]\displaystyle{ L^2. }[/math] This means that if the Nth partial sum of the Fourier series corresponding to a square-integrable function f is given by [math]\displaystyle{ S_N f(x) = \sum_{n=-N}^{N} F_n \, \mathrm{e}^{inx}, }[/math] where [math]\displaystyle{ F_n, }[/math] the nth Fourier coefficient, is given by [math]\displaystyle{ F_n =\frac{1}{2\pi}\int_{-\pi}^\pi f(x)\, \mathrm{e}^{-inx}\, \mathrm{d}x, }[/math] then [math]\displaystyle{ \lim_{N \to \infty} \left\Vert S_N f - f \right\|_2 = 0, }[/math] where [math]\displaystyle{ \|\,\cdot\,\|_2 }[/math] is the [math]\displaystyle{ L^2 }[/math]-norm.

Conversely, if [math]\displaystyle{ \{a_n\} \, }[/math] is a two-sided sequence of complex numbers (that is, its indices range from negative infinity to positive infinity) such that [math]\displaystyle{ \sum_{n=-\infty}^\infty \left|a_n\right\vert^2 \lt \infty, }[/math] then there exists a function f such that f is square-integrable and the values [math]\displaystyle{ a_n }[/math] are the Fourier coefficients of f.

This form of the Riesz–Fischer theorem is a stronger form of Bessel's inequality, and can be used to prove Parseval's identity for Fourier series.

Other results are often called the Riesz–Fischer theorem (Dunford Schwartz). Among them is the theorem that, if A is an orthonormal set in a Hilbert space H, and [math]\displaystyle{ x \in H, }[/math] then [math]\displaystyle{ \langle x, y\rangle = 0 }[/math] for all but countably many [math]\displaystyle{ y \in A, }[/math] and [math]\displaystyle{ \sum_{y\in A} |\langle x,y\rangle|^2 \le \|x\|^2. }[/math] Furthermore, if A is an orthonormal basis for H and x an arbitrary vector, the series [math]\displaystyle{ \sum_{y\in A} \langle x,y\rangle \, y }[/math] converges commutatively (or unconditionally) to x. This is equivalent to saying that for every [math]\displaystyle{ \varepsilon \gt 0, }[/math] there exists a finite set [math]\displaystyle{ B_0 }[/math] in A such that [math]\displaystyle{ \|x - \sum_{y\in B} \langle x,y\rangle y \| \lt \varepsilon }[/math] for every finite set B containing B0. Moreover, the following conditions on the set A are equivalent:

  • the set A is an orthonormal basis of H
  • for every vector [math]\displaystyle{ x \in H, }[/math]
[math]\displaystyle{ \|x\|^2 = \sum_{y\in A} |\langle x,y\rangle|^2. }[/math]

Another result, which also sometimes bears the name of Riesz and Fischer, is the theorem that [math]\displaystyle{ L^2 }[/math] (or more generally [math]\displaystyle{ L^p, 0 \lt p \leq \infty }[/math]) is complete.

Example

The Riesz–Fischer theorem also applies in a more general setting. Let R be an inner product space consisting of functions (for example, measurable functions on the line, analytic functions in the unit disc; in old literature, sometimes called Euclidean Space), and let [math]\displaystyle{ \{\varphi_n\} }[/math] be an orthonormal system in R (e.g. Fourier basis, Hermite or Laguerre polynomials, etc. – see orthogonal polynomials), not necessarily complete (in an inner product space, an orthonormal set is complete if no nonzero vector is orthogonal to every vector in the set). The theorem asserts that if the normed space R is complete (thus R is a Hilbert space), then any sequence [math]\displaystyle{ \{c_n\} }[/math] that has finite [math]\displaystyle{ \ell^2 }[/math] norm defines a function f in the space R.

The function f is defined by [math]\displaystyle{ f = \lim_{n \to \infty} \sum_{k=0}^n c_k \varphi_k, }[/math] limit in R-norm.

Combined with the Bessel's inequality, we know the converse as well: if f is a function in R, then the Fourier coefficients [math]\displaystyle{ (f,\varphi_n) }[/math] have finite [math]\displaystyle{ \ell^2 }[/math] norm.

History: the Note of Riesz and the Note of Fischer (1907)

In his Note, (Riesz 1907) states the following result (translated here to modern language at one point: the notation [math]\displaystyle{ L^2([a, b]) }[/math] was not used in 1907).

Let [math]\displaystyle{ \left\{\varphi_n\right\} }[/math]be an orthonormal system in [math]\displaystyle{ L^2([a, b]) }[/math] and [math]\displaystyle{ \left\{a_n\right\} }[/math] a sequence of reals. The convergence of the series [math]\displaystyle{ \sum a_n^2 }[/math] is a necessary and sufficient condition for the existence of a function f such that [math]\displaystyle{ \int_a^b f(x) \varphi_n(x) \, \mathrm{d}x = a_n \quad \text{ for every } n. }[/math]

Today, this result of Riesz is a special case of basic facts about series of orthogonal vectors in Hilbert spaces.

Riesz's Note appeared in March. In May, (Fischer 1907) states explicitly in a theorem (almost with modern words) that a Cauchy sequence in [math]\displaystyle{ L^2([a, b]) }[/math] converges in [math]\displaystyle{ L^2 }[/math]-norm to some function [math]\displaystyle{ f \in L^2([a, b]). }[/math] In this Note, Cauchy sequences are called "sequences converging in the mean" and [math]\displaystyle{ L^2([a, b]) }[/math] is denoted by [math]\displaystyle{ \Omega. }[/math] Also, convergence to a limit in [math]\displaystyle{ L^2 }[/math]–norm is called "convergence in the mean towards a function". Here is the statement, translated from French:

Theorem. If a sequence of functions belonging to [math]\displaystyle{ \Omega }[/math] converges in the mean, there exists in [math]\displaystyle{ \Omega }[/math] a function f towards which the sequence converges in the mean.

Fischer goes on proving the preceding result of Riesz, as a consequence of the orthogonality of the system, and of the completeness of [math]\displaystyle{ L^2. }[/math]

Fischer's proof of completeness is somewhat indirect. It uses the fact that the indefinite integrals of the functions gn in the given Cauchy sequence, namely [math]\displaystyle{ G_n(x) = \int_a^x g_n(t) \, \mathrm{d}t, }[/math] converge uniformly on [math]\displaystyle{ [a, b] }[/math] to some function G, continuous with bounded variation. The existence of the limit [math]\displaystyle{ g \in L^2 }[/math] for the Cauchy sequence is obtained by applying to G differentiation theorems from Lebesgue's theory.
Riesz uses a similar reasoning in his Note, but makes no explicit mention to the completeness of [math]\displaystyle{ L^2, }[/math] although his result may be interpreted this way. He says that integrating term by term a trigonometric series with given square summable coefficients, he gets a series converging uniformly to a continuous function F  with bounded variation. The derivative f  of F, defined almost everywhere, is square summable and has for Fourier coefficients the given coefficients.

Completeness of Lp,  0 < p ≤ ∞

For some authors, notably Royden,[1] the Riesz-Fischer Theorem is the result that [math]\displaystyle{ L^p }[/math] is complete: that every Cauchy sequence of functions in [math]\displaystyle{ L^p }[/math] converges to a function in [math]\displaystyle{ L^p, }[/math] under the metric induced by the p-norm. The proof below is based on the convergence theorems for the Lebesgue integral; the result can also be obtained for [math]\displaystyle{ p \in [1,\infty] }[/math] by showing that every Cauchy sequence has a rapidly converging Cauchy sub-sequence, that every Cauchy sequence with a convergent sub-sequence converges, and that every rapidly Cauchy sequence in [math]\displaystyle{ L^p }[/math] converges in [math]\displaystyle{ L^p. }[/math]

When [math]\displaystyle{ 1 \leq p \leq \infty, }[/math] the Minkowski inequality implies that the Lp space [math]\displaystyle{ L^p }[/math] is a normed space. In order to prove that [math]\displaystyle{ L^p }[/math] is complete, i.e. that [math]\displaystyle{ L^p }[/math] is a Banach space, it is enough (see e.g. Banach space) to prove that every series [math]\displaystyle{ \sum u_n }[/math] of functions in [math]\displaystyle{ L^p(\mu) }[/math] such that [math]\displaystyle{ \sum \|u_n\|_p \lt \infty }[/math] converges in the [math]\displaystyle{ L^p }[/math]-norm to some function [math]\displaystyle{ f \in L^p(\mu). }[/math] For [math]\displaystyle{ p \lt \infty, }[/math] the Minkowski inequality and the monotone convergence theorem imply that [math]\displaystyle{ \int \left(\sum_{n=0}^\infty |u_n|\right)^p \, \mathrm{d}\mu \le \left(\sum_{n=0}^{\infty} \|u_n\|_p\right)^p\lt \infty, \ \ \text{ hence } \ \ f = \sum_{n=0}^\infty u_n }[/math] is defined [math]\displaystyle{ \mu }[/math]–almost everywhere and [math]\displaystyle{ f \in L^p(\mu). }[/math] The dominated convergence theorem is then used to prove that the partial sums of the series converge to f in the [math]\displaystyle{ L^p }[/math]-norm, [math]\displaystyle{ \int \left|f - \sum_{k=0}^{n} u_k\right|^p \, \mathrm{d}\mu \le \int \left( \sum_{\ell \gt n} |u_\ell| \right)^p \, \mathrm{d}\mu \rightarrow 0 \text{ as } n \rightarrow \infty. }[/math]

The case [math]\displaystyle{ 0 \lt p \lt 1 }[/math] requires some modifications, because the p-norm is no longer subadditive. One starts with the stronger assumption that [math]\displaystyle{ \sum \|u_n\|_p^p \lt \infty }[/math] and uses repeatedly that [math]\displaystyle{ \left|\sum_{k=0}^n u_k \right|^p \le \sum_{k=0}^n |u_k|^p \text{ when } p \lt 1 }[/math] The case [math]\displaystyle{ p = \infty }[/math] reduces to a simple question about uniform convergence outside a [math]\displaystyle{ \mu }[/math]-negligible set.

See also

References

  1. Royden, H. L. (13 February 2017). Real analysis. Fitzpatrick, Patrick, 1946- (Fourth ed.). New York, New York. ISBN 9780134689494. OCLC 964502015. 
  • Beals, Richard (2004), Analysis: An Introduction, New York: Cambridge University Press, ISBN 0-521-60047-2 .
  • Dunford, N.; Schwartz, J.T. (1958), Linear operators, Part I, Wiley-Interscience .
  • Fischer, Ernst (1907), "Sur la convergence en moyenne", Comptes rendus de l'Académie des sciences 144: 1022–1024 .
  • Riesz, Frigyes (1907), "Sur les systèmes orthogonaux de fonctions", Comptes rendus de l'Académie des sciences 144: 615–619 .