Return to computing page for the first course APMA0330
Return to computing page for the second course APMA0340
Return to computing page for the fourth course APMA0360
Return to Mathematica tutorial for the first course APMA0330
Return to Mathematica tutorial for the second course APMA0340
Return to Mathematica tutorial for the fourth course APMA0360
Return to the main page for the course APMA0330
Return to the main page for the course APMA0340
Return to the main page for the course APMA0360
Introduction to Linear Algebra with Mathematica

Glossary

Preface

Orthogonalization is critical in elimination of redundancy between variables, ensures independent contributions in models, and improves numerical stability in computations. It simplifies complex systems, such as in Machine Learning and Linear Algebra, by creating independent components (bases) that make calculations faster, more stable, and easier to interpret

Orthogonal Systems

Two functions f(x) and g(x) from Hilbert space 𝔏² are called orthogonal if their inner product—typically defined as the definite integral of their product over a specific interval —equals zero $ \displaystyle \quad \left( \left\langle f, g \right\rangle = \int_a^b f(x)^{\ast} g(x)\,w(x)\,{\text d} x = 0 \right) . \quad $ Here 𝑎 and b are some real numbers, possibly infinite, w(x) is a weight function, and $ \displaystyle \quad f(x)^{\ast} = \overline{f(x)} \quad $ is complex conjugate function to f(x).

Let X be a real or complex-vector space. A norm of a vector f, denoted as ∥f∥ is a function ∥·∥ : X → [0, ∞) that assigns a non-negative real number to every element f ∈ X, representing its length, magnitude, or size. It acts as a generalization of distance, satisfying properties of non-negativity, absolute scalability (∥k f∥ = |k| ∥f∥ for any scalar k), and the triangle inequality (∥f + g∥ ≤ ∥f∥ + ∥g∥).

A normalized function is a function whose norm (typically the 𝔏² norm) is exactly 1.

In this section, we mostly consider real Hilbert space 𝔏², equipped with a weighted Euclidean norm:

\[ \| f \| = + \left( \int_a^b |f(x)|^2 w(x)\,{\text d} x \right)^{1/2} = + \left( \langle f \mid f \rangle \right)^{1/2} , \]

where w(x) is a weight and "plus" indicated that the positive branch is taken for square root.

A set {ω₁(x), ω₂(x), …} of linearly independent functions from 𝔏² is called orthonormal system if every vector has a unit length (norm of 1) and all distinct vectors are mutually orthogonal (inner product is 0), \[ \left\langle \omega_n , \omega_k \right\rangle = \begin{cases} 1, & \quad \mbox{if}\quad n=k , \\ 0, & \quad \mbox{otherwise}. \end{cases} \]

The following set of trigonometric functions constitutes a typical orthonormal system:

\[ \omega_n (x) = \sqrt{\frac{2}{\pi}}\,\sin (nx) , \qquad -0 \le x \le \pi , \quad \forall n \in \mathbb{Z}. \]

Let f ∈ 𝔏² and { ω₁, ω₂, … , ωₙ …} be an ordered system of orthonormal vectors in the Hilbert space. Numbers 𝑎ₙ = ⟨f, ωₙ⟩ are called the Fourier coefficients of function f with respect to the orthonormal system { ωₙ }, and series $ \displaystyle \quad \sum_n a_n \omega_n (x) \quad $ is referred to as the Fourier series of function f. Its finite partial sum is \[ S_N (f; x) = \sum_{n=1}^N a_n \omega_n (x) . \]

Theorem 1 (Bessel inequality): If function f ∈ 𝔏² and the ordered set {ω₁(x), ω₂(x), …} forms an orthonormal system, then the Bessel inequality holds: \[ \sum_{n=1}^{\infty} \left\vert a_n \right\vert^2 = \sum_{n=1}^{\infty} \left\vert \left\langle f, \omega_n \right\rangle \right\vert^2 \leqslant \| f \|^2 . \]

Let us determine the norm of the difference \begin{align*} 0 &\le \| f(x) - S_n (f; x) \|^2 = \left\langle f(x) - S_n (f; x) , f(x) - S_n (f; x) \right\rangle \\ &= \left\langle f, f \right\rangle - \left\langle f, S_n \right\rangle - \left\langle S_n , f \right\rangle + \left\langle S_n , S_n \right\rangle \\ &= \| f \|^2 - \left\langle f, \sum_{k=1}^n a_k \omega_k \right\rangle - \left\langle \sum_{k=1}^n a_k \omega_k , f \right\rangle + \left\langle \sum_{k=1}^n a_k \omega_k , \sum_{k=1}^n a_k \omega_k \right\rangle \\ &= \| f \|^2 - \sum_{k=1}^n a_k \left\langle f, \omega_k \right\rangle - \sum_{k=1}^n a_k^{\ast} \left\langle\omega_k , f\right\rangle + \sum_{k,i=1}^n a_i a_k^{\ast} \left\langle \omega_i , \omega_k \right\rangle \\ &= \| f \|^2 - \sum_{k=1}^n \left\vert a_k \right\vert^2 . \end{align*} Therefore, we get the inequality \[ \sum_{k=1}^n \left\vert a_k \right\vert^2 \leqslant \| f \|^2 \] that is valid for any positive integer n. So application of limit when n → ∞ gives the Bessel inequality.

Example 1: Consider the Hilbert space 𝔏²([-π , π]) with the standard inner product \[ \langle f , g \rangle = \int_{-\pi}^{\pi} f(x)\,g(x)\,{\text d}x \] because it is a real vector space. Take the orthonormal system \[ \omega_1 (x) =\frac{1}{\sqrt{2\pi}},\qquad \omega_2 (x)= \frac{\cos x}{\sqrt{\pi}} ,\qquad \omega_3 (x) = \frac{\sin x}{\sqrt{\pi}} . \] These are the first three functions of the Fourier orthonormal basis. Let the function be \[ f(x)=x. \] Step 1. Compute the Fourier coefficients 𝑎ₙ = ⟨ f , ωₙ ⟩. Coefficient for ω₁(x) = 1/√2π, we have \[ a_1 =\left< x , \frac{1}{\sqrt{2\pi}}\right> =\frac{1}{\sqrt{2\pi}}\int_{-\pi}^{\pi} x\, {\text d}x = 0 \] because the integrand is odd.

Coefficient for ω₂(x) = cosx/√π is \[ a_2 = \frac{1}{\sqrt{\pi}} \int _{-\pi}^{\pi} x\,\cos x\, {\text d}x = 0 \] again odd integrand.

Coefficient for ω₃(x) = sinx/√π becomes \[ a_3 = \frac{1}{\sqrt{\pi}} \int_{-\pi}^{\pi} x\,\sin x\, {\text d}x. \] Here the integrand is even, so \[ a_3 =\frac{2}{\sqrt{\pi}} \int_0^{\pi} x\,\sin x\, {\text d}x. \] Integrate by parts: \[ \int_0^{\pi} x\,\sin x\, {\text d}x = \left[ -x\cos x\right]_0^{\pi} + \int_0^{\pi} \cos x\, {\text d}x = \pi +0 = \pi . \] Thus, \[ a_3 =\frac{2}{\sqrt{\pi}}\cdot \pi = 2\,\sqrt{\pi} . \]

Integrate[x*Sin[x], {x, -Pi, Pi}]/Pi

2 \[Pi]

So the first three coefficients are: \[ a_1 =0,\qquad a_2 =0,\qquad a_3 =2\sqrt{\pi} . \]

Apply Bessel’s inequality \[ \sum _{n=1}^{\infty }|a_n|^2\leq \| f\| ^2. \] Let’s compute both sides. The left-hand side (partial sum) is \[ |a_1|^2+|a_2|^2+|a_3|^2 =0+0+4\pi = 4\pi \approx 12.5664 . \] the right-hand side: the norm of f(x) = x \[ \| f\|^2 = \int_{-\pi}^{\pi} x^2\, {\text d}x = 2\int_0^{\pi} x^2\, {\text d}x = \frac{2\,\pi ^3}{3} \approx 20.6709 . \] Compare numerical values, we conclude that \[ 4\pi \leq \frac{2\pi^3}{3}, \] which confirms Bessel’s inequality.

What this example shows:

Even if we take only one nonzero coefficient (𝑎₃ = 2√π), the sum of squares \[ |a_3|^2=4\pi \] is already bounded above by the total energy ∥ f ∥² ≈ 20.6709. Adding more orthonormal functions can only increase the left-hand side, but it can never exceed ∥ f ∥². This illustrates the geometric meaning: the squared lengths of the projections of f onto any orthonormal system cannot exceed the squared length of f itself. ■

End of Example 1

The Bessel inequality tells us that the series

\[ \sum_{k\ge 1} \left\vert a_k \right\vert^2 \]

converges. Therefore, the general term of this series tends to zero,

\[ \lim_{k\to\infty} a_k = \lim_{k\to\infty} \left\langle f, \omega_k \right\vert = 0 . \]

If the equality

\[ \sum_{k\ge 1} \left\vert a_k \right\vert^2 = \sum_{k\ge 1} \left\vert \left\langle f, \omega_k \right\rangle \right\vert^2 = \| f \|^2 = \int_a^b |f(x)|^2 {\text d}x \]

holds, it is called Parseval's identity.

Note that Parseval's identity is equivalent to

\[ f(x) = \underset{n\to\infty}{\mbox{l.i.m.}} \,S_n (f;x) \quad \iff \quad \lim_{n\to\infty} \left\| f(x) - S_n (f;x) \right\| = 0 , \]

where "l.i.m." denotes the limit in 𝔏². Indeed, if Parseval's identity is valid, then

\[ \left\| f(x) - S_n (f;x) \right\|^2 = \| f \|^2 - \sum_{k=1}^n \left\vert a_k \right\vert^2 . \]

This expression tends to zero as n → ∞.

Example 2: We start with the classical Fourier series \[ f(x) \,\sim\, \frac{a_0}{2} + \sum_{n\ge 0} \left( a_n \cos \left( \frac{n\pi x}{\ell} \right) + b_n \sin \left( \frac{n\pi x}{\ell} \right) \right) , \] where \[ \begin{split} a_n &= \frac{1}{\ell} \int_{-\ell}^{+\ell} f(x)\,\cos \left( \frac{n\pi x}{\ell} \right) {\text d}x , \qquad n=0,1,2, \ldots , \\ b_n &= \frac{1}{\ell} \int_{-\ell}^{+\ell} f(x)\,\sin \left( \frac{n\pi x}{\ell} \right) {\text d}x , \qquad n=1,2,\ldots . \end{split} \] Corresponding Parseval's identity is \[ \frac{1}{\ell} \.\| f \|^2 = \frac{1}{\ell} \int_{-\ell}^{+\ell} |f(x) |^2 {\text d} x = \frac{1}{2}\, a_0^2 + \sum_{n\ge 1} \left( a_n^2 + b_n^2 \right) . \] In complex form, the Fourier expansion reads \[ f(t) \,\sim\, \mbox{V.P.} \sum_{n=-\infty}^{\infty} c_n e^{{\bf j}2\pi nt/T} , \quad c_n = \frac{1}{T} \int_0^T f(t)\,e^{-{\bf j} 2\pi nt/T} {\text d}t . \] where "V.P." stands for the Cauchy Principal value.

Parseval's identity for the complex Fourier series states that the average power of a periodic signal is equal to the sum of the squared magnitudes of its complex Fourier coefficients. It equates the time-domain energy over one period to the frequency-domain representation: \[ \frac{1}{T} \int_0^T |f(t)|^2 {\text d}t = \mbox{V.P.} \sum_{n=-\infty}^{\infty} \left\vert c_n \right\vert^2 , \quad c_n = \frac{1}{T} \int_0^T f(t)\,e^{-{\bf j} 2\pi nt/T} {\text d}t . \]

Example: We consider the Heaviside function: \[ H(t) = \begin{cases} 1, &\quad\mbox{for } t > 0, \\ \frac{1}{2} , &\quad\mbox{for } t =0 , \\ 0, &\quad\mbox{for } t < 0. \end{cases} \] Using Mathematica we evaluate the Fourier coefficients

Integrate[HeavisideTheta[x]*Cos[n*Pi*x/2], {x, -2, 2}]

(2 Sin[n \[Pi]])/(n \[Pi])

Integrate[Sin[n*Pi*x/2], {x, 0, 2}]

(2 - 2 Cos[n \[Pi]])/(n \[Pi])

\[ b_n = \frac{1}{2} \int_{-2}^2 H(t)\,\sin\left( \frac{n\pi x}{2} \right) {\text d}x = \begin{cases} \frac{2}{n\pi}, &\quad \mbox{if } n = 2k+1 , \\ 0, &\quad \mbox{if } n = 2k . \end{cases} \] Therefore, the Fourier series for the Heaviside function on interval [−2,2] becomes \[ H(t) = \frac{1}{2} + \frac{2}{\pi} \sum_{k\ge 1} \frac{1}{(2k-1)}\,\sin \left( (2k-1)\,\frac{\pi t}{2} \right) . \] We plot 10-term approximation:

S10[t_] = 1 /2 + (2/Pi)*Sum[Sin[(2*k - 1)*Pi*t/2]/(2*k - 1), {k, 1, 10}]; Plot[{HeavisideTheta[t], S10[t]}, {t, -2, 2}, PlotStyle -> {{Thick, Red}, {Thick, Blue}}]

Figure 2.1: Fourier approximations of the Heaviside function with n = 10 terms (blue).

Parseval's identity reads \[ \frac{1}{2} \,\| H(t) \|^2 = \frac{1}{2} \,\int_0^2 {\text d}t = 1 = \frac{1}{2} + \frac{4}{\pi^2} \sum_{k\ge 1} \frac{1}{(2k-1)^2} . \] Mathematica confirms:

1/2 + 4*Sum[1/(2*k - 1)^2 , {k, 1, Infinity}]/Pi^2

Legendre polynomials can be defined by Rodrigues' formula: \[ P_n (x) = \frac{1}{2^n n!}\,\frac{{\text d}^n}{{\text d} x^n} \left( x^2 -1 \right)^n = \frac{1}{2^n} \sum_{k=0}^{\lfloor n/2 \rfloor} (-1)^k \binom{n}{k} \binom{2n-2k}{n} x^{n-2k} , \] or via recurrence \[ \begin{split} \left( n+1 \right) P_{n+1} (x) &= \left( 2n+1 \right) x\,P_n (x) -n\,P_{n-1} (x) , \qquad n=1,2,\ldots , \\ P_0 (x) &= 1, \qquad P_1 (x) = x , \quad P_2 (x) = \frac{1}{2} \left( 3 x^2 -1 \right) . \end{split} \] A function f from Hilbert space 𝔏²([−1,1]) can be expanded into Fourier--Legendre series with respect to Legendre polynomials \[ f \,\sim\, \sum_{n\ge 0} a_n P_n (x) , \qquad a_n = \left( n + \frac{1}{2} \right) \cdot c_n , \quad c_n = \int_{-1}^1 f(x)\,P_n (x)\,{\text d} x . \tag{L.1} \] Parseval's identity: \[ \| f(x) \|^2 = \int_{-1}^{+1} | f(x) |^2 {\text d} x = \sum_{n\ge 0} \left( n + \frac{1}{2} \right)^{-1} a_n^2 = \sum_{n\ge 0} \left( n + \frac{1}{2} \right) c_n^2 . \tag{L.2} \]

Example: We apply Legendre expansion (L.1) to f(x) = xⁿ. Because monomial xⁿ and Legendre's polynomial Pₖ(x) have definite parity, only terms with the same parity as n, the Legendre coefficients appear also in the same parity. \[ x^n = \sum_{k=0}^n a_k P_k (x) , \qquad a_k = \left( k + \frac{1}{2} \right) c_k, \quad c_k = \int_{-1}^1 x^n P_k (x)\,{\text d}x , \] where \[ a_k = \frac{(2k+1)\, n!}{(n-k)!!\, (n+k+1)!!},\qquad n-k\ \mathrm{even}. \]

Here is Mathematica's code for evaluation of coefficients in the Legendre expansion:

LegendreExpandMonomial[n_] := Module[{k, a}, Table[If[EvenQ[n - k], a = (2 k + 1)* n!/((n - k)!! (n + k + 1)!!); a Subscript[P, k], 0], {k, 0, n}] // Total]

We set n = 7 and obtain

LegendreExpandMonomial[7]

\[ x^7 = \frac{16}{429}\, P_7 (x) + \frac{8}{39}\, P_5 (x) + \frac{14}{33}\, P_3 (x) + \frac{1}{3}\, P_1 (x) . \] Mathematica confirms:

Simplify[ 16*LegendreP[7, x]/429 + 8*LegendreP[5, x]/39 + 14*LegendreP[3, x]/33 + LegendreP[1, x]/3]

x^7

Parseval's identity (L.2) reads \[ \| x^7 \|^2 = \int_{-1}^1 x^{14} \,{\text d} x = \frac{2}{15} = \frac{2}{15} \cdot \left( \frac{16}{429} \right)^2 + \frac{2}{11}\cdot \left( \frac{8}{39} \right)^2 + \frac{2}{7} \cdot \left( \frac{14}{33} \right)^2 + \frac{2}{3} \cdot \left( \frac{1}{3} \right)^2 . \] Mathematica confirms:

2/15*(16/429)^2 + 2/11 *(8/39)^2 + 2/7 * (14/33)^2 + 2/3*(1/3)^2

2/15

Another example: \[ x^3 =\frac{2}{5}P_3(x)+\frac{3}{5}P_1(x). \]

Chebyshev expansions:

The Chebyshev expansion of a function f ∈ 𝔏²([−1, 1], w dx) is its series representation of the form (that converges in the norm of the Hilbert space) \[ f(x)\sim \sum _{n=0}^{\infty }a_n\, X_n(x), \] where Xₙ is one of Chebyshev polynomials: Tₙ, Uₙ, Vₙ, or Wₙ. These polynomials were first presented by the Russian scientist Pafnuty Chebyshev in a paper read before the St. Petersburg Academy in 1853.

Parseval's identity corresponding to Chebyshev expansions always has the form \[ \| f(x) \|^2 = \int _{-1}^1|f(x)|^2 w(x)\, {\text d}x = \sum _{n=0}^{\infty} a_n^2\, \| X_n\| ^2, \] where ∥ Xₙ ∥² is the squared norm of the polynomial under its weight.

Chebyshev polynomials of the first kind are defined on interval [−1,1] by the formula \[ T_n (\cos\theta ) = \cos \left( n\,\theta \right) \] or by recurrence \[ \begin{split} T_{n+1} (x) &= 2x\,T_n (x) - T_{n-1} (x) , \qquad n=1,2,\ldots , \\ T_0 (x) &= 1, \qquad T_1 (x) = x , \quad T_2 (x) = 2 x^2 -1 . \end{split} \] or via the Jacobi polynomials \[ T_n (x) = \frac{P_n^{(-1/2, -1/2)} (x)}{P_n^{(-1/2, -1/2)} (1)} = \frac{2^{2n} \left( n! \right)^2}{(2n)!}\, P_n^{(-1/2, -1/2)} (x) . \]

Expand[JacobiP[3, -1/2, -1/2, x]*2^6 *6^2 /6!]

-3 x + 4 x^3

Chebyshev polynomials of the first kind form an orthogonal basis in the Hilbert space ℌ = 𝔏²_w ≡ 𝔏²([−1,1], w dx) with weight \[ w_T (x) =\frac{1}{\sqrt{1-x^2}} , \quad w(\cos\theta ) = \frac{1}{\sin\theta} . \] The inner product of two functions, \[ \langle f \mid g \rangle = \int_{-1}^1 f(x)\,g(x)\,w(x)\,{\text d} x \] and corresponding norms of Chebyshev polynomials are \[ \| T_0\|^2 = \int_{-1}^1 \frac{{\text d}x}{\sqrt{1-x^2}} = \pi ,\qquad \| T_n\|^2 = \frac{\pi}{2}\quad (n\geq 1). \]

Integrate[1/Sqrt[1 - x^2], {x, -1, 1}]

\[Pi]

Integrate[(ChebyshevT[3, x])^2 /Sqrt[1 - x^2], {x, -1, 1}]

\[Pi]/2

Parseval's identity corresponding to Chebyshev expansion \[ f(x) = \frac{1}{2}\, a_0 + \sum_{n\ge 1} a_n T_n (x) \tag{T.1} \] is \[ \frac{2}{\pi} \,\| f \|^2 = \frac{2}{\pi} \,\int_{-1}^1 f^2 (x)\,\frac{{\text d}x}{\sqrt{1 - x^2}} = \frac{1}{2}\, a_0^2 + \sum_{n\ge 1} a_n^2 , \qquad a_n = \int_{-1}^1 f(x)\,T_n (x)\,\frac{{\text d}x}{\sqrt{1 - x^2}} . \tag{T.2} \]

Example: We want to expand the monomial xⁿ into the Fourier--Chebyshev series with respect to the Chebyshev polynomials of the first kind: \[ x^n = \frac{1}{2}\, c_{n,0} + \sum _{k=1}^n c_{n,k}\, T_k (x), \tag{T.3} \] where \[ c_{n,k} = \frac{1}{2^{n-1}} \binom{n}{(n - k)/2} \] Because upon substitution x = cosθ into manomial $ \displaystyle \ x^n =\cos^n \theta \ $ has definite parity, only terms with the same parity as n appear in expansion (T.3). Since Chebyshev expansions for monomials xⁿ with odd n involve T₀ ≡ 1 whose norm is different from norms of other Chebyshev polynomials, the corresponding Parseval's identity reflects this situation similarly to the classical trigonometric Fourier series.

ChebyshevTExpandMonomial[n_] := Module[{k, c}, Sum[ If[EvenQ[n - k], c = If[k == 0, If[EvenQ[n], Binomial[n, n/2]/2^n, 0], Binomial[n, (n - k)/2]/2^(n - 1) ]; c Subscript[T, k], 0 ], {k, 0, n} ] // Total ] ChebyshevTCoefficients[n_] := Table[ If[EvenQ[n - k], If[k == 0, If[EvenQ[n], Binomial[n, n/2]/2^n, 0], Binomial[n, (n - k)/2]/2^(n - 1) ], 0 ], {k, 0, n} ]

We set n = 7 and obtain \[ x^7 = \frac{1}{64} \left[ 35\, T_1 (x) + 21\, T_3 (x) + 7\,T_5 (x) + T_7 (x) \right] . \]

ChebyshevTExpandMonomial[7]

Total[(35 Subscript[T, 1])/64 + (21 Subscript[T, 3])/64 + ( 7 Subscript[T, 5])/64 + Subscript[T, 7]/64]

We check this expansion with Mathematica:

Simplify[(35*x + 21*ChebyshevT[3, x] + 7*ChebyshevT[5, x] + ChebyshevT[7, x])/64]

x^7

Parseval's identity for this expansion becomes \[ \frac{2}{\pi} \,\| x^7 \|^2 = \frac{2}{\pi} \,\int_{-1}^1 \frac{x^{14}}{\sqrt{1- x^2}}\,{\text d}x = \frac{429}{1024} = \frac{1}{2^{12}} \left[ 35^2 + 21^2 + 7^2 +1 \right] \] Mathematica confirms:

(35^2 + 21^2 + 7^2 + 1)/2^(12)

429/1024

Since Fourier--Chebyshev expansions for even power monomials involve T₀ = 1, we present a corresponding example: \[ x^6 = \frac{1}{32} \left[ 10\,T_0 (x) + 15\, T_2 (x) + 6\, T_4 (x) + T_6 (x) \right] . \]

ChebyshevTExpandMonomial[6]

Total[(5 Subscript[T, 0])/16 + (15 Subscript[T, 2])/32 + ( 3 Subscript[T, 4])/16 + Subscript[T, 6]/32]

Parseval's identity for this expansion becomes \[ \frac{2}{\pi} \,\| x^6 \|^2 = \frac{2}{\pi} \,\int_{-1}^1 \frac{x^{12}}{\sqrt{1- x^2}}\,{\text d}x = \frac{231}{512} = \frac{1}{2^{10}} \left[ \frac{1}{2}\,20^2 + 15^2 + 6^2 +1 \right] \] Mathematica confirms:

(20^2 /2 + 15^2 + 6^2 + 1)/2^(10)

231/512

Chebyshev polynomials of the second kind are defined on interval {−1,1} by the formula \[ U_n (\cos\theta ) = \frac{\sin \left( n+1 \right) \theta}{\sin\theta} \] or by recurrence \[ \begin{split} U_{n+1} (x) &= 2x\,U_n (x) - U_{n-1} (x) , \qquad n=1,2,\ldots , \\ U_0 (x) &= 1, \qquad U_1 (x) = 2x , \quad U_2 (x) = 4 x^2 -1 . \end{split} \] or via Jacobi polynomials \[ U_n (x) = \left( n+1 \right) \frac{P_n^{(1/2, 1/2)} (x)}{P_n^{(1/2, 1/2)} (1)} = \left( n+1 \right) \frac{n!\,\Gamma \left( \frac{3}{2} \right)}{\Gamma \left( n + \frac{3}{2} \right)}\, P_n^{(1/2, 1/2)} (x) = \left( n+1 \right) \frac{n! 2^n}{\left( 2n+1 \right) !!}\, P_n^{(1/2, 1/2)} (x) . \] Any function f ∈ 𝔏²([−1, 1], w dx) can be expanded into convergent Chebyshev series \[ f \,\sim\, \sum_{k\ge 0} a_{k} U_k (x) , \tag{U.1} \] where 𝑎ₖ = ⟨ f∣Uₖ ⟩, and the inner product in Hilbert space 𝔏³_w is \[ \langle f \mid g \rangle = \int_{-1}^1 f(x)\,g(x)\,w(x)\,{\text d}x , \quad w_U (x) =\sqrt{1-x^2}. \] The norms of Chebyshev polynomials of the second kind are all the same for any index: \[ \| U_n\| ^2 = \int_{-1}^1 U_n^2 (x)\,\sqrt{1-x^2}\,{\text d}x = \frac{\pi }{2}. \] Parseval identity: \[ \| f \|^2 = \int_{-1}^1 f^2 (x)\,U_n (x) \,\sqrt{1-x^2}\,{\text d}x = = \frac{\pi}{2} \sum_{k\ge 0} a_{k}^2 . \tag{U.3} \]

Example: We are looking for expansions of monomials xⁿ with respect to Chebyshev polynomials of the second kind \[ x^n = \sum_{k=0}^n a_{n,k} U_k (x) , \] where \[ a_{n,k} = \frac{1}{2^n} \left[ \binom{n}{n - k)/2} - \binom{n}{(n - k - 2)/2} \right] . \tag{U.2} \] Mathematica helps to determine these coefficients:

ChebyshevUExpandMonomial[n_] := Module[{k, c}, Sum[If[EvenQ[n - k], c = 2^-n (Binomial[n, (n - k)/2] - Binomial[n, (n - k - 2)/2]); c Subscript[U, k], 0], {k, 0, n}]]; ChebyshevUCoefficients[n_] := Table[If[EvenQ[n - k], 2^-n (Binomial[n, (n - k)/2] - Binomial[n, (n - k - 2)/2]), 0], {k, 0, n}]

We set n = 7 and obtain \[ x^7 = \frac{1}{128} \left[ 14\,U_1 (x) + 14\, U_3 (x) + 6\, U_5 (x) + U_7 (x) \right] . \]

ChebyshevUExpandMonomial[7]

(7 Subscript[U, 1])/64 + (7 Subscript[U, 3])/64 + ( 3 Subscript[U, 5])/64 + Subscript[U, 7]/128

We check with Mathematica:

Simplify[(14*ChebyshevU[1, x] + 14*ChebyshevU[3, x] + 6*ChebyshevU[5, x] + ChebyshevU[7, x])/128]

x^7

Parseval's identity reads \[ \frac{2}{\pi}\,\| x^7 \|^2 = \frac{2}{\pi}\,\int_{-1}^1 x^{14} \sqrt{1 - x^2}\,{\text d}x = \frac{429}{16384} = \frac{1}{128^2} \left[ 14^2 + 14^2 + 6^2 + 1 \right] . \]

(2*14^2 + 6^2 + 1)/128^2

429/128^2

We calculate also the norm squared:

2*Integrate[x^(14)*Sqrt[1 - x^2], {x, -1, 1}]/Pi

429/16384

2*Integrate[(Cos[t])^(14) *(Sin[t])^2, {t, 0, Pi}]/Pi

429/16384

Chebyshev polynomials of the third kind are defined by the formula \[ V_n (\cos\theta ) = \frac{\cos \left( n + \frac{1}{2} \right)\theta}{\cos\left( \frac{\theta}{2} \right)} \] or by recurrence \[ \begin{split} V_{n+1}(x) &= 2x \, V_n(x) - V_{n-1}(x) , \qquad n = 1,2,\ldots , \\ V_0 (x) &= 1 , \quad V_1 (x) = 2x - 1, \quad V_2 (x) = 4 x^2 - 2x -1 . \end{split} \] or through Chebyshev polynomials of the first and second kinds \[ V_n (x) = T_n (x) + \left( x-1 \right) U_{n-1}(x) , \qquad n=1,2,\ldots , \] or via Jacobi polynomials \[ V_n (x) = \frac{P_n^{(-1/2, 1/2)}(x)}{P_n^{(-1/2, 1/2)}(1)} = \frac{4^n \left( n! \right)^2}{(2n)!}\,P_n^{(-1/2, 1/2)}(x) . \] Let ℌ = 𝔏²_w ≡ 𝔏²([−1,1], w dx) be the (real) Hilbert space, equipped with the inner product \[ \langle f \mid g \rangle = \int_{-1}^1 f(x)\,g(x)\,w(x)\,{\text d} x , \quad w_V(x) = \frac{\sqrt{1+x}}{\sqrt{1-x}} . . \] Chebyshev polynomials of the third kind Vₙ form an orthogonal system in the Hilbert space ℌ = 𝔏²([−1, 1], w dx) with weight \[ w(x) = (1 - x)^{-1/2} \cdot (1 + x)^{1/2} = \sqrt{\frac{1+x}{1-x}} . \] Any function f from ℌ can be extended into convergent (in the norm of the Hilbert space) series over Chebyshev polynomials of the third kind: \[ f (x) = \sum_{n\ge 0} b_n V_n (x) , \] where \[ b_n = \frac{\langle f \mid V_n \rangle}{\langle V_n \mid V_n \rangle} = \frac{1}{\pi}\,\langle f \mid V_n \rangle = \frac{1}{\pi}\,\int_{-1}^1 f(x)\,V_n (x)\,\sqrt{\frac{1+x}{1-x}}\,{\text d} x \] because norms of all Chebyshev polynomials of the third kind are the same, \[ \| V_n\|^2 = \int_{-1}^1 V_n^2 (x)\,\sqrt{\frac{1+x}{1-x}}\,{\text d} x = \pi , \quad n=0,1,2,\ldots . \] Then Parseval identity will be as follows: \[ \frac{1}{\pi}\, \| f \|^2 = \frac{1}{\pi}\, \int_{-1}^1 f^2 (x)\,\sqrt{\frac{1+x}{1-x}}\,{\text d}x = \sum_{n\ge 0} b_n^2 . \]

Example: We consider expansion of monomials xⁿ into series with respect to Chebyshev polynomials of the third kind \[ x^n = \sum_{k=0}^n b_{n,k} V_n (x) , \] where \begin{align*} \pi\,b_{n,k} &= \langle x^n \mid V_k \rangle = \langle x^n \mid T_k \rangle + \langle x^n \mid \left( x-1 \right) U_{k-1} \rangle \\ &= \langle x^n \mid T_k \rangle + \langle x^{n+1} \mid U_{k-1} \rangle - \langle x^n \mid U_{k-1} \rangle \\ &= \frac{1}{2^{n-1}} \binom{n}{(n-k)/2} + \frac{1}{2^{n+1}} \left[ \binom{n+1}{(n-k)/2} - \binom{n+1}{(n-k-2)/2} \right] \\ &\quad - \frac{1}{2^n} \left[ \binom{n}{(n-k-1)/2} - \binom{n}{(n-k-3)/2} \right] \end{align*} We can let Mathematica do these integrals symbolically.

ChebyshevV[n_, x_] = Expand[ChebyshevT[n, x] + (x - 1)*ChebyshevU[n - 1, x]]; ChebyshevVExpandMonomial[n_] := Module[{k, w, num, den, c}, w[x_] := (1 - x)^(-1/2) (1 + x)^(1/2); Sum[num = Integrate[x^n ChebyshevV[k, x] w[x], {x, -1, 1}]; den = Integrate[ChebyshevV[k, x]^2 w[x], {x, -1, 1}]; c = Simplify[num/den]; c Subscript[V, k], {k, 0, n}] // Total // Expand]

We set n = 7 and obtain

ChebyshevVExpandMonomial[7]

Total[(35 Subscript[V, 0])/128 + (35 Subscript[V, 1])/128 + ( 21 Subscript[V, 2])/128 + (21 Subscript[V, 3])/128 + ( 7 Subscript[V, 4])/128 + (7 Subscript[V, 5])/128 + Subscript[V, 6]/ 128 + Subscript[V, 7]/128]

\[ x^7 = \frac{1}{128} \left[ 35 + 35\,V_1 + 21\, V_2 + 21\, V_3 + 7\,V_4 + 7\,V_5 + V_6 + V_7 \right] . \] We check this expansion with Mathematica:

Expand[(35*2*x + 21*ChebyshevV[2, x] + 21*ChebyshevV[3, x] + 7*ChebyshevV[4, x] + 7*ChebyshevV[5, x] + ChebyshevV[6, x] + ChebyshevV[7, x])/128]

x^7

Finally, we verify Parseval's identity: \[ \frac{1}{\pi} \,\| x^7 \|^2 = \frac{1}{\pi} \,\int_{-1}^1 x^{14} \,\sqrt{\frac{1+x}{1-x}}\,{\text d}x = \frac{429}{2048} = \frac{1}{2^{13}} \left[ 35^2 + 21^2 + 7^2 +1 \right] . \] Mathematica confirms:

(35^2 + 21^2 + 7^2 + 1)/2^(13)

429/2048

Chebyshev polynomials of the fourth kind are defined by the formula \[ W_n (\cos\theta ) = \frac{\sin \left( \frac{2n+1}{2}\,\theta \right)}{\sin \left( \frac{\theta}{2} \right)} = \sum_{k = \lceil n/2 \rceil}^n \binom{k}{n-k} 2^{2k-n-1} (-1)^{n-k} x^{2k-n-1} \left( 2x +2 - \frac{n}{k} \right) , \quad n\ge 2, , \qquad x= \cos\theta , \qquad n=0,1,2,\ldots . \] or by recurrence \[ \begin{split} W_{n+1} (x) &= 2x\, W_n (x) - W_{n-1} (x) , \\ W_0 (x) &= 1, \quad W_1 (x) = 2x+1 , \quad W_2 (x) = 4x^2 + 2x -1 . \end{split} \] or through Chebyshev polynomials of the second kind \[ W_n (x) = U_n (x) + U_{n-1} (x) , \qquad n=1,2,\ldots . \]

ChebyshevW[n_, x_] = Expand[ChebyshevU[n, x] + ChebyshevU[n - 1, x]];

or via Jacobi's polynomials \[ W_n (x) = \left( 2n+1 \right) \frac{P_n^{(1/2, -1/2)} (x)}{P_n^{(1/2, -1/2)} (1)} = \left( 2n+1 \right) \frac{n!}{(3/2)_n}\,P_n^{(1/2, -1/2)} (x) = \left( 2n+1 \right) \frac{4^n \left( n! \right)^2}{(2n+1)!} \, P_n^{(1/2, -1/2)} (x) . \] They can be determined by Rodrigues' formula \[ W_n (x) = \frac{(-1)^n}{ 2^n n!} \,\sqrt{\frac{1+x}{1-x}} \,\frac{{\text d}^n}{{\text d}x^n} \left[ \left( 1 - x^2 \right)^{n} \sqrt{\frac{1-x}{1+x}} \right] , \qquad n=0,1,2,\ldots . \tag{W.5} \] The leading coefficient of polynomial Wₙ(x) is 2ⁿ.

Chebyshev polynomials of the fourth kind form an orthogonal system in Hilbert space ℌ = 𝔏²_w ≡ 𝔏²([−1,1], w dx), with weight \[ w_W (x) =\frac{\sqrt{1-x}}{\sqrt{1+x}} . \] These polynomials all have the same norm: \[ \| W_n\|^2 = \int_{-1}^1 W_n^2 (x) \,\sqrt{\frac{1-x}{1+x}} \,{\text d}x = \pi , \qquad n=0,1,2,\ldots . \] Then any function f from ℌ can be expanded into convergent in the norm series \[ f(x) = \sum_{n\ge 0} b_n W_n (x) , \qquad b_n = \frac{1}{\pi}\,\int_{-1}^1 f(x)\,W_n (x)\,\sqrt{\frac{1-x}{1+x}} \,{\text d}x . \tag{W.1} \] Parseval identity: \[ \frac{1}{\pi}\,\| f \|^2 = \frac{1}{\pi}\,\int_{-1}^1 f^2 (x)\,\sqrt{\frac{1-x}{1+x}}\,{\text d}x = \sum_{n\ge 0} b_n^2 . \tag{W.2} \]

Example: We are looking for expansions of monomials xⁿ with respect to Chebyshev polynomials of the fourth kind \[ x^n = \sum_{k=0}^n b_{n,k} W_k (x) , \] We can let Mathematica do these integrals symbolically.

ChebyshevWExpandMonomial[n_] := Module[{k, w, num, den, c}, w[x_] := (1 - x)^(1/2) (1 + x)^(-1/2); Sum[ num = Integrate[x^n ChebyshevW[k, x] w[x], {x, -1, 1}]; den = Integrate[ChebyshevW[k, x]^2 w[x], {x, -1, 1}]; c = Simplify[num/den]; c Subscript[W, k], {k, 0, n} ] // Total // Expand ]

We set n = 7 and obtain

ChebyshevWExpandMonomial[7]

Total[-((35 Subscript[W, 0])/128) + (35 Subscript[W, 1])/128 - ( 21 Subscript[W, 2])/128 + (21 Subscript[W, 3])/128 - ( 7 Subscript[W, 4])/128 + (7 Subscript[W, 5])/128 - Subscript[W, 6]/ 128 + Subscript[W, 7]/128]

\[ x^7 = \frac{1}{2^7} \left[ -35 + 35\, W_1 - 21\,W_2 + 21\, W_3 - 7\, W_4 + 7\,W_5 - W_6 + W_7 \right] . \] We verify this expansion with Mathematica:

Expand[(35*(2*x) - 21*ChebyshevW[2, x] + 21*ChebyshevW[3, x] - 7*ChebyshevW[4, x] + 7*ChebyshevW[5, x] - ChebyshevW[6, x] + ChebyshevW[7, x] )/2^7]

x^7

Parseval's identity reads

Integrate[x^(14) *Sqrt[1 - x]/Sqrt[1 + x], {x, -1, 1}]/Pi

429/2048

\[ \frac{1}{\pi}\,\| x^7 \|^2 = \frac{429}{2048} = \frac{1}{2^{13}} \left[ 35^2 + 21^2 + 7^2 + 1 \right] \]

Laguerre polynomials were invented and studied by Pafnuty Chebyshev in 1859. Therefore, these polynomials were known in nineteen century as Chebyshev--Laguerre polynomials. There is no evidence that Edmond Laguerre (1834–1886) used them.

Laguerre polynomials can be defined by the Rodrigues formula, \[ L_n (x) = \frac{e^x}{n!}\,\frac{{\text d}^n}{{\text d} x^n} \left( e^{-x} x^n \right) = \sum_{k=0}^n \binom{n}{k} \frac{(-1)^k}{k!}\, x^k , \] or recurrence relation \[ \begin{split} \left( n+1 \right) L_{n+1} (x) &= \left( 2n+1 -x \right) L_n (x) -n\, L_{n-1} (x) , \qquad n=1,2,\ldots , \\ L_0 (x) &= 1, \quad L_1 (x) = 1- x , \quad L_2 (x) = \frac{1}{2} \left( x^2 -4x +2 \right) . . \end{split} \] The Laguerre polynomials are orthogonal on [0, ∞) with weight w: \[ \int _0^{\infty } L_m(x)\,L_n(x)\,e^{-x}\, {\text d}x =\delta _{mn}. \] For any function f ∈ 𝔏²([0,∞), e^−xdx), its Laguerre expansion becomes \[ f(x)\sim \sum _{n=0}^{\infty }a_n\, L_n(x), \] with coefficients \[ a_n =\int _0^{\infty } f(x)\, L_n(x)\, e^{-x}\, {\text d}x. \] Because the system is orthonormal, Parseval takes the simplest possible form: That’s the entire identity — no extra factors, because the classical Laguerre polynomials are already normalized in 𝔏²([0,∞), e^−xdx). \[ \| f \|^2 = \int_0^{\infty} |f(x)|^2 e^{-x}{\text d} x = \sum_{n\ge 0} a_n^2 , \qquad a_n = \int_{0}^{\infty} f(x)\,L_n (x)\, e^{-x} {\text d}x . \]

Example: The monomial xⁿ has the finite expansion \[ x^n\; =\; n!\, \sum _{k=0}^n (-1)^k {n \choose k}\, L_k(x). \] So the coefficient of Lₖ(x) is \[ a_{n,k} =n!\, (-1)^k {n \choose k},\qquad k=0,1,\dots ,n. \] You can check quickly with Mathematica:

LaguerreExpandMonomial[n_] := Sum[n! (-1)^k Binomial[n, k] Subscript[L, k], {k, 0, n}] LaguerreExpandMonomialPoly[n_] := Expand @ LaguerreExpandMonomial[n]

We set n = 7 and obtain

LaguerreExpandMonomial[7]

5040 Subscript[L, 0] - 35280 Subscript[L, 1] + 105840 Subscript[L, 2] - 176400 Subscript[L, 3] + 176400 Subscript[L, 4] - 105840 Subscript[L, 5] + 35280 Subscript[L, 6] - 5040 Subscript[L, 7]

\[ x^7 = 5040 - 35280 L_1 (x) + 105840 L_2 (x) - 176400 L_3 (x) + 176400 L_4 (x) - 105840 L_5 (x) + 35280 L_6 (x) - 5040 L_7 (x) . \] We check this expansion with Mathematica:

Expand[5040 - 35280*LaguerreL[1, x] + 105840*LaguerreL[2, x] - 176400*LaguerreL[3, x] + 176400*LaguerreL[4, x] - 105840*LaguerreL[5, x] + 35280*LaguerreL[6, x] - 5040*LaguerreL[7, x]]

x^7

The Parseval identity for the Laguerre series states \[ \| x^7 \|^2 = \int_0^{\infty} x^{14} e^{-x} {\text d}x = 87178291200 = 2\left[ 5040^2 + 35280^2 + 2105840^2 + 176400^2 \right] . \] We verify Parseval's identity with Mathematica:

2*5040^2 + 2*35280^2 + 2*105840^2 + 2*176400^2

87178291200

Hermite polynomials conventionally denoted by Hₙ(x) were introduced and studied in detail by Pafnuty Chebyshev. Five years later, in 1864 they were overlooked by the French mathematician Charles Hermite (1822--1901). Therefore, these polynomials were known in 19th century as Chebyshev--Hermite polynomials.

The "physicist's Hermite polynomials" are given by the Rodrigues formula, \[ H_n (x) = (-1)^n e^{x^2} \frac{{\text d}^n}{{\text d} x^n}\, e^{-x^2} \] or recurrence relation \[ \begin{split} H_{n+1} (x) &= 2x\, H_n (x) - 2n\, H_{n-1} (x) , \qquad n=1,2,\ldots , \\ H_0 (x) &= 1, \quad H_1 (x) = 2x, \quad H_2 (x) = 4x^2 -2 . \end{split} \] The orthogonality relation \[ \int _{-\infty }^{\infty }H_m(x)H_n(x)e^{-x^2}\, {\text d}x = 2^n n!\sqrt{\pi}\, \delta _{mn} \] tells us that the ordered system of Hermite polynomials { Hₙ(x) }_n≥0 forms an orthogonal basis in Hilbert space ℌ = 𝔏²(ℝ, w dx)), equipped with the inner product \[ \langle f \mid g \rangle = \int_{-\infty}^{\infty} f(x)\, g(x)\,w(x)\,{\text d}x , \qquad w(x) = e^{-x^2} . \] Any function f ∈ 𝔏²_w ≡ 𝔏²(ℝ, w dx)), square integrable with weight $ \displaystyle \quad w(x) = e^{-x^2} , $ can be expanded into Fourier--Hermite series \[ f(x) \sim \sum_{n\ge 0} \frac{1}{2^n n! \sqrt{\pi}}\, c_n H_n (x) = \sum_{n\ge 0} a_n H_n (x) , \qquad c_n = \int_{-\infty}^{\infty} f(x)\, H_n (x)\,e^{-x^2} {\text d}x . \] Therefore, Parseval’s identity becomes: \[ \| f \|^2 = \int_{\mathbb{R}} | f(x) |^2 e^{-x^2} {\text d}x = \sum_{n\ge 0} \frac{1}{2^n n! \sqrt{\pi}}\, c_n^2 = \sum_{n\ge 0} \left( 2^n n! \sqrt{\pi} \right) a_n^2 . \]

Example: We use the familiar formulas \[ x^n = \frac{n!}{2^n} \sum_{k=0}^{\lfloor n/2 \rfloor} \frac{1}{k! \left( n -2k \right) !}\, H_{n-2k} (x) ; \tag{H.1} \] \[ \mbox{erf} (x) = \frac{2}{\sqrt{\pi}} \int_0^x e^{-t^2} {\text d}t = \frac{1}{\sqrt{2\pi}} \sum_{k\ge 0} \frac{(-1)^k}{k! \left( 2k+1 \right) 2^{3k}}\, H_{2k+1} (x) . \tag{H.2} \] If we choose n = 7, we get \[ x^7 = \frac{1}{2^7}\,H_7 (x) + \frac{21}{64}\, H_5 (x) + \frac{105}{32}\,H_3 (x) + \frac{105}{16}\, H_1 (x) . \]

Simplify[ HermiteH[7, x]/2^7 + 21*HermiteH[5, x]/64 + 105*HermiteH[3, x]/32 + 105*HermiteH[1, x]/16]

x^7

The left hand-side of Parseval’s identity is \[ \| x^7 \|^2 = \int_{\mathbb{R}} x^{14} e^{-x^2} {\text d} x = \frac{135135}{128}\,\sqrt{\pi} \approx 1871.25 , \]

Integrate[x^(14) *Exp[-x^2], {x, -Infinity, Infinity}]

(135135 Sqrt[\[Pi]])/128

We ask Mathematica to evaluate the corresponding right hand-side in Parseval's identity: \[ \sqrt{\pi} \left( \frac{7!}{2^7} + 2^5 5! \left( \frac{21}{64} \right)^2 + 2^3 3! \left( \frac{105}{32} \right)^2 + 2 \left( \frac{105}{16} \right)^2 \right) = \frac{135135}{128}\,\sqrt{\pi} . \]

7!/2^7 + 5! * 2^5 *(21/64)^2 + 3! * 2^3 *(105/32)^2 + 2*(105/16)^2

135135/128

This proves Parseval's identity for x⁷ expansion into Hermite series.

▣

In many situations, it is preferable to use the Hermite functions (also called Gauss--Hermite functions): \[ \psi_n (x) =\frac{1}{\sqrt{2^n n!\sqrt{\pi }}}\, H_n(x)\, e^{-x^2/2}, \qquad n=0,1,2,\ldots , \] that form an orthonormal basis of 𝔏²(ℝ). For instance, these functions are eigenfunctions of the Fourier transform. Any real square integrable function can be expanded into convergent in ℌ series \[ f(x) =\sum_{n=0}^{\infty } c_n\, \psi_n (x), \qquad c_n = \int_{\mathbb{R}} f(x)\,\psi_n (x)\,{\text d} x . \] Then Parseval's identity becomes the clean Hilbert‑space identity: \[ \| f \|^2 = \sum_{n\ge 0} c_n^2 . \] This is the form used in quantum mechanics (harmonic oscillator eigenfunctions).

Example: Let us expand $ \displaystyle \quad f(x) =x^2 e^{-x^2/2} \ $ into Gauss-Hermite series. Because f is even, only even Hermite functions ψ_2k can appear in its expansion. In fact, we’ll see that only ψ₀ and ψ₂ are needed for this expansion.

First, we write functions ψ₀ and ψ₂ explicitely: \[ \psi _0(x)=\frac{1}{\sqrt{2^0 0!\sqrt{\pi }}}e^{-x^2/2}H_0(x) =\pi^{-1/4} e^{-x^2/2}, \] and \[ \psi_2(x) =\frac{1}{\sqrt{2^2 2!\sqrt{\pi }}}e^{-x^2/2}H_2(x) =\frac{1}{\sqrt{8\sqrt{\pi }}}e^{-x^2/2}\left( 4x^2-2\right) = \frac{1}{\sqrt{2\sqrt{\pi }}}e^{-x^2/2}\left( 2x^2 -1\right) \] because \[ H_0(x)=1,\quad H_2 (x) =4x^2-2. \] We look for constants 𝑎, b such that \[ f(x) =x^2 e^{-x^2/2} =a\, \psi_0 (x) + b\, \psi_2 (x). \] Substitute the explicit forms: \[ a\, \psi_0 (x) + b\, \psi_2(x) =e^{-x^2/2}\left[ a\, \pi ^{-1/4}+b\, \frac{4x^2-2}{\sqrt{8\sqrt{\pi }}}\right] . \] We want this to be equal to $ \displaystyle \ x^2e^{-x^2/2},\ $ so we match the polynomial in brackets to x²: \[ x^2 =A+B\left( 4x^2-2 \right) , \] where \[ A=a\, \pi ^{-1/4},\quad B=\frac{b}{\sqrt{8\sqrt{\pi }}}. \] Expanding: \[ x^2=4Bx^2+(A-2B). \] Match coefficients of x² and the constant term: \begin{align*} 4B &=1\quad \Longrightarrow \quad B=\frac{1}{4}, \\ A-2B&=0\quad \Longrightarrow \quad A=2B=\frac{1}{2}. \end{align*} Now recover 𝑎, b: \begin{align*} a&=A\, \pi^{1/4} =\frac{1}{2}\, \pi ^{1/4}, \\ b&=B\, \sqrt{8\sqrt{\pi }}=\frac{1}{4}\, \sqrt{8\sqrt{\pi }}=\frac{1}{4}\cdot 2\sqrt{2}\, \pi ^{1/4}=\frac{\sqrt{2}}{2}\, \pi ^{1/4}=\frac{\pi ^{1/4}}{\sqrt{2}}. \end{align*} So the expansion becomes \[ x^2 e^{-x^2/2} =\frac{\pi^{1/4}}{2}\, \psi_0 (x) + \frac{\pi^{1/4}}{\sqrt{2}}\, \psi_2 (x). \] Because { ψₙ } is orthonormal, you could also obtain these coefficients directly via \[ a=\langle f,\psi_0\rangle ,\quad b=\langle f,\psi_2\rangle , \] and all other ⟨ f, ψₙ ⟩ vanish by orthogonality and parity.

Finally, we state Parseval's identity \[ \| f \|^2 = \int_{-\infty}^{\infty} x^4 e^{-x^2} {\text d}x = \frac{3}{4}\,\sqrt{\pi} = a^2 + b^2 . \]

Integrate[x^4 /Exp[x^2], {x, -Infinity, Infinity}]

(3 Sqrt[\[Pi]])/4

■

End of Example 2

Reverse statement: if $ \displaystyle \quad \left\| f(x) - S_n (f;x) \right\| \quad $ tends to zero as n → ∞, then the difference $ \displaystyle \quad \| f \|^2 - \sum_{k=1}^n \left\vert a_k \right\vert^2 \quad $ tends to zero, namely, numerical series $ \displaystyle \quad\sum_{k=1}^n \left\vert a_k \right\vert^2 \quad $ converges to ∥ f ∥².

Theorem 2: If 𝑎ₙ = ⟨ f, ωₙ ⟩ are Fourier coefficients of function f with respect to an orthonormal system { ωₙ }, then the following inequality \[ \left\| f(x) - S_n (f; x) \right\| \leqslant \left\| f - \sum_{k=1}^n b_k \omega_k \right\| \] holds for any numbers b₁, b₂, … , bₙ.

This property is known as the extreme (or optimal) property of the Fourier series in the sense that it provides the best possible approximation of a function in the least-squares sense among all possible approximations of the same degree. The n-th partial sum Sₙ(f; x) minimizes the mean-square error $ \displaystyle \quad \int_a^b \left\vert f(x) - S_n (f; x) \right\vert^2 {\text d} x \quad $ compared to any other linear combination of functions from the given system of orthonormal functions.

Let us consider square norm of the difference \begin{align*} \left\| f - \sum_{k=1}^n b_k \omega_k \right\|^2 &= \left\langle f - \sum_{k=1}^n b_k \omega_k , f - \sum_{k=1}^n b_k \omega_k \right\rangle \\ &= \| f \|^2 - \sum_{k=1}^n b_k^{\ast} \left\langle f, \omega_k \right\rangle - \sum_{k=1}^n b_k \left\langle \omega_k , f \right\rangle + \sum_{k=1}^n \left\vert b_k \right\vert^2 \\ &= \| f \|^2 - \sum_{k=1}^n a_k b_k^{\ast} - \sum_{k=1}^n a_k^{\ast} b_k + \sum_{k=1}^n \left\vert b_k \right\vert^2 + \sum_{k=1}^n \left\vert a_k \right\vert^2 - \sum_{k=1}^n \left\vert a_k \right\vert^2 \\ &= \| f \|^2 - \sum_{k=1}^n \left\vert a_k \right\vert^2 + \sum_{k=1}^n \left\vert a_k - b_k \right\vert^2 \\ &= \left\| f(x) - S_n (f; x) \right\|^2 + \sum_{k=1}^n \left\vert a_k - b_k \right\vert^2 \\ & \geqslant \left\| f(x) - S_n (f; x) \right\|^2 . \end{align*}

Example 3: We work in Hilbert space 𝔏²[-1,1] with inner product \[ \langle f,g\rangle =\int _{-1}^1f(x)g(x)\, {\text d}x. \] The (unnormalized) Legendre polynomials of degree n can be defined via Rodrigues' formula \[ P_n (x) = \frac{1}{2^n n!} \,\frac{{\text d}^n}{{\text d} x^n} \left( x^2 -1 \right)^n , \qquad n=0,1,2,\ldots , \] or recursively \[ \left( n+1 \right) P_{n+1} (x) = \left( 2n+1 \right) x\, P_n (x) -n\,P_{n-1} (x) , \qquad P_0 = 1, \quad P_1 (x) = x. \] They satisfy the orthogonal relation: \[ \int _{-1}^1 P_n(x)\,P_m(x)\, dx=0\quad (n\neq m),\qquad \int _{-1}^1P_n(x)^2\, {\text d}x =\frac{2}{2n+1}. \] To get an orthonormal system { ωₙ }, we set \[ \omega _n(x)=\sqrt{\frac{2n+1}{2}}\, P_n(x). \] Then ⟨ ωₙ , ω_m ⟩ = δ_nm, the Kronecker delta. We choose a function \[ f(x)=x^2. \] The classical Legendre expansion of this function is \[ x^2 =\frac{1}{3}P_0(x)+\frac{2}{3}P_2(x). \] In terms of the orthonormal basis ωₙ, write \[ P_0(x)=\sqrt{\frac{2}{1}}\, \omega_0 (x) =\sqrt{2}\, \omega _0(x),\qquad P_2(x)=\sqrt{\frac{2}{5}}\, \omega_2 (x). \] So \[ x^2=\frac{1}{3}\sqrt{2}\, \omega _0(x)+\frac{2}{3}\sqrt{\frac{2}{5}}\, \omega _2(x). \] Thus, the Fourier coefficients 𝑎ₙ = ⟨ f , ωₙ ⟩ are \[ a_0=\frac{\sqrt{2}}{3},\quad a_2=\frac{2}{3}\sqrt{\frac{2}{5}},\quad a_1=0,\quad a_n=0\ (n\geq 3). \] The second partial sum (up to n = 2) is \[ S_2 (f;x) = a_0\omega _0(x)+a_1\omega _1(x)+a_2\omega _2(x)=\frac{\sqrt{2}}{3}\, \omega _0(x)+\frac{2}{3}\sqrt{\frac{2}{5}}\, \omega _2(x), \] which in fact equals f(x) exactly in this case (since the expansion stops at P₂). To see the theorem in a nontrivial way, imagine we only take a partial sum up to n = 1: \[ S_1(f;x)=a_0\omega _0(x)+a_1\omega _1(x)=\frac{\sqrt{2}}{3}\, \omega _0(x). \] Illustrating the inequality with an arbitrary linear combination, we invoke the theorem that says: for any numbers b₀, b₁, we have \[ \left\| f(x) - S_1 (f; x) \right\| \leqslant \left\| f - \sum_{k=0}^1 b_k \omega_k \right\| = \left\| f - b_0 \omega_0 - b_1 \omega_1 \right\| , \] because, in general, \[ \left\| f(x) - S_n (f; x) \right\| \leqslant \left\| f - \sum_{k=1}^n b_k \omega_k \right\| . \] Here \[ f-S_1(f)=f-\frac{\sqrt{2}}{3}\, \omega _0=\left( \frac{2}{3}\sqrt{\frac{2}{5}}\, \omega _2\right) +\sum _{n\geq 3}a_n\omega _n, \] so f - S₁(f) is orthogonal to ω₀;, ω₁ (it lives entirely in the orthogonal complement of span{ ω₀;, ω₁ } ). Now take any other approximation \[ g(x)=b_0\omega _0(x)+b_1\omega _1(x). \] Then S₁(f) - g lies in span{ ω₀;, ω₁ }. So these two terms are orthogonal, and by Pythagoras, \[ \| f-g\| ^2 =\| f-S_1(f)\| ^2 +\| S_1(f)-g\| ^2\; \geq \; \| f-S_1(f)\| ^2. \] Taking square roots gives exactly the inequality \[ \| f-S_1(f)\| \leq \| f-g\| \] for every choice of b₀, b₁. That is, among all linear combinations of ω₀;, ω₁, the Fourier–Legendre partial sum S₁(f) is the unique best 𝔏²-approximation to f.

So in this Legendre example, the theorem is realized by the fact that the Fourier–Legendre partial sum Sₙ(f) is the orthogonal projection of f onto span{ ω₀;, ω₁, … , ωₙ }, and any other choice of coefficients b_k only moves you farther away in 𝔏²-norm.

Why this is the correct formula?

Sₙ(f) is the orthogonal projection of f onto span{ ω₀;, ω₁, … , ωₙ }.
Therefore \[ f-S_n(f)\; \perp \; \mathrm{span}\{ \omega _1,\dots ,\omega _n\} . \]
Meanwhile, \[ S_n(f)-g\in \mathrm{span}\left\{ \omega _1,\dots ,\omega _n \right\} . \] So the decomposition above splits f-g into two orthogonal components.

■

End of Example 3

Because Theorem 2 plays a pivotal role, we restate it in the framework of an abstract Hilbert space, keeping the original numbering for convenience. We trust that the reader is sufficiently mature mathematically to distinguish the general setting from the particular case of 𝔏². The Hilbert‑space version of the best‑approximation property will be essential in the developments that follow.

Theorem 2: (Best Approximation by Fourier Partial Sums in a Hilbert Space) Let ℌ be a Hilbert space and let { ω₁, ω₂, … , ωₙ } be an orthonormal system in ℌ. For any f ∈ ℌ, define the Fourier coefficients $ \displaystyle \quad a_k=\langle f,\omega _k\rangle ,\qquad k=1,\dots ,n, \quad $ and the Fourier partial sum $ \displaystyle \quad S_n (f; x)=\sum _{k=1}^na_k\, \omega _k. \quad $ Then for every choice of scalars b₁, b₂, … , bₙ, \[ \left\| \, f(x) - S_n (f; x) \right\| \leqslant \left\| \, f - \sum_{k=1}^n b_k \omega_k \right\| . \] Thus, Sₙ(f) is the unique best approximation to f from the subspace \[ V_n =\mathrm{span}\left\{ \omega _1 , \omega_2 , \dots ,\omega _n \right\} . \]

Note on separability. This theorem does not require the Hilbert space to be separable. Separability is only needed when one wants a countable orthonormal basis and infinite Fourier expansions. For a finite orthonormal system, the theorem holds in any Hilbert space.

Let \[ g=\sum _{k=1}^nb_k\, \omega _k\in V_n. \] Since \[ S_n(f)=\sum _{k=1}^n\langle f,\omega _k\rangle \, \omega _k \] is the orthogonal projection of f onto Vₙ, we have \[ f-S_n(f)\; \perp \; V_n. \] But Sₙ(f) - g ∈ Vₙ. Therefore, using the Pythagorean identity, we get \[ \| f-g\|^2 =\| f-S_n(f)\|^2 + \| S_n(f)-g\|^2\; \geq \; \| f-S_n(f)\| ^2. \] Taking square roots gives the desired inequality. Thus Sₙ(f) is the unique best approximation to f from Vₙ.

Example 4: Let \[ L_n (x) = \frac{e^x}{n!}\,\frac{{\text d}^n}{{\text d}x^n} \left( e^{-x} x^n \right) = \frac{1}{n!} \left( \frac{\text d}{{\text d}x} -1 \right)^n x^n , \qquad n=0,1,\ldots , \] be the (standard) Laguerre polynomials satisfying the orthogonality relation \[ \int _0^{\infty }L_m(x)L_n(x)e^{-x}\, {\text d}x = \delta_{mn} . \] Thus, the system \[ \omega _n(x) =L_n (x),\qquad n=0,1,2,\dots \] is already orthonormal in Hilbert space 𝔏²([0, ∞), e^−xdx).

Choose a function f(x) = x ∈ ℌ ≡ 𝔏²([0, ∞), e^−xdx).

We compute its Fourier–Laguerre coefficients: \[ a_n =\langle f,\omega _n\rangle =\int _0^{\infty }x\, L_n(x)\, e^{-x}\, {\text d}x. \] A standard identity for Laguerre polynomials is: \[ \int _0^{\infty }x\, L_n(x)\, e^{-x}\, {\text d}x = \left\{ \, \begin{array}{ll}\phantom{-}1,&\quad n=0,\\ -1,&\quad n=1,\\ \phantom{-}0,&\quad n\geq 2.\end{array}\right. \]

Integrate[x*LaguerreL[0, x]/Exp[x], {x, 0, Infinity}]

Integrate[x*LaguerreL[1, x]/Exp[x], {x, 0, Infinity}]

-1

Integrate[x*LaguerreL[2, x]/Exp[x], {x, 0, Infinity}]

Thus, \[ a_0=1,\qquad a_1=-1,\qquad a_n=0\quad (n\geq 2). \] So the Fourier–Laguerre expansion of f(x) = x is \[ x=1\cdot L_0(x)-1\cdot L_1(x). \]

Let us approximate f(x) = x using only the subspace \[ V_0 =\mathrm{span}\{ L_0\} . \] The best approximation is the projection \[ S_1(f)(x)=a_0 L_0(x) =1. \] Now take any other approximation of the form \[ g(x)=b_0L_0(x) =b_0. \] The theorem states: \[ \| x - L_0 \| = \| x - 1 \| \le \| x - b_0 \| . \] Let us verify this explicitly. Compute the norms \[ \| x-b_0\|^2 =\int _0^{\infty }(x-b_0)^2 e^{-x}\, {\text d}x. \] A direct computation gives \[ \int _0^{\infty }(x-b_0)^2e^{-x}\, {\text d}x =2-2b_0 +b_0^2. \]

Integrate[(x - b0)^2 *Exp[-x] , {x, 0, Infinity}]

2 - 2 b0 + b0^2

Thus, \[ \| x-b_0\|^2 =(b_0-1)^2 +1. \] The minimum occurs at b₀ = 1, giving \[ \| x-1\| ^2=1. \] Therefore, for all b₀, \[ \| x-1\|^2 =1\; \leq \; (b_0-1)^2+1=\| x-b_0\| ^2, \] which is exactly the theorem. ■

End of Example 4

In theory of finite-dimensional vector spaces, you learn a number of equivalent alternative characterizations of a complete set of basis vectors. The corresponding problem in separable Hilbert space is concerned about representing functions as linear combinations of some given set of functions that are usually chosen as orthonormal systems. In other words, we are facing a problem of series expansions of functions in terms of a given set.

The first question we must face is that of defining the completeness of an orthonormal system of functions in separable Hilbert space such as 𝔏². We could say that an orthonormal system of functions { ωₙ(x) } is complete if any function f(x) in Hilbert space is expressible as a linear combination of the ωₙ(x):

\[ f(x) = \sum_n c_n \omega_n (x) , \]

the series converging to f at every point x. This would provide a close analog of the idea of completeness of a set of basis vectors in finite dimensional spaces. However, this criterion of convergence is, for many purposes, unnecessarily severe and exclusive because it would lead to non-existence of complete orthonormal system of functions in Hilbert space. Hence, instead of demanding strict pointwise convergence, we shall weaken the convergence criterion that will permit existence of complete sets.

A sequence of functions hₙ(x) converges in the mean (or mean square) to h(x) if

\[ \lim_{n\to\infty} \int_a^b \left\vert h(x) - h_n (x) \right\vert^2 {\text d}x = 0, \]

that is, if for every ε > 0 there exists N(ε) such that for n > N(ε), \[ \int_a^b \left\vert h(x) - h_n (x) \right\vert^2 {\text d}x < \varepsilon . \] We abbreviate this type of convergence by writing l.i.m. hₙ = h. A series $ \displaystyle \quad \sum_{n\ge 1} s_n (x) \quad $ converges in mean to h(x) if \[ \lim_{n\to\infty} \int_a^b \left\vert h(x) - \sum_{k=1}^n s_k (x) \right\vert^2 {\text d} x = 0 . \]

Now we define completeness of an orthonormal system in terms of mean convergence.

Let g(x) ∈ 𝔏²(𝑎, b), and let { ω_k(x) } be an ordered orthonormal system of functions in this Hilbert space. If there exist a list of constants { cₙ } such that the sequence of partial sums \[ g_n (x) = \sum_{k=1}^n c_k \omega_k (x) \] converges in the mean to g(x), then the system of functions { ω_k(x) } is called a complete orthonormal set.

Equivalently, if the mean square error can be made arbitrarily small, \[ \lim_{n\to\infty} \int_a^b \left\vert g(x) - g_n (x) \right\vert^2 {\text d}x = \lim_{n\to\infty} \int_a^b \left\vert g(x) - \sum_{k=1}^n c_k \omega_k (x) \right\vert^2 {\text d} x = 0 , \] then the set { ω_k(x) } is a complete orthonormal system.

It should be noted that coefficients { c_k } are independent of n. Thus, as n increases and we include more terms in the partial sum gₙ approximating g, the earlier coefficients do not change. So we can write

\[ f(x) = \sum_{k\ge 1} c_k \omega_k (x) \]

when infinite series approximates a function in the mean. However, we prefer to omit the argument in this relation when we deal with Hilbert space 𝔏².

Example 5: There are known four kinds of Chebyshev polynomials that are usually denoted by Tₙ(x), Uₙ(x), Vₙ(x), and Wₙ(x), n = 0, 1, 2, …. They form orthogonal systems in Hilbert spaces ℌ_x = 𝔏²([−1, 1], w dx) of square integrable on [−1, 1] functions with weights \[ w_1 (x) = \frac{1}{\sqrt{1-x^2}}, \quad w_2 (x) = \sqrt{1-x^2}, \quad w_3 (x) = \sqrt{\frac{1+x}{1-x}} , \quad w_4 (x) = \sqrt{\frac{1-x}{1+x}} , \] respectively. The substitution \[ x = \cos\theta , \quad \theta \in [0, \pi ], \quad x \in [-1,1] \] is a smooth strictly decreasing bijection between (0, π) and (−1, 1). Its Jacobian is \[ {\text d}x = - \sin\theta\,{\text d}\theta , \qquad \sin\theta = +\sqrt{1- x^2} . \] So any integral on [−1, 1] can be rewritten as an integral on (0, π) with weight coming from sinθ (Jacobian), w(cosθ), and conversely. This is the basic mechanism behind all the Chebyshev systems: they are trigonometric systems in disguise, transported from θ-space to x-space by substitution x = cosθ.

Let w(x) be a positive weight on (−1, 1), that is used to define the Hilbert space ℌ_x = 𝔏²([−1,1], wdx). We want to relate it to a space of θ. We evaluate a norm of a function from this space \[ \| f \|^2_x = \int_{-1}^1 \left\vert f(x) \right\vert^2 w(x)\,{\text d}x = \int_0^{\pi} \left\vert f(\cos\theta ) \right\vert^2 w(\cos\theta )\,\sin\theta\,{\text d}\theta = \| f \|^2_{\theta} . \] Hence, we define a map U : ℌ_x → ℌ_θ by \[ (Uf)(\theta ) = f(\cos\theta ), \] which is an isometry from ℌ_x into ℌ_θ because x ↦ arccos(x) is a bijection between (−1, 1) and (0, π). This isomorphism is onto since every g ∈ ℌ_θ can be written as g(θ) = f(cosθ) for a unique f ∈ ℌ_x. Thus, \[ U \ :\ ℌ_x \mapsto ℌ_{\theta} \quad \mbox{is a unitary isomorphism}. \] Suppose { ϕₙ } is an orthogonal system in ℌ_θ. Define \[ \psi_n (x) := \phi_n (\mbox{arccos}x), \qquad x \in [-1, 1]. \] Then ψₙ = U⁻¹ϕₙ, and orthogonality is preserved: \[ \left\langle \psi_k , \psi_n \right\rangle_x = \left\langle U\psi_k , U \psi_n \right\rangle_{\theta} = \left\langle \phi_k , \phi_n \right\rangle_{\theta} \] Hence { ψₙ } is orthogonal in ℌ_x. More importantly, completeness is preserved. If the linear span of { ϕₙ } is dense in ℌ_θ, then for any f ∈ ℌ_x, \[ U\,f \in ℌ_{\theta} \] can be approximated in norm by finite linear combination of ϕₙ. Applying U⁻¹, we see that f can be approximated in norm by finite linear combinations of ψₙ. Symbolically, \[ \overline{\mbox{span}\{ \phi_n \}} = ℌ_{\theta} \quad \iff \quad \overline{\mbox{span}\{ \psi_n \}} = ℌ_{x} . \] So completeness is not something we have to re-prove in x-space; it is inherited from the trigonometric system via the unitary map. Let { ϕₙ } be a complete orthogonal system in ℌ_θ and ψₙ = U⁻¹ϕₙ, the corresponding system in ℌ_x. Suppose f ∈ ℌ_x is orthogonal to all ψₙ: \[ \left\langle f, \psi_n \right\rangle_x = 0 \qquad \forall n . \] Apply U: \[ 0 = \left\langle f, \psi_n \right\rangle_x = \left\langle U\,f, U\,\psi_n \right\rangle_{\theta} = \left\langle U\,f, \phi_n \right\rangle_{\theta} , \quad \forall n . \] So U f is orthogonal to all ϕₙ. By completeness of { ϕₙ } in ℌ_θ, this forces U f = 0 in ℌ_θ; hence, f = 0 in ℌ_x. Therefore, \[ \left\{ \psi_n \right\}^{\perp} = \{ 0 \} , \] which is exactly statement that { ψₙ } is complete in ℌ_x.

This is the core of the completeness proof for Chebyshev systems; once you know the trigonometric system is complete, the rest is just the unitary change of variables. We demonstrate this approach for each Chebyshev polynomial system.

For Chebyshev systems, the weight w(x) is chosen so that the corresponding weight μ(θ) in ℌ_θ becomes very simple. The Chebyshev polynomial of the first kind is expressed as \[ T_n (\cos (\theta ) = \cos (n\theta ) , \] which is just cosine system, known to be complete. The inner product in both spaces ℌ_x and ℌ_θ are related via unitary transformation U \[ \left\langle f , T_n \right\rangle = \int_{-1}^1 f(x)\,T_n (x) \left( \frac{{\text d}x}{\sqrt{1-x^2}} \right) = \int_0^{\pi} f(\cos\theta )\,\cos (n\theta ) \left( \frac{\sin\theta\,{\text d}\theta}{\sin\theta} \right) = \int_0^{\pi} f(x)\,\cos (n\theta ) \,{\text d}\theta = \left\langle U\,f , \cos (n\theta ) \right\rangle_{\theta} . \]

For the Chebyshev polynomial of the second kind, we have \[ U_n (\cos\theta ) = \frac{\sin \left( \left( n+1 \right) \theta \right)}{\sin\theta} , \qquad \sqrt{1-x^2}\,{\text d}x = \sin^2 \theta \,{\text d}\theta . \] Then \[ \left\langle f, U_n \right\rangle = \int_{-1}^1 f(x)\,U_n (x)\,\sqrt{1-x^2}\,{\text d}x = \int_0^{\pi} f(\cos\theta )\, \sin \left( \left( n+1 \right) \theta \right) \sin\theta\, {\text d}\theta . \] So we have \[ \left\langle f, U_n \right\rangle_x = \left\langle (U\,f)(\theta )\,\sin\theta , \sin ((n+1)\theta ) \right\rangle , \] which is the expansion of product (U f) sinθ into sin-Fourier series.

For Chebyshev polynomials of the third kind, we have \[ V_n (\cos\theta ) = \frac{\cos \left( n + \frac{1}{2} \right)\theta}{\cos\frac{\theta}{2}} = \frac{\cos \frac{(2n+1)\theta}{2}}{\cos\frac{\theta}{2}}. \] The inner product in Hilbert space ℌ_x = 𝔏²([−1,1], w₃dx) with weight $ \displaystyle \quad w_3 (x) = \sqrt{\frac{1+x}{1-x}} \quad $ is related to the inner product in ℌ_θ: \[ \left\langle f , V_n \right\rangle_x = \int_{-1}^1 f(x)\,V_n (x) \left( \sqrt{\frac{1+x}{1-x}}\,{\text d}x \right) = \int_0^{\pi} f(\cos\theta )\, \frac{\cos \left( n + \frac{1}{2} \right)\theta}{\cos\frac{\theta}{2}} \left( \frac{\cos\frac{\theta}{2}}{\sin\frac{\theta}{2}} \,\sin\theta\,{\text d}\theta\right) \] because \[ \frac{1 + \cos\theta}{1-\cos\theta} = \left( \frac{\cos\frac{\theta}{2}}{\sin\frac{\theta}{2}} \right)^2 . \] Upon simplification, we reduce \[ \left\langle f , V_n \right\rangle_x = 2 \int_0^{\pi} f(\cos\theta )\,\cos \left( n + \frac{1}{2} \right)\theta \,\cos\frac{\theta}{2}\,{\text d}\theta = \left\langle (U\,f)(\theta )\,2\,\cos\frac{\theta}{2} , \cos \left( n + \frac{1}{2} \right)\theta \right\rangle , \] which is the cosine-Fourier expansion of function 2(U f(θ) cos(θ/2).

For Chebyshev polynomials of the fourth kind, we have the expansion \[ f(x) \sim \sum_{n\ge 0} a_n W_n (x) \] for arbitrary function f ∈ ℌ_x. Under transformation x = cosθ, it is equivalent to trigonometric expansion \[ \left( U\,f \right) (\theta ) = f(\cos\theta ) \sim \sum_{n\ge 0} a_n \,\frac{\sin (n+1/2)\theta}{\sin (\theta /2)} \] because \[ W_n (\cos\theta ) = \frac{\sin \left( n + \frac{1}{2} \right)\theta}{\sin\frac{\theta}{2}} . \] The coefficients are \[ a_n = \left\langle f , W_n \right\rangle_x = \int_{-1}^1 f(x)\,W_n (x) \left( \sqrt{\frac{1-x}{1+x}}\,{\text d}x \right) = \int_0^{\pi} f(\cos\theta )\,\frac{\sin \left( n + \frac{1}{2} \right)\theta}{\sin\frac{\theta}{2}} \left( \frac{\sin\frac{\theta}{2}}{\cos\frac{\theta}{2}}\,\sin\theta \,{\text d}\theta \right) \] Upon simplification, we get \[ a_n = \left\langle f , W_n \right\rangle_x = \int_0^{\pi} f(\cos\theta )\,2\,\sin\frac{\theta}{2}\,\sin (n+1/2)\theta\,{\text d}\theta = \left\langle 2\,(U\,f)(\theta )\, \sin\frac{\theta}{2}\,, \,\sin (n+1/2)\theta \right\rangle . \] If we multiply both sides by 2sin(θ/2), we get a pure sine series \[ f(\cos\theta )\,2\,\sin \frac{\theta}{2} \sim \sum_{n\ge 0} c_n \sin ((n+1/2)\theta ) , \quad c_n = \langle \cdot , \sin (n+1/2)\theta \rangle . \] whith respect to the standard Lebesgue measure dθ. All the familiar machinery---orthogonality, Parseval, convergence theorems---lives here, in the sine system. The Chebyshev-IV story is just this sine story, pulled back through the unitary map.

Thus, the role of x = cosθ in completeness proofs is not cosmetic: it is the bridge that turns polynomials questions on [−1,1] into trigonometric questions on [0, π], where the structure is already fully understood. ■

End of Example 5

Since mean convergence does not necessarily imply pointwise or uniform convergence, it is clear that the completeness of an orthonormal system of functions { ωₙ(x) } expressed by the relation

\[ \lim_{n\to\infty} \int_a^b \left\vert f(x) - \sum_{k=1}^n a_k \omega_k (x) \right\vert^2 {\text d} x = 0 \]

or symbolically $ \displaystyle \quad f(x) = \underset{n\to\infty}{\mbox{l.i.m.}} \sum_{k=1}^n a_k \omega_k (x) \quad $ does not apply that this identity holds at every point because we may equate f(x) and expansion series if the series converges pointwise.

If the set { ωₙ(x) } of orthonormal functions is complete, then the equal sign holds in Bessel's inequality and we observe Parseval's identity for every function f ∈ 𝔏². Therefore, Parseval's identity is also called the completeness relation, which can be states as

\[ \langle f, g \rangle = \sum_{n\ge 1} \langle f, \omega_n \rangle \,\langle \omega_n, g \rangle . \]

A set of orthonormal functions is said to be closed if no nonzero function is orthogonal to every function in the set.

Theorem 3: A set of orthonormal functions in Hilbert space is complete if and only if it is closed.

We first prove that completeness of the set implies that the set is closed. Assume that there is a nonzero function f(x) such that \[ \langle \omega_k, f \rangle = c_k = \int_a^b \omega_k^{\ast} f(x)\,{\text d}x = 0 \] for all k. Then \[ \lim_{n\to\infty} \int_a^b \left\vert f - \sum_{k=1}^n c_k \omega_k \right\vert^2 {\text d}x = \int_a^b \left\vert f \right\vert^2 {\text d}x \ne 0. \] Hence, the set { ωₙ(x) } is not complete. Thus, completeness of an orthonormal system of functions implies that there are no functions that are orthogonal to every member of the set.

We now prove the converse: if the orthonormal system is closed, then it is complete. If it is not complete, then the completeness relation \[ \langle f, f \rangle = \sum_{k\ge 1} \left\vert \langle \omega_k , f \rangle \right\vert^2 \] is not satisfied. Hence, there exists some function f(x) such that \[ \| f \|^2 > \sum_{n\ge 1} \left\vert a_n \right\vert^2 , \qquad a_n = \langle \omega_n , f \rangle . \] Since the above infinite series converges, the sequence of partial sums \[ g_n = \sum_{k=1}^n a_k \omega_k (x) \] is a Cauchy sequence in Hilbert space. This sequence of partial sums must converge in mean because of the completeness of the space. Let us call this limit g(x) and 𝑎ₙ = ⟨ ωₙ , g ⟩. Therefore, ⟨ ωₙ , g ⟩ = ⟨ ωₙ , f ⟩, so ⟨ ωₙ , f − g ⟩ = 0. Hence, f − g is orthogonal to ωₙ for all n. We now show that the norm of f − g is not equal to zero, so system { ωₙ(x) } is not closed, contrary to our assumption. It will then follow by contradiction that the system { ωₙ(x) } is complete and proof will be finished.

Using inequality \[ \| x - y \| \ge \left\vert \| x \| - \| y \| \right\vert . \] we have \[ \| f - g \| = \| f - g_n - \left( g - g_n \right) \| \ge \left\vert \| f - g_n \| - \| g - g_n \| \right\vert \] for all n. Now as n → ∞, we know that ∥g − gₙ∥ → 0, whereas \[ \| f - g_n \|^2 = \left\| f - \sum_{k=1}^n a_k \omega_k \right\|^2 = \| f \|^2 = \sum_{k=1}^n \left\vert a_k \right\vert^2 > 0 \] for all n by assumption. Thus, ∥f − g∥ > 0 and the proof is complete.

Example 6: We work in the Hilbert space ℌ = 𝔏²(ℝ), that is equipped with the inner product \[ \langle f, g\rangle =\int _{-\infty }^{\infty} \overline{f(x)}\, g(x)\, {\text d}x = \int _{\mathbb{R}} f^{\ast}(x) \, g(x)\, {\text d}x , \] where $ \displaystyle \quad \overline{f(x)} = f^{\ast}(x) \quad $ denotes complex conjugate of function f(x). Let $ \displaystyle \quad \left\{ \psi_n \right\}_{n=0}^{\infty } \quad $ be the Hermite functions, an orthonormal system in Hilbert space 𝔏²(ℝ): \[ \psi _n(x)=c_n\, H_n(x)\, e^{-x^2/2}, \] where Hₙ are the Hermite polynomials and cₙ are normalization constants chosen so that \[ \langle \psi _n,\psi _m\rangle =\delta _{nm}. \] A classical result states that the Hermite functions form a complete orthonormal system in 𝔏²(ℝ): for every f ∈ 𝔏²(ℝ), there exist coefficients { 𝑎ₙ } such that \[ f=\sum _{n=0}^{\infty }a_n\, \psi _n\quad \mathrm{in\ }𝔏^2\mathrm{-sense}, \] i.e., \[ \lim _{N\rightarrow \infty }\left\| \,f-\sum _{n=0}^Na_n\, \psi _n\right\| _{L^2(\mathbb{R})}=0. \] By Theorem 3, this is equivalent to saying that the Hermite system is closed in the sense that:

If f ∈ 𝔏²(ℝ) satisfies ⟨ f , ψₙ ⟩ = 0 for all n, then f = 0 in 𝔏²(ℝ).

So for Hermite functions:

Completeness: every 𝔏²-function can be approximated (in mean square) by finite Hermite expansions.
Closedness: the only function orthogonal to all Hermite functions is the zero function.

These two properties are equivalent by Theorem 3, and the Hermite system is a concrete, classical example where both hold.

Contrast: a non-complete, non-closed orthonormal subset. Consider only the first two Hermite functions { ψ₀ , ψ₁ }. This is still an orthonormal set, but not complete in 𝔏²(ℝ): for example, \[ f(x)=\psi _2(x) \] is orthogonal to both ψ₀ and psi;₁, yet f ≠ 0. Thus, the set { ψ₀, ψ₁ } is not closed in the sense of Theorem 3, and indeed not complete.

This illustrates Theorem 3:

The full Hermite system { ψₙ }_n≥0: orthonormal, complete, and closed.
A proper orthonormal subset { ψ₀ , ψ₁ }: orthonormal but not complete, hence not closed (there exists a nonzero function orthogonal to all of them).

■

End of Example 6

The following statement is valid for general Hilbert space, not only for 𝔏².

Theorem 4: Let ℌ be a Hilbert space and { ωₙ(x) } be an ordered complete orthonormal system in ℌ. For any f ∈ ℌ, the Fourier series $ \displaystyle \quad \sum_{n\ge 1} \left\langle \omega_n , f \right\rangle \omega_n (x) \quad $ converges to f in the mean square (i.e., in the norm of ℌ).

Step 1: Define the partial sums. Let \[ a_n=\langle f,\omega _n\rangle ,\qquad S_N(f)=\sum _{n=1}^Na_n\, \omega _n. \] We must show \[ \lim _{N\rightarrow \infty }\| f-S_N(f)\| =0. \]

Step 2: Use completeness (density of finite linear combinations). Completeness of { ωₙ(x) } means that the linear span of { ωₙ(x) } is dense in ℌ.

Equivalently: for every ε > 0, there exists a finite linear combination \[ g=\sum _{k=1}^M c_k\, \omega _k \] such that \[ \| f-g\| <\varepsilon . \] Fix such a g and M.

Step 3: Best approximation property of S_M(f). By Theorem 2 (orthogonal projection is best approximation), we know that among all linear combinations of ω₁, … , ω_M, \[ S_M(f)=\sum _{k=1}^M\langle f,\omega _k\rangle \, \omega _k \] is the closest to f. That is, \[ \| \,f-S_M(f)\| \; \leq \; \left\| f-\sum _{k=1}^Mc_k\, \omega _k\right\| =\| f-g\| <\varepsilon . \] So we have found an M such that \[ \| \,f -S_M(f)\| <\varepsilon . \]

Step 4: Monotonicity of the error and convergence. For N > M, we have \[ S_N(f)=S_M(f)+\sum _{k=M+1}^Na_k\, \omega _k. \] The difference \[ f-S_N(f) \] is orthogonal to each ωₙ, and the additional tail $ \displaystyle \quad \sum _{k=M+1}^N a_k\, \omega _k \quad $ lies in the span of { ω_M+1, … , ω_N }. A standard computation (or Bessel’s inequality) shows that \[ \| f-S_N(f)\| ^2=\| f\| ^2-\sum _{k=1}^N|a_k|^2 \] is a decreasing sequence in N. In particular, \[ \| f-S_N(f)\| \leq \| f-S_M(f)\| <\varepsilon \quad \mathrm{for\ all\ }N\geq M. \] Thus, \[ \limsup _{N\rightarrow \infty }\| f-S_N(f)\| \leq \varepsilon . \] Since ε > 0 was arbitrary, we conclude \[ \lim _{N\rightarrow \infty }\| f-S_N(f)\| =0. \] This is exactly convergence in mean square (in the norm of the Hilbert space).

So the key logical structure is:

Theorem 3: orthonormal system is complete ⇔ closed (no nonzero vector orthogonal to all).
Completeness ⇒ density of finite linear combinations.
Theorem 2: orthogonal projection onto span{ ω₁, ω₂, … , ωₙ } is the best approximation.

Put together ⇒ Fourier partial sums converge in mean square to f.

Example 7: We work in the weighted Hilbert space ℌ with inner product: \[ \langle f,g\rangle =\int _{-1}^1 f(x)\,g(x)\, \frac{{\text d}x}{\sqrt{1-x^2}}. \] The Chebyshev polynomials of the first kind Tₙ(x) can be defined by \[ T_n(\cos \theta ) = \cos (n\theta ), \] with the orthogonality relations \[ \int _{-1}^1 T_m(x)\,T_n(x)\, \frac{{\text d}x}{\sqrt{1-x^2}} =\left\{ \, \begin{array}{ll}\textstyle 0,&\textstyle m\neq n,\\ \pi ,&\textstyle n=m=0,\\ \frac{\pi }{2},&\textstyle n=m\geq 1.\end{array}\right. \] Chebyshev polynomials of the first kind Tₙ(x) are specific cases of Jacobi polynomials: \[ T_n (x) = \frac{2^{2n}}{\binom{2n}{n}}\, P^{(-1/2, -1/2)}_n (x) , \qquad n=0,1,2,\ldots . \]

Let us define an orthonormal system { ωₙ }_n≥0 by \[ \omega_0(x)=\frac{1}{\sqrt{\pi }}\, T_0(x) = \frac{1}{\sqrt{\pi }},\qquad \omega _n(x)=\sqrt{\frac{2}{\pi }}\, T_n (x),\quad n\geq 1. \] Then { ωₙ } is a complete orthonormal system in ℌ = 𝔏²([−1,1], 1/√(1−x²)).

Theorem 4 in this setting claims that for any f ∈ ℌ, its Fourier--Chebyshev series converges in the mean to f when Fourier–Chebyshev coefficients are evaluated as \[ a_n=\langle f,\omega _n\rangle ,\qquad S_N(x)=\sum _{n=0}^N a_n\, \omega _n(x). \]

Let’s take f(x) = |x| on [-1,1]. We form its Fourier–Chebyshev series: \[ f(x)\sim \sum _{n=0}^{\infty} a_n\, \omega _n(x),\qquad a_n=\langle f,\omega _n\rangle . \] Because f is even and Tₙ is even for even n, odd for odd n, all odd coefficients 𝑎ₙ vanish: \[ a_{2k+1}=0, \qquad k=0,1,2,\ldots . \] So only even terms remain.

We. compute the coefficients via θ-substitution. Use x = cosθ, θ ∈ [0, π]. Then \[ {\text d}x=-\sin \theta \, {\text d}\theta ,\qquad \sqrt{1-x^2}=\sin \theta ,\qquad \frac{{\text d}x}{\sqrt{1-x^2}} = -{\text d}\theta . \] Also |x| = |cosθ|, and on [0, π], \[ |\cos \theta |=\left\{ \, \begin{array}{ll}\textstyle \cos \theta ,&\textstyle 0\leq \theta \leq \frac{\pi }{2},\\ -\cos \theta ,&\textstyle \frac{\pi }{2}\leq \theta \leq \pi .\end{array}\right. \] For n ≥ 1, \[ a_n = \langle f,\omega _n\rangle =\sqrt{\frac{2}{\pi }}\int _{-1}^1|x|\, T_n(x)\, \frac{{\text d}x}{\sqrt{1-x^2}}=\sqrt{\frac{2}{\pi }}\int _0^{\pi }|\cos \theta |\cos (n\theta )\, {\text d}\theta , \]

Integrate[Abs[Cos[t]]*Cos[n*t], {t, 0, Pi}]

(-2 Cos[(n \[Pi])/2] + n Sin[n \[Pi]])/(-1 + n^2)

and \[ a_0 = \langle f,\omega _0 \rangle = \frac{1}{\sqrt{\pi}} \int_{-1}^1 \frac{|x|}{\sqrt{1-x^2}} \,{\text d}x = \frac{1}{\sqrt{\pi}} \int_0^{\pi} |\cos\theta |\,{\text d}\theta = 2\,\frac{1}{\sqrt{\pi}} , \]

Integrate[Abs[x]/Sqrt[1 - x^2], {x, -1, 1}]

Integrate[Abs[Cos[t]], {t, 0, Pi}]

So \[ a_0 = \frac{2}{\sqrt{\pi}}, \qquad a_n = \sqrt{\frac{2}{\pi }} \left( -\frac{2}{n^2 -1}\,\cos \frac{n\pi}{2} \right) , \quad n=1,2,\ldots . \] One can show (standard Fourier–cosine computation) that \[ |\cos \theta |=\frac{2}{\pi }+\frac{4}{\pi }\sum _{k=1}^{\infty }\frac{(-1)^k}{1-4k^2}\cos (2k\theta ), \] so the coefficients are: \[ a_0=\langle f,\omega _0\rangle =\frac{1}{\sqrt{\pi }}\int _{-1}^1|x|\, \frac{dx}{\sqrt{1-x^2}}=\frac{2}{\sqrt{\pi }}, \] and for k ≥ 1, \[ a_{2k}=\sqrt{\frac{2}{\pi }}\int _0^{\pi }|\cos \theta |\cos (2k\theta )\, {\text d}\theta =\sqrt{\frac{2}{\pi }} \cdot \frac{2(-1)^k}{1-4k^2}, \] while \[ a_{2k+1}=0. \] Thus, the Fourier–Chebyshev series is genuinely infinite: \[ f(x)=|x|\sim a_0\, \omega _0(x)+\sum _{k=1}^{\infty }a_{2k}\, \omega _{2k}(x), \] with \[ a_0 = \frac{2}{\sqrt{\pi }},\qquad a_{2k}=\sqrt{\frac{2}{\pi }}\cdot \frac{4}{\pi }\cdot \frac{(-1)^k}{1-4k^2},\quad k\geq 1. \] In terms of Tₙ, this is the classical expansion \[ |x| = \frac{2}{\pi }+\frac{4}{\pi }\sum _{k=1}^{\infty }\frac{(-1)^k}{1-4k^2}\, T_{2k}(x), \] which does not terminate.

For illustration of Theorem 4, we define partial sums \[ S_N(x)=\sum _{n=0}^N a_n\, \omega _n(x). \] We use Mathematica to plot some partial sums.

s5[x_] = 2/Pi + 4*Sum[(-1)^k /(1 - 4*k^2) *ChebyshevT[2*k, x], {k, 1, 5}]/Pi; s20[x_] = 2/Pi + 4* Sum[(-1)^k /(1 - 4*k^2) *ChebyshevT[2*k, x], {k, 1, 20}]/Pi; Plot[{s5[x], s20[x]}, {x, -1, 1}, PlotStyle -> {{Thick, Red}, {Thick, Blue}}]

Figure 6.1: Chebyshev approximations with n = 5 terms (red) and n = 20 terms (blue).

As you can observe, even five term approximation gives a reasonable accuracy.

Theorem 4 (in this Chebyshev setting) says that the corresponding series converges in the mean.

Therefore:

The Chebyshev–Fourier series of |x| is infinite.
The partial sums S_N are Chebyshev polynomials (finite combinations of Tₙ).
These polynomials approximate |x| in the mean square sense with respect to the weight w(x) = 1/√(1-x²).
The 𝔏²_w-error of approximation goes to zero as N → ∞.

That is exactly Theorem 4 in a concrete, nontrivial, infinite‑series example. ■

End of Example 7

Theorem 5: (Uniqueness of Fourier Coefficients) Let ℌ be a Hilbert space and let { ωₙ(x) } be a complete orthonormal system in ℌ. If f ∈ ℌ, then the Fourier coefficients \[ a_n = \langle f, \omega_n \rangle \] uniquely determine f (up to equality almost everywhere in the 𝔏² case).

Equivalently:

If f,g ∈ ℌ satisfy \[ \langle f, \omega_n \rangle = \langle g, \omega_n \rangle \qquad \forall n , \] then \[ f = g \quad \mbox{in } ℌ , \] In 𝔏²-spaces, this means \[ f(x) = g(x) \quad \mbox{ almost everywhere}. \]

Suppose that two functions, f and g, have the same expansion coefficients, that is, \[ a_k = \langle \omega_k , f \rangle = \langle \omega_k , g \rangle . \] Hence, ⟨ ω_k , f − g ⟩ = 0, so by Theorem 3, f − g = 0, giving f = g

Now consider the converse problem. Does a given function have unique set of expansion coefficients? Assume that \[ \lim_{n\to\infty} \left\| \, f - \sum_{k=1}^n a_k \omega_k \right\| = \lim_{n\to\infty} \left\| \, f - \sum_{k=1}^n b_k \omega_k \right\| ; \] that is, assume that there are two partial sums with different expansion coefficients that converge in the mean to the same function f. If the expansion coefficients are unique, then 𝑎ₙ = bₙ for all n. To prove this, we observe that \begin{align*} \left\| \sum_{k=1}^n a_k \omega_k -\sum_{k=1}^n b_k \omega_k \right\| &= \left\| \sum_{k=1}^n a_k \omega_k - f + f - \sum_{k=1}^n b_k \omega_k \right\| \\ &\le \left\| \, f - \sum_{k=1}^n a_k \omega_k \right\| + \left\| \, f - \sum_{k=1}^n b_k \omega_k \right\| , \end{align*} where we have used the triangle inequality. Now given any ε, we can choose by assumption an n large enough so that both the last two norms are less than ε/2. Therefore, for such an n, \[ \left\| \sum_{k=1}^n a_k \omega_k - \sum_{k=1}^n b_k \omega_k \right\| = \left\| \sum_{k=1}^n \left( a_k - b_k \right) \omega_k \right\| = \left[ \sum_{k=1}^n \left( a_k - b_k \right)^2 \right]^{1/2} < \varepsilon . \] However, this can only be true if 𝑎ₙ = bₙ. So the expansion coefficients of a given function are unique. Since the set { ωₙ(x) } is complete orthonormal system of functions, it follows that 𝑎ₙ = bₙ = ⟨ ωₙ , f ⟩, the Fourier coefficients.

Example 8: Chebyshev polynomials of the second kind satisfy \[ U_n(\cos \theta )=\frac{\sin ((n+1)\theta )}{\sin \theta } = \sum_{i=0}^n P_{n-i} \left( \cos\theta\right) P_i \left( \cos\theta\right) , \]

Simplify[ Sum[LegendreP[4 - i, Cos[t]]*LegendreP[i, Cos[t]], {i, 0, 4}]]

1 + 2 Cos[2 t] + 2 Cos[4 t]

Simplify[ChebyshevU[4, Cos[t]]]

1 + 2 Cos[2 t] + 2 Cos[4 t]

where P_i is the Legendre polynomial of degree i. The Chebyshev polynomials of second kind are orthogonal on [-1,1] with weight \[ w(x)=\sqrt{1-x^2}. \] Define the Hilbert space ℌ = 𝔏²([-1,1], w(x) dx), with inner product \[ \langle f,g\rangle =\int _{-1}^1f(x)g(x)\sqrt{1-x^2}\, {\text d}x. \] The orthogonality relation becomes \[ \int _{-1}^1U_m(x)U_n(x)\sqrt{1-x^2}\, {\text d}x = \frac{\pi }{2}\, \delta _{mn} , \] where δ_mn is the Kronecker delta. Thus, an orthonormal system is \[ \omega _n(x)=\sqrt{\frac{2}{\pi }}\, U_n(x),\qquad n\geq 0. \] This system is complete in ℌ, so Theorem 4 applies.

Let’s take f(x) = |x|. This function is even, continuous, and non‑polynomial — perfect for an infinite Chebyshev‑Uₙ expansion.

Compute the Fourier–Chebyshev coefficients: \[ a_n =\langle f,\omega _n\rangle =\sqrt{\frac{2}{\pi }}\int _{-1}^1|x|\, U_n(x)\sqrt{1-x^2}\, {\text d}x. \] Use the substitution x = cosθ, θ ∈ [0, π]: \[ {\text d}x = -\sin \theta \, {\text d}\theta , \quad \sqrt{1-x^2}=\sin \theta , \quad |x|=|\cos \theta | \] Then \[ a_n = \sqrt{\frac{2}{\pi }}\int _0^{\pi }|\cos \theta |\sin ((n+1)\theta )\sin \theta \, {\text d}\theta . \]

Integrate[Abs[Cos[t]]*Sin[(n + 1)*t]*Sin[t], {t, 0, Pi}, Assumptions -> n \[Element] Integers]

(-2 Cos[(n \[Pi])/2] + Sin[n \[Pi]])/(-3 + 2 n + n^2)

So Mathematica tells us that \[ a_n = \sqrt{\frac{2}{\pi }}\, \frac{-2}{-3 + 2 n + n^2}\,\cos \left( \frac{n\pi}{2} \right) = \sqrt{\frac{2}{\pi }}\, \frac{-2}{(n-1)(n+3)} \times \begin{cases} 0, & \quad \mbox{if $n$ is odd}, \\ (-1)^k & \quad \mbox{if }\ n= 2k, \quad k=0,1,2,\ldots . \end{cases} \] Because |cosθ| is even around π/2, only even n survive. A classical Fourier computation yields the explicit formula: \[ a_{2k+1} = 0,\qquad a_{2k} = -2\sqrt{\frac{2}{\pi}}\cdot \frac{(-1)^k}{(2k-1)(2k+3)}. \] Thus, the Fourier–Chebyshev series is genuinely infinite. Putting the coefficients together, we obtain \[ |x|\sim \sum _{k=0}^{\infty }a_{2k}\, \omega _{2k}(x) = -\frac{4}{\pi}\, \sum _{k=0}^{\infty} \frac{(-1)^k}{(2k-1)(2k+3)} \, U_{2k}(x). \] This is a true infinite series — it does not terminate.

For illustration of Theorem 4, we define the partial sums \[ S_N(x) =\sum _{n=0}^N a_n\, \omega _n(x). \] We use Mathematica to plot some partial sums with 5 and 20 terms.

u5[x_] = -4* Sum[(-1)^k *ChebyshevU[2*k, x]/(2*k + 3)/(2*k - 1), {k, 0, 5}]/Pi; u20[x_] = -4* Sum[(-1)^k *ChebyshevU[2*k, x]/(2*k + 3)/(2*k - 1), {k, 0, 20}]/Pi; Plot[{u5[x], u20[x]}, {x, -1, 1}, PlotStyle -> {{Thick, Red}, {Thick, Blue}}]

Figure 7.1: Chebyshev-2 approximations with n = 5 terms (red) and n = 20 terms (blue).

Each partial sum S_N is a (finite) linear combination of Chebyshev polynomial of the second kind.
These polynomials approximate |x| in the mean‑square sense with respect to the weight √(1-x²).
The approximation improves monotonically as N → ∞.
The infinite series converges in the exact sense guaranteed by Theorem 4.

We present another illustrative example of Theorem 4 using Chebyshev polynomials of the third kind, usually denoted by Vₙ(x). This gives you a genuinely infinite Fourier–Chebyshev expansion in a weighted Hilbert space, exactly parallel to the Uₙ example.

Chebyshev polynomials of the third kind Vₙ form an orthogonal system in the Hilbert space ℌ = 𝔏⊃([−1, 1], w dx) with weight \[ w(x) = (1 - x)^{-1/2} \cdot (1 + x)^{1/2} = \sqrt{\frac{1+x}{1-x}} . \] Chebyshev polynomials of the third kind Vₙ(x) are defined by either the recurrence relation \[ \begin{split} V_{n+1}(x) &= 2x \, V_n(x) - V_{n-1}(x) , \qquad n = 1,2,\ldots , \\ V_0 (x) &= 1 , \quad V_1 (x) = 2x - 1, \end{split} \] or via Chebyshev polynomials of the first and second kind \[ V_n (x) = T_n (x) - \left( 1-x \right) U_{n-1}(x) , \] or through Jacobi polynomials \[ V_n (x) = \frac{n!}{(1/2)_n}\, P_n^{(-1/2, 1/2)}(x) = \frac{P_n^{(-1/2, 1/2)}(x)}{P_n^{(-1/2, 1/2)}(1)} = \frac{4^n \left( n! \right)^2}{(2n)!}\,P_n^{(-1/2, 1/2)}(x) , \] where (1/2)ₙ is the Pochhammer symbol (rising factorial).

Similar to other Chebyshev polynomials, we can express Vₙ as \[ V_{n} (\cos\theta ) = \frac{\cos \left( \frac{2n+1}{2}\,\theta \right)}{\cos \left( \frac{\theta}{2} \right)} . \] Chebyshev polynomials of the third kind are orthogonal on [-1,1] with weight \[ w(x) = (1 - x)^{-1/2} \cdot (1 + x)^{1/2} = \sqrt{\frac{1+x}{1-x}} . \] Then the inner product in ℌ is \[ \langle f,g\rangle =\int_{-1}^1 f(x)\,g(x)\, w(x)\, {\text d}x = \int_{-1}^1 f(x)\,g(x)\, \sqrt{\frac{1+x}{1-x}} \, {\text d}x . \] The orthogonality relation becomes \[ \int _{-1}^1 V_m(x)\,V_n(x)\, w(x)\, {\text d}x =\frac{\pi }{2}\, \delta _{mn}. \] Thus, the corresponding orthonormal system becomes \[ \omega _n(x) =\sqrt{\frac{2}{\pi}}\, V_n(x),\qquad n\geq 0. \] This system is complete in ℌ = 𝔏²([−1,1], wdx), so Theorem 4 applies. The Fourier coefficients are determined as \[ a_n = \langle f, \omega_n \rangle = \sqrt{\frac{2}{\pi}} \int_0^{\pi} f(\cos\theta )\,2\cos\left( \frac{\theta}{2} \right) \cos \left( n + \frac{1}{2} \right) \theta \,{\text d}\theta , \quad n=0,1,2,\ldots . \] For function f(x) = |x|, Mathematica evaliate \[ a_n = \sqrt{\frac{2}{\pi}} \cdot \frac{1}{n \left( n^2 -1 \right) \left( n+2 \right)} \left[ 2n^2 \sin \left( \frac{n\pi}{2} \right) -4n\,\cos \left( \frac{n\pi}{2} \right) -2n^2 \cos \left( \frac{n\pi}{2} \right) -2\,\sin\left( \frac{n\pi}{2} \right) \right] , \quad n\ge 2 . \]

2*Integrate[Abs[Cos[t]]*Cos[(n + 1/2)*t]*Cos[t/2], {t, 0, Pi}]

(-4 n Cos[(n \[Pi])/2] - 2 n^2 Cos[(n \[Pi])/2] - 2 Sin[(n \[Pi])/2] + 2 n^2 Sin[(n \[Pi])/2] + Sin[n \[Pi]] + n Sin[n \[Pi]] + n^2 Sin[n \[Pi]])/((-1 + n) n (1 + n) (2 + n))

Choose another function with a genuinely infinite expansion: \[ f(x)=\sqrt{1+x}. \] This is a natural choice because:

it is not a polynomial,
it is square‑integrable with respect to the weight w(x),
its Chebyshev–Vₙ expansion is infinite.

Compute the Fourier–Chebyshev coefficients: \[ a_n =\langle f,\omega _n\rangle =\sqrt{\frac{2}{\pi }}\int _{-1}^1 \sqrt{1+x}\, V_n(x)\, w(x)\, {\text d}x. \] Use the substitution x = cosθ, θ ∈ [0, π]. Then: \[ V_n (\cos \theta ) = \frac{\cos (n + 1/2)\theta )}{\cos ( \theta/2 )} , \quad n=0,1,2,\ldots , \] and \[ {\text d}x = -\sin \theta \, {\text d}\theta , \qquad \sqrt{1-x} = \sqrt{2}\,\sin (\theta /2) . \] Putting everything together, we get \[ a_n = \sqrt{\frac{2}{\pi }}\int_{0}^{\pi} \frac{1+\cos\theta}{\sqrt{1-\cos\theta}} \, \frac{\cos (n + 1/2)\theta )}{\cos ( \theta/2 )}\,\sin\theta\,{\text d}\theta , \]

2*Integrate[(1 + Cos[t])*Cos[(n + 1/2)*t]* Sin[t/2]/Sqrt[1 - Cos[t]], {t, 0, Pi}, Assumptions -> n \[Element] Integers]

-((8 Sqrt[2] Cos[n \[Pi]])/((3 + 2 n) (-1 + 4 n^2)))

which we simplify using trigonometric identities, This gives \[ a_n = \sqrt{\frac{2}{\pi }}\int_{0}^{\pi} \sqrt{2}\left( 1 + \cos\theta \right)\cos (n + 1/2)\theta )\,{\text d}\theta . \] Now we ask Mathematica to evaluate the integral:

Sqrt[2]*Integrate[(1 + Cos[t])*Cos[(n + 1/2)*t], {t, 0, Pi}]

(8 Sqrt[2] Cos[n \[Pi]])/(3 + 2 n - 12 n^2 - 8 n^3)

This is exactly the same output when Mathematica was applied to unsimplified expression \[ \sqrt{1+x} \,\sim\, \frac{2}{\pi} \sum_{n\ge 0} \frac{8\sqrt{2} (-1)^n}{3 + 2 n - 12 n^2 - 8 n^3} \,V_n (x) . \] Define partial sums \[ S_N(x)=\sum _{n=0}^N a_n\, \omega _n(x) = \sqrt{\frac{2}{\pi}} \sum _{n=0}^N a_n\,V_n (x) , \] Interpretation:

Each S_N is a finite Chebyshev polynomial of the third kind.
These polynomials approximate \sqrt{1+x} in the mean‑square sense with respect to the weight w(x)=1/(\sqrt{1-x^2}(1+x)).
The approximation improves monotonically as N → ∞.
The infinite series converges exactly as Theorem 4 guarantees.

This is a perfect demonstration of Theorem 4 using the third‑kind Chebyshev system, with a non‑polynomial function and a true infinite Fourier expansion. ■

End of Example 8

A subset $ \displaystyle \ \mathcal{D} \ $ of a Hilbert space ℌ is dense if every element of ℌ can be approximated arbitrarily well (in the norm of ℌ) by elements of $ \displaystyle \ \mathcal{D}. \ $ Concretely, for every f ∈ ℌ and every ε > 0, there exists g ∈ $ \displaystyle \mathcal{D} \quad $ with \[ \| f-g\|_H < \varepsilon . \] This is the analytic expression of the geometric idea that $ \displaystyle \quad \mathcal{D} \quad $ “fills” the space: no nonzero vector is orthogonal to all of $ \displaystyle \quad \mathcal{D}. \quad $ Equivalently, \[ \overline{\mathcal{D}} = ℌ\quad \Longleftrightarrow \quad \mathcal{D}^{\perp } = \{ 0\} . \]

Completeness of an orthogonal system { ϕₙ } ⊆ ℌ means that its finite linear combinations form a dense set. In that case, every f ∈ ℌ admits a convergent Fourier expansion \[ f\sim \sum _{n=0}^{\infty }c_n\phi _n, \] with convergence in the norm of ℌ. This is the setting in which Parseval’s identity and the usual Fourier approximation theory operate.

A set of trigonometric function

\begin{equation} \label{EqOrtho.1} 1, \ \cos x , \ \sin x , \ cos 2x, \ \sin 2x , \ \ldots ,\ \cos (nx) , \ \sin (nx) , \ \ldots , \end{equation}

provides a typical example of a complete orthogonal system in Hilbert space 𝔏²([−π, π]) of square integrable functions on any interval of length 2π. Functions in this system are not normalized, so we need to divide each of them by corresponding norm to obtain the orthonormal set of functions:

\[ \left\{ \frac{1}{\sqrt{2\pi}},\ \frac{\cos n\theta}{\sqrt{\pi}} ,\ \frac{\sin n\theta}{\sqrt{\pi}} \right\}_{n\geq 1}. \]

Its finite linear combinations—the trigonometric polynomials—are dense in 𝔏²(−π,π). This follows from the orthogonality relations

\[ \int _{-\pi}^{\pi }\cos m\theta \, \cos n\theta \, {\text d}\theta =\pi \delta _{mn},\qquad \int_{-\pi}^{\pi} \sin m\theta \, \sin n\theta \, {\text d}\theta =\pi \delta _{mn}, \]

and the fact that no nonzero 𝔏²-function can be orthogonal to all sines and cosines. Parseval’s identity gives

\[ \frac{1}{\pi} \int_{-\pi}^{\pi} \vert f(x) \vert^2 {\text d}x = \frac{1}{2}\,a_0^2 + \sum_{n\ge 1} \left( a_n^2 + b_n^2 \right) . \]

The trigonometric system is therefore the canonical model of a complete orthogonal system. Its density also follows from the Sturm--Liouville theory. Indeed, let us consider the unbounded differential operator

\[ L=-\frac{{\text d}^2}{{\text d}x^2} \]

on Hilbert space ℌ = 𝔏²(−π, π) with periodic boundary conditions

\[ f(-\pi ) = f(\pi ),\qquad f'(-\pi ) = f'(\pi ). \]

This is a self‑adjoint non-negative operator on a Hilbert space ℌ. Its eigenfunctions are exactly \eqref{EqOrtho.1} with eigenvalues {n²}. The spectral theory says that for such a regular self‑adjoint problem on a finite interval, the eigenfunctions form a complete orthogonal set in 𝔏². “Complete” here means their closed linear span is the whole space. So the trigonometric system is dense in 𝔏²(−π,π) because it is the eigenbasis of a self‑adjoint operator with discrete spectrum.

Theorem 6: An orthonormal system { ωₙ(x) } in Hilbert space ℌ is closed if and only if Parseval's identity holds for a dense family of functions.

Necessary condition is obvious.

We need to prove only its sufficience. Let us consider operation Sₙ that transfers a function f(x) into a linear combination spanned on first n functions ω₁ , ω₂ , … , ωₙ by calculating n-th partial Fourier sum Sₙ(f; x). This transformation satisfies the following properties:

Sₙ(f₁ + f₂) = Sₙ(f₁) + Sₙ(f₂);
Sₙ(λ f) = λ Sₙ(f), λ ∈ ℂ;
∥Sₙ(f)∥ ≤ ∥f∥.

Property "a" reflects the fact that the Fourier coefficient of a sum of two functions is the sum of Fourier coefficients of corresponding functions, which follows from the corresponding property of inner product. This is also true for property "b". Property "c" is just a finite version of Bessel's inequality: \[ \sum_{i=1}^n \left\vert \langle f , \omega_i \rangle \right\vert^2 \le \| f \|^2 . \]

For any ε > 0 and any function f ∈ 𝔏², find a function g(x) from a dense set A such that \[ \| f - g \| < \frac{\varepsilon}{3} . \] Since for functions from family A the closeness condition is valid, there exists an integer N such that for n ≥ N, we have \[ \| g - S_n (g) \| \le \frac{\varepsilon}{3} . \] Let us estimate the norm of the difference: \begin{align*} \| f - S_n (f) \| &= \| f -g + g - S_n (g) + S_n (g) - S_n (f) \| \\ &\leqslant \| f - g \| + \| g - S_n (g) \| + \| S_n (g) - S_n (f) \| \\ &\leqslant 2 \| f - g \| + \| g - S_n (g) \| < \varepsilon \end{align*} for n ≥ N.

Before we work on the nest example, we need some preliminary information.

A function f : [0,1] → ℝ is a dyadic step function if there exists some integer N such that: \[ f(x) = c_n\quad \mathrm{for\ }x\in \left[ \frac{n}{2^N},\, \frac{n+1}{2^N}\right) ,\qquad n=0,1,\dots ,2^N -1. \] The constants cₙ can be arbitrary real numbers.

The following dyadic function \[ \psi (x) = \begin{cases} \phantom{-}1, &\quad \mbox{for } 0 \le x < 1/2 , \\ -1 &\quad \mbox{for } 1/2 \le x < 1 , \\ \phantom{-}0, &\quad \mbox{elsewhere.} \end{cases} \] is called the mother wavelet or the Haar function..

The Haar sequence was proposed in 1909 by the Hungarian mathematician Alfréd Haar (1885--1933). A dyadic step function is a step function whose “steps’’ occur at dyadic rationals, meaning numbers of the form \[ \frac{k}{2^n},\qquad k,n\in \mathbb{Z}. \] It is one of the fundamental building blocks in analysis, probability, and harmonic analysis because it aligns perfectly with binary subdivision of the interval. A dyadic step function on an interval (usually [0,1]) is a function that is:

Piecewise constant.
Constant on each dyadic interval \[ I_{n,k}=\left[ \frac{k}{2^n},\, \frac{k+1}{2^n}\right) . \]
Allowed to jump only at dyadic points $ \displaystyle \quad \frac{k}{2^n}. $

Example 9: Let ℌ = 𝔏²(ℝ), with usuall inner product. Define the Haar mother wavelet: \[ \psi _{j,k}(x)=2^{j/2}\, \psi (2^jx-k) \] for j,k ∈ ℤ. Then { ψ_j,k }_{j,k ∈ ℤ} is an orthonormal system in 𝔏²(ℝ).

It is a dense family in ℌ where Parseval's identity holds. Let us consider the set \[ \mathcal{D} = \left\{ f\in 𝔏^2(\mathbb{R}):f\mathrm{\ has\ compact\ support\ and\ is\ piecewise\ constant\ on\ dyadic\ intervals\ }[m2^{-N},(m+1)2^{-N})\right\} . \] These dyadic step functions are dense in 𝔏²(ℝ) (they approximate any 𝔏² function by local averaging on fine dyadic grids).

Wavelet expansion on $ \displaystyle \quad \mathcal{D}\ : \ $ For such an f, only finitely many Haar coefficients \[ c_{j,k}=\langle f,\psi _{j,k}\rangle \] are nonzero, because f is constant on sufficiently fine dyadic intervals and has compact support.

Parseval's identity on $ \displaystyle \quad \mathcal{D}: \ $ For each $ \displaystyle \quad f\in \mathcal{D}, $ \[ \| f\| _{𝔏^2(\mathbb{R})}^2=\sum _{j,k}|\langle f,\psi _{j,k}\rangle |^2, \] where the sum is actually finite (so there is no convergence issue). Thus Parseval’s identity holds for all f in the dense set $ \displaystyle \quad \mathcal{D}. $

Theorem 6 says: An orthonormal system { ωₙ } in ℌ is closed (complete) if and only if Parseval’s identity holds for a dense family of functions.

We have: { ψ_j,k } is orthonormal in 𝔏²(ℝ). There is a dense set $ \displaystyle \quad \mathcal{D} \subset 𝔏^2(\mathbb{R}) $ (dyadic step functions) such that Parseval's identity holds for every $ \displaystyle \quad f\in \mathcal{D}. $ Therefore, by Theorem 6, the Haar wavelet system { ψ_j,k } is closed, i.e., complete in 𝔏²(ℝ). We have:

{ ψ_j,k } is orthonormal in 𝔏²(ℝ).
There is a dense set $ \displaystyle \quad \mathcal{D} \subset 𝔏^2;(\mathbb{R}) $ (dyadic step functions) such that Parseval holds for every f ∈ $ \displaystyle \quad \mathcal{D}. $

Therefore, by Theorem 6, the Haar wavelet system { ψ_j,k } is closed, i.e., complete in 𝔏²(ℝ). ■

End of Example 9

Orthogonalization

We recall some definitions from Linear Algebra.

For given finite list of functions { ϕ₁, ϕ₂, … , ϕₙ } and scalars b₁, b₂, … , bₙ, the expression \[ b_1 \phi_1 + b_2 \phi_2 + \cdots + b_n \phi_n \] is called the linear combination functions ϕ₁, ϕ₂, … , ϕₙ.

A set of functions ϕ₁, ϕ₂, … , ϕₙ is called linear independent if no function in the set can be expressed as a linear combination (constant multiple) of the others. A set of functions is linearly independent if b₁ϕ₁ + b₂ϕ₂ + ⋯ + bₙϕₙ = 0 mplies all constants b_i are zero.

An (infinite) set of functions { ϕₙ }, n = 1, 2, … , is called linearly independent if any finite subset of these functions is linearly independent.

Suppose we know an ordered basis { ϕₙ }_ₙ≥1 in a separable Hilbert space ℌ with inner product ⟨·∣·⟩. This means that elements in the system { ϕₙ } are linearly independent and the set of all finite linear combinations $ \displaystyle \ \sum_{i=1}^n c_i \phi_i , \ $ where coefficients c_i are arbitrary scalars and integer n can be any positive integer, form a dense subset in ℌ.

We are going to find an orthogonal basis { ψₙ } using Gram–Schmidt process. We start with a basis of two elements { ϕ₁, ϕ₂ } that are linearly independent. let us take the first of them as our first element in a new orthogonal basis, so let ψ₁ = ϕ₁. As our next vector, we choose a difference between ϕ₂ and its projection on ϕ₁:

\[ \psi_2 = \phi_2 - \frac{\langle \phi_2 \mid \phi_1 \rangle}{\langle \phi_1 \mid \phi_1 \rangle}\,\phi_1 = \frac{\langle \phi_2 \mid \phi_1 \rangle}{\| \phi_1 \|^2} \,\phi_1 . \]

This vector ψ₂ is linearly independent of ψ₁ because it is not a scalar multiple of ψ₁ = ϕ₁, but it is a linear combination of two elements, ϕ₁ and ϕ₂. This observation is a core of orthogonalization.

For arbitrary n ∈ ℤ, we build a system of n elements { ψ₁, ψ₂, … , ψₙ } that is linearly independent and mutually orthogonal:

\begin{align*} \psi_1 &= \phi_1 , \\ \psi_2 &= \phi_1 + b_{2,1}\phi_1 , \\ \psi_3 &= \phi_3 + b_{3,2} \phi_3 + b_{3,1} \phi_1 , \\ \vdots& \quad \vdots \\ \psi_n &= \phi_n + b_{n,n-1} \phi_{n-1} + b_{n,n-2}\phi_{n-2} + \cdots + b_{n,1} \phi_1 . \end{align*}

Our objective is to determine coefficients b_i,j in such a way that vectors ψ₁, ψ₂, … , ψₙ are non-zero and mutually orthogonal (then they will be linearly independent). In other words, the list of n elements { ψ₁, ψ₂, … , ψₙ } forms an orthogonal basis for the span of { ϕ₁, ϕ₂, … , ϕₙ }. We build the required system { ψ₁, ψ₂, … , ψₙ } inductively. Suppose that such system of n elements has been already determined. We seek the next element ψ_n+1 as a linear combination:

\[ \psi_{n+1} = \phi_{n+1} + c_1 \psi_1 + c_2 \psi_2 + \cdots + c_n \psi_n . \]

For each i = 1, 2, … , n, we evaluate the dot product of the ψ_n+1 with ψ_i to obtain

\[ 0 = \langle \psi_{n+1} \mid \psi_i \rangle = \langle \phi_{n+1} \mid \psi_i \rangle + c_i \langle \psi_i \mid \psi_i \rangle \]

because the list { ψ₁, ψ₂, … , ψₙ } is mutually orthogonal by construction. Since ⟨ ψ_i ∣ ψ_i ⟩ = ∥ψ_i∥² ≠ 0, the above equation has a unique solution, c_i. Hence, vector ψ_n+1 is orthogonal to all previously determined elements { ψ₁, ψ₂, … , ψₙ }. Moreover, this vector ψ_n+1 is non-zero. Indeed, substituting into the formula

\[ \psi_{n+1} = \phi_{n+1} + \sum_{i=1}^n c_i \psi_i \]

instead of each ψ_i its expression through ϕ_i, we get

\[ \psi_{n+1} = \phi_{n+1} + \sum_{i=1}^n c_i \sum_{j=1}^i b_{i,j} \phi_j . \]

This expression cannot be zero because elements in the list { ϕ₁, ϕ₂, … , ϕₙ, ϕ_n+1 } are linearly independent, and the right hand-side is just their linear combination.

Now we prove that the system { ψ₁, ψ₂, … , ψₙ, …} is complete. Choose arbitrary f ∈ ℌ and ε > 0. Since the set of all linear combinations $ \displaystyle \ \sum_{k=1}^n c_k \phi_k \ $ is dense in ℌ, then there exist scalars b₁, b₂, … , bₙ and integer n such that

\[ \left\| \, f - \sum_{k=1}^n b_k \phi_k \right\| < \varepsilon . \]

Any element ϕ_i ∈ ℌ is expressed as a linear combination of vectors { ψ;₁, ψ;₂, … , ψₙ }, therefore, the inequality above can be rewritten as

\[ \left\| \, f - \sum_{k=1}^n c_k \psi_k \right\| < \varepsilon , \]

\[ \left\| \, f - \sum_{k=1}^n d_k \omega_k \right\| < \varepsilon , \qquad \omega_k = \frac{\psi_k}{\| \psi_k \|} . \]

According to Theorem 2, we have

\[ \left\| \, f - \sum_{k=1}^n a_k \omega_k \right\| \le \left\| \, f - \sum_{k=1}^n d_k \omega_k \right\| < \varepsilon , \]

where 𝑎_i = ⟨ f∣ω_i ⟩ are Fourier coefficients of function f. This means that the orthogonal systems { ω_i } and so { ψ_i } are complete.

The Gram-Schmidt process does not generally produce a unique output, as the result depends heavily on the order of the input vectors. While it produces a unique, consistent orthonormal basis for a specific ordered set of input vectors, reordering the input vectors will result in a different, albeit valid, orthogonal basis. We summarize our observations in the following statement.

Lemma: (Uniqueness of Gram–Schmidt with sign/phase convention) Let V be an inner product space over 𝔽 = ℝ or ℂ, and let { ϕ₁, ϕ₂, … , ϕₙ } be a linearly independent ordered family in V. Suppose { ψ₁, ψ₂, … , ψₙ } ⊂ V satisfies:

Orthogonality: \[ \langle \psi _i,\psi _j\rangle =0\quad \mathrm{for\ }i\neq j. \]
Span preservation: \[ \mbox{span} \{ \phi _1,\dots ,\phi _k \} =\operatorname{span} \{ \psi _1,\dots ,\psi _k\} \quad \mathrm{for\ all\ }k=1,\dots ,n. \]
Sign/phase convention: \[ \langle \varphi _k,\psi _k\rangle >0\quad \mathrm{for\ all\ }k\quad \mathrm{(over\ }\mathbb{R}), \] or more generally over ℂ, \[ \langle \varphi _k,\psi _k\rangle \in (0,\infty )\subset \mathbb{R}\quad \mathrm{(i.e.,\ real\ and\ positive)}. \]

Then the family { ψ₁, ψ₂, … , ψₙ } is uniquely determined by { ϕ₁, ϕ₂, … , ϕₙ }. In particular, any two such families coincide.

We argue by induction on k.

Step 1: Base case n = 1. We need to show:

If ψ₁ is a nonzero vector with span{ ψ₁ } = span{ ϕ₁ } and ⟨ ϕ₁∣ψ₁ ⟩ > 0, then ψ₁ is uniquely determined.

Since ψ₁ ∈ span{ ϕ₁ }, there exists a scalar α ≠ 0 such that \[ \psi_1 =\alpha \, \phi_1 . \] Then \[ \langle \varphi _1,\psi _1\rangle =\langle \varphi _1,\alpha \varphi _1\rangle =\alpha \, \langle \varphi _1,\varphi _1\rangle . \] Because ϕ₁ ≠ 0, we have ⟨ ϕ₁∣ϕ₁ ⟩ > 0. The condition ⟨ ϕ₁∣ψ₁ ⟩ > 0 forces α > 0 (in ℝ), so α is uniquely determined. Hence ψ₁ is unique.

(Over ℂ, the same argument shows α must be a positive real scalar, i.e., we fix the phase.)

Step 2: Inductive hypothesis Assume the lemma holds for k-1:
If { ψ₁, ψ₂, … , ψ_k-1 } and $ \displaystyle \ \{ \tilde{\psi}_1 , \tilde{\psi}_2 , \dots ,\tilde{\psi}_{k-1}\} \ $ are two orthogonal families such that \[ \operatorname{span} \{ \varphi _1,\dots ,\varphi _j\} =\operatorname{span} \{ \psi _1,\dots ,\psi _j\} =\operatorname{span} \{ \tilde {\psi }_1,\dots ,\tilde {\psi }_j\} \mbox{ for all } j\leq k-1, \ \langle \varphi _j,\psi _j\rangle >0 \] and \[ \langle \varphi _j,\tilde {\psi }_j\rangle >0 \quad \forall j\leq k-1, \] then $ \displaystyle \ \psi _j = \tilde {\psi }_j \ $ for all j ≤ k-1.

We now prove uniqueness for the k-th vector.
Step 3: Uniqueness of ψₖ. Suppose we have two Gram–Schmidt outputs \[ \{ \psi _1,\dots ,\psi _k\} \quad \mathrm{and}\quad \{ \tilde {\psi }_1,\dots ,\tilde {\psi }_k\} \] satisfying:

Both are orthogonal families.
For each j ≤ k, \[ \mbox{span} \{ \phi _1,\dots ,\phi _j\} =\mbox{span} \{ \psi _1,\dots ,\psi _j\} =\operatorname{span} \{ \tilde {\psi }_1,\dots ,\tilde {\psi }_j\} . \]
\[ \langle \varphi _j,\psi _j\rangle >0 \ \mbox{ and }\ \langle \varphi _j,\tilde {\psi }_j\rangle >0 \quad \forall j\leq k. \]

By the inductive hypothesis, we already know \[ \psi _j =\tilde {\psi }_j\quad \mathrm{for\ }j=1,\dots ,k-1. \] Now look at ψₖ and $ \displaystyle \quad \tilde{\psi}_k. $

Both lie in the same subspace: By span preservation, \[ \psi _k,\tilde {\psi }_k\in \mbox{span} \{ \phi _1,\dots ,\phi _k\} = \mbox{span} \{ \psi _1,\dots ,\psi _k\} = \mbox{span} \{ \tilde {\psi }_1,\dots ,\tilde {\psi }_k\} . \]
Both are orthogonal to the previous vectors: Since each family is orthogonal, \[ \langle \psi _k,\psi _j\rangle =0,\quad \langle \tilde {\psi }_k ,\tilde{\psi}_j\rangle = 0\quad \mathrm{for\ }j \le k-1 . \] Using $ \displaystyle \quad \psi _j=\tilde {\psi }_j \quad $ for j = 1, 2, … , k−1, \[ W_{k-1}:=\mbox{span} \{ \psi _1,\dots ,\psi _{k-1}\} = \mbox{span} \{ \phi _1,\dots ,\phi _{k-1}\} . \]
They live in the same 1-dimensional complement: Consider the subspace \[ V_k := \mbox{span} \{ \phi_1, \phi_2 , \dots ,\phi _k\} . \] We know W_k-1 ⊂ Vₖ, and $ \displaystyle \ \psi_k , \tilde{\psi}_k \in V_k \ $ while both are orthogonal to W_k-1. In an inner product space, the orthogonal complement of W_k-1 inside Vₖ is 1-dimensional (because { ϕ₁, ϕ₂, … , ϕₖ } are linearly independent and we’ve already accounted for k-1 dimensions in W_k-1.

Hence, ψₖ and $ \displaystyle \ \tilde {\psi }_k \ $ must be scalar multiples of each other: \[ \tilde {\psi}_k =\alpha \, \psi _k\quad \mathrm{for\ some\ }\alpha \neq 0. \]
Use the positivity condition to fix the scalar: We have \[ \langle \phi_k,\tilde{\psi}_k \rangle = \langle \phi_k , \alpha \psi_k\rangle =\alpha \, \langle \phi _k ,\psi_k \rangle . \] Both $ \displaystyle \ \langle \phi_k , \tilde{\psi }_k\rangle \ \mbox{ and } \ \langle \phi_k , \psi_k\rangle \ $ are strictly positive real numbers by assumption. This forces α > 0. Therefore, \[ \tilde {\psi }_k =\psi_k . \]

This completes the inductive step. By induction on k, we have shown: For a given ordered basis { ϕ₁, ϕ₂, … , ϕₙ }, any orthogonal family { ψ₁, ψ₂, … , ψₙ } satisfying

span{ ϕ₁, ϕ₂, … , ϕₖ } = span{ ψ₁, ψ₂, … , ψₖ } for all k,
⟨ ϕₖ∣ψₖ ⟩ > 0 for all k, is unique.

As an important application of orthogonalization procedure, let us consider 𝒫⟦x⟧, the set of all polynomials with real coefficients and 𝒫_≤n⟦x⟧ be its subset of polynomials of degree up to n. Both these vector spaces have a basis (in the usual sense), consisting of monomials 1, x, x², x³, …, which terminates for 𝒫_≤n⟦x⟧.

The set of all polynomials is dense in the space ℭ([𝑎, b]) of all continuous functions on finite interval [𝑎, b]. Stone--Weierstrass's theorem assures us that every continuous function defined on a closed finite interval [𝑎, b] can be uniformly approximated as closely as desired by a polynomial function. Since the set of continuous functions is dence in Hilbert space ℌ = 𝔏², the set of polynomials is dense in ℌ. So we conclude that the set of all monomials { xⁿ } is dense in the Hilbert space ℌ.

However, the Stone---Weierstrass theorem would be false if it claimed to produce a uniformly convergent power series, known as Taylor's series. For instance, these is no power series that converges uniformly to the continuous function √x in the interval [0,1]. Taylor series are beautiful theoretically and essential in analysis, but numerically they are too fragile, especially for practical numerical approximations because they behave poorly outside a narrow region and are expensive or unstable to compute. The core issue is that Taylor series are local objects, while numerical approximations usually need global stability and accuracy.

Weierstrass's theorem for approximations by a sequence of polynomials is in one sense much stronger than Taylor's theorem for expansion in power series. Weierstrass's theorem demonstrates the existence of polynomial approximations outside the radius of convergence of a Taylor series. However, there is, in general, no possibility of rearranging the uniform convergent sequence of polynomials that approximate any continuous function as to produce a convergent Taylor's series. Therefore, polynomials are more suitable for approximations than power series. The following examples demonstrate how we can generate orthogonal polynomials from the set of monomials { xⁿ }_n≥0 using the Gram–Schmidt process.

Gram-Schmidt orthogonalization is a method that takes a non-orthogonal list of linearly independent function and literally constructs an orthogonal ordered set in Hilbert space 𝔏^²_w over an arbitrary interval and with respect to an arbitrary weighting function w. Here for convenience, all functions are assumed to be real. Three cases of possible intervals include [−1,1], semi-infinite interval [0,∞), and ℝ, are the most important ones; the corresponding outputs of the Gram--Schmidt orthogonalizations of the same list of monomials are called the classical orthogonal polynomials, and we will study these three case in the following examples.

Example 10: The Legendre polynomials appear in many different contenxts, so they can be defined, for instance, via the Rodrigues' formula \[ P_n (x) = \frac{1}{2^n n!}\,\frac{{\text d}^n}{{\text d}x^n} \left( x^2 -1 \right)^n = \frac{1}{2^n}\,\sum_0^{\lfloor n/2 \rfloor} (-1)^{k} \frac{(2n-2k)!}{k! \,(n-k)! \, (n-2k)!}\, x^{n-2k} , \tag{9.1} \] or by recurrence \[ \left( n+1 \right) P_{n+1} (x) = \left( 2n+1 \right) P_n - n\,P_{n-1} (x) , \qquad P_0 = 1, \quad P_1 (x) = x . \tag{9.2} \]

u[i_,x_] = x^i ; a[i_, j_] = Integrate[u[i, x]*u[j, x], {x, -1, 1}]; gram[n_] = Det[Table[a[i, j], {i, 0, n}, {j, 0, n}]]; gram[-1] = 1; \[CapitalPhi][n_, x_] = Det[Append[Table[a[i, j], {i, 0, n}, {j, 0, n}], x^Range[0, n]]] // Simplify; \[CapitalPhi][n_, x_] = \[CapitalPhi][n, x] /Sqrt[gram[n - 1]*gram[n]]; Table[{n, \[CapitalPhi][n, x], LegendreP[n, x]}, {n, 0, 5}] // Simplify

The polynomials were named "Legendre coefficients" by the British mathematician Isaac Todhunter in honor of the French mathematician Adrien-Marie Legendre (1752–1833), who was the first to introduce and study them. Todhunter called the functions "coefficients", instead of "polynomials", because they appear as coefficients in the expansion of the generating function; Todhunter also introduced the notation Pₙ, which is still generally used. Legendre's polynomials have been introduced by Legendre in a memoir Sur l'attraction des sphéroïdes homogènes published in the Mémoires de Mathématiques et de Physique, présentés à l'Académie royale des sciences par sçavants étrangers, Tome x, pp. 411–435, Paris, 1785.

With the aid of Mathematica, we plot Legendre's polynomial P₅ for real and complex argument.

Plot[LegendreP[5, x], {x, -1.04, 1.04}, PlotStyle -> Thick, PlotLegends -> "Legendre[5,x]"]
ComplexPlot3D[LegendreP[5, z], {z, -1 - I, 1 + I}, PlotLegends -> Automatic]

Figure 9.1: Legendre polynomial P₅(x) on real axis.

Figure 9.2: Legendre polynomial P₅(z) for complex variable.

We use the Gram-Schmidt orthogonalization procedure to generate Legendre polynomials from the list of monomials \[ 1 , \ x, \ x^2 , \ x^3 , \ \ldots , x^n , \ \ldots , \] in Hilbert space ℌ = 𝔏²([−1,1]) with inner product \[ \langle f \mid g \rangle = \int_{-1}^1 f(x)\,g(x)\,{\text d} x. \]

The first two items, ϕ₀ = 1 and ϕ₁ = x, in the list { xⁿ } are orthogonal: \[ \langle \phi_0 \mid \phi_1 \rangle = \langle 1 \mid x \rangle = \int_{-1}^1 x\,{\text d} x = 0 . \] So we can set ψ₀ = ϕ₀ = 1 and ψ₁ = ϕ₁ = x. The next element in the orthogonal list is taken as a linear combination \[ \psi_2 = \phi_2 + b_{20}\psi_0 + b_1\psi_1 = x^2 + b_{20} + b_{21} x . \] This vector ψ₂ must be orthogonal to both, ψ₀ and ψ₁. So this leads to \begin{align*} 0&= \langle \psi_2 \mid \phi_0 \rangle = \langle x^2 \mid 1 \rangle + b_{20} \langle 1 \mid 1 \rangle + b_{21} \langle x \mid 1 \rangle = \frac{2}{3} + 2\,b_{20} , \\ 0&= \langle \psi_2 \mid \phi_1 \rangle = \langle x^2 \mid x \rangle + b_{20} \langle 1 \mid x \rangle + b_{21} \langle x \mid x \rangle = b_{21}\,\frac{2}{3} . \end{align*}

Integrate[1, {x, -1,

Integrate[x^2 , {x, -1,

2/3

Hence we get b₂₀ = −⅓ and b₂₁ = 0. So \[ \psi_2 (x) = x^2 - \frac{1}{3} , \qquad P_2 (x) = \frac{1}{2} \left( 3 x^2 -1 \right) . \] For n = 3, we set \[ \psi_3 (x) = x^3 + b_{30}\phi_0 + b_{31}\phi_1 + b_{32}\phi_2 = x^3 + b_{30} + b_{31} x + b_{32} x^2 . \] Its inner product with ϕ₀, ϕ₁, and ϕ₂ must be zero, so \begin{align*} 0&= \langle \psi_3 \mid \phi_0 \rangle = \langle x^3 \mid 1 \rangle + b_{30} \langle 1 \mid 1 \rangle + b_{31} \langle x \mid 1 \rangle + b_{32} \langle x^2 \mid 1 \rangle = 2\,b_{30} + b_{32} \frac{2}{3} , \\ 0&= \langle \psi_3 \mid \phi_1 \rangle = \langle x^3 \mid x \rangle + b_{30} \langle 1 \mid x \rangle + b_{31} \langle x \mid x \rangle + b_{32} \langle x^2 \mid x \rangle = \frac{2}{5} + b_{31}\,\frac{2}{3} , \\ 0&= \langle \psi_3 \mid \phi_2 \rangle = \langle x^3 \mid x^2 \rangle + b_{30} \langle 1 \mid x^2 \rangle + b_{31} \langle x \mid x^2 \rangle + b_{32} \langle x^2 \mid x^2 \rangle = b_{30}\,\frac{2}{3} + b_{32} \,\frac{2}{5} . \end{align*} Solving the corresponding system of equations \begin{align*} 0&= 2\,b_{30} + b_{32} \frac{2}{3} , \\ 0&= \frac{2}{5} + b_{31}\,\frac{2}{3} , \\ 0&= b_{30}\,\frac{2}{3} + b_{32} \,\frac{2}{5} , \end{align*} we obtain

Solve[{b30*2 + b32*2/3 == 0, 2/5 + b31*2/3 == 0, b30*2/3 + b32*2/5 == 0}, {b30, b31, b32}]

\[ \psi_3 = x^3 - \frac{3}{5}\,x , \qquad P_3 (x) = \frac{1}{2} \left( 5 x^3 - 3x \right) . \] For n = 4, we set \[ \psi_4 (x) = x^4 + b_{40}\phi_0 + b_{41}\phi_1 + b_{42}\phi_2 + b_{43}\phi_3 \] Taking the inner product with ϕ₀, ϕ₁, ϕ₂, and ϕ₃, we get \begin{align*} 0&= \langle \psi_4 \mid \phi_0 \rangle = \langle x^4 \mid 1 \rangle + b_{40} \langle 1 \mid 1 \rangle + b_{41} \langle x \mid 1 \rangle + b_{42} \langle x^2 \mid 1 \rangle + b_{43} \langle x^3 \mid 1 \rangle = \frac{2}{5} + 2\,b_{40} + b_{42} \frac{2}{3} , \\ 0&= \langle \psi_4 \mid \phi_1 \rangle = \langle x^4 \mid x \rangle + b_{40} \langle 1 \mid x \rangle + b_{41} \langle x \mid x \rangle + b_{42} \langle x^2 \mid x \rangle + b_{43} \langle x^3 \mid x \rangle = b_{41}\,\frac{2}{3} + b_{43} \,\frac{2}{5} , \\ 0&= \langle \psi_4 \mid \phi_2 \rangle = \langle x^4 \mid x^2 \rangle + b_{40} \langle 1 \mid x^2 \rangle + b_{41} \langle x \mid x^2 \rangle + b_{42} \langle x^2 \mid x^2 \rangle + b_{43} \langle x^3 \mid x^2 \rangle = \frac{2}{7} + b_{40}\,\frac{2}{3} + b_{42} \,\frac{2}{5} , \\ 0&= \langle \psi_4 \mid \phi_3 \rangle = \langle x^4 \mid x^3 \rangle + b_{40} \langle 1 \mid x^3 \rangle + b_{41} \langle x \mid x^3 \rangle + b_{42} \langle x^2 \mid x^3 \rangle + b_{43} \langle x^3 \mid x^3 \rangle = b_{41}\,\frac{2}{5} + b_{43} \,\frac{2}{7} . \end{align*} Solving this system of equations, we obtain \[ \psi_4 (x) = x^4 + \frac{3}{35} - \frac{6}{7}\, x^2 , \qquad P_4 (x) = \frac{1}{8} \left( 35\,x^4 -30\, x^2 + 3 \right) . \]

Solve[{2/5 + b40*2 + b42*2/3 == 0, b41*2/3 + b43*2/5 == 0, 2/7 + b40*2/3 + b42*2/5 == 0, b41*2/5 + b43*2/7 == 0}, {b40, b41, b42, b43}]

These formulas allow us to observe that applying the Gram–Schmidt process to the ordered basis $ \displaystyle \quad 1,\; x,\; x^2,\; x^3,\; \dots \quad $ produces an orthogonal sequence of monic polynomials { qₙ }_{n ≥ 0}, each of degree n, which are constant multiples of the classical Legendre polynomials (9.1).

Since our algorithm of the Gram–Schmidt version keeps the leading coefficient 1, then each qₙ(x) is monic: \[ q_n(x) = x^n + \mathrm{(lower\ degree\ terms)}. \] By construction:

degree of qₙ(x) = n,
⟨ qₙ∣qₖ ⟩ = 0 for n ≠ k,
and the leading coefficient of qₙ is 1.

Fix n. Consider all polynomials of degree at most n. Among them, look at monic degree-n polynomials: \[ p(x) = x^n + a_{n-1} x^{n-1} + \dots + a_0 . \] The orthogonality conditions \[ \langle p \mid x^k \rangle = 0,\quad k=0,1,\dots ,n-1 \] are n linear equations in the n unknowns 𝑎₀, 𝑎₁, … , 𝑎_n-1. The Gram–Schmidt construction shows that this system has a solution (namely, the coefficients of qₙ are determined). Because the inner product is nondegenerate on polynomials, this system has a unique solution. Hence:

Lemma: (Uniqueness) For each n, there is a unique monic polynomial pₙ of degree n that is orthogonal to all polynomials of lower degree (equivalently, to 1, x, x², … , x^n-1).

So the Gram–Schmidt orthogonalization produces exactly these monic polynomials qₙ that are uniquely identified by the above three properties. We are going to show that Legendre's polynomials, upon multiplication by some non-zero factor, also satisfy these three properties, Due to uniqueness, these polynomials must be exactly those that are generated by the Gram--Schmidt process.

Legendre polynomials satisfy the same orthogonality conditions, \[ \langle P_n \mid P_k \rangle = \int_{-1}^1 P_n (x)\, P_k (x)\,{\text d}x = 0 \quad\mbox{for}\quad n \ne k . \] For k < n, we integrate by parts \begin{align*} A_{kn} &= \langle x^k \mid P_n \rangle = \int_{-1}^1 x^k P_n (x)\,{\text d}x \\ &= \frac{1}{2^n n!} \int_{-1}^1 x^k \left[ \left( x^2 -1 \right)^n \right]^{(n)} {\text d}x \\ &= \left. \frac{x^k}{2^n n!} \,\frac{{\text d}^{n-1}}{{\text d}x^{n-1}} \left( x^2 -1 \right)^n \right\vert_{x=-1}^{x=1} - \frac{k}{2^n n!} \int_{-1}^1 x^{k-1}\left[ \left( x^2 -1 \right)^n \right]^{(n-1)} {\text d}x \tag{9.3} \end{align*} Since polynomial (x² − 1)ⁿ has zeroes of multiplicity n at points ±1, all its derivatives up to the order n−1 vanish at these points. Hence, all terms outside integral in Eq.(9.3) are zero. Then we integrate k−1 times by parts integral (9.3) and obtain for k < n that \begin{align*} A_{kn} &= \frac{(-1)^k k!}{2^n n!} \int_{-1}^1 \left[ \left( x^2 -1 \right)^n \right]^{(n-k)} {\text d}x \\ &= \left. \frac{(-1)^k k!}{2^n n!} \,\frac{{\text d}^{n-k-1}}{{\text d}x^{n-k-1}} \left( x^2 -1 \right)^n \right\vert_{x=-1}^{x=1} = 0 . \end{align*} So Legendre's polynomials are orthogonal.

The leading coefficient of the classical Legendre polynomial (9.1) is \[ \frac{(2n)^{\underline{n}}}{2^n n!} = \frac{2n \left( 2n-1 \right) \cdots \left( n+1 \right)}{2^n n!} = \frac{(2n)!}{2^n \left( n! \right)^2} . \] Therefore, if we divide by this coefficient the Legendre polynomial, we obtain a monic polynomial: \[ \tilde{P}_n (x) = \frac{2^n \left( n! \right)^2}{(2n)!}\, P_n (x) = \frac{n!}{(2n)!}\,\frac{{\text d}^n}{{\text d}x^n} \left( x^2 -1 \right)^n . \]

Let us calculate their norms: \begin{align*} \| P_n \|^2 &= \int_{-1}^1 P_n^2 (x)\,{\text d} x = \frac{(2n)!}{2^n \left( n! \right)^2} \int_{-1}^1 x^n P_n (x)\,{\text d} x \\ &= \frac{(2n)!}{2^{2n} \left( n! \right)^2} \int_{-1}^1 x^n \left[ \left( x^2 -1 \right)^n \right]^{(n)} {\text d} x \\ &= (-1)^n \frac{(2n)!}{2^{2n} \left( n! \right)^2} \int_{-1}^1 \left( x^2 -1 \right)^n {\text d} x . \end{align*} The latter integral we also evaluate by integrating by parts. \begin{align*} (-1)^n \frac{(2n)!}{2^{2n} \left( n! \right)^2} \,\| P_n \|^2 &= \int_{-1}^1 \left( x+1 \right)^n \left( x-1 \right)^n {\text d} x \\ &= \left. \left( x+1 \right)^n \left( x-1 \right)^{n+1} \frac{1}{n+1} \right\vert_{x=-1}^{x=1} - \frac{n}{n+1} \int_{-1}^1 \left( x+1 \right)^{n-1} \left( x-1 \right)^{n+1} {\text d} x \\ &= (-1)^n \,\frac{\left( n! \right)^2}{(2n)!} \int_{-1}^1 \left( x-1 \right)^{2n} {\text d} x \\ &= (-1)^n \,\frac{\left( n! \right)^2}{(2n)!} \,\frac{2^{2n+1}}{2n+1} . \end{align*} Finally, we get \[ ,\| P_n \|^2 = \int_{-1}^1 P_n^2 (x) \,{\text d}x = \frac{2}{2n+1} . \]

We summarize some facts regarding the (classical) Legendre polynomials Pₙ that are defined by Rodrigues’ formula (9.1).

Degree of Pₙ = n.
Orthogonality: \[ \int _{-1}^1 P_n(x)P_m(x)\, dx=0,\quad n\neq m. \] In particular, for fixed n, Pₙ is orthogonal to every polynomial of degree less than n.
Leading coefficient: Pₙ has leading term \[ P_n(x)=\frac{2^n}{(2n)!}\, (2n-1)!!\, x^n + \cdots , \] so the leading coefficient is nonzero.

Thus, if we define the monic Legendre polynomial \[ \tilde {P}_n(x) := \frac{1}{\mathrm{(leading\ coefficient\ of\ }P_n)}\, P_n(x) =\frac{2^n(n!)^2}{(2n)!}\, P_n(x), \] then:

$ \displaystyle \ \tilde {P}_n \ $ is monic,
Degree of monic polynomial $ \displaystyle \ \tilde {P}_n \ $ is n,
$ \displaystyle \ \tilde {P}_n \ $ is orthogonal to all polynomials of degree less than n.

So $ \displaystyle \ \tilde {P}_n \ $ satisfies exactly the same defining properties as qₙ. For each n, there is a unique monic polynomial qₙ of degree n that is orthogonal to all polynomials of lower degree (equivalently, to 1, x, x², … , x^n-1). Hence, the Gram–Schmidt process produces exactly this unique monic polynomial qₙ that is equal to $ \displaystyle \ \tilde {P}_n . $ ■

End of Example 10

ClearAll[x, inner, gsMonic] (*Inner product on[-1,1]*) inner[f_, g_] := Integrate[f g, {x, -1, 1}] (*Gram\[Dash]Schmidt producing monic polynomials*) gsMonic[n_] := Module[{basis, orth = {}, v, proj, p, lc}, basis = Table[x^k, {k, 0, n - 1}]; Do[v = basis[[k]]; proj = Sum[(inner[v, orth[[j]]]/ inner[orth[[j]], orth[[j]]]) orth[[j]], {j, Length[orth]}]; p = Expand[v - proj]; (*make polynomial monic*)lc = Coefficient[p, x, Exponent[p, x]]; p = Expand[p/lc]; AppendTo[orth, p], {k, Length[basis]}]; orth] (*First five monic orthogonal polynomials*) TableForm[gsMonic[5]]

Example 11: We consider Hilbert space ℌ = 𝔏²([0, ∞), e^−xdx) with inner product \[ \langle f \mid g\rangle \; =\; \int _0^{\infty} f(x)\, g(x)\, e^{-x}\, {\text d}x. \] Monomials \[ \left\{ x^n \right\} , \qquad n= 0,1,2,\ldots \] form a linearly independent sequence that is dense in this Hilbert space. Since they are not orthogonal, we apply the Gram--Schmidt process to transform them into orthogonal system of monic polynomials { ψₙ }_n≥0, each of them is a constant multiple of the classical Laguaerre polynomial, usually denoted by Lₙ(x).

The Laguerre polynomials, named after the French mathematician Edmond Laguerre (1834–1886), may be defined by the Rodrigues formula, \[ L_n (x) = \frac{e^x}{n!}\,\frac{{\text d}^n}{{\text d} x^n} \left( e^{-x} x^n \right) = \frac{1}{n!} \left( \frac{\text d}{{\text d}x} -1 \right)^n x^n , \quad n=0,1,2,\ldots . \tag{10.1} \] Actually, there is no evidence that Edmond Laguerre worked or was familiar with the Laguaerre polynomials that were invented by Pafnuty Chebyshev in 1859, \[ L_n (x) = \sum_{k=0}^n \frac{(-1)^k}{k!} \,\binom{n}{k} \,x^k , \tag{10.2} \] where $ \displaystyle \ \binom{n}{k} = \frac{n!}{k!\,(n-k)!} \ $ is the binomial coefficient.

An orthogonalization procedure is applied to a given linearly independent sequence { ϕ₀, ϕ₁, … , ϕₙ, …} to define \begin{align*} u_0 &= \phi_0 , \\ u_n &= \phi_n - \sum_{k=0}^{n-1} \frac{\langle \phi_n \mid u_k \rangle}{\langle u_k \mid u_k \rangle}\, \phi_k , \quad n\ge 1. \end{align*} Then { uₙ } is an orthogonal sequence, and each uₙ lies in span{ ϕ₀, ϕ₁, … , ϕₙ }.

If we want monic polynomials, we simply normalize each uₙ by dividing by its leading coefficient: \[ p_n(x)\; =\; \frac{1}{\mathrm{(leading\ coefficient\ of\ }u_n)}\, u_n(x). \] Then:

Each pₙ is a polynomial of degree n.
Each pₙ is monic.
The family { pₙ }_n≥0 is orthogonal with respect to ⟨ · ∣ · ⟩.

When a sequence of monic polynomials { ϕ₀, ϕ₁, … , ϕₙ, …} is given as, for isntance, { 1, x, x², … , xⁿ, …}, the Gram–Schmidt orthogonalization produces a unique sequence { pₙ }_n≥0 of monic orthogonal polynomials for this inner product.

The key idea now is: Laguerre polynomials also form such a sequence. If we show that the (appropriately normalized) Laguerre polynomials are monic and orthogonal with respect to the same inner product, then a uniqueness argument will force { pₙ } to coincide with them.

The Laguerre polynomials satisfy the orthogonality relation \[ \langle L_n \mid L_m \rangle = \int _0^{\infty }L_n(x)\, L_m(x)\, e^{-x}\, {\text d}x = \begin{cases} 0, &\quad n\neq m , \\ 1 , &\quad n=m . \end{cases} \] Mathematica confirms:

Integrate[LaguerreL[5, x]*LaguerreL[5, x]/Exp[x], {x, 0, Infinity}]

For our purposes, it is convenient to consider the monic version of Laguerre polynomials: \[ \hat {L}_n(x)\; :=\; (-1)^nn!\, L_n(x). \] We will show:

$ \displaystyle \ \hat {L}_n(x) \ $ is monic of degree n.
List of monic polynomials $ \displaystyle \ \{ \hat {L}_n\} $ is orthogonal with respect to $ \displaystyle \ \langle f,g\rangle =\int _0^{\infty }f(x)g(x)e^{-x}\, {\text d}x.$
Once that is done, uniqueness of monic orthogonal polynomials will give $ \displaystyle \ \hat {L}_n = p_n \ $ for all n.

Starting from the Rodrigues formula, we apply Leibniz’ rule \[ \left( f\,g \right)^{(n)} = \sum_{k=0}^n \binom{n}{k} f^{(n-k)} g^{(k)} , \] to the product e^−xxⁿ. The derivative of the power function xⁿ is \[ \frac{{\text d}^{\, n-k}}{{\text d}x^{\, n-k}}\left( x^n \right) = \frac{n!}{k!}\, x^k,\qquad 0\leq k\leq n. \] On the other hand, the derivative of the exponential function is well-known: \[ \frac{{\text d}^{\, k}}{{\text d}x^{\, k}}(e^{-x})=(-1)^k e^{-x}. \] Substitution back into Rodrigues’ formula yields \[ L_n (x) =\frac{e^x}{n!}\sum_{k=0}^n {n \choose k}\, \frac{n!}{k!}x^k\, (-1)^ke^{-x} = \sum _{k=0}^n{n \choose k}\, \frac{(-1)^k}{k!}\, x^k. \] So we obtain the explicit formula showing that the Laguerre polynomial Lₙ(x) is a finite sum of powers x^k with k = 0,1, 2, … , n. The coefficient of the highest power xⁿ is \[ (-1)^n{n \choose n}\frac{1}{n!} = \frac{(-1)^n}{n!}\neq 0, \] So the degree of Lₙ(x) is exactly n.

Therefore, \[ \hat {L}_n(x) =(-1)^n n!\,L_n(x) \] has leading coefficient 1, i.e., $ \displaystyle \ \hat {L}_n \ $ is monic of degree n.

We now show that { Lₙ } (and hence $ \displaystyle \ \{ \hat {L}_n\} $ ) is orthogonal with respect to $ \displaystyle \ \langle f,g\rangle =\int _0^{\infty }f(x)g(x)e^{-x}\, {\text d}x. \ $ Take m < n. Consider \[ \int _0^{\infty }L_n(x)\, x^m\, e^{-x}\, {\text d}x. \] Using the Rodrigues formula, we get \[ \int _0^{\infty} x^m\, \frac{e^x}{n!}\,\frac{{\text d}^n}{{\text d} x^n} \left( e^{-x} x^n \right) e^{-x}\, {\text d}x = \int _0^{\infty} x^m\, \frac{1}{n!}\,\frac{{\text d}^n}{{\text d} x^n} \left( e^{-x} x^n \right) {\text d}x . \] Integrate by parts n times. Each integration by parts moves one derivative from $ \displaystyle \ \frac{{\text d}^n}{{\text d}x^n}(e^{-x}x^n) \ $ onto x^m. Because m < n, after m + 1 derivatives of x^m becomes zero. More systematically:

After k integrations by parts, the integrand involves $ \displaystyle \ \frac{{\text d}^{n-k}}{{\text d}x^{n-k}}(e^{-x}x^n) \times \frac{d^k}{dx^k}x^m. $
For k>m, $ \displaystyle \ \frac{{\text d}^k}{{\text d}x^k}x^m=0. $

Boundary terms vanish because e^−xxⁿ decays at ∞ and is 0 at 0 for n ≥ 1. Thus, the integral is zero: \[ \int _0^{\infty }L_n(x)\, x^m\, e^{-x}\, {\text d}x =0,\quad m < n . \] Now, any polynomial q(x) of degree < n is a linear combination of monomials { 1, x, x², … , xⁿ⁻¹ }, so by linearity, \[ \int _0^{\infty }L_n(x)\, q(x)\, e^{-x}\, {\text d}x = 0,\quad \deg q < n . \] In particular, for m < n, L_m(x) has degree m < n, so \[ \int _0^{\infty }L_n(x)\, L_m(x)\, e^{-x}\, {\text d}x=0,\quad m < n . \] Thus, { Lₙ } is an orthogonal sequence, and so is $ \displaystyle \ \{ \hat{L}_n\} , $ since multiplying by nonzero constants does not affect orthogonality.

Uniqueness of monic orthogonal polynomials follows from a simple but powerful uniqueness lemma.

Lemma: (Uniqueness) For each n, there is a unique monic polynomial pₙ of degree n that is orthogonal to all polynomials of lower degree (equivalently, to 1, x, x², … , x^n-1).

Proof. Fix n, and suppose that there exist two polynomials pₙ and qₙ of degree n that satisfy conditions of Lemma. Since both pₙ and qₙ are of degree n and monic, their difference \[ r_n(x):=p_n(x)-q_n(x) \] is a polynomial of degree at most n-1. Because { pₖ } is orthogonal and pₙ is orthogonal to all polynomials of degree < n, we have \[ \langle p_n,r_n\rangle =0, \] since rₙ has degree ≤ n-1.

On the other hand, \[ 0 = \langle p_n,r_n\rangle =\langle p_n,p_n-q_n\rangle =\langle p_n,p_n\rangle -\langle p_n,q_n\rangle . \] But qₙ is also orthogonal to all polynomials of degree < n, and pₙ has degree n, so the only way ⟨ pₙ , qₙ ⟩ can be nonzero if pₙ and qₙ are linearly dependent. However, they are both monic of the same degree, so if they are linearly dependent, each of them must be a scalar multiple of another. This scalar must be 1, i,e,, qₙ ≡ pₙ. In that case, rₙ = 0 and we done.

A more direct argument: since rₙ has degree ≤ n-1, orthogonality of pₙ and qₙ to all polynomials of degree ≤ n-1 implies \[ \langle p_n, r_n\rangle = \langle q_n, r_n\rangle = 0. \] Since rₙ = pₙ − qₙ has degree ≤ n-1, we also get \[ \langle q_n,r_n\rangle =0\; \Rightarrow \; \langle q_n,p_n\rangle =\langle q_n,q_n\rangle . \] But $ \displaystyle \ \langle p_n,q_n\rangle =\langle q_n,p_n\rangle , \ $ so \[ \langle p_n,p_n\rangle =\langle q_n,q_n\rangle . \] Then \[ 0=\langle p_n,r_n\rangle =\langle p_n,p_n\rangle -\langle p_n,q_n\rangle =\langle p_n,p_n\rangle -\langle q_n,q_n\rangle =0, \] which is consistent but doesn’t yet force equality. To pin it down, note that if rₙ ≠ 0, then rₙ has degree ≤ n-1 and is nonzero, so it cannot be orthogonal to all polynomials of degree ≤ n-1 unless the inner product is degenerate. Since our inner product is nondegenerate on polynomials (because $ \displaystyle \ \int _0^{\infty }|p(x)|^2e^{-x}dx=0 \ $ implies p ≡ 0), we must have rₙ = 0. Hence, pₙ = qₙ.

In short: for a nondegenerate inner product, there is at most one monic orthogonal polynomial of each degree. ▣

This gives us a conclusion: the Gram–Schmidt process gives the Laguerre polynomials.

Application of the Gram–Schmidt process to the sequence of monomials { 1, x, x², … , xⁿ, …} produces a sequence of monic orthogonal polynomials { pₙ(x) }_n≥0, where \[ p_n(x)=x^n+\mathrm{(lower\ degree\ terms)},\quad \deg p_n=n, \] and \[ \langle p_n,p_m\rangle =0\quad \mathrm{for\ }n\neq m. \] These pₙ(x) will turn out to satisfy a three-term recurrence (usually called the difference equation): \[ x\, p_n(x)\; =\; p_{n+1}(x)+\alpha _n\, p_n(x)+\beta _n\, p_{n-1}(x),\qquad n\geq 1, \] with \[ p_0 (x) = 1,\qquad p_1(x) =x-\alpha _0, \] and real coefficients αₙ, βₙ > 0. The reason is purely Gram–Schmidt/linear algebra:

Degree constraint: x pₖ has degree n+1, so it can be expressed in the basis { p₀, p₁, … , p_n+1 }.
Monicity: both x pₖ and p_n+1 are monic of degree n+1, so the coefficient in front of p_n+1 must be 1.
Orthogonality: the component of x pₙ along pₖ for k ≤ n-2 must vanish, because \[ \langle xp_n, p_k\rangle =\langle p_n, xp_k\rangle , \]
and x pₖ is a linear combination of p₀, p₁, … , p_k+1, all orthogonal to pₙ when k+1 ≤ n-1. So ⟨ x pₖ∣pₖ ⟩ = 0 if |n-k|>1. Hence, only p_n+1, pₖ, p_n-1 can appear in the recurrence.

Key observation: x pₙ lives in span{ p₀, p₁, … , p_k+1 }. Since degree of pₙ = n, we have degree of (x pₙ) = n+1. The orthogonal polynomials { pₖ } form a basis of the polynomial space, so \[ xp_n(x)=\sum _{k=0}^{n+1}a_{n,k}\, p_k(x). \] Because p_n+1 is monic, the coefficient of xⁿ⁺¹ on both sides forces \[ a_{n,n+1}=1. \] So we can write \[ xp_n(x)=p_{n+1}(x)+\sum _{k=0}^n a_{n,k}\, p_k(x). \] Orthogonality kills all but three terms: \[ \langle x\,p_n, p_j\rangle =0\quad \mathrm{for\ }j\leq n-2 . \]

Now use orthogonality, and take inner product with p_j: \[ \left\langle x\,p_n \mid p_j \right\rangle = \left\langle p_{n+1} \mid p_j \right\rangle + \sum _{k=0}^n a_{n,k}\left\langle p_k,p_j\right\rangle . \] Moreover, the coefficients are given directly by inner products: \[ \alpha_n =\frac{\langle xp_n , p_n\rangle }{\langle p_n , p_n\rangle },\qquad \beta _n =\frac{\langle xp_n , p_{n-1}\rangle }{\langle p_{n-1} , p_{n-1}\rangle }. \]

In our case, we know that pₙ = (−1)ⁿn! Lₙ, so \[ \langle p_n , p_n\rangle = \| p_n \|^2 = \left( n! \right)^2 \int_0^{\infty} L_n^2 (x)\,e^{-x} {\text d} x = \left( n! \right)^2 . \] Since \[ x\,L_n (x) = \left( 2n+1 \right) L_n (x) - \left( n+1 \right) L_{n+1} (x) -n\,L_{n-1} (x) , \] we find \[ \langle xp_n \mid p_n\rangle = \left( 2n+1 \right) \| p_n \|^2 = \left( 2n+1 \right) \left( n! \right)^2 . \] This gives \[ \alpha_n = 2n+1 , \qquad \beta_n = -n . \] ■

End of Example 11

Example 12: Like the other classical orthogonal polynomials, the Hermite polynomials can be defined from several different starting points. In this example, we use orthogonalization procedure to derive these polynomials. They were invented and studied in detail by the Russian scientist Pafnuty Chebyshev in 1859. Five years later, Charles Hermite provided a deep analysis of these polynomials. Therefore, they were known for almost 100 years as Chebyshev--Hermite polynomials. It should be noted that these polynomials were mentioned previously in 1810 by Pierre-Simon Laplace in scarcely recognizable form.

Hermite polynomials, denoted by Hₙ(x), can be defined recursively \[ H_{n+1} (x) = 2x\,H_n (x) - 2n\,H_{n-1} (x) , \qquad H_0 (x) = 1, \quad H_1 (x) = 2x. \tag{11.1} \] Mathematica has a build-in command:

HermiteH[5, x] == Expand[2*x*HermiteH[4, x] - 8*HermiteH[3, x]]

True

We plot H₅ in real and complex domain:

Plot[HermiteH[5, x], {x, -2.5, 2.5}, PlotStyle -> Thick, PlotLegends -> "Hermite[5,x]"]
ComplexPlot3D[HermiteH[5, z], {z, -1 - I, 1 + I}, PlotLegends -> Automatic]

Figure 11.1: Hermite polynomial H₅(x) on real axis.

Figure 11.2: Hermite polynomial H₅(z) for complex variable.

Be aware that there are the "probabilist's Hermite polynomials", given by \[ \mbox{He}_n ( x ) = ( − 1 )^n e^{x^2 /2} \frac{{\text d}^n}{{\text d} x^n}\, e^{− x^2 /2} , \qquad n=0,1,2,\ldots . \]

The "physicist's Hermite polynomial" Hₙ(x) can be derived by orthogonalization from the list of monomials \[ \phi_n (x) = x^n \qquad \left( n=0,1,2,\ldots \right) , \] in Hilbert space ℌ = 𝔏²(ℝ, wdx) = 𝔏^²_w with the weight function \[ w(x) = e^{-x^2} . \] These monomials { xⁿ } are linearly independent, but not orthogonal in ℌ. We verify their linearly independence with Mathematica by showing that their Wronskian is not zero.

G[k_] := Table[x^n, {n, 0, k}]; W[k_, m_] := D[G[k], {x, m}]; WR[k_] := Table[W[k, m], {m, 0, k}]; Det[WR[6]]

24883200

Starting with n = 0, let \[ \psi_0 = \phi_0 = 1 , \quad \omega_0 = \frac{1}{\| 1 \|} = \frac{1}{\pi^{1/4}} = \pi^{-1/4} \] because \[ \int_{-\infty}^{\infty} {\text d} x = \sqrt{\pi} . \]

Integrate[Exp[-x^2], {x, -Infinity, Infinity}]

Sqrt[\[Pi]]

For n = 1, let \[ \psi_1 = \phi_1 + b_{10}\phi_0 = x + b_{10} . \] This function is orthogonal to ϕ₀ if its inner product with this function vanishes: \[ 0 = \langle \psi_1 \mid \phi_0 \rangle = \int_{-\infty}^{\infty} \left( x + b_{10} \right) e^{-x^2} {\text d} x = b_{10} \int_{-\infty}^{\infty} e^{-x^2} {\text d} x . \] Since the integral of b₁₀ is non-zero, we conclude that b₁₀ = 0. Hence, ψ₁ = x.

For n = 2, let \[ \psi_2 = \phi_2 + b_{20}\phi_0 + b_{21} \phi_1 = x^2 + b_{20} + b_{21} x . \] Its inner product with ϕ₀ = 1 gives \[ 0 = \langle \psi_2 \mid 1 \rangle = \int_{-\infty}^{\infty} \left( x^2 + b_{20} + b_{21} x \right) e^{-x^2} {\text d} x = b_{20} \sqrt{\pi} + \frac{\sqrt{\pi}}{2} . \]

Integrate[x^2 * Exp[-x^2], {x, -Infinity, Infinity}]

Sqrt[\[Pi]]/2

This yields a linear equation \[ b_{20} + \frac{1}{2} = 0 \qquad \Longrightarrow \qquad b_{20} = - \frac{1}{2} . \] Projection onto ϕ₀: \[ \mathfrak{\mathrm{proj}_{\mathrm{\phi}_0}}(x^2) =\frac{\langle x^2,1\rangle }{\langle 1,1\rangle }\phi_0 =\frac{\frac{\sqrt{\pi }}{2}}{\sqrt{\pi }}\cdot 1=\frac{1}{2}. \] Taking the inner product with ϕ₁ = x, we get \[ 0 = \langle \psi_2 \mid x \rangle = b_{21} \| x^2 \|^2 \qquad \Longrightarrow \qquad b_{21} = 0 . \] Hence, ψ₂ = x² − ½ = 4 H₂(x). Projection onto ϕ₁ is zero.

For n = 3, let \[ \psi_3 = \phi_3 + b_{30}\phi_0 + b_{31} \phi_1 +b_{32} \phi_2 = x^3 + b_{32} x^2 + b_{31} x + b_{30} . \] Taking the inner product with 1, we get \[ 0 = \langle \psi_3 \mid 1 \rangle = b_{30} \| 1 \|^2 + b_{32} \langle x^2 \mid 1 \rangle = \sqrt{\pi} \left( b_{30} + \frac{b_{32}}{2} \right) . \] Similarly, we have \begin{align*} 0 &= \langle \psi_3 \mid x \rangle = b_{31} \langle x \mid x \rangle + \langle x^3 \mid x \rangle = \sqrt{\pi} \left( \frac{1}{2}\,b_{31} + \frac{3}{4} \right) , \\ 0 &= \langle \psi_3 \mid x^2 \rangle = b_{32} \langle x^2 \mid x^2 \rangle + b_{30} \langle 1 \mid x^2 \rangle = \sqrt{\pi} \left( b_{32}\,\frac{3}{4} + b_{30}\,\frac{1}{2} \right) . \end{align*} Therefore, ψ₃ = x³ − 3/2 = 8 H₃(x). So empirically, we have \[ \psi_n (x) = 2^n H_n (x) = \left( -2 \right)^n e^{x^2} \frac{{\text d}^n}{{\text d} x^n} \left( e^{-x^2} \right) , \quad n = 0,1,2,\ldots . \tag{11.2} \] This system of functions { ψₙ(x) } is orthogonal by construction. We are going to prove that the Gram--Schmidt process defines (up to a non-zero multiple) the "physicists" Hermite polynomials Hₙ(x) in two ways. First, we show that the orthogonalization leads to recurrence (11.1); then we rederive the Rodrigues formula for the n-th Hermite polynomial Hₙ(x).

We are going to derive a three‑term recurrence from Gram–Schmidt orthogonalization procedure. A key structural fact: for any orthogonal polynomial system { pₙ } with respect to a positive weight on an interval, multiplication by x acts tridiagonally: \[ x\,p_n (x) = a_{n+1} p_{n+1}(x) + b_n p_n(x) + a_n p_{n-1}(x), \] with suitable coefficients 𝑎ₙ, bₙ.

Here, because the weight $ \displaystyle \ e^{-x^2} \ $ is even and pₙ has parity (-1)ⁿ, we get bₙ = 0 (the integrals defining bₙ vanish by parity). So we expect \[ x\,p_n(x) = a_{n+1}p_{n+1}(x) + a_n p_{n-1}(x). \] To compute the coefficients, we take inner products with p_n+1 and p_n-1. \[ \langle xp_n,p_{n+1}\rangle =a_{n+1}\langle p_{n+1},p_{n+1}\rangle , \] so we determine coefficient 𝑎_n+1: \[ a_{n+1}=\frac{\langle xp_n,p_{n+1}\rangle }{\| p_{n+1}\| ^2}. \] Coefficient 𝑎ₙ: \[ \langle x\,p_n \mid p_{n-1}\rangle = a_n\langle p_{n-1} \mid p_{n-1}\rangle . \] Hence, \[ a_n=\frac{\langle xp_n,p_{n-1}\rangle }{\| p_{n-1}\| ^2}. \] Because pₙ is monic (leading coefficient is 1), the leading term of x pₙ is xⁿ⁺¹, and the leading term of p_n+1 is also xⁿ⁺¹. This forces \[ a_{n+1}=1. \] Thus, the recurrence simplifies to \[ x\,p_n(x) = p_{n+1}(x) + a_n p_{n-1}(x). \] We can determine 𝑎ₙ by comparing norms or by direct integration. A standard computation (or comparison with the known Hermite recurrence) gives \[ a_n = \frac{n}{2}. \] So the Gram–Schmidt polynomials satisfy \[ p_{n+1}(x) = x\,p_n(x)-\frac{n}{2}p_{n-1}(x), \] with \[ p_0(x)=1,\quad p_1(x)=x. \] Multiplying by 2ⁿ, define \[ H_n (x) = 2^n p_n(x). \] Then \[ H_{n+1}(x) = 2x\,H_n(x)-2n\,H_{n-1}(x), \] which is exactly the standard Hermite recurrence (11.1). So the three‑term recurrence is inherited from Gram–Schmidt process.

Next we prove that Gram–Schmidt polynomials are exactly Hermite polynomials up to some numerical multiple. We start by deriving the Rodrigues' formula. Let us evaluate derivatives of the weight function: \begin{align*} u &= e^{-x^2} , \\ u' &= -2x\,e^{-x^2} , \\ u'' &= \left( 4x^2 -2 \right) e^{-x^2} , \\ u''' &= - \left( 8x^3 -12\,x \right) e^{-x^2} , \\ \vdots & \qquad \vdots \\ u^{(n)} &= (-1)^n H_n (x)\, e^{-x^2} , \end{align*} where Hₙ(x) is the Hermite polynomial. From the above, we have \[ H_n (x) = (-1)^n u^{(n)} (x) = (-1)^n e^{x^2} \frac{{\text d}^n}{{\text d} x^n} \, e^{-x^2} , \quad n=0,1,2,\ldots . \] This polynomial is of degree n and its leading coefficient is 2ⁿ.

We prove it by induction. For n = 0 and n = 1, the statement is true. Suppose that it is valid up to n, that is, the polynomial Hₙ(x) is of degree n and its leading coefficient is 2ⁿ. We determine the Hermite polynomial of order n+1: \begin{align*} u^{(n+1)} &= (-1)^n e^{-x^2} \frac{\text d}{{\text d}x}\, H_n (x) -2x \left( -1 \right)^n H_n (x)\, e^{-x^2} \\ &= (-1)^{n+1} 2x\,H_n (x)\, e^{-x^2} + (-1)^n e^{-x^2} \frac{\text d}{{\text d}x}\, H_n (x) . \end{align*} The first multiple 2x Hₙ(x) is a polynomial of degree n+1 and its leading term is 2ⁿ⁺¹ because Hₙ is of degree n and has leading coefficient 2ⁿ by assumption. The second term is a polynomial of degree n−1, so it does not contribute to the leading term.

Let us introduce the function \[ \psi_n (x) = H_n (x)\, e^{-x^2 /2} . \] For any n, function ψₙ is square integrable, so the system { ψₙ }_n≥0 is orthogonal in 𝔏²(ℝ). Indeed, integration by parts yields \begin{align*} \langle u^{(n)} \mid v \rangle &= \int_{-\infty}^{\infty} u^{(n)} (x)\,v(x)\,{\text d}x \\ &= \left[ u^{(n-1)}(x)\,v(x) - u^{(n-2)}(x)\,v'(x) + \cdots + (-1)^{n-1} u(x)\, v^{(n-1)} (x) \right]_{-\infty}^{\infty} \\ & \quad +(-1)^n \int_{\mathbb{R}} u(x)\,v^{(n)} {\text d}x . \tag{11.3} \end{align*} Assuming n > k, we calculate \begin{align*} \langle \psi_n \mid \psi_k \rangle &= \int_{-\infty}^{\infty} H_n (x)\,H_k (x)\, e^{-x^2} {\text d}x \\ &= (-1)^n \int_{\mathbb{R}} u^{(n)} (x) \,H_k (x)\,{\text d}x . \end{align*} Applying integration by parts formula (11.3), we get \begin{align*} \langle \psi_n \mid \psi_k \rangle &= (-1)^n \int_{\mathbb{R}} u^{(n)} (x) \,H_k (x)\,{\text d}x \\ &= (-1)^{2n} \int_{\mathbb{R} u(x)\, H_k^{(n)} \,{\text d} x = 0 \end{align*} because Hₖ(x) is a polynomial of degree k < n. ■

End of Example 12

ClearAll[weightedInner, gramSchmidtPolys, hermiteFromGS]; (* Weighted inner product with Gaussian weight *) weightedInner[f_, g_, x_] := Integrate[f[x] g[x] Exp[-x^2], {x, -Infinity, Infinity}, Assumptions -> Element[x, Reals] ]; (* Gram–Schmidt on monomials 1, x, ..., x^N *) gramSchmidtPolys[N_Integer?NonNegative, x_Symbol] := Module[ {v, p, proj, inner}, v[n_] := x^n; p[0] = v[0]; Do[ p[k] = v[k] - Sum[ inner = weightedInner[p[j] &, v[k] &, x]/weightedInner[p[j] &, p[j] &, x]; inner p[j], {j, 0, k - 1} ]; (* make monic *) p[k] = Expand[p[k]/Coefficient[p[k], x, k]]; , {k, 1, N} ]; Table[p[n], {n, 0, N}] ]; (* Rescale to physicists' Hermite polynomials *) hermiteFromGS[N_Integer?NonNegative, x_Symbol] := Module[ {p = gramSchmidtPolys[N, x]}, Table[2^n p[[n + 1]], {n, 0, N}] ]; (* Example: first 4 polynomials *) pList = gramSchmidtPolys[3, x] hList = hermiteFromGS[3, x]

pList will give the Gram–Schmidt polynomials.
hList will give the standard Hₙ(x).

Byron, F.W. and Fuller, R.W., Mathematics of Classical and Quantum Physics, Dover Publications, 1992.
I. Todhunter, An Elementary Treatise on Laplace's, Lamé's, and Bessel's Functions, MacMillan, 1875 (London).

Return to Mathematica page
Return to the main page (APMA0340)
Return to the Part 1 Basic Concepts
Return to the Part 2 Fourier Series
Return to the Part 3 Integral Transformations
Return to the Part 4 Parabolic PDEs
Return to the Part 5 Hyperbolic PDEs
Return to the Part 6 Elliptic PDEs
Return to the Part 6P Potential Theory
Return to the Part 7 Numerical Methods

MATHEMATICA
TUTORIAL

under the terms of the GNU General Public License (GPL)

Part 4.3: Orthogonality

Email: Prof. Vladimir Dobrushkin ()

Contents [hide]

Glossary

Preface

Orthogonal Systems

Orthogonalization

MATHEMATICA TUTORIAL under the terms of the GNU General Public License (GPL) Part 4.3: Orthogonality

Email: Prof. Vladimir Dobrushkin ()

Contents [hide]

Glossary

Preface

Orthogonal Systems

Orthogonalization

MATHEMATICA
TUTORIAL

under the terms of the GNU General Public License (GPL)

Part 4.3: Orthogonality