Vieta's formulas

From HandWiki
Short description: Relating coefficients and roots of a polynomial
François Viète

In mathematics, Vieta's formulas relate the coefficients of a polynomial to sums and products of its roots. They are named after François Viète (more commonly referred to by the Latinised form of his name, "Franciscus Vieta").

Basic formulas

Any general polynomial of degree n [math]\displaystyle{ P(x) = a_n x^n + a_{n-1}x^{n-1} + \cdots + a_1 x + a_0 }[/math] (with the coefficients being real or complex numbers and an ≠ 0) has n (not necessarily distinct) complex roots r1, r2, ..., rn by the fundamental theorem of algebra. Vieta's formulas relate the polynomial coefficients to signed sums of products of the roots r1, r2, ..., rn as follows:

[math]\displaystyle{ \begin{cases} r_1 + r_2 + \dots + r_{n-1} + r_n = -\dfrac{a_{n-1}}{a_n} \\[1ex] (r_1 r_2 + r_1 r_3 + \cdots + r_1 r_n) + (r_2r_3 + r_2r_4+\cdots + r_2r_n)+\cdots + r_{n-1}r_n = \dfrac{a_{n-2}}{a_{n}} \\[1ex] {} \quad \vdots \\[1ex] r_1 r_2 \cdots r_n = (-1)^n \dfrac{a_0}{a_n}. \end{cases} }[/math]

 

 

 

 

([[#equation_*|*]])

Vieta's formulas can equivalently be written as [math]\displaystyle{ \sum_{1\le i_1 \lt i_2 \lt \cdots \lt i_k\le n} \left(\prod_{j = 1}^k r_{i_j}\right)=(-1)^k\frac{a_{n-k}}{a_n} }[/math] for k = 1, 2, ..., n (the indices ik are sorted in increasing order to ensure each product of k roots is used exactly once).

The left-hand sides of Vieta's formulas are the elementary symmetric polynomials of the roots.

Vieta's system (*) can be solved by Newton's method through an explicit simple iterative formula, the Durand-Kerner method.

Generalization to rings

Vieta's formulas are frequently used with polynomials with coefficients in any integral domain R. Then, the quotients [math]\displaystyle{ a_i/a_n }[/math] belong to the field of fractions of R (and possibly are in R itself if [math]\displaystyle{ a_n }[/math] happens to be invertible in R) and the roots [math]\displaystyle{ r_i }[/math] are taken in an algebraically closed extension. Typically, R is the ring of the integers, the field of fractions is the field of the rational numbers and the algebraically closed field is the field of the complex numbers.

Vieta's formulas are then useful because they provide relations between the roots without having to compute them.

For polynomials over a commutative ring that is not an integral domain, Vieta's formulas are only valid when [math]\displaystyle{ a_n }[/math] is not a zero-divisor and [math]\displaystyle{ P(x) }[/math] factors as [math]\displaystyle{ a_n(x-r_1)(x-r_2)\dots(x-r_n) }[/math]. For example, in the ring of the integers modulo 8, the quadratic polynomial [math]\displaystyle{ P(x) = x^2-1 }[/math] has four roots: 1, 3, 5, and 7. Vieta's formulas are not true if, say, [math]\displaystyle{ r_1=1 }[/math] and [math]\displaystyle{ r_2=3 }[/math], because [math]\displaystyle{ P(x)\neq (x-1)(x-3) }[/math]. However, [math]\displaystyle{ P(x) }[/math] does factor as [math]\displaystyle{ (x-1)(x-7) }[/math] and also as [math]\displaystyle{ (x-3)(x-5) }[/math], and Vieta's formulas hold if we set either [math]\displaystyle{ r_1=1 }[/math] and [math]\displaystyle{ r_2=7 }[/math] or [math]\displaystyle{ r_1=3 }[/math] and [math]\displaystyle{ r_2=5 }[/math].

Example

Vieta's formulas applied to quadratic and cubic polynomials:

The roots [math]\displaystyle{ r_1, r_2 }[/math] of the quadratic polynomial [math]\displaystyle{ P(x) = ax^2 + bx + c }[/math] satisfy [math]\displaystyle{ r_1 + r_2 = -\frac{b}{a}, \quad r_1 r_2 = \frac{c}{a}. }[/math]

The first of these equations can be used to find the minimum (or maximum) of P; see Quadratic equation § Vieta's formulas.

The roots [math]\displaystyle{ r_1, r_2, r_3 }[/math] of the cubic polynomial [math]\displaystyle{ P(x) = ax^3 + bx^2 + cx + d }[/math] satisfy [math]\displaystyle{ r_1 + r_2 + r_3 = -\frac{b}{a}, \quad r_1 r_2 + r_1 r_3 + r_2 r_3 = \frac{c}{a}, \quad r_1 r_2 r_3 = -\frac{d}{a}. }[/math]

Proof

Vieta's formulas can be proved by expanding the equality [math]\displaystyle{ a_n x^n + a_{n-1}x^{n-1} +\cdots + a_1 x+ a_0 = a_n (x-r_1) (x-r_2) \cdots (x-r_n) }[/math] (which is true since [math]\displaystyle{ r_1, r_2, \dots, r_n }[/math] are all the roots of this polynomial), multiplying the factors on the right-hand side, and identifying the coefficients of each power of [math]\displaystyle{ x. }[/math]

Formally, if one expands [math]\displaystyle{ (x-r_1) (x-r_2) \cdots (x-r_n), }[/math] the terms are precisely [math]\displaystyle{ (-1)^{n-k}r_1^{b_1}\cdots r_n^{b_n} x^k, }[/math] where [math]\displaystyle{ b_i }[/math] is either 0 or 1, accordingly as whether [math]\displaystyle{ r_i }[/math] is included in the product or not, and k is the number of [math]\displaystyle{ r_i }[/math] that are included, so the total number of factors in the product is n (counting [math]\displaystyle{ x^k }[/math] with multiplicity k) – as there are n binary choices (include [math]\displaystyle{ r_i }[/math] or x), there are [math]\displaystyle{ 2^n }[/math] terms – geometrically, these can be understood as the vertices of a hypercube. Grouping these terms by degree yields the elementary symmetric polynomials in [math]\displaystyle{ r_i }[/math] – for xk, all distinct k-fold products of [math]\displaystyle{ r_i. }[/math]

As an example, consider the quadratic [math]\displaystyle{ f(x) = a_2x^2 + a_1x + a_0 = a_2(x - r_1)(x - r_2) = a_2(x^2 - x(r_1 + r_2) + r_1 r_2). }[/math]

Comparing identical powers of [math]\displaystyle{ x }[/math], we find [math]\displaystyle{ a_2=a_2 }[/math], [math]\displaystyle{ a_1=-a_2 (r_1+r_2) }[/math] and [math]\displaystyle{ a_0 = a_2 (r_1r_2) }[/math], with which we can for example identify [math]\displaystyle{ r_1+r_2 = - a_1/a_2 }[/math] and [math]\displaystyle{ r_1r_2 = a_0/a_2 }[/math], which are Vieta's formula's for [math]\displaystyle{ n=2 }[/math].

Alternate Proof (Mathematical Induction)

Vieta's formulas also formulas can be proven by induction.

Inductive Hypothesis:

Let [math]\displaystyle{ {P(x)} }[/math] be a n degree polynomial, with real or complex roots [math]\displaystyle{ {r_1},{r_2},{\cdots},{r_n} }[/math].[math]\displaystyle{ {P(x)} = {a_n}{x^n}+{{a_{n-1}}{x^{n-1}}}+{\cdots}+{{a_{1}}{x}}+{{a}_{0}} = {{a_n}{x^{n}}}-{a_n}{({r_1}+{r_2}+{\cdots}+{r_n}){x^{n-1}}}+{\cdots}+ {{(-1)^{n}}{ (a_n)}{({r_1}{r_2}{\cdots}{r_n})}}, { a_n} \neq 0 }[/math]Base Case, [math]\displaystyle{ n = 2 }[/math] (quadratic):

Let [math]\displaystyle{ {a_2},{a_1} }[/math] be coefficients of the quadratic and [math]\displaystyle{ a_0 }[/math]be the constant term. Similarly, let [math]\displaystyle{ {r_1},{r_2} }[/math] be the roots of the quadratic:[math]\displaystyle{ {a_2 x^2}+{a_1 x} + a_0 = {a_2}{(x-r_1)(x-r_2)} }[/math]Expand the right side using distributive property:[math]\displaystyle{ {a_2 x^2}+{a_1 x} + a_0 = {a_2}{({x^2}-{r_1x}-{r_2x}+{r_1}{r_2})} }[/math]Collect like terms:[math]\displaystyle{ {a_2 x^2}+{a_1 x} + a_0 = {a_2}{({x^2}-{({r_1}+{r_2}){x}}+{r_1}{r_2})} }[/math]Apply distributive property again:[math]\displaystyle{ {a_2 x^2}+{a_1 x} + a_0 = {{a_2}{x^2}-{{a_2}({r_1}+{r_2}){x}}+{a_2}{({r_1}{r_2})}} }[/math]The inductive hypothesis has now been proven true for n = 2.

Induction Step:

Assuming the inductive hypothesis holds true for all [math]\displaystyle{ n\geqslant 2 }[/math], it must be true for all [math]\displaystyle{ n+1 }[/math].[math]\displaystyle{ {P(x)} = {a_{n+1}}{x^{n+1}}+{{a_{n}}{x^{n}}}+{\cdots}+{{a_{1}}{x}}+{{a}_{0}} }[/math]By the factor theorem, [math]\displaystyle{ {(x-r_{n+1})} }[/math] can be factored out of [math]\displaystyle{ P(x) }[/math] leaving a 0 remainder. Note that the roots of the polynomial in the square brackets are [math]\displaystyle{ r_1,r_2,\cdots,r_n }[/math]:[math]\displaystyle{ {P(x)} = {(x-r_{n+1})} {[{\frac{{a_ {n+ 1}}{x^ {n+1}}+{{a_{n}}{x^{n}}}+{\cdots}+{{a_{1}}{x}}+{{a}_{0}}}{x- r_{n +1}}}]} }[/math]Factor out [math]\displaystyle{ a_{n+1} }[/math], the leading coefficient [math]\displaystyle{ P(x) }[/math], from the polynomial in the square brackets:[math]\displaystyle{ {P(x)} ={(a_{n+{1}})}{(x-r_{n+1})} {\frac{{a_{n}} {x^{n}}}{(a_{n+{1}})}}+{\cdots}+{\frac {a_{1}}{(a_{n+{1}})} {x}}+ {{\frac{a_0}{{(a_{n+{1}})}}}}} {x- r_{n +1}}}]} }[/math]For simplicity sake, allow the coefficients and constant of polynomial be denoted as [math]\displaystyle{ \zeta }[/math]:[math]\displaystyle{ P(x) = {(a_ {n+1})}{(x-r_ {n+1})}{[{x^n}+{\zeta_{n-1}x^{n-1}}+{\cdots}+{\zeta_0}]} }[/math]Using the inductive hypothesis, the polynomial in the square brackets can be rewritten as:[math]\displaystyle{ P(x) = {(a_ {n+1})} {(x-r_ {n+1})} {[{{x^{n}}}-{({r_1}+{r_2}+{\cdots}+{r_n}){x^{n-1}}}+{\cdots}+ {{(-1)^{n}}{({r_1}{r_2}{\cdots}{r_n})}}]} }[/math]Using distributive property:[math]\displaystyle{ P(x) = {(a_ {n+1})}{({x} {[{{x^{n}}}-{({r_1}+{r_2}+{\cdots}+{r_n}){x^{n-1}}}+{\cdots}+ {{(-1)^{n}}{({r_1}{r_2}{\cdots}{r_n})}}]} {- r_ {n+1}} {[{{x^{n}}}-{({r_1}+{r_2}+{\cdots}+{r_n}){x^{n-1}}}+{\cdots}+ {{(-1)^{n}}{({r_1}{r_2}{\cdots}{r_n})}}]} )} }[/math]After expanding and collecting like terms:[math]\displaystyle{ \begin{align} {P(x)} = {{a_{n+1}}{x^{n+1}}}-{a_{n+1}}{({r_1}+{r_2}+{\cdots}+{r_n}+{r_{n+1}}){x^{n}}}+{\cdots}+ {{(-1)^{n+1}}{({r_1}{r_2}{\cdots}{r_n}{r_{n+1}})}} \\ \end{align} }[/math]The inductive hypothesis holds true for [math]\displaystyle{ n+1 }[/math], therefore it must be true [math]\displaystyle{ \forall n \in \mathbb{N} }[/math]

Conclusion:[math]\displaystyle{ {a_ n}{x^n}+{{a_{n-1}}{x^{n-1}}}+{\cdots}+{{a_{1}}{x}}+{{a}_{0}} = {{a_n}{x^{n}}}-{a_n}{({r_1}+{r_2}+{\cdots}+{r_n}){x^{n-1}}}+{\cdots}+ {{(-1)^{n}}{({r_1}{r_2}{\cdots}{r_n})}} }[/math]By dividing both sides both sides by [math]\displaystyle{ a_{n} }[/math], it proves the Vieta's formulas true.

History

As reflected in the name, the formulas were discovered by the 16th-century French mathematician François Viète, for the case of positive roots.

In the opinion of the 18th-century British mathematician Charles Hutton, as quoted by Funkhouser,[1] the general principle (not restricted to positive real roots) was first understood by the 17th-century French mathematician Albert Girard:

...[Girard was] the first person who understood the general doctrine of the formation of the coefficients of the powers from the sum of the roots and their products. He was the first who discovered the rules for summing the powers of the roots of any equation.

See also

References

  1. (Funkhouser 1930)