- Proofs of Fermat's theorem on sums of two squares
Fermat's theorem on sums of two squares asserts that an oddprime number "p" can be expressed as:
with
integer "x" and "y" if and only if "p" is congruent to 1 (mod 4). The statement was announced by Fermat in 1640, but he supplied no proof.The "only if" clause is easy: a
perfect square is congruent to 0 or 1 modulo 4, hence a sum of two squares is congruent to 0, 1, or 2. An odd prime number is congruent to either 1 or 3 modulo 4, and the second possibility has just been ruled out. The first proof that such a representation exists was given byLeonhard Euler in 1747 and was quite complicated. Since then, many different proofs have been found. Among them, the proof usingMinkowski's theorem aboutconvex set s andDon Zagier 's stunningly short proof based on involutions especially stand out.Euler's proof by infinite descent
Euler succeeded in proving Fermat's theorem on sums of two squares in1747 , when he was forty years old. He communicated this in a letter to Goldbach dated6 May 1747 . The proof relies on infinite descent, and proceeds in five steps; the fifth step below is from another letter to Goldbach written in1749 , as the first letter was vague on that final step:1. "The product of two numbers, each of which is a sum of two squares, is itself a sum of two squares."
::This is simply a restatement of the
Brahmagupta-Fibonacci identity .2. "If a number which is a sum of two squares is divisible by a prime which is a sum of two squares, then the quotient is a sum of two squares."
::Indeed, suppose for example that is divisible by and that this latter is a prime. Then divides
:::
::Since is a prime, it divides one of the two factors. Suppose that it divides . Since
:::
::(
Brahmagupta-Fibonacci identity ) it follows that must divide . So the equation can be divided by the square of . Dividing the expression by yields::::
::and thus expresses the quotient as a sum of two squares, as claimed.
::If divides , a similar argument holds by using
:::
::(
Brahmagupta-Fibonacci identity ).3. "If a number which can be written as a sum of two squares is divisible by a number which is not a sum of two squares, then the quotient has a factor which is not a sum of two squares."
::Suppose divides and that the quotient, factored into its prime factors is . Then . If all factors can be written as sums of two squares, then we can divide successively by , , etc., and applying the previous step we deduce that each quotient is a sum of two squares. This until we get to , concluding that would have to be the sum of two squares. So, by contraposition, if is not the sum of two squares, then at least one of the primes is not the sum of two squares.
4. "If and are relatively prime then every factor of is a sum of two squares."
::This is the step that uses infinite descent. Let be a factor of . We can write :::::where and are at most half of in absolute value. This gives::::::Therefore, must be divisible by , say . If and are not relatively prime, then their gcd cannot divide (if it did, then it would divide and which we assume are relatively prime). Thus the square of the gcd divides (as it divides ), giving us an expression of the form for relatively prime and , and with no more than half of , since
:::
::If and are relatively prime, then we can use them directly instead of switching to and .
::If is not the sum of two squares, then by the third step there must be a factor of which is not the sum of two squares; call it . This gives an infinite descent, going from to a smaller number , both not the sums of two squares but dividing a sum of two squares. Since an infinite descent is impossible, we conclude that must be expressible as a sum of two squares, as claimed.
5. "Every prime of the form is a sum of two squares."
::If , then by
Fermat's Little Theorem each of the numbers is congruent to one modulo . The differences are therefore all divisible by . Each of these differences can be factored as:::::Since is prime, it must divide one of the two factors. If in any of the cases it divides the first factor, then by the previous step we conclude that is itself a sum of two squares (since and differ by , they are relatively prime). So it is enough to show that cannot always divide the second factor. If it divides all differences , then it would divide all differences of successive terms, all differences of the differences, and so forth. Since the th differences of the sequence are all equal to , the th differences would all be constant and equal to , which is certainly not divisible by . Therefore, cannot divide all the second factors which proves that is indeed the sum of two squares.Lagrange's proof through quadratic forms
Lagrange gave a proof in 1770 based on his general theory of integral
quadratic forms . The following is a slight simplification of his argument, due to Gauss, which appears in article 182 of theDisquisitiones Arithmeticae .A (binary) quadratic form will be taken to be an expression of the form with integers. A number is said to be "represented by the form" if there exist integers such that . Fermat's theorem on sums of two squares is then equivalent to the statement that a prime is represented by the form (i.e., , ) exactly when is congruent to modulo .
The
discriminant of the quadratic form is defined to be (this is the definition due to Gauss; Lagrange did not require the term to have even coefficient, and defined the discriminant as ). The discriminant of is then equal to .Two forms and are "equivalent" if and only if there exist substitutions with integer coefficients: : with such that, when substituted into the first form, yield the second. Equivalent forms are readily seen to have the same discriminant. Moreover, it is clear that equivalent forms will represent exactly the same integers.
Lagrange proved that all forms of discriminant are equivalent. Thus, to prove Fermat's theorem it is enough to find "any" form of discriminant that represents . To do this, it suffices to find an integer such that divides . For, finding such an integer, we can consider the form:which has discriminant and represents by setting and .
Suppose then that . Again we invoke Fermat's Little Theorem: for any relatively prime to , we know that divides . Moreover, by a theorem of Lagrange, the number of solutions modulo to a congruence of degree modulo is at most (this follows since the integers modulo form a field, and a polynomial of degree has at most roots). So the congruence has at most solutions among the numbers . Therefore, there exists some positive integer strictly smaller than (and so relatively prime to ) such that does not divide . Since divides , must divide . Setting completes the proof.
Dedekind's two proofs using Gaussian integers
Dedekind gave at least two proofs of Fermat's theorem on sums of two squares, both using the arithmetical properties of theGaussian integers . One appears in section 27 of his exposition of ideals published in1877 ; the second appeared in Supplement XI toDirichlet 's "Lectures on Number Theory", and was published in1894 .1. First proof. If is a positive odd rational prime, then we have , where is the imaginary square root of . Consequently, if is any Gaussian integer, then:If is congruent to modulo , then each Gaussian integer satisfies the congruence :and therefore the ideal is the product of two different prime ideals of degree in Z [] . Since Z [] is a principal ideal domain, (in fact, a
Euclidean domain ), every ideal is principal, generated by an element of minimal norm. So if is a generator of one of the ideal factors of , then we must have which gives Fermat's theorem.2. Second proof. This proof builds on Lagrange's result that if , then there must be a number of the form which is a multiple of . Since does not divide the Gaussian factors nor (the quotients are clearly equal to , which are not Gaussian integers), despite dividing the product, it follows that cannot be a prime in Z [] . Since it is not a prime, we must have a factorization in the Gaussian integers of the form for some integers and , which immediately yields that .
Zagier's "one-sentence proof"
Let "p" = 4"k" + 1 be prime. The set "S" = {("x", "y", "z") ∈ N3: "x"2 + 4"yz" = "p"} is finite and has two
involution s, an obvious one ("x", "y", "z") → ("x", "z", "y"), whose fixed points correpond to representations of "p" as a sum of two squares, and a more complicated one,:
which has exactly one fixed point, (1, 1, "k"). However, the number of fixed points of an involution of a finite set "S" has the same parity as the
cardinality of "S", therefore, this number is odd for the first involution as well, proving that "p" is a sum of two squares.This proof, due to Zagier, is a simplification of an earlier proof by Heath-Brown, which in turn was inspired by a proof of Liouville. The technique of the proof is a combinatorial analogue of the topological principle that the
Euler characteristic s of atopological space with an involution and of its fixed point set have the same parity and is reminiscent of the use of "sign-reversing involutions" in the proofs of combinatorial bijections.References
*Richard Dedekind, "The theory of algebraic integers".
*Harold M. Edwards, "Fermat's Last Theorem. A genetic introduction to algebraic number theory". Graduate Texts in Mathematics no. 50, Springer-Verlag, NY, 1977.
*C. F. Gauss, "Disquisitiones Arithmeticae " (English Edition). Transl. by Arthur A. Clarke. Springer-Verlag, 1986.
*D. R. Heath-Brown, "Fermat's two squares theorem". Invariant, 11 (1984) pp. 3-5.
*John Stillwell , Introduction to "Theory of Algebraic Integers" by Richard Dedekind. Cambridge Mathematical Library, Cambridge University Press, 1996.
*Don Zagier , "A one-sentence proof that every prime p ≡ 1 mod 4 is a sum of two squares". Amer. Math. Monthly 97 (1990), no. 2, 144, doi|id=10.2307/2323918External links
* [http://planetmath.org/encyclopedia/ProofOfThuesLemma.html Two more proofs at PlanetMath.org]
* [http://www.math.unh.edu/~dvf/532/Zagier A one-sentence proof of the theorem]
* [http://eprints.maths.ox.ac.uk/677/ reprint of Heath-Brown's proof, with commentary]
Wikimedia Foundation. 2010.