Jump to content

Schroder-Bernstein theorem

From Wikipedia, the free encyclopedia
Theorem in set theory

In set theory, the Schroder-Bernstein theorem states that, if there exist injective functions f : A - B and g : B - A between the sets A and B, then there exists a bijective function h : A - B.

In terms of the cardinality of the two sets, this classically implies that if |A| <= |B| and |B| <= |A|, then |A| = |B|; that is, A and B are equipotent. This is a useful feature in the ordering of cardinal numbers.

The theorem is named after Ernst Schroder and Felix Bernstein. It is also known as the Cantor-Bernstein theorem or Cantor-Schroder-Bernstein theorem, after Georg Cantor, who first published it (albeit without proof).

Proof

[edit]
Konig's definition of a bijection h:A - B from given example injections f:A - B and g:B - A. An element in A and B is denoted by a number and a letter, respectively. The sequence 3 - e - 6 - ... is an A-stopper, leading to the definitions h(3) = f(3) = e, h(6) = f(6), .... The sequence d - 5 - f - ... is a B-stopper, leading to h(5) = g-1(5) = d, .... The sequence ... - a - 1 - c - 4 - ... is doubly infinite, leading to h(1) = g-1(1) = a, h(4) = g-1(4) = c, .... The sequence b - 2 - b is cyclic, leading to h(2) = g-1(2) = b.

The following proof is attributed to Julius Konig.[1]

Assume without loss of generality that A and B are disjoint. For any a in A or b in B we can form a unique two-sided sequence of elements that are alternately in A and B, by repeatedly applying f {\displaystyle f} and g - 1 {\displaystyle g^{-1}} to go from A to B and g {\displaystyle g} and f - 1 {\displaystyle f^{-1}} to go from B to A (where defined; the inverses f - 1 {\displaystyle f^{-1}} and g - 1 {\displaystyle g^{-1}} are understood as partial functions.)

- f - 1 ( g - 1 ( a ) ) - g - 1 ( a ) - a - f ( a ) - g ( f ( a ) ) - {\displaystyle \cdots \rightarrow f^{-1}(g^{-1}(a))\rightarrow g^{-1}(a)\rightarrow a\rightarrow f(a)\rightarrow g(f(a))\rightarrow \cdots }

For any particular a, this sequence may terminate to the left or not, at a point where f - 1 {\displaystyle f^{-1}} or g - 1 {\displaystyle g^{-1}} is not defined.

By the fact that f {\displaystyle f} and g {\displaystyle g} are injective functions, each a in A and b in B is in exactly one such sequence to within identity: if an element occurs in two sequences, all elements to the left and to the right must be the same in both, by the definition of the sequences. Therefore, the sequences form a partition of the (disjoint) union of A and B. Hence it suffices to produce a bijection between the elements of A and B in each of the sequences separately, as follows:

Call a sequence an A-stopper if it stops at an element of A, or a B-stopper if it stops at an element of B. Otherwise, call it doubly infinite if all the elements are distinct or cyclic if it repeats. See the picture for examples.

  • For an A-stopper, the function f {\displaystyle f} is a bijection between its elements in A and its elements in B.
  • For a B-stopper, the function g {\displaystyle g} is a bijection between its elements in B and its elements in A.
  • For a doubly infinite sequence or a cyclic sequence, either f {\displaystyle f} or g {\displaystyle g} will do ( g {\displaystyle g} is used in the picture).

Corollary for surjective pair

[edit]

If we assume the axiom of choice, then a pair of surjective functions f {\displaystyle f} and g {\displaystyle g} also implies the existence of a bijection.[citation needed] We construct an injective function h : B - A from f - 1 {\displaystyle f^{-1}} by picking a single element from the inverse image of each point in B . {\displaystyle B.} The surjectivity of f {\displaystyle f} guarantees the existence of at least one element in each such inverse image. We do the same to obtain an injective function k : A - B from g - 1 . {\displaystyle g^{-1}.} The Schroder-Bernstein theorem then can be applied to the injections h and k.

Examples

[edit]
Bijective function from [ 0 , 1 ] - [ 0 , 1 ) {\displaystyle [0,1]\to [0,1)}
Note: [ 0 , 1 ) {\displaystyle [0,1)} is the half open set from 0 to 1, including the boundary 0 and excluding the boundary 1.
Let f : [ 0 , 1 ] - [ 0 , 1 ) {\displaystyle f:[0,1]\to [0,1)} with f ( x ) = x / 2 {\displaystyle f(x)=x/2} ; and g : [ 0 , 1 ) - [ 0 , 1 ] {\displaystyle g:[0,1)\to [0,1]} with g ( x ) = x {\displaystyle g(x)=x} ; the two injective functions.
In line with that procedure C 0 = { 1 } , C k = { 2 - k } , C = k = 0 C k = { 1 , 1 2 , 1 4 , 1 8 , . . . } {\displaystyle C_{0}=\{1\},\;C_{k}=\{2^{-k}\},\;C=\bigcup _{k=0}^{\infty }C_{k}=\{1,{\tfrac {1}{2}},{\tfrac {1}{4}},{\tfrac {1}{8}},...\}}
Then h ( x ) = { x 2 , f o r x C x , f o r x [ 0 , 1 ] \ C {\displaystyle h(x)={\begin{cases}{\frac {x}{2}},&\mathrm {for} \ x\in C\\x,&\mathrm {for} \ x\in [0,1]\smallsetminus C\end{cases}}} is a bijective function from [ 0 , 1 ] - [ 0 , 1 ) {\displaystyle [0,1]\to [0,1)} .
Bijective function from [ 0 , 2 ) - [ 0 , 1 ) 2 {\displaystyle [0,2)\to [0,1)^{2}}
Let f : [ 0 , 2 ) - [ 0 , 1 ) 2 {\displaystyle f:[0,2)\to [0,1)^{2}} with f ( x ) = ( x / 2 ; 0 ) {\displaystyle f(x)=(x/2;0)} ;
Then for ( x ; y ) [ 0 , 1 ) 2 {\displaystyle (x;y)\in [0,1)^{2}} one can use the expansions x = k = 1 a k 10 - k {\displaystyle x=\sum _{k=1}^{\infty }a_{k}\cdot 10^{-k}} and y = k = 1 b k 10 - k {\displaystyle y=\sum _{k=1}^{\infty }b_{k}\cdot 10^{-k}} with a k , b k { 0 , 1 , . . . , 9 } {\displaystyle a_{k},b_{k}\in \{0,1,...,9\}}
and now one can set g ( x ; y ) = k = 1 ( 10 a k + b k ) 10 - 2 k {\displaystyle g(x;y)=\sum _{k=1}^{\infty }(10\cdot a_{k}+b_{k})\cdot 10^{-2k}} which defines an injective function [ 0 , 1 ) 2 - [ 0 , 2 ) {\displaystyle [0,1)^{2}\to [0,2)} . (Example: g ( 1 3 ; 2 3 ) = 0.363636... = 4 11 {\displaystyle g({\tfrac {1}{3}};{\tfrac {2}{3}})=0.363636...={\tfrac {4}{11}}} )
And therefore a bijective function h {\displaystyle h} can be constructed with the use of f ( x ) {\displaystyle f(x)} and g - 1 ( x ) {\displaystyle g^{-1}(x)} .
In this case C 0 = [ 1 , 2 ) {\displaystyle C_{0}=[1,2)} is still easy but already C 1 = g ( f ( C 0 ) ) = g ( { ( x ; 0 ) | x [ 1 2 , 1 ) } ) {\displaystyle C_{1}=g(f(C_{0}))=g(\{(x;0)|x\in [{\tfrac {1}{2}},1)\,\})} gets quite complicated.
Note: Of course there's a more simple way by using the (already bijective) function definition g 2 ( x ; y ) = 2 k = 1 ( 10 a k + b k ) 10 - 2 k {\displaystyle g_{2}(x;y)=2\cdot \sum _{k=1}^{\infty }(10\cdot a_{k}+b_{k})\cdot 10^{-2k}} . Then C {\displaystyle C} would be the empty set and h ( x ) = g 2 - 1 ( x ) {\displaystyle h(x)=g_{2}^{-1}(x)} for all x.

History

[edit]

The traditional name "Schroder-Bernstein" is based on two proofs published independently in 1898. Cantor is often added because he first stated the theorem in 1887, while Schroder's name is often omitted because his proof turned out to be flawed while the name of Richard Dedekind, who first proved it, is not connected with the theorem. According to Bernstein, Cantor had suggested the name equivalence theorem (Aquivalenzsatz).[2]

Cantor's first statement of the theorem (1887)[3]
  • 1887 Cantor publishes the theorem, however without proof.[3][2]
  • 1887 On July 11, Dedekind proves the theorem (not relying on the axiom of choice)[4] but neither publishes his proof nor tells Cantor about it. Ernst Zermelo discovered Dedekind's proof and in 1908[5] he publishes his own proof based on the chain theory from Dedekind's paper Was sind und was sollen die Zahlen?[2][6]
  • 1895 Cantor states the theorem in his first paper on set theory and transfinite numbers. He obtains it as an easy consequence of the linear order of cardinal numbers.[7][8][9] However, he could not prove the latter theorem, which is shown in 1915 to be equivalent to the axiom of choice by Friedrich Moritz Hartogs.[2][10]
  • 1896 Schroder announces a proof (as a corollary of a theorem by Jevons).[11]
  • 1897 Bernstein, a 19-year-old student in Cantor's Seminar, presents his proof.[12][13]
  • 1897 Almost simultaneously, but independently, Schroder finds a proof.[12][13]
  • 1897 After a visit by Bernstein, Dedekind independently proves the theorem a second time.
  • 1898 Bernstein's proof (not relying on the axiom of choice) is published by Emile Borel in his book on functions.[14] (Communicated by Cantor at the 1897 International Congress of Mathematicians in Zurich.) In the same year, the proof also appears in Bernstein's dissertation.[15][2]
  • 1898 Schroder publishes his proof[16] which, however, is shown to be faulty by Alwin Reinhold Korselt in 1902 (just before Schroder's death),[17] (confirmed by Schroder),[2][18] but Korselt's paper is published only in 1911.

Both proofs of Dedekind are based on his famous 1888 memoir Was sind und was sollen die Zahlen? and derive it as a corollary of a proposition equivalent to statement C in Cantor's paper,[7] which reads A B C and |A| = |C| implies |A| = |B| = |C|. Cantor observed this property as early as 1882/83 during his studies in set theory and transfinite numbers and was therefore (implicitly) relying on the axiom of choice.

Prerequisites

[edit]

The 1895 proof by Cantor relied, in effect, on the axiom of choice by inferring the result as a corollary of the well-ordering theorem.[8][9] However, Konig's proof given above shows that the result can also be proved without using the axiom of choice.

On the other hand, Konig's proof uses the principle of excluded middle to draw a conclusion through case analysis. As such, the above proof is not a constructive one. In fact, in a constructive set theory such as intuitionistic set theory I Z F {\displaystyle {\mathsf {IZF}}} , which adopts the full axiom of separation but dispenses with the principle of excluded middle, assuming the Schroder-Bernstein theorem implies the latter.[19] In turn, there is no proof of Konig's conclusion in this or weaker constructive theories. Thus, the theorem is not provable in intuitionistic set theory and is generally not accepted as constructively valid.[20]

There is also a proof which uses Tarski's fixed point theorem.[21]

See also

[edit]

Notes

[edit]
  1. ^ Konig, J. (1906). "Sur la theorie des ensembles". Comptes Rendus Hebdomadaires des Seances de l'Academie des Sciences (in French). 143: 110-112.
  2. ^ a b c d e f Hausdorff, Felix (2002), Brieskorn, Egbert; Chatterji, Srishti D.; et al. (eds.), Grundzuge der Mengenlehre (in German) (1. ed.), Berlin/Heidelberg: Springer, p. 587, ISBN 978-3-540-42224-2 - Original edition (1914)
  3. ^ a b Cantor, Georg (1887), "Mitteilungen zur Lehre vom Transfiniten", Zeitschrift fur Philosophie und philosophische Kritik (in German), 91: 81-125
    Reprinted in: Cantor, Georg (1932), Adolf Fraenkel (Lebenslauf); Ernst Zermelo (eds.), Gesammelte Abhandlungen mathematischen und philosophischen Inhalts (in German), Berlin: Springer, pp. 378-439 Here: p.413 bottom
  4. ^ Dedekind, Richard (1932), Robert Fricke; Emmy Noether; Oystein Ore (eds.), Gesammelte mathematische Werke (in German), vol. 3, Braunschweig: Friedr. Vieweg & Sohn, pp. 447-449 (Ch.62)
  5. ^ Zermelo, Ernst (1908), Felix Klein; Walther von Dyck; David Hilbert; Otto Blumenthal (eds.), "Untersuchungen uber die Grundlagen der Mengenlehre I", Mathematische Annalen (in German), 65 (2): 261-281, here: p.271-272, doi:10.1007/bf01449999, ISSN 0025-5831, S2CID 120085563
  6. ^ Dedekind, Richard (1888), Was sind und was sollen die Zahlen? (in German) (2., unchanged (1893) ed.), Braunschweig: Friedr. Vieweg & Sohn
  7. ^ a b Cantor, Georg (1932), Adolf Fraenkel (Lebenslauf); Ernst Zermelo (eds.), Gesammelte Abhandlungen mathematischen und philosophischen Inhalts (in German), Berlin: Springer, pp. 285 ("Satz B")
  8. ^ a b Cantor, Georg (1895). "Beitrage zur Begrundung der transfiniten Mengenlehre (1)". Mathematische Annalen. 46 (4): 481-512 (Theorem see "Satz B", p.484). doi:10.1007/bf02124929. S2CID 177801164.
  9. ^ a b (Cantor, Georg (1897). "Beitrage zur Begrundung der transfiniten Mengenlehre (2)". Mathematische Annalen (in German). 49 (2): 207-246. doi:10.1007/bf01444205. S2CID 121665994.)
  10. ^ Hartogs, Friedrich M. (1915), Felix Klein; Walther von Dyck; David Hilbert; Otto Blumenthal (eds.), "Uber das Problem der Wohlordnung", Mathematische Annalen (in German), 76 (4): 438-443, doi:10.1007/bf01458215, ISSN 0025-5831, S2CID 121598654
  11. ^ Schroder, Ernst (1896). "Uber G. Cantorsche Satze". Jahresbericht der Deutschen Mathematiker-Vereinigung (in German). 5: 81-82.
  12. ^ a b Deiser, Oliver (2010), Einfuhrung in die Mengenlehre - Die Mengenlehre Georg Cantors und ihre Axiomatisierung durch Ernst Zermelo, Springer-Lehrbuch (in German) (3rd, corrected ed.), Berlin/Heidelberg: Springer, pp. 71, 501, doi:10.1007/978-3-642-01445-1, ISBN 978-3-642-01444-4
  13. ^ a b Suppes, Patrick (1972), Axiomatic Set Theory (1. ed.), New York: Dover Publications, pp. 95 f, ISBN 978-0-486-61630-8
  14. ^ Borel, Emile (1898), Lecons sur la theorie des fonctions (in French), Paris: Gauthier-Villars et fils, pp. 103 ff
  15. ^ Bernstein, Felix (1901), Untersuchungen aus der Mengenlehre (in German), Halle a. S.: Buchdruckerei des Waisenhauses
    Reprinted in: Bernstein, Felix (1905), Felix Klein; Walther von Dyck; David Hilbert (eds.), "Untersuchungen aus der Mengenlehre", Mathematische Annalen (in German), 61 (1): 117-155, (Theorem see "Satz 1" on p.121), doi:10.1007/bf01457734, ISSN 0025-5831, S2CID 119658724
  16. ^ Schroder, Ernst (1898), Kaiserliche Leopoldino-Carolinische Deutsche Akademie der Naturforscher (ed.), "Ueber zwei Definitionen der Endlichkeit und G. Cantor'sche Satze", Nova Acta (in German), 71 (6): 303-376 (proof: p.336-344)
  17. ^ Korselt, Alwin R. (1911), Felix Klein; Walther von Dyck; David Hilbert; Otto Blumenthal (eds.), "Uber einen Beweis des Aquivalenzsatzes", Mathematische Annalen (in German), 70 (2): 294-296, doi:10.1007/bf01461161, ISSN 0025-5831, S2CID 119757900
  18. ^ Korselt (1911), p.295
  19. ^ Pradic, Cecilia; Brown, Chad E. (2019). "Cantor-Bernstein implies Excluded Middle". arXiv:1904.09193 [math.LO].
  20. ^ Carruccio, Ettore (2006). Mathematics and Logic in History and in Contemporary Thought. Transaction Publishers. p. 354. ISBN 978-0-202-30850-0.
  21. ^ Roland Uhl. "Tarski's Fixed Point Theorem". MathWorld. Example 3.

References

[edit]
[edit]
Overview
Axioms
Operations
  • Concepts
  • Methods
Set types
Theories
Set theorists
General
Theorems
(list),
paradoxes
Logics
Traditional
Propositional
Predicate
Set theory
Types
of sets
Maps,
cardinality
Theories
Formal
systems

(list),
language,
syntax
Example
axiomatic
systems

(list)
Proof theory
Model theory
Computability
theory
Related