Dark Mode

Jump to content

Ordinal collapsing function

From Wikipedia, the free encyclopedia
Set-theoretic function
This article includes a list of references, related reading, or external links, but its sources remain unclear because it lacks inline citations. Please help improve this article by introducing more precise citations. (June 2022) (Learn how and when to remove this message)

In mathematical logic and set theory, an ordinal collapsing function (or projection function) is a technique for defining (notations for) certain recursive large countable ordinals, whose principle is to give names to certain ordinals much larger than the one being defined, perhaps even large cardinals (though they can be replaced with recursively large ordinals at the cost of extra technical difficulty), and then "collapse" them down to a system of notations for the sought-after ordinal. For this reason, ordinal collapsing functions are described as an impredicative manner of naming ordinals.

The details of the definition of ordinal collapsing functions vary, and get more complicated as greater ordinals are being defined, but the typical idea is that whenever the notation system "runs out of fuel" and cannot name a certain ordinal, a much larger ordinal is brought "from above" to give a name to that critical point. An example of how this works will be detailed below, for an ordinal collapsing function defining the Bachmann-Howard ordinal (i.e., defining a system of notations up to the Bachmann-Howard ordinal).

The use and definition of ordinal collapsing functions is inextricably intertwined with the theory of ordinal analysis, since the large countable ordinals defined and denoted by a given collapse are used to describe the ordinal-theoretic strength of certain formal systems, typically[1][2] subsystems of second-order arithmetic (such as those seen in reverse mathematics), extensions of Kripke-Platek set theory, Bishop-style systems of constructive mathematics or Martin-Lof-style systems of intuitionistic type theory.

Ordinal collapsing functions are typically denoted using some variation of either the Greek letter ps {\displaystyle \psi } (psi) or th {\displaystyle \theta } (theta).

An example leading up to the Bachmann-Howard ordinal

[edit]

The choice of the ordinal collapsing function given as example below imitates greatly the system introduced by Buchholz[3] but is limited to collapsing one cardinal for clarity of exposition. More on the relation between this example and Buchholz's system will be said below.

Definition

[edit]

Let O {\displaystyle \Omega } stand for the first uncountable ordinal o 1 {\displaystyle \omega _{1}} , or, in fact, any ordinal that is an e {\displaystyle \varepsilon } -number and guaranteed to be greater than all the countable ordinals that will be constructed (for example, the Church-Kleene ordinal is adequate for our purposes; but we will work with o 1 {\displaystyle \omega _{1}} because it allows the convenient use of the word countable in the definitions).

We define a function ps {\displaystyle \psi } (which will be non-decreasing and continuous), taking an arbitrary ordinal a {\displaystyle \alpha } to a countable ordinal ps ( a ) {\displaystyle \psi (\alpha )} , recursively on a {\displaystyle \alpha } , as follows:

Assume ps ( b ) {\displaystyle \psi (\beta )} has been defined for all b < a {\displaystyle \beta <\alpha } , and we wish to define ps ( a ) {\displaystyle \psi (\alpha )} .
Let C ( a ) {\displaystyle C(\alpha )} be the set of ordinals generated starting from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } and O {\displaystyle \Omega } by recursively applying the following functions: ordinal addition, multiplication and exponentiation and the function ps | a {\displaystyle \psi {\upharpoonright _{\alpha }}} , i.e., the restriction of ps {\displaystyle \psi } to ordinals b < a {\displaystyle \beta <\alpha } . (Formally, we define C ( a ) 0 = { 0 , 1 , o , O } {\displaystyle C(\alpha )_{0}=\{0,1,\omega ,\Omega \}} and inductively C ( a ) n + 1 = C ( a ) n { b 1 + b 2 , b 1 b 2 , b 1 b 2 : b 1 , b 2 C ( a ) n } { ps ( b ) : b C ( a ) n b < a } {\displaystyle C(\alpha )_{n+1}=C(\alpha )_{n}\cup \{\beta _{1}+\beta _{2},\beta _{1}\cdot \beta _{2},{\beta _{1}}^{\beta _{2}}:\beta _{1},\beta _{2}\in C(\alpha )_{n}\}\cup \{\psi (\beta ):\beta \in C(\alpha )_{n}\land \beta <\alpha \}} for all natural numbers n {\displaystyle n} and we let C ( a ) {\displaystyle C(\alpha )} be the union of the C ( a ) n {\displaystyle C(\alpha )_{n}} for all n {\displaystyle n} .)
Then ps ( a ) {\displaystyle \psi (\alpha )} is defined as the smallest ordinal not belonging to C ( a ) {\displaystyle C(\alpha )} .

In a more concise (although more obscure) way:

ps ( a ) {\displaystyle \psi (\alpha )} is the smallest ordinal that cannot be expressed from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } and O {\displaystyle \Omega } using sums, products, exponentials, and the ps {\displaystyle \psi } function itself (to previously constructed ordinals less than a {\displaystyle \alpha } ).

Here is an attempt to explain the motivation for the definition of ps {\displaystyle \psi } in intuitive terms: since the usual operations of addition, multiplication and exponentiation are not sufficient to designate ordinals very far, we attempt to systematically create new names for ordinals by taking the first one that does not have a name yet, and whenever we run out of names, rather than invent them in an ad hoc fashion or using diagonal schemes, we seek them in the ordinals far beyond the ones we are constructing (beyond O {\displaystyle \Omega } , that is); so we give names to uncountable ordinals and, since in the end the list of names is necessarily countable, ps {\displaystyle \psi } will "collapse" them to countable ordinals.

Computation of values of ps

[edit]

To clarify how the function ps {\displaystyle \psi } is able to produce notations for certain ordinals, we now compute its first values.

Predicative start

[edit]

First consider C ( 0 ) {\displaystyle C(0)} . It contains ordinals 0 , 1 , 2 , 3 , o , o + 1 , o + 2 , o 2 , o 3 , o 2 , o 3 , o o , o o o {\displaystyle 0,1,2,3,\omega ,\omega +1,\omega +2,\omega \cdot 2,\omega \cdot 3,\omega ^{2},\omega ^{3},\omega ^{\omega },\omega ^{\omega ^{\omega }}} and so on. It also contains such ordinals as O , O + 1 , o O + 1 , O O {\displaystyle \Omega ,\Omega +1,\omega ^{\Omega +1},\Omega ^{\Omega }} . The first ordinal that it does not contain is e 0 {\displaystyle \varepsilon _{0}} (which is the limit of o {\displaystyle \omega } , o o {\displaystyle \omega ^{\omega }} , o o o {\displaystyle \omega ^{\omega ^{\omega }}} and so on -- less than O {\displaystyle \Omega } by assumption). The upper bound of the ordinals it contains is e O + 1 {\displaystyle \varepsilon _{\Omega +1}} (the limit of O {\displaystyle \Omega } , O O {\displaystyle \Omega ^{\Omega }} , O O O {\displaystyle \Omega ^{\Omega ^{\Omega }}} and so on), but that is not so important. This shows that ps ( 0 ) = e 0 {\displaystyle \psi (0)=\varepsilon _{0}} .

Similarly, C ( 1 ) {\displaystyle C(1)} contains the ordinals which can be formed from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } and this time also e 0 {\displaystyle \varepsilon _{0}} , using addition, multiplication and exponentiation. This contains all the ordinals up to e 1 {\displaystyle \varepsilon _{1}} but not the latter, so ps ( 1 ) = e 1 {\displaystyle \psi (1)=\varepsilon _{1}} . In this manner, we prove that ps ( a ) = e a {\displaystyle \psi (\alpha )=\varepsilon _{\alpha }} inductively on a {\displaystyle \alpha } : the proof works, however, only as long as a < e a {\displaystyle \alpha <\varepsilon _{\alpha }} . We therefore have:

ps ( a ) = e a = ph 1 ( a ) {\displaystyle \psi (\alpha )=\varepsilon _{\alpha }=\varphi _{1}(\alpha )} for all a <= z 0 {\displaystyle \alpha \leq \zeta _{0}} , where z 0 = ph 2 ( 0 ) {\displaystyle \zeta _{0}=\varphi _{2}(0)} is the smallest fixed point of a - e a {\displaystyle \alpha \mapsto \varepsilon _{\alpha }} .

(Here, the ph {\displaystyle \varphi } functions are the Veblen functions defined starting with ph 1 ( a ) = e a {\displaystyle \varphi _{1}(\alpha )=\varepsilon _{\alpha }} .)

Now ps ( z 0 ) = z 0 {\displaystyle \psi (\zeta _{0})=\zeta _{0}} but ps ( z 0 + 1 ) {\displaystyle \psi (\zeta _{0}+1)} is no larger, since z 0 {\displaystyle \zeta _{0}} cannot be constructed using finite applications of ph 1 : a - e a {\displaystyle \varphi _{1}\colon \alpha \mapsto \varepsilon _{\alpha }} and thus never belongs to a C ( a ) {\displaystyle C(\alpha )} set for a <= O {\displaystyle \alpha \leq \Omega } , and the function ps {\displaystyle \psi } remains "stuck" at z 0 {\displaystyle \zeta _{0}} for some time:

ps ( a ) = z 0 {\displaystyle \psi (\alpha )=\zeta _{0}} for all z 0 <= a <= O {\displaystyle \zeta _{0}\leq \alpha \leq \Omega } .

First impredicative values

[edit]

Again, ps ( O ) = z 0 {\displaystyle \psi (\Omega )=\zeta _{0}} . However, when we come to computing ps ( O + 1 ) {\displaystyle \psi (\Omega +1)} , something has changed: since O {\displaystyle \Omega } was ("artificially") added to all the C ( a ) {\displaystyle C(\alpha )} , we are permitted to take the value ps ( O ) = z 0 {\displaystyle \psi (\Omega )=\zeta _{0}} in the process. So C ( O + 1 ) {\displaystyle C(\Omega +1)} contains all ordinals which can be built from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } , the ph 1 : a - e a {\displaystyle \varphi _{1}\colon \alpha \mapsto \varepsilon _{\alpha }} function up to z 0 {\displaystyle \zeta _{0}} and this time also z 0 {\displaystyle \zeta _{0}} itself, using addition, multiplication and exponentiation. The smallest ordinal not in C ( O + 1 ) {\displaystyle C(\Omega +1)} is e z 0 + 1 {\displaystyle \varepsilon _{\zeta _{0}+1}} (the smallest e {\displaystyle \varepsilon } -number after z 0 {\displaystyle \zeta _{0}} ).

We say that the definition ps ( O ) = z 0 {\displaystyle \psi (\Omega )=\zeta _{0}} and the next values of the function ps {\displaystyle \psi } such as ps ( O + 1 ) = e z 0 + 1 {\displaystyle \psi (\Omega +1)=\varepsilon _{\zeta _{0}+1}} are impredicative because they use ordinals (here, O {\displaystyle \Omega } ) greater than the ones that are being defined (here, z 0 {\displaystyle \zeta _{0}} ).

Values of ps up to the Feferman-Schutte ordinal

[edit]

The fact that ps ( O + a ) {\displaystyle \psi (\Omega +\alpha )} equals e z 0 + a {\displaystyle \varepsilon _{\zeta _{0}+\alpha }} remains true for all a <= z 1 = ph 2 ( 1 ) {\displaystyle \alpha \leq \zeta _{1}=\varphi _{2}(1)} . (Note, in particular, that ps ( O + z 0 ) = e z 0 2 {\displaystyle \psi (\Omega +\zeta _{0})=\varepsilon _{\zeta _{0}\cdot 2}} : but since now the ordinal z 0 {\displaystyle \zeta _{0}} has been constructed there is nothing to prevent from going beyond this). However, at z 1 = ph 2 ( 1 ) {\displaystyle \zeta _{1}=\varphi _{2}(1)} (the first fixed point of a - e a {\displaystyle \alpha \mapsto \varepsilon _{\alpha }} beyond z 0 {\displaystyle \zeta _{0}} ), the construction stops again, because z 1 {\displaystyle \zeta _{1}} cannot be constructed from smaller ordinals and z 0 {\displaystyle \zeta _{0}} by finitely applying the e {\displaystyle \varepsilon } function. So we have ps ( O 2 ) = z 1 {\displaystyle \psi (\Omega \cdot 2)=\zeta _{1}} .

The same reasoning shows that ps ( O ( 1 + a ) ) = ph 2 ( a ) {\displaystyle \psi (\Omega \cdot (1+\alpha ))=\varphi _{2}(\alpha )} for all a <= ph 3 ( 0 ) = e 0 {\displaystyle \alpha \leq \varphi _{3}(0)=\eta _{0}} , where ph 2 {\displaystyle \varphi _{2}} enumerates the fixed points of ph 1 : a - e a {\displaystyle \varphi _{1}\colon \alpha \mapsto \varepsilon _{\alpha }} and ph 3 ( 0 ) {\displaystyle \varphi _{3}(0)} is the first fixed point of ph 2 {\displaystyle \varphi _{2}} . We then have ps ( O 2 ) = ph 3 ( 0 ) {\displaystyle \psi (\Omega ^{2})=\varphi _{3}(0)} .

Again, we can see that ps ( O a ) = ph 1 + a ( 0 ) {\displaystyle \psi (\Omega ^{\alpha })=\varphi _{1+\alpha }(0)} for some time: this remains true until the first fixed point G 0 {\displaystyle \Gamma _{0}} of a - ph a ( 0 ) {\displaystyle \alpha \mapsto \varphi _{\alpha }(0)} , which is the Feferman-Schutte ordinal. Thus, ps ( O O ) = G 0 {\displaystyle \psi (\Omega ^{\Omega })=\Gamma _{0}} is the Feferman-Schutte ordinal.

Beyond the Feferman-Schutte ordinal

[edit]

We have ps ( O O + O a ) = ph G 0 + a ( 0 ) {\displaystyle \psi (\Omega ^{\Omega }+\Omega ^{\alpha })=\varphi _{\Gamma _{0}+\alpha }(0)} for all a <= G 1 {\displaystyle \alpha \leq \Gamma _{1}} where G 1 {\displaystyle \Gamma _{1}} is the next fixed point of a - ph a ( 0 ) {\displaystyle \alpha \mapsto \varphi _{\alpha }(0)} . So, if a - G a {\displaystyle \alpha \mapsto \Gamma _{\alpha }} enumerates the fixed points in question (which can also be noted ph ( 1 , 0 , a ) {\displaystyle \varphi (1,0,\alpha )} using the many-valued Veblen functions) we have ps ( O O ( 1 + a ) ) = G a {\displaystyle \psi (\Omega ^{\Omega }(1+\alpha ))=\Gamma _{\alpha }} , until the first fixed point ph ( 1 , 1 , 0 ) {\displaystyle \varphi (1,1,0)} of the a - G a {\displaystyle \alpha \mapsto \Gamma _{\alpha }} itself, which will be ps ( O O + 1 ) {\displaystyle \psi (\Omega ^{\Omega +1})} (and the first fixed point ph ( 2 , 0 , 0 ) {\displaystyle \varphi (2,0,0)} of the a - ph ( 1 , a , 0 ) {\displaystyle \alpha \mapsto \varphi (1,\alpha ,0)} functions will be ps ( O O 2 ) {\displaystyle \psi (\Omega ^{\Omega \cdot 2})} ). In this manner:

  • ps ( O O 2 ) {\displaystyle \psi (\Omega ^{\Omega ^{2}})} is the Ackermann ordinal (the range of the notation ph ( a , b , g ) {\displaystyle \varphi (\alpha ,\beta ,\gamma )} defined predicatively),
  • ps ( O O o ) {\displaystyle \psi (\Omega ^{\Omega ^{\omega }})} is the "small" Veblen ordinal (the range of the notations ph ( ) {\displaystyle \varphi (\cdot )} defined predicatively using finitely many variables),
  • ps ( O O O ) {\displaystyle \psi (\Omega ^{\Omega ^{\Omega }})} is the "large" Veblen ordinal (the range of the notations ph ( ) {\displaystyle \varphi (\cdot )} defined predicatively using transfinitely-but-predicatively-many variables),
  • the limit ps ( e O + 1 ) {\displaystyle \psi (\varepsilon _{\Omega +1})} of ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O O ) {\displaystyle \psi (\Omega ^{\Omega })} , ps ( O O O ) {\displaystyle \psi (\Omega ^{\Omega ^{\Omega }})} , etc., is the Bachmann-Howard ordinal: after this our function ps {\displaystyle \psi } is constant, and we can go no further with the definition we have given.

Ordinal notations up to the Bachmann-Howard ordinal

[edit]

We now explain more systematically how the ps {\displaystyle \psi } function defines notations for ordinals up to the Bachmann-Howard ordinal.

A note about base representations

[edit]

Recall that if d {\displaystyle \delta } is an ordinal that is a power of o {\displaystyle \omega } (for example o {\displaystyle \omega } itself, or e 0 {\displaystyle \varepsilon _{0}} , or O {\displaystyle \Omega } ), any ordinal a {\displaystyle \alpha } can be uniquely expressed in the form d b 1 g 1 + ... + d b k g k {\displaystyle \delta ^{\beta _{1}}\gamma _{1}+\ldots +\delta ^{\beta _{k}}\gamma _{k}} , where k {\displaystyle k} is a natural number, g 1 , ... , g k {\displaystyle \gamma _{1},\ldots ,\gamma _{k}} are non-zero ordinals less than d {\displaystyle \delta } , and b 1 > b 2 > > b k {\displaystyle \beta _{1}>\beta _{2}>\cdots >\beta _{k}} are ordinal numbers (we allow b k = 0 {\displaystyle \beta _{k}=0} ). This "base d {\displaystyle \delta } representation" is an obvious generalization of the Cantor normal form (which is the case d = o {\displaystyle \delta =\omega } ). Of course, it may quite well be that the expression is uninteresting, i.e., a = d a {\displaystyle \alpha =\delta ^{\alpha }} , but in any other case the b i {\displaystyle \beta _{i}} must all be less than a {\displaystyle \alpha } ; it may also be the case that the expression is trivial (i.e., a < d {\displaystyle \alpha <\delta } , in which case k <= 1 {\displaystyle k\leq 1} and g 1 = a {\displaystyle \gamma _{1}=\alpha } ).

If a {\displaystyle \alpha } is an ordinal less than e O + 1 {\displaystyle \varepsilon _{\Omega +1}} , then its base O {\displaystyle \Omega } representation has coefficients g i < O {\displaystyle \gamma _{i}<\Omega } (by definition) and exponents b i < a {\displaystyle \beta _{i}<\alpha } (because of the assumption a < e O + 1 {\displaystyle \alpha <\varepsilon _{\Omega +1}} ): hence one can rewrite these exponents in base O {\displaystyle \Omega } and repeat the operation until the process terminates (any decreasing sequence of ordinals is finite). We call the resulting expression the iterated base O {\displaystyle \Omega } representation of a {\displaystyle \alpha } and the various coefficients involved (including as exponents) the pieces of the representation (they are all < O {\displaystyle <\Omega } ), or, for short, the O {\displaystyle \Omega } -pieces of a {\displaystyle \alpha } .

Some properties of ps

[edit]
  • The function ps {\displaystyle \psi } is non-decreasing and continuous (this is more or less obvious from its definition).
  • If ps ( a ) = ps ( b ) {\displaystyle \psi (\alpha )=\psi (\beta )} with b < a {\displaystyle \beta <\alpha } then necessarily C ( a ) = C ( b ) {\displaystyle C(\alpha )=C(\beta )} . Indeed, no ordinal b ' {\displaystyle \beta '} with b <= b ' < a {\displaystyle \beta \leq \beta '<\alpha } can belong to C ( a ) {\displaystyle C(\alpha )} (otherwise its image by ps {\displaystyle \psi } , which is ps ( a ) {\displaystyle \psi (\alpha )} would belong to C ( a ) {\displaystyle C(\alpha )} -- impossible); so C ( b ) {\displaystyle C(\beta )} is closed by everything under which C ( a ) {\displaystyle C(\alpha )} is the closure, so they are equal.
  • Any value g = ps ( a ) {\displaystyle \gamma =\psi (\alpha )} taken by ps {\displaystyle \psi } is an e {\displaystyle \varepsilon } -number (i.e., a fixed point of b - o b {\displaystyle \beta \mapsto \omega ^{\beta }} ). Indeed, if it were not, then by writing it in Cantor normal form, it could be expressed using sums, products and exponentiation from elements less than it, hence in C ( a ) {\displaystyle C(\alpha )} , so it would be in C ( a ) {\displaystyle C(\alpha )} , a contradiction.
  • Lemma: Assume d {\displaystyle \delta } is an e {\displaystyle \varepsilon } -number and a {\displaystyle \alpha } an ordinal such that ps ( b ) < d {\displaystyle \psi (\beta )<\delta } for all b < a {\displaystyle \beta <\alpha } : then the O {\displaystyle \Omega } -pieces (defined above) of any element of C ( a ) {\displaystyle C(\alpha )} are less than d {\displaystyle \delta } . Indeed, let C ' {\displaystyle C'} be the set of ordinals all of whose O {\displaystyle \Omega } -pieces are less than d {\displaystyle \delta } . Then C ' {\displaystyle C'} is closed under addition, multiplication and exponentiation (because d {\displaystyle \delta } is an e {\displaystyle \varepsilon } -number, so ordinals less than it are closed under addition, multiplication and exponentiation). And C ' {\displaystyle C'} also contains every ps ( b ) {\displaystyle \psi (\beta )} for b < a {\displaystyle \beta <\alpha } by assumption, and it contains 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } . So C ' C ( a ) {\displaystyle C'\supseteq C(\alpha )} , which was to be shown.
  • Under the hypothesis of the previous lemma, ps ( a ) <= d {\displaystyle \psi (\alpha )\leq \delta } (indeed, the lemma shows that d C ( a ) {\displaystyle \delta \not \in C(\alpha )} ).
  • Any e {\displaystyle \varepsilon } -number less than some element in the range of ps {\displaystyle \psi } is itself in the range of ps {\displaystyle \psi } (that is, ps {\displaystyle \psi } omits no e {\displaystyle \varepsilon } -number). Indeed: if d {\displaystyle \delta } is an e {\displaystyle \varepsilon } -number not greater than the range of ps {\displaystyle \psi } , let a {\displaystyle \alpha } be the least upper bound of the b {\displaystyle \beta } such that ps ( b ) < d {\displaystyle \psi (\beta )<\delta } : then by the above we have ps ( a ) <= d {\displaystyle \psi (\alpha )\leq \delta } , but ps ( a ) < d {\displaystyle \psi (\alpha )<\delta } would contradict the fact that a {\displaystyle \alpha } is the least upper bound -- so ps ( a ) = d {\displaystyle \psi (\alpha )=\delta } .
  • Whenever ps ( a ) = d {\displaystyle \psi (\alpha )=\delta } , the set C ( a ) {\displaystyle C(\alpha )} consists exactly of those ordinals g {\displaystyle \gamma } (less than e O + 1 {\displaystyle \varepsilon _{\Omega +1}} ) all of whose O {\displaystyle \Omega } -pieces are less than d {\displaystyle \delta } . Indeed, we know that all ordinals less than d {\displaystyle \delta } , hence all ordinals (less than e O + 1 {\displaystyle \varepsilon _{\Omega +1}} ) whose O {\displaystyle \Omega } -pieces are less than d {\displaystyle \delta } , are in C ( a ) {\displaystyle C(\alpha )} . Conversely, if we assume ps ( b ) < d {\displaystyle \psi (\beta )<\delta } for all b < a {\displaystyle \beta <\alpha } (in other words if a {\displaystyle \alpha } is the least possible with ps ( a ) = d {\displaystyle \psi (\alpha )=\delta } ), the lemma gives the desired property. On the other hand, if ps ( a ) = ps ( b ) {\displaystyle \psi (\alpha )=\psi (\beta )} for some b < a {\displaystyle \beta <\alpha } , then we have already remarked C ( a ) = C ( b ) {\displaystyle C(\alpha )=C(\beta )} and we can replace a {\displaystyle \alpha } by the least possible with ps ( a ) = d {\displaystyle \psi (\alpha )=\delta } .

The ordinal notation

[edit]

Using the facts above, we can define a (canonical) ordinal notation for every g {\displaystyle \gamma } less than the Bachmann-Howard ordinal. We do this by induction on g {\displaystyle \gamma } .

If g {\displaystyle \gamma } is less than e 0 {\displaystyle \varepsilon _{0}} , we use the iterated Cantor normal form of g {\displaystyle \gamma } . Otherwise, there exists a largest e {\displaystyle \varepsilon } -number d {\displaystyle \delta } less or equal to g {\displaystyle \gamma } (this is because the set of e {\displaystyle \varepsilon } -numbers is closed): if d < g {\displaystyle \delta <\gamma } then by induction we have defined a notation for d {\displaystyle \delta } and the base d {\displaystyle \delta } representation of g {\displaystyle \gamma } gives one for g {\displaystyle \gamma } , so we are finished.

It remains to deal with the case where g = d {\displaystyle \gamma =\delta } is an e {\displaystyle \varepsilon } -number: we have argued that, in this case, we can write d = ps ( a ) {\displaystyle \delta =\psi (\alpha )} for some (possibly uncountable) ordinal a < e O + 1 {\displaystyle \alpha <\varepsilon _{\Omega +1}} : let a {\displaystyle \alpha } be the greatest possible such ordinal (which exists since ps {\displaystyle \psi } is continuous). We use the iterated base O {\displaystyle \Omega } representation of a {\displaystyle \alpha } : it remains to show that every piece of this representation is less than d {\displaystyle \delta } (so we have already defined a notation for it). If this is not the case then, by the properties we have shown, C ( a ) {\displaystyle C(\alpha )} does not contain a {\displaystyle \alpha } ; but then C ( a + 1 ) = C ( a ) {\displaystyle C(\alpha +1)=C(\alpha )} (they are closed under the same operations, since the value of ps {\displaystyle \psi } at a {\displaystyle \alpha } can never be taken), so ps ( a + 1 ) = ps ( a ) = d {\displaystyle \psi (\alpha +1)=\psi (\alpha )=\delta } , contradicting the maximality of a {\displaystyle \alpha } .

Note: Actually, we have defined canonical notations not just for ordinals below the Bachmann-Howard ordinal but also for certain uncountable ordinals, namely those whose O {\displaystyle \Omega } -pieces are less than the Bachmann-Howard ordinal (viz.: write them in iterated base O {\displaystyle \Omega } representation and use the canonical representation for every piece). This canonical notation is used for arguments of the ps {\displaystyle \psi } function (which may be uncountable).

Examples

[edit]

For ordinals less than e 0 = ps ( 0 ) {\displaystyle \varepsilon _{0}=\psi (0)} , the canonical ordinal notation defined coincides with the iterated Cantor normal form (by definition).

For ordinals less than e 1 = ps ( 1 ) {\displaystyle \varepsilon _{1}=\psi (1)} , the notation coincides with iterated base e 0 {\displaystyle \varepsilon _{0}} notation (the pieces being themselves written in iterated Cantor normal form): e.g., o o e 0 + o {\displaystyle \omega ^{\omega ^{\varepsilon _{0}+\omega }}} will be written e 0 o o {\displaystyle {\varepsilon _{0}}^{\omega ^{\omega }}} , or, more accurately, ps ( 0 ) o o {\displaystyle \psi (0)^{\omega ^{\omega }}} . For ordinals less than e 2 = ps ( 2 ) {\displaystyle \varepsilon _{2}=\psi (2)} , we similarly write in iterated base e 1 {\displaystyle \varepsilon _{1}} and then write the pieces in iterated base e 0 {\displaystyle \varepsilon _{0}} (and write the pieces of that in iterated Cantor normal form): so o o e 1 + e 0 + 1 {\displaystyle \omega ^{\omega ^{\varepsilon _{1}+\varepsilon _{0}+1}}} is written e 1 e 0 o {\displaystyle {\varepsilon _{1}}^{\varepsilon _{0}\omega }} , or, more accurately, ps ( 1 ) ps ( 0 ) o {\displaystyle \psi (1)^{\psi (0)\,\omega }} . Thus, up to z 0 = ps ( O ) {\displaystyle \zeta _{0}=\psi (\Omega )} , we always use the largest possible e {\displaystyle \varepsilon } -number base which gives a non-trivial representation.

Beyond this, we may need to express ordinals beyond O {\displaystyle \Omega } : this is always done in iterated O {\displaystyle \Omega } -base, and the pieces themselves need to be expressed using the largest possible e {\displaystyle \varepsilon } -number base which gives a non-trivial representation.

Note that while ps ( e O + 1 ) {\displaystyle \psi (\varepsilon _{\Omega +1})} is equal to the Bachmann-Howard ordinal, this is not a "canonical notation" in the sense we have defined (canonical notations are defined only for ordinals less than the Bachmann-Howard ordinal).

Conditions for canonicalness

[edit]

The notations thus defined have the property that whenever they nest ps {\displaystyle \psi } functions, the arguments of the "inner" ps {\displaystyle \psi } function are always less than those of the "outer" one (this is a consequence of the fact that the O {\displaystyle \Omega } -pieces of a {\displaystyle \alpha } , where a {\displaystyle \alpha } is the largest possible such that ps ( a ) = d {\displaystyle \psi (\alpha )=\delta } for some e {\displaystyle \varepsilon } -number d {\displaystyle \delta } , are all less than d {\displaystyle \delta } , as we have shown above). For example, ps ( ps ( O ) + 1 ) {\displaystyle \psi (\psi (\Omega )+1)} does not occur as a notation: it is a well-defined expression (and it is equal to ps ( O ) = z 0 {\displaystyle \psi (\Omega )=\zeta _{0}} since ps {\displaystyle \psi } is constant between z 0 {\displaystyle \zeta _{0}} and O {\displaystyle \Omega } ), but it is not a notation produced by the inductive algorithm we have outlined.

Canonicalness can be checked recursively: an expression is canonical if and only if it is either the iterated Cantor normal form of an ordinal less than e 0 {\displaystyle \varepsilon _{0}} , or an iterated base d {\displaystyle \delta } representation all of whose pieces are canonical, for some d = ps ( a ) {\displaystyle \delta =\psi (\alpha )} where a {\displaystyle \alpha } is itself written in iterated base O {\displaystyle \Omega } representation all of whose pieces are canonical and less than d {\displaystyle \delta } . The order is checked by lexicographic verification at all levels (keeping in mind that O {\displaystyle \Omega } is greater than any expression obtained by ps {\displaystyle \psi } , and for canonical values the greater ps {\displaystyle \psi } always trumps the lesser or even arbitrary sums, products and exponentials of the lesser).

For example, ps ( O o + 1 ps ( O ) + ps ( O o ) ps ( O 2 ) 42 ) ps ( 1729 ) o {\displaystyle \psi (\Omega ^{\omega +1}\,\psi (\Omega )+\psi (\Omega ^{\omega })^{\psi (\Omega ^{2})}42)^{\psi (1729)\,\omega }} is a canonical notation for an ordinal which is less than the Feferman-Schutte ordinal: it can be written using the Veblen functions as ph 1 ( ph o + 1 ( ph 2 ( 0 ) ) + ph o ( 0 ) ph 3 ( 0 ) 42 ) ph 1 ( 1729 ) o {\displaystyle \varphi _{1}(\varphi _{\omega +1}(\varphi _{2}(0))+\varphi _{\omega }(0)^{\varphi _{3}(0)}42)^{\varphi _{1}(1729)\,\omega }} .

Concerning the order, one might point out that ps ( O O ) {\displaystyle \psi (\Omega ^{\Omega })} (the Feferman-Schutte ordinal) is much more than ps ( O ps ( O ) ) = ph ph 2 ( 0 ) ( 0 ) {\displaystyle \psi (\Omega ^{\psi (\Omega )})=\varphi _{\varphi _{2}(0)}(0)} (because O {\displaystyle \Omega } is greater than ps {\displaystyle \psi } of anything), and ps ( O ps ( O ) ) = ph ph 2 ( 0 ) ( 0 ) {\displaystyle \psi (\Omega ^{\psi (\Omega )})=\varphi _{\varphi _{2}(0)}(0)} is itself much more than ps ( O ) ps ( O ) = ph 2 ( 0 ) ph 2 ( 0 ) {\displaystyle \psi (\Omega )^{\psi (\Omega )}=\varphi _{2}(0)^{\varphi _{2}(0)}} (because O ps ( O ) {\displaystyle \Omega ^{\psi (\Omega )}} is greater than O {\displaystyle \Omega } , so any sum-product-or-exponential expression involving ps ( O ) {\displaystyle \psi (\Omega )} and smaller value will remain less than ps ( O O ) {\displaystyle \psi (\Omega ^{\Omega })} ). In fact, ps ( O ) ps ( O ) {\displaystyle \psi (\Omega )^{\psi (\Omega )}} is already less than ps ( O + 1 ) {\displaystyle \psi (\Omega +1)} .

Standard sequences for ordinal notations

[edit]

To witness the fact that we have defined notations for ordinals below the Bachmann-Howard ordinal (which are all of countable cofinality), we might define standard sequences converging to any one of them (provided it is a limit ordinal, of course). Actually we will define canonical sequences for certain uncountable ordinals, too, namely the uncountable ordinals of countable cofinality (if we are to hope to define a sequence converging to them...) which are representable (that is, all of whose O {\displaystyle \Omega } -pieces are less than the Bachmann-Howard ordinal).

The following rules are more or less obvious, except for the last:

  • First, get rid of the (iterated) base d {\displaystyle \delta } representations: to define a standard sequence converging to a = d b 1 g 1 + + d b k g k {\displaystyle \alpha =\delta ^{\beta _{1}}\gamma _{1}+\cdots +\delta ^{\beta _{k}}\gamma _{k}} , where d {\displaystyle \delta } is either o {\displaystyle \omega } or ps ( ) {\displaystyle \psi (\cdots )} (or O {\displaystyle \Omega } , but see below):
    • if k {\displaystyle k} is zero then a = 0 {\displaystyle \alpha =0} and there is nothing to be done;
    • if b k {\displaystyle \beta _{k}} is zero and g k {\displaystyle \gamma _{k}} is successor, then a {\displaystyle \alpha } is successor and there is nothing to be done;
    • if g k {\displaystyle \gamma _{k}} is limit, take the standard sequence converging to g k {\displaystyle \gamma _{k}} and replace g k {\displaystyle \gamma _{k}} in the expression by the elements of that sequence;
    • if g k {\displaystyle \gamma _{k}} is successor and b k {\displaystyle \beta _{k}} is limit, rewrite the last term d b k g k {\displaystyle \delta ^{\beta _{k}}\gamma _{k}} as d b k ( g k - 1 ) + d b k {\displaystyle \delta ^{\beta _{k}}(\gamma _{k}-1)+\delta ^{\beta _{k}}} and replace the exponent b k {\displaystyle \beta _{k}} in the last term by the elements of the fundamental sequence converging to it;
    • if g k {\displaystyle \gamma _{k}} is successor and b k {\displaystyle \beta _{k}} is also, rewrite the last term d b k g k {\displaystyle \delta ^{\beta _{k}}\gamma _{k}} as d b k ( g k - 1 ) + d b k - 1 d {\displaystyle \delta ^{\beta _{k}}(\gamma _{k}-1)+\delta ^{\beta _{k}-1}\delta } and replace the last d {\displaystyle \delta } in this expression by the elements of the fundamental sequence converging to it.
  • If d {\displaystyle \delta } is o {\displaystyle \omega } , then take the obvious 0 , 1 , 2 , 3 , ... {\displaystyle 0,1,2,3,\ldots } as the fundamental sequence for d {\displaystyle \delta } .
  • If d = ps ( 0 ) {\displaystyle \delta =\psi (0)} then take as fundamental sequence for d {\displaystyle \delta } the sequence o , o o , o o o , ... {\displaystyle \omega ,\omega ^{\omega },\omega ^{\omega ^{\omega }},\ldots }
  • If d = ps ( a + 1 ) {\displaystyle \delta =\psi (\alpha +1)} then take as fundamental sequence for d {\displaystyle \delta } the sequence ps ( a ) , ps ( a ) ps ( a ) , ps ( a ) ps ( a ) ps ( a ) , ... {\displaystyle \psi (\alpha ),\psi (\alpha )^{\psi (\alpha )},\psi (\alpha )^{\psi (\alpha )^{\psi (\alpha )}},\ldots }
  • If d = ps ( a ) {\displaystyle \delta =\psi (\alpha )} where a {\displaystyle \alpha } is a limit ordinal of countable cofinality, define the standard sequence for d {\displaystyle \delta } to be obtained by applying ps {\displaystyle \psi } to the standard sequence for a {\displaystyle \alpha } (recall that ps {\displaystyle \psi } is continuous and increasing, here).
  • It remains to handle the case where d = ps ( a ) {\displaystyle \delta =\psi (\alpha )} with a {\displaystyle \alpha } an ordinal of uncountable cofinality (e.g., O {\displaystyle \Omega } itself). Obviously it doesn't make sense to define a sequence converging to a {\displaystyle \alpha } in this case; however, what we can define is a sequence converging to some r < a {\displaystyle \rho <\alpha } with countable cofinality and such that ps {\displaystyle \psi } is constant between r {\displaystyle \rho } and a {\displaystyle \alpha } . This r {\displaystyle \rho } will be the first fixed point of a certain (continuous and non-decreasing) function x - h ( ps ( x ) ) {\displaystyle \xi \mapsto h(\psi (\xi ))} . To find it, apply the same rules (from the base O {\displaystyle \Omega } representation of a {\displaystyle \alpha } ) as to find the canonical sequence of a {\displaystyle \alpha } , except that whenever a sequence converging to O {\displaystyle \Omega } is called for (something which cannot exist), replace the O {\displaystyle \Omega } in question, in the expression of a = h ( O ) {\displaystyle \alpha =h(\Omega )} , by a ps ( x ) {\displaystyle \psi (\xi )} (where x {\displaystyle \xi } is a variable) and perform a repeated iteration (starting from 0 {\displaystyle 0} , say) of the function x - h ( ps ( x ) ) {\displaystyle \xi \mapsto h(\psi (\xi ))} : this gives a sequence 0 , h ( ps ( 0 ) ) , h ( ps ( h ( ps ( 0 ) ) ) ) , ... {\displaystyle 0,h(\psi (0)),h(\psi (h(\psi (0)))),\ldots } tending to r {\displaystyle \rho } , and the canonical sequence for ps ( a ) = ps ( r ) {\displaystyle \psi (\alpha )=\psi (\rho )} is ps ( 0 ) {\displaystyle \psi (0)} , ps ( h ( ps ( 0 ) ) ) {\displaystyle \psi (h(\psi (0)))} , ps ( h ( ps ( h ( ps ( 0 ) ) ) ) ) {\displaystyle \psi (h(\psi (h(\psi (0)))))} ... If we let the n {\displaystyle n} th element (starting at 0 {\displaystyle 0} ) of the fundamental sequence for d {\displaystyle \delta } be denoted as d [ n ] {\displaystyle \delta [n]} , then we can state this more clearly using recursion. Using this notation, we can see that d [ 0 ] = ps ( 0 ) {\displaystyle \delta [0]=\psi (0)} quite easily. We can define the rest of the sequence using recursion: d [ n ] = ps ( h ( d [ n - 1 ] ) ) {\displaystyle \delta [n]=\psi (h(\delta [n-1]))} . (The examples below should make this clearer.)

Here are some examples for the last (and most interesting) case:

  • The canonical sequence for ps ( O ) {\displaystyle \psi (\Omega )} is: ps ( 0 ) {\displaystyle \psi (0)} , ps ( ps ( 0 ) ) {\displaystyle \psi (\psi (0))} , ps ( ps ( ps ( 0 ) ) ) {\displaystyle \psi (\psi (\psi (0)))} ... This indeed converges to r = ps ( O ) = z 0 {\displaystyle \rho =\psi (\Omega )=\zeta _{0}} after which ps {\displaystyle \psi } is constant until O {\displaystyle \Omega } .
  • The canonical sequence for ps ( O 2 ) {\displaystyle \psi (\Omega 2)} is: ps ( 0 ) {\displaystyle \psi (0)} , ps ( O + ps ( 0 ) ) {\displaystyle \psi (\Omega +\psi (0))} , ps ( O + ps ( O + ps ( 0 ) ) ) , ... {\displaystyle \psi (\Omega +\psi (\Omega +\psi (0))),\ldots } This indeed converges to the value of ps {\displaystyle \psi } at r = O + ps ( O 2 ) = O + z 1 {\displaystyle \rho =\Omega +\psi (\Omega 2)=\Omega +\zeta _{1}} after which ps {\displaystyle \psi } is constant until O 2 {\displaystyle \Omega 2} .
  • The canonical sequence for ps ( O 2 ) {\displaystyle \psi (\Omega ^{2})} is: ps ( 0 ) , ps ( O ps ( 0 ) ) , ps ( O ps ( O ps ( 0 ) ) ) , ... {\displaystyle \psi (0),\psi (\Omega \psi (0)),\psi (\Omega \psi (\Omega \psi (0))),\ldots } This converges to the value of ps {\displaystyle \psi } at r = O ps ( O 2 ) {\displaystyle \rho =\Omega \psi (\Omega ^{2})} .
  • The canonical sequence for ps ( O 2 3 + O ) {\displaystyle \psi (\Omega ^{2}3+\Omega )} is ps ( 0 ) , ps ( O 2 3 + ps ( 0 ) ) , ps ( O 2 3 + ps ( O 2 3 + ps ( 0 ) ) ) , ... {\displaystyle \psi (0),\psi (\Omega ^{2}3+\psi (0)),\psi (\Omega ^{2}3+\psi (\Omega ^{2}3+\psi (0))),\ldots } This converges to the value of ps {\displaystyle \psi } at r = O 2 3 + ps ( O 2 3 + O ) {\displaystyle \rho =\Omega ^{2}3+\psi (\Omega ^{2}3+\Omega )} .
  • The canonical sequence for ps ( O O ) {\displaystyle \psi (\Omega ^{\Omega })} is: ps ( 0 ) , ps ( O ps ( 0 ) ) , ps ( O ps ( O ps ( 0 ) ) ) , ... {\displaystyle \psi (0),\psi (\Omega ^{\psi (0)}),\psi (\Omega ^{\psi (\Omega ^{\psi (0)})}),\ldots } This converges to the value of ps {\displaystyle \psi } at r = O ps ( O O ) {\displaystyle \rho =\Omega ^{\psi (\Omega ^{\Omega })}} .
  • The canonical sequence for ps ( O O 3 ) {\displaystyle \psi (\Omega ^{\Omega }3)} is: ps ( 0 ) , ps ( O O 2 + O ps ( 0 ) ) , ps ( O O 2 + O ps ( O O 2 + O ps ( 0 ) ) ) , ... {\displaystyle \psi (0),\psi (\Omega ^{\Omega }2+\Omega ^{\psi (0)}),\psi (\Omega ^{\Omega }2+\Omega ^{\psi (\Omega ^{\Omega }2+\Omega ^{\psi (0)})}),\ldots } This converges to the value of ps {\displaystyle \psi } at r = O O 2 + O ps ( O O 3 ) {\displaystyle \rho =\Omega ^{\Omega }2+\Omega ^{\psi (\Omega ^{\Omega }3)}} .
  • The canonical sequence for ps ( O O + 1 ) {\displaystyle \psi (\Omega ^{\Omega +1})} is: ps ( 0 ) , ps ( O O ps ( 0 ) ) , ps ( O O ps ( O O ps ( 0 ) ) ) , ... {\displaystyle \psi (0),\psi (\Omega ^{\Omega }\psi (0)),\psi (\Omega ^{\Omega }\psi (\Omega ^{\Omega }\psi (0))),\ldots } This converges to the value of ps {\displaystyle \psi } at r = O O ps ( O O + 1 ) {\displaystyle \rho =\Omega ^{\Omega }\psi (\Omega ^{\Omega +1})} .
  • The canonical sequence for ps ( O O 2 + O 3 ) {\displaystyle \psi (\Omega ^{\Omega ^{2}+\Omega 3})} is: ps ( 0 ) , ps ( O O 2 + O 2 + ps ( 0 ) ) , ps ( O O 2 + O 2 + ps ( O O 2 + O 2 + ps ( 0 ) ) ) , ... {\displaystyle \psi (0),\psi (\Omega ^{\Omega ^{2}+\Omega 2+\psi (0)}),\psi (\Omega ^{\Omega ^{2}+\Omega 2+\psi (\Omega ^{\Omega ^{2}+\Omega 2+\psi (0)})}),\ldots }

Here are some examples of the other cases:

  • The canonical sequence for o 2 {\displaystyle \omega ^{2}} is: 0 {\displaystyle 0} , o {\displaystyle \omega } , o 2 {\displaystyle \omega 2} , o 3 {\displaystyle \omega 3} ...
  • The canonical sequence for ps ( o o ) {\displaystyle \psi (\omega ^{\omega })} is: ps ( 1 ) {\displaystyle \psi (1)} , ps ( o ) {\displaystyle \psi (\omega )} , ps ( o 2 ) {\displaystyle \psi (\omega ^{2})} , ps ( o 3 ) {\displaystyle \psi (\omega ^{3})} ...
  • The canonical sequence for ps ( O ) o {\displaystyle \psi (\Omega )^{\omega }} is: 1 {\displaystyle 1} , ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O ) 2 {\displaystyle \psi (\Omega )^{2}} , ps ( O ) 3 {\displaystyle \psi (\Omega )^{3}} ...
  • The canonical sequence for ps ( O + 1 ) {\displaystyle \psi (\Omega +1)} is: ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O ) ps ( O ) {\displaystyle \psi (\Omega )^{\psi (\Omega )}} , ps ( O ) ps ( O ) ps ( O ) {\displaystyle \psi (\Omega )^{\psi (\Omega )^{\psi (\Omega )}}} ...
  • The canonical sequence for ps ( O + o ) {\displaystyle \psi (\Omega +\omega )} is: ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O + 1 ) {\displaystyle \psi (\Omega +1)} , ps ( O + 2 ) {\displaystyle \psi (\Omega +2)} , ps ( O + 3 ) {\displaystyle \psi (\Omega +3)} ...
  • The canonical sequence for ps ( O o ) {\displaystyle \psi (\Omega \omega )} is: ps ( 0 ) {\displaystyle \psi (0)} , ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O 2 ) {\displaystyle \psi (\Omega 2)} , ps ( O 3 ) {\displaystyle \psi (\Omega 3)} ...
  • The canonical sequence for ps ( O o ) {\displaystyle \psi (\Omega ^{\omega })} is: ps ( 1 ) {\displaystyle \psi (1)} , ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O 2 ) {\displaystyle \psi (\Omega ^{2})} , ps ( O 3 ) {\displaystyle \psi (\Omega ^{3})} ...
  • The canonical sequence for ps ( O ps ( 0 ) ) {\displaystyle \psi (\Omega ^{\psi (0)})} is: ps ( O o ) {\displaystyle \psi (\Omega ^{\omega })} , ps ( O o o ) {\displaystyle \psi (\Omega ^{\omega ^{\omega }})} , ps ( O o o o ) {\displaystyle \psi (\Omega ^{\omega ^{\omega ^{\omega }}})} ... (this is derived from the fundamental sequence for ps ( 0 ) {\displaystyle \psi (0)} ).
  • The canonical sequence for ps ( O ps ( O ) ) {\displaystyle \psi (\Omega ^{\psi (\Omega )})} is: ps ( O ps ( 0 ) ) {\displaystyle \psi (\Omega ^{\psi (0)})} , ps ( O ps ( ps ( 0 ) ) ) {\displaystyle \psi (\Omega ^{\psi (\psi (0))})} , ps ( O ps ( ps ( ps ( 0 ) ) ) ) {\displaystyle \psi (\Omega ^{\psi (\psi (\psi (0)))})} ... (this is derived from the fundamental sequence for ps ( O ) {\displaystyle \psi (\Omega )} , which was given above).

Even though the Bachmann-Howard ordinal ps ( e O + 1 ) {\displaystyle \psi (\varepsilon _{\Omega +1})} itself has no canonical notation, it is also useful to define a canonical sequence for it: this is ps ( O ) {\displaystyle \psi (\Omega )} , ps ( O O ) {\displaystyle \psi (\Omega ^{\Omega })} , ps ( O O O ) {\displaystyle \psi (\Omega ^{\Omega ^{\Omega }})} ...

A terminating process

[edit]

Start with any ordinal less than or equal to the Bachmann-Howard ordinal, and repeat the following process so long as it is not zero:

  • if the ordinal is a successor, subtract one (that is, replace it with its predecessor),
  • if it is a limit, replace it by some element of the canonical sequence defined for it.

Then it is true that this process always terminates (as any decreasing sequence of ordinals is finite); however, like (but even more so than for) the hydra game:

  1. it can take a very long time to terminate,
  2. the proof of termination may be out of reach of certain weak systems of arithmetic.

To give some flavor of what the process feels like, here are some steps of it: starting from ps ( O O o ) {\displaystyle \psi (\Omega ^{\Omega ^{\omega }})} (the small Veblen ordinal), we might go down to ps ( O O 3 ) {\displaystyle \psi (\Omega ^{\Omega ^{3}})} , from there down to ps ( O O 2 ps ( 0 ) ) {\displaystyle \psi (\Omega ^{\Omega ^{2}\psi (0)})} , then ps ( O O 2 o o ) {\displaystyle \psi (\Omega ^{\Omega ^{2}\omega ^{\omega }})} then ps ( O O 2 o 3 ) {\displaystyle \psi (\Omega ^{\Omega ^{2}\omega ^{3}})} then ps ( O O 2 o 2 3 ) {\displaystyle \psi (\Omega ^{\Omega ^{2}\omega ^{2}3})} then ps ( O O 2 ( o 2 2 + o ) ) {\displaystyle \psi (\Omega ^{\Omega ^{2}(\omega ^{2}2+\omega )})} then ps ( O O 2 ( o 2 2 + 1 ) ) {\displaystyle \psi (\Omega ^{\Omega ^{2}(\omega ^{2}2+1)})} then ps ( O O 2 o 2 2 + O ps ( O O 2 o 2 2 + O ps ( 0 ) ) ) {\displaystyle \psi (\Omega ^{\Omega ^{2}\omega ^{2}2+\Omega \psi (\Omega ^{\Omega ^{2}\omega ^{2}2+\Omega \psi (0)})})} then ps ( O O 2 o 2 2 + O ps ( O O 2 o 2 2 + O o o o ) ) {\displaystyle \psi (\Omega ^{\Omega ^{2}\omega ^{2}2+\Omega \psi (\Omega ^{\Omega ^{2}\omega ^{2}2+\Omega \omega ^{\omega ^{\omega }}})})} and so on. It appears as though the expressions are getting more and more complicated whereas, in fact, the ordinals always decrease.

Concerning the first statement, one could introduce, for any ordinal a {\displaystyle \alpha } less or equal to the Bachmann-Howard ordinal ps ( e O + 1 ) {\displaystyle \psi (\varepsilon _{\Omega +1})} , the integer function f a ( n ) {\displaystyle f_{\alpha }(n)} which counts the number of steps of the process before termination if one always selects the n {\displaystyle n} 'th element from the canonical sequence (this function satisfies the identity f a ( n ) = f a [ n ] ( n ) + 1 {\displaystyle f_{\alpha }(n)=f_{\alpha [n]}(n)+1} ). Then f a {\displaystyle f_{\alpha }} can be a very fast growing function: already f o o ( n ) {\displaystyle f_{\omega ^{\omega }}(n)} is essentially n n {\displaystyle n^{n}} , the function f ps ( O o ) ( n ) {\displaystyle f_{\psi (\Omega ^{\omega })}(n)} is comparable with the Ackermann function A ( n , n ) {\displaystyle A(n,n)} , and f ps ( e O + 1 ) ( n ) {\displaystyle f_{\psi (\varepsilon _{\Omega +1})}(n)} is comparable with the Goodstein function. If we instead make a function that satisfies the identity g a ( n ) = g a [ n ] ( n + 1 ) + 1 {\displaystyle g_{\alpha }(n)=g_{\alpha [n]}(n+1)+1} , so the index of the function increases it is applied, then we create a much faster growing function: g ps ( 0 ) ( n ) {\displaystyle g_{\psi (0)}(n)} is already comparable to the Goodstein function, and g ps ( O O o o ) ( n ) {\displaystyle g_{\psi (\Omega ^{\Omega ^{\omega }\omega })}(n)} is comparable to the TREE function.

Concerning the second statement, a precise version is given by ordinal analysis: for example, Kripke-Platek set theory can prove[4] that the process terminates for any given a {\displaystyle \alpha } less than the Bachmann-Howard ordinal, but it cannot do this uniformly, i.e., it cannot prove the termination starting from the Bachmann-Howard ordinal. Some theories like Peano arithmetic are limited by much smaller ordinals ( e 0 {\displaystyle \varepsilon _{0}} in the case of Peano arithmetic).

Variations on the example

[edit]

Making the function less powerful

[edit]

It is instructive (although not exactly useful) to make ps {\displaystyle \psi } less powerful.

If we alter the definition of ps {\displaystyle \psi } above to omit exponentiation from the repertoire from which C ( a ) {\displaystyle C(\alpha )} is constructed, then we get ps ( 0 ) = o o {\displaystyle \psi (0)=\omega ^{\omega }} (as this is the smallest ordinal that cannot be constructed from 0 {\displaystyle 0} , 1 {\displaystyle 1} and o {\displaystyle \omega } using addition and multiplication only), then ps ( 1 ) = o o 2 {\displaystyle \psi (1)=\omega ^{\omega ^{2}}} and similarly ps ( o ) = o o o {\displaystyle \psi (\omega )=\omega ^{\omega ^{\omega }}} , ps ( ps ( 0 ) ) = o o o o {\displaystyle \psi (\psi (0))=\omega ^{\omega ^{\omega ^{\omega }}}} until we come to a fixed point which is then our ps ( O ) = e 0 {\displaystyle \psi (\Omega )=\varepsilon _{0}} . We then have ps ( O + 1 ) = e 0 o {\displaystyle \psi (\Omega +1)={\varepsilon _{0}}^{\omega }} and so on until ps ( O 2 ) = e 1 {\displaystyle \psi (\Omega 2)=\varepsilon _{1}} . Since multiplication of O {\displaystyle \Omega } 's is permitted, we can still form ps ( O 2 ) = ph 2 ( 0 ) {\displaystyle \psi (\Omega ^{2})=\varphi _{2}(0)} and ps ( O 3 ) = ph 3 ( 0 ) {\displaystyle \psi (\Omega ^{3})=\varphi _{3}(0)} and so on, but our construction ends there as there is no way to get at or beyond O o {\displaystyle \Omega ^{\omega }} : so the range of this weakened system of notation is ps ( O o ) = ph o ( 0 ) {\displaystyle \psi (\Omega ^{\omega })=\varphi _{\omega }(0)} (the value of ps ( O o ) {\displaystyle \psi (\Omega ^{\omega })} is the same in our weaker system as in our original system, except that now we cannot go beyond it). This does not even go as far as the Feferman-Schutte ordinal.

If we alter the definition of ps {\displaystyle \psi } yet some more to allow only addition as a primitive for construction, we get ps ( 0 ) = o 2 {\displaystyle \psi (0)=\omega ^{2}} and ps ( 1 ) = o 3 {\displaystyle \psi (1)=\omega ^{3}} and so on until ps ( ps ( 0 ) ) = o o 2 {\displaystyle \psi (\psi (0))=\omega ^{\omega ^{2}}} and still ps ( O ) = e 0 {\displaystyle \psi (\Omega )=\varepsilon _{0}} . This time, ps ( O + 1 ) = e 0 o {\displaystyle \psi (\Omega +1)=\varepsilon _{0}\omega } and so on until ps ( O 2 ) = e 1 {\displaystyle \psi (\Omega 2)=\varepsilon _{1}} and similarly ps ( O 3 ) = e 2 {\displaystyle \psi (\Omega 3)=\varepsilon _{2}} . But this time we can go no further: since we can only add O {\displaystyle \Omega } 's, the range of our system is ps ( O o ) = e o = ph 1 ( o ) {\displaystyle \psi (\Omega \omega )=\varepsilon _{\omega }=\varphi _{1}(\omega )} .

If we alter the definition even more, to allow nothing except psi, we get ps ( 0 ) = 1 {\displaystyle \psi (0)=1} , ps ( ps ( 0 ) ) = 2 {\displaystyle \psi (\psi (0))=2} , and so on until ps ( o ) = o + 1 {\displaystyle \psi (\omega )=\omega +1} , ps ( ps ( o ) ) = o + 2 {\displaystyle \psi (\psi (\omega ))=\omega +2} , and ps ( O ) = o 2 {\displaystyle \psi (\Omega )=\omega 2} , at which point we can go no further since we cannot do anything with the O {\displaystyle \Omega } 's. So the range of this system is only o 2 {\displaystyle \omega 2} .

In both cases, we find that the limitation on the weakened ps {\displaystyle \psi } function comes not so much from the operations allowed on the countable ordinals as on the uncountable ordinals we allow ourselves to denote.

Going beyond the Bachmann-Howard ordinal

[edit]

We know that ps ( e O + 1 ) {\displaystyle \psi (\varepsilon _{\Omega +1})} is the Bachmann-Howard ordinal. The reason why ps ( e O + 1 + 1 ) {\displaystyle \psi (\varepsilon _{\Omega +1}+1)} is no larger, with our definitions, is that there is no notation for e O + 1 {\displaystyle \varepsilon _{\Omega +1}} (it does not belong to C ( a ) {\displaystyle C(\alpha )} for any a {\displaystyle \alpha } , it is always the least upper bound of it). One could try to add the e {\displaystyle \varepsilon } function (or the Veblen functions of so-many-variables) to the allowed primitives beyond addition, multiplication and exponentiation, but that does not get us very far. To create more systematic notations for countable ordinals, we need more systematic notations for uncountable ordinals: we cannot use the ps {\displaystyle \psi } function itself because it only yields countable ordinals (e.g., ps ( O + 1 ) {\displaystyle \psi (\Omega +1)} is, e ph 2 ( 0 ) + 1 {\displaystyle \varepsilon _{\varphi _{2}(0)+1}} , certainly not e O + 1 {\displaystyle \varepsilon _{\Omega +1}} ), so the idea is to mimic its definition as follows:

Let ps 1 ( a ) {\displaystyle \psi _{1}(\alpha )} be the smallest ordinal that cannot be expressed from all countable ordinals and O 2 {\displaystyle \Omega _{2}} using sums, products, exponentials, and the ps 1 {\displaystyle \psi _{1}} function itself (to previously constructed ordinals less than a {\displaystyle \alpha } ).

Here, O 2 {\displaystyle \Omega _{2}} is a new ordinal guaranteed to be greater than all the ordinals which will be constructed using ps 1 {\displaystyle \psi _{1}} : again, letting O = o 1 {\displaystyle \Omega =\omega _{1}} and O 2 = o 2 {\displaystyle \Omega _{2}=\omega _{2}} works.

For example, ps 1 ( 0 ) = O {\displaystyle \psi _{1}(0)=\Omega } , and more generally ps 1 ( a ) = e O + a {\displaystyle \psi _{1}(\alpha )=\varepsilon _{\Omega +\alpha }} for all countable ordinals and even beyond ( ps 1 ( O ) = ps 1 ( ps 1 ( 0 ) ) = e O 2 {\displaystyle \psi _{1}(\Omega )=\psi _{1}(\psi _{1}(0))=\varepsilon _{\Omega 2}} and ps 1 ( ps 1 ( 1 ) ) = e e O + 1 {\displaystyle \psi _{1}(\psi _{1}(1))=\varepsilon _{\varepsilon _{\Omega +1}}} ): this holds up to the first fixed point z O + 1 {\displaystyle \zeta _{\Omega +1}} of the function x - e x {\displaystyle \xi \mapsto \varepsilon _{\xi }} beyond O {\displaystyle \Omega } , which is the limit of ps 1 ( 0 ) {\displaystyle \psi _{1}(0)} , ps 1 ( ps 1 ( 0 ) ) {\displaystyle \psi _{1}(\psi _{1}(0))} and so forth. Beyond this, we have ps 1 ( a ) = z O + 1 {\displaystyle \psi _{1}(\alpha )=\zeta _{\Omega +1}} and this remains true until O 2 {\displaystyle \Omega _{2}} : exactly as was the case for ps ( O ) {\displaystyle \psi (\Omega )} , we have ps 1 ( O 2 ) = z O + 1 {\displaystyle \psi _{1}(\Omega _{2})=\zeta _{\Omega +1}} and ps 1 ( O 2 + 1 ) = e z O + 1 + 1 {\displaystyle \psi _{1}(\Omega _{2}+1)=\varepsilon _{\zeta _{\Omega +1}+1}} .

The ps 1 {\displaystyle \psi _{1}} function gives us a system of notations (assuming we can somehow write down all countable ordinals!) for the uncountable ordinals below ps 1 ( e O 2 + 1 ) {\displaystyle \psi _{1}(\varepsilon _{\Omega _{2}+1})} , which is the limit of ps 1 ( O 2 ) {\displaystyle \psi _{1}(\Omega _{2})} , ps 1 ( O 2 O 2 ) {\displaystyle \psi _{1}({\Omega _{2}}^{\Omega _{2}})} and so forth.

Now we can reinject these notations in the original ps {\displaystyle \psi } function, modified as follows:

ps ( a ) {\displaystyle \psi (\alpha )} is the smallest ordinal that cannot be expressed from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } and O 2 {\displaystyle \Omega _{2}} using sums, products, exponentials, the ps 1 {\displaystyle \psi _{1}} function, and the ps {\displaystyle \psi } function itself (to previously constructed ordinals less than a {\displaystyle \alpha } ).

This modified function ps {\displaystyle \psi } coincides with the previous one up to (and including) ps ( ps 1 ( 1 ) ) {\displaystyle \psi (\psi _{1}(1))} -- which is the Bachmann-Howard ordinal. But now we can get beyond this, and ps ( ps 1 ( 1 ) + 1 ) {\displaystyle \psi (\psi _{1}(1)+1)} is e ps ( ps 1 ( 1 ) ) + 1 {\displaystyle \varepsilon _{\psi (\psi _{1}(1))+1}} (the next e {\displaystyle \varepsilon } -number after the Bachmann-Howard ordinal). We have made our system doubly impredicative: to create notations for countable ordinals we use notations for certain ordinals between O {\displaystyle \Omega } and O 2 {\displaystyle \Omega _{2}} which are themselves defined using certain ordinals beyond O 2 {\displaystyle \Omega _{2}} .


A variation on this scheme, which makes little difference when using just two (or finitely many) collapsing functions, but becomes important for infinitely many of them, is to define

ps ( a ) {\displaystyle \psi (\alpha )} is the smallest ordinal that cannot be expressed from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } and O 2 {\displaystyle \Omega _{2}} using sums, products, exponentials, and the ps 1 {\displaystyle \psi _{1}} and ps {\displaystyle \psi } function (to previously constructed ordinals less than a {\displaystyle \alpha } ).

i.e., allow the use of ps 1 {\displaystyle \psi _{1}} only for arguments less than a {\displaystyle \alpha } itself. With this definition, we must write ps ( O 2 ) {\displaystyle \psi (\Omega _{2})} instead of ps ( ps 1 ( O 2 ) ) {\displaystyle \psi (\psi _{1}(\Omega _{2}))} (although it is still also equal to ps ( ps 1 ( O 2 ) ) = ps ( z O + 1 ) {\displaystyle \psi (\psi _{1}(\Omega _{2}))=\psi (\zeta _{\Omega +1})} , of course, but it is now constant until O 2 {\displaystyle \Omega _{2}} ). This change is inessential because, intuitively speaking, the ps 1 {\displaystyle \psi _{1}} function collapses the nameable ordinals beyond O 2 {\displaystyle \Omega _{2}} below the latter so it matters little whether ps {\displaystyle \psi } is invoked directly on the ordinals beyond O 2 {\displaystyle \Omega _{2}} or on their image by ps 1 {\displaystyle \psi _{1}} . But it makes it possible to define ps {\displaystyle \psi } and ps 1 {\displaystyle \psi _{1}} by simultaneous (rather than "downward") induction, and this is important if we are to use infinitely many collapsing functions.

Indeed, there is no reason to stop at two levels: using o + 1 {\displaystyle \omega +1} new cardinals in this way, O 1 , O 2 , ... , O o {\displaystyle \Omega _{1},\Omega _{2},\ldots ,\Omega _{\omega }} , we get a system essentially equivalent to that introduced by Buchholz,[3] the inessential difference being that since Buchholz uses o + 1 {\displaystyle \omega +1} ordinals from the start, he does not need to allow multiplication or exponentiation; also, Buchholz does not introduce the numbers 1 {\displaystyle 1} or o {\displaystyle \omega } in the system as they will also be produced by the ps {\displaystyle \psi } functions: this makes the entire scheme much more elegant and more concise to define, albeit more difficult to understand. This system is also sensibly equivalent to the earlier (and much more difficult to grasp) "ordinal diagrams" of Takeuti[5] and th {\displaystyle \theta } functions of Feferman: their range is the same ( ps 0 ( e O o + 1 ) {\displaystyle \psi _{0}(\varepsilon _{\Omega _{\omega }+1})} , which could be called the Takeuti-Feferman-Buchholz ordinal, and which describes the strength of P 1 1 {\displaystyle \Pi _{1}^{1}} -comprehension plus bar induction).


A "normal" variant

[edit]

Most definitions of ordinal collapsing functions found in the recent literature differ from the ones we have given in one technical but important way which makes them technically more convenient although intuitively less transparent. We now explain this.

The following definition (by induction on a {\displaystyle \alpha } ) is completely equivalent to that of the function ps {\displaystyle \psi } above:

Let C ( a , b ) {\displaystyle C(\alpha ,\beta )} be the set of ordinals generated starting from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } and all ordinals less than b {\displaystyle \beta } by recursively applying the following functions: ordinal addition, multiplication and exponentiation, and the function ps | a {\displaystyle \psi {\upharpoonright _{\alpha }}} . Then ps ( a ) {\displaystyle \psi (\alpha )} is defined as the smallest ordinal r {\displaystyle \rho } such that C ( a , r ) O = r {\displaystyle C(\alpha ,\rho )\cap \Omega =\rho } .

(This is equivalent, because if s {\displaystyle \sigma } is the smallest ordinal not in C ( a , 0 ) {\displaystyle C(\alpha ,0)} , which is how we originally defined ps ( a ) {\displaystyle \psi (\alpha )} , then it is also the smallest ordinal not in C ( a , 0 ) = C ( a , s ) {\displaystyle C(\alpha ,0)=C(\alpha ,\sigma )} , and furthermore the properties we described of ps {\displaystyle \psi } imply that no ordinal between s {\displaystyle \sigma } inclusive and O {\displaystyle \Omega } exclusive belongs to C ( a , s ) {\displaystyle C(\alpha ,\sigma )} .)

We can now make a change to the definition which makes it subtly different:

Let C ~ ( a , b ) {\displaystyle {\tilde {C}}(\alpha ,\beta )} be the set of ordinals generated starting from 0 {\displaystyle 0} , 1 {\displaystyle 1} , o {\displaystyle \omega } , O {\displaystyle \Omega } and all ordinals less than b {\displaystyle \beta } by recursively applying the following functions: ordinal addition, multiplication and exponentiation, and the function ps ~ | a {\displaystyle {\tilde {\psi }}{\upharpoonright _{\alpha }}} . Then ps ~ ( a ) {\displaystyle {\tilde {\psi }}(\alpha )} is defined as the smallest ordinal r {\displaystyle \rho } such that C ~ ( a , r ) O = r {\displaystyle {\tilde {C}}(\alpha ,\rho )\cap \Omega =\rho } and a C ~ ( a , r ) {\displaystyle \alpha \in {\tilde {C}}(\alpha ,\rho )} .

The first values of ps ~ {\displaystyle {\tilde {\psi }}} coincide with those of ps {\displaystyle \psi } : namely, for all a < z 0 {\displaystyle \alpha <\zeta _{0}} where z 0 = ph 2 ( 0 ) {\displaystyle \zeta _{0}=\varphi _{2}(0)} , we have ps ~ ( a ) = ps ( a ) {\displaystyle {\tilde {\psi }}(\alpha )=\psi (\alpha )} because the additional clause a C ~ ( a , r ) {\displaystyle \alpha \in {\tilde {C}}(\alpha ,\rho )} is always satisfied. But at this point the functions start to differ: while the function ps {\displaystyle \psi } gets "stuck" at z 0 {\displaystyle \zeta _{0}} for all z 0 <= a <= O {\displaystyle \zeta _{0}\leq \alpha \leq \Omega } , the function ps ~ {\displaystyle {\tilde {\psi }}} satisfies ps ~ ( z 0 ) = e z 0 + 1 {\displaystyle {\tilde {\psi }}(\zeta _{0})=\varepsilon _{\zeta _{0}+1}} because the new condition a C ~ ( a , r ) {\displaystyle \alpha \in {\tilde {C}}(\alpha ,\rho )} imposes ps ~ ( z 0 ) > z 0 {\displaystyle {\tilde {\psi }}(\zeta _{0})>\zeta _{0}} . On the other hand, we still have ps ~ ( O ) = z 0 {\displaystyle {\tilde {\psi }}(\Omega )=\zeta _{0}} (because O C ( a , r ) {\displaystyle \Omega \in C(\alpha ,\rho )} for all r {\displaystyle \rho } so the extra condition does not come in play). Note in particular that ps ~ {\displaystyle {\tilde {\psi }}} , unlike ps {\displaystyle \psi } , is not monotonic, nor is it continuous.

Despite these changes, the ps ~ {\displaystyle {\tilde {\psi }}} function also defines a system of ordinal notations up to the Bachmann-Howard ordinal: the notations, and the conditions for canonicity, are slightly different (for example, ps ( O + 1 + a ) = ps ~ ( ps ~ ( O ) + a ) {\displaystyle \psi (\Omega +1+\alpha )={\tilde {\psi }}({\tilde {\psi }}(\Omega )+\alpha )} for all a {\displaystyle \alpha } less than the common value ps ( O 2 ) = ps ~ ( O + 1 ) {\displaystyle \psi (\Omega 2)={\tilde {\psi }}(\Omega +1)} ).

Other similar ordinal collapsing functions

[edit]

Arai's ps

[edit]

Arai's ps function is an ordinal collapsing function introduced by Toshiyasu Arai (husband of Noriko H. Arai) in his paper: A simplified ordinal analysis of first-order reflection. ps O ( a ) {\displaystyle \psi _{\Omega }(\alpha )} is a collapsing function such that ps O ( a ) < O {\displaystyle \psi _{\Omega }(\alpha )<\Omega } , where O {\displaystyle \Omega } represents the first uncountable ordinal (it can be replaced by the Church-Kleene ordinal at the cost of extra technical difficulty). Throughout the course of this article, K P P N {\displaystyle {\mathsf {KP\Pi _{N}}}} represents Kripke-Platek set theory for a P N {\displaystyle {\mathsf {\Pi _{N}}}} -reflecting universe, K N {\displaystyle \mathbb {K} _{N}} is the least P N - 2 1 {\displaystyle {\mathsf {\Pi }}_{N-2}^{1}} -indescribable cardinal (it may be replaced with the least P N {\displaystyle {\mathsf {\Pi }}_{N}} -reflecting ordinal at the cost of extra technical difficulty), N {\displaystyle N} is a fixed natural number >= 3 {\displaystyle \geq 3} , and O 0 = 0 {\displaystyle \Omega _{0}=0} .

Suppose K P P N th {\displaystyle {\mathsf {KP\Pi _{N}}}\vdash \theta } for a S 1 {\displaystyle {\mathsf {\Sigma _{1}}}} ( O {\displaystyle \Omega } )-sentence th {\displaystyle {\mathsf {\theta }}} . Then, there exists a finite n {\displaystyle n} such that for a = ps O ( o n ( K N + 1 ) ) {\displaystyle \alpha =\psi _{\Omega }(\omega _{n}(\mathbb {K} _{N}+1))} , L a th {\displaystyle L_{\alpha }\models \theta } . It can also be proven that K P P N {\displaystyle {\mathsf {KP\Pi _{N}}}} proves that each initial segment { a O T : a < ps O ( o n ( K N + 1 ) ) } ; n = 1 , 2 , ... {\displaystyle \{\alpha \in OT:\alpha <\psi _{\Omega }(\omega _{n}(\mathbb {K} _{N}+1))\};n=1,2,\ldots } is well-founded, and therefore, ps O ( e K N + 1 ) {\displaystyle \psi _{\Omega }(\varepsilon _{\mathbb {K} _{N}+1})} is the proof-theoretic ordinal of K P P N {\displaystyle {\mathsf {KP\Pi _{N}}}} . One can then make the following conversions:

  • ps O ( e O + 1 ) = | K P o | = B H O {\displaystyle \psi _{\Omega }(\varepsilon _{\Omega +1})=|{\mathsf {KP\omega }}|={\mathsf {BHO}}} , where O {\displaystyle \Omega } is either the least recursively regular ordinal or the least uncountable cardinal, K P o {\displaystyle {\mathsf {KP\omega }}} is Kripke-Platek set theory with infinity and B H O {\displaystyle {\mathsf {BHO}}} is the Bachmann-Howard ordinal.
  • ps O ( O o ) = | P 1 1 - C A 0 | = B O {\displaystyle \psi _{\Omega }(\Omega _{\omega })=|{\mathsf {\Pi _{1}^{1}-CA_{0}}}|={\mathsf {BO}}} , where O o {\displaystyle \Omega _{\omega }} is either the least limit of admissible ordinals or the least limit of infinite cardinals and B O {\displaystyle {\mathsf {BO}}} is Buchholz's ordinal.
  • ps O ( e O o + 1 ) = | K P l | = T F B O {\displaystyle \psi _{\Omega }(\varepsilon _{\Omega _{\omega }+1})=|{\mathsf {KPl}}|={\mathsf {TFBO}}} , where O o {\displaystyle \Omega _{\omega }} is either the least limit of admissible ordinals or the least limit of infinite cardinals, K P l {\displaystyle {\mathsf {KPl}}} is KPi without the collection scheme and T F B O {\displaystyle {\mathsf {TFBO}}} is the Takeuti-Feferman-Buchholz ordinal.
  • ps O ( e I + 1 ) = | K P i | {\displaystyle \psi _{\Omega }(\varepsilon _{I+1})=|{\mathsf {KPi}}|} , where I {\displaystyle I} is either the least recursively inaccessible ordinal or the least weakly inaccessible cardinal and K P i {\displaystyle {\mathsf {KPi}}} is Kripke-Platek set theory with a recursively inaccessible universe.

Bachmann's ps

[edit]

The first true ordinal collapsing function, Bachmann's ps {\displaystyle \psi } was invented by Heinz Bachmann, somewhat cumbersome as it depends on fundamental sequences for all limit ordinals; and the original definition is complicated. Michael Rathjen has suggested a "recast" of the system, which goes like so:

  • Let O {\displaystyle \Omega } represent an uncountable ordinal such as o 1 {\displaystyle \omega _{1}} ;
  • Then define C O ( a , b ) {\displaystyle C^{\Omega }(\alpha ,\beta )} as the closure of b { 0 , O } {\displaystyle \beta \cup \{0,\Omega \}} under addition, ( x - o x ) {\displaystyle (\xi \rightarrow \omega ^{\xi })} and ( x - ps O ( x ) ) {\displaystyle (\xi \rightarrow \psi _{\Omega }(\xi ))} for x < a {\displaystyle \xi <\alpha } .
  • ps O ( a ) {\displaystyle \psi _{\Omega }(\alpha )} is the smallest countable ordinal r such that C O ( a , r ) O = r {\displaystyle C^{\Omega }(\alpha ,\rho )\cap \Omega =\rho }

ps O ( e O + 1 ) {\displaystyle \psi _{\Omega }(\varepsilon _{\Omega +1})} is the Bachmann-Howard ordinal, the proof-theoretic ordinal of Kripke-Platek set theory with the axiom of infinity (KP).

Buchholz's ps

[edit]

Buchholz's ps {\displaystyle \psi } is a hierarchy of single-argument functions ps n : O n - O n {\displaystyle \psi _{\nu }:{\mathsf {On}}\rightarrow {\mathsf {On}}} , with ps n ( a ) {\displaystyle \psi _{\nu }(\alpha )} occasionally abbreviated as ps n a {\displaystyle \psi _{\nu }\alpha } . This function is likely the most well known out of all ordinal collapsing functions. The definition is so:

  • Define O 0 = 1 {\displaystyle \Omega _{0}=1} and O n = n {\displaystyle \Omega _{\nu }=\aleph _{\nu }} for n > 0 {\displaystyle \nu >0} .
  • Let P ( a ) {\displaystyle P(\alpha )} be the set of distinct terms in the Cantor normal form of a {\displaystyle \alpha } (with each term of the form o x {\displaystyle \omega ^{\xi }} for x O n {\displaystyle \xi \in {\mathsf {On}}} , see Cantor normal form theorem)
  • C n 0 ( a ) = O n {\displaystyle C_{\nu }^{0}(\alpha )=\Omega _{\nu }}
  • C n n + 1 ( a ) = C n n ( a ) { g | P ( g ) C n n ( a ) } { ps n ( x ) | x a C n n ( a ) x C u ( x ) u <= o } {\displaystyle C_{\nu }^{n+1}(\alpha )=C_{\nu }^{n}(\alpha )\cup \{\gamma \mid P(\gamma )\subseteq C_{\nu }^{n}(\alpha )\}\cup \{\psi _{\nu }(\xi )\mid \xi \in \alpha \cap C_{\nu }^{n}(\alpha )\land \xi \in C_{u}(\xi )\land u\leq \omega \}}
  • C n ( a ) = n < o C n n ( a ) {\displaystyle C_{\nu }(\alpha )=\bigcup \limits _{n<\omega }C_{\nu }^{n}(\alpha )}
  • ps n ( a ) = min ( { g | g C n ( a ) } ) {\displaystyle \psi _{\nu }(\alpha )=\min(\{\gamma \mid \gamma \notin C_{\nu }(\alpha )\})}

The limit of this system is ps 0 ( e O o + 1 ) {\displaystyle \psi _{0}(\varepsilon _{\Omega _{\omega }+1})} , the Takeuti-Feferman-Buchholz ordinal.

Extended Buchholz's ps

[edit]

This ordinal collapsing function is a sophisticated extension of Buchholz's ps {\displaystyle \psi } by mathematician Denis Maksudov. The limit of this system, sometimes called the Extended Buchholz Ordinal, is much greater, equal to ps 0 ( O O O ) {\displaystyle \psi _{0}(\Omega _{\Omega _{\Omega _{\cdots }}})} where O O O . . . {\displaystyle \Omega _{\Omega _{\Omega _{...}}}} denotes the first omega fixed point. The function is defined as follows:

  • Define O 0 = 1 {\displaystyle \Omega _{0}=1} and O n = n {\displaystyle \Omega _{\nu }=\aleph _{\nu }} for n > 0 {\displaystyle \nu >0} .
  • C n 0 ( a ) = { b | b < O n } {\displaystyle C_{\nu }^{0}(\alpha )=\{\beta \mid \beta <\Omega _{\nu }\}}
  • C n n + 1 ( a ) = { b + g , ps m ( e ) | m , b , g , e C n n ( a ) e < a } {\displaystyle C_{\nu }^{n+1}(\alpha )=\{\beta +\gamma ,\psi _{\mu }(\eta )\mid \mu ,\beta ,\gamma ,\eta \in C_{\nu }^{n}(\alpha )\land \eta <\alpha \}}
  • C n ( a ) = n < o C n n ( a ) {\displaystyle C_{\nu }(\alpha )=\bigcup \limits _{n<\omega }C_{\nu }^{n}(\alpha )}
  • ps n ( a ) = min ( { g | g C n ( a ) } ) {\displaystyle \psi _{\nu }(\alpha )=\min(\{\gamma \mid \gamma \notin C_{\nu }(\alpha )\})}

Madore's ps

[edit]

This ordinal collapsing function was the same as the ps function previously used throughout this article; it is a simpler, more efficient version of Buchholz's ps function defined by David Madore. Its use in this article lead to widespread use of the function.

  • C 0 ( a ) = { 0 , 1 , o , O } {\displaystyle C_{0}(\alpha )=\{0,1,\omega ,\Omega \}}
  • C n + 1 ( a ) = { g + d , g d , g d , ps ( e ) | g , d , e C n ( a ) ; e < a } {\displaystyle C_{n+1}(\alpha )=\{\gamma +\delta ,\gamma \delta ,\gamma ^{\delta },\psi (\eta )\mid \gamma ,\delta ,\eta \in C_{n}(\alpha );\eta <\alpha \}}
  • C ( a ) = n < o C n ( a ) {\displaystyle C(\alpha )=\bigcup \limits _{n<\omega }C_{n}(\alpha )}
  • ps ( a ) = min ( { b O | b C ( a ) } ) {\displaystyle \psi (\alpha )=\min(\{\beta \in \Omega \mid \beta \notin C(\alpha )\})}

This function was used by Chris Bird, who also invented the next ordinal collapsing function.

Bird's th

[edit]

Chris Bird devised the following shorthand for the extended Veblen function ph {\displaystyle \varphi } :

  • th ( O n - 1 a n - 1 + + O 2 a 2 + O a 1 + a 0 , b ) = ph ( a n - 1 , ... , a 2 , a 1 , a 0 , b ) {\displaystyle \theta (\Omega ^{n-1}a_{n-1}+\cdots +\Omega ^{2}a_{2}+\Omega a_{1}+a_{0},b)=\varphi (a_{n-1},\ldots ,a_{2},a_{1},a_{0},b)}
  • th ( a , 0 ) {\displaystyle \theta (\alpha ,0)} is abbreviated th ( a ) {\displaystyle \theta (\alpha )}

This function is only defined for arguments less than O o {\displaystyle \Omega ^{\omega }} , and its outputs are limited by the small Veblen ordinal.

Jager's ps

[edit]

Jager's ps is a hierarchy of single-argument ordinal functions psk indexed by uncountable regular cardinals k smaller than the least weakly Mahlo cardinal M0 introduced by German mathematician Gerhard Jager in 1984. It was developed on the base of Buchholz's approach.

  • If k = I a ( 0 ) {\displaystyle \kappa =I_{\alpha }(0)} for some a < k, k - = 0 {\displaystyle \kappa ^{-}=0} .
  • If k = I a ( b + 1 ) {\displaystyle \kappa =I_{\alpha }(\beta +1)} for some a, b < k, k - = I a ( b ) {\displaystyle \kappa ^{-}=I_{\alpha }(\beta )} .
  • C k 0 ( a ) = { k - } k - {\displaystyle C_{\kappa }^{0}(\alpha )=\{\kappa ^{-}\}\cup \kappa ^{-}}
  • For every finite n, C k n + 1 ( a ) M 0 {\displaystyle C_{\kappa }^{n+1}(\alpha )\subset M_{0}} is the smallest set satisfying the following:
    • The sum of any finitely many ordinals in C k n ( a ) M 0 {\displaystyle C_{\kappa }^{n}(\alpha )\subset M_{0}} belongs to C k n + 1 ( a ) M 0 {\displaystyle C_{\kappa }^{n+1}(\alpha )\subset M_{0}} .
    • For any b , g C k n ( a ) {\displaystyle \beta ,\gamma \in C_{\kappa }^{n}(\alpha )} , ph b ( g ) C k n + 1 ( a ) {\displaystyle \varphi _{\beta }(\gamma )\in C_{\kappa }^{n+1}(\alpha )} .
    • For any b , g C k n ( a ) {\displaystyle \beta ,\gamma \in C_{\kappa }^{n}(\alpha )} , I b ( g ) C k n + 1 ( a ) {\displaystyle I_{\beta }(\gamma )\in C_{\kappa }^{n+1}(\alpha )} .
    • For any ordinal g and uncountable regular cardinal p C k n ( a ) {\displaystyle \pi \in C_{\kappa }^{n}(\alpha )} , g < p < k = g C k n + 1 ( a ) {\displaystyle \gamma <\pi <\kappa \Rightarrow \gamma \in C_{\kappa }^{n+1}(\alpha )} .
    • For any g a C k n ( a ) {\displaystyle \gamma \in \alpha \cap C_{\kappa }^{n}(\alpha )} and uncountable regular cardinal p C k n ( a ) {\displaystyle \pi \in C_{\kappa }^{n}(\alpha )} , g C p ( g ) = ps p ( g ) C k n + 1 ( a ) {\displaystyle \gamma \in C_{\pi }(\gamma )\Rightarrow \psi _{\pi }(\gamma )\in C_{\kappa }^{n+1}(\alpha )} .
  • C k ( a ) = n < o C k n ( a ) {\displaystyle C_{\kappa }(\alpha )=\bigcup \limits _{n<\omega }C_{\kappa }^{n}(\alpha )}
  • ps k ( a ) = min ( { x k | x C k ( a ) } ) {\displaystyle \psi _{\kappa }(\alpha )=\min(\{\xi \in \kappa \mid \xi \notin C_{\kappa }(\alpha )\})}

Simplified Jager's ps

[edit]

This is a sophisticated simplification of Jager's ps created by Denis Maksudov. An ordinal is a-weakly inaccessible if it is uncountable, regular and it is a limit of g-weakly inaccessible cardinals for g < a. Let I(a, 0) be the first a-weakly inaccessible cardinal, I(a, b + 1) be the first a-weakly inaccessible cardinal after I(a, b) and I(a, b) = s u p ( { I ( a , g ) | g < b } ) {\displaystyle sup(\{I(\alpha ,\gamma )\mid \gamma <\beta \})} for limit b. Restrict r and p to uncountable regular ordinals of the form I(a, 0) or I(a, b + 1). Then,

  • C 0 ( a , b ) = b { 0 } {\displaystyle C_{0}(\alpha ,\beta )=\beta \cup \{0\}}
  • C n + 1 ( a , b ) = { g + d | g , d C n ( a , b ) } { I ( g , d ) | g , d C n ( a , b ) } { ps p ( g ) | p , g , C n ( a , b ) g < a } {\displaystyle C_{n+1}(\alpha ,\beta )=\{\gamma +\delta \mid \gamma ,\delta \in C_{n}(\alpha ,\beta )\}\cup \{I(\gamma ,\delta )\mid \gamma ,\delta \in C_{n}(\alpha ,\beta )\}\cup \{\psi _{\pi }(\gamma )\mid \pi ,\gamma ,\in C_{n}(\alpha ,\beta )\land \gamma <\alpha \}}
  • C ( a , b ) = n < o C n ( a , b ) {\displaystyle C(\alpha ,\beta )=\bigcup \limits _{n<\omega }C_{n}(\alpha ,\beta )}
  • ps p ( a ) = min ( { b < p | C ( a , b ) p b } ) {\displaystyle \psi _{\pi }(\alpha )=\min(\{\beta <\pi \mid C(\alpha ,\beta )\cap \pi \subseteq \beta \})}

Rathjen's Ps

[edit]

Rathjen's Ps function is based on the least weakly compact cardinal to create large countable ordinals. For a weakly compact cardinal K, the functions M a {\displaystyle M^{\alpha }} , C ( a , p ) {\displaystyle C(\alpha ,\pi )} , Ks ( a ) {\displaystyle \Xi (\alpha )} , and Ps p x ( a ) {\displaystyle \Psi _{\pi }^{\xi }(\alpha )} are defined in mutual recursion in the following way:

  • M0 = K L i m {\displaystyle K\cap {\mathsf {Lim}}} , where Lim denotes the class of limit ordinals.
  • For a > 0, Ma is the set { p < K | C ( a , p ) K = p x C ( a , p ) a , M x {\displaystyle \{\pi is stationary in p a C ( a , p ) } {\displaystyle \pi \land \alpha \in C(\alpha ,\pi )\}}
  • C ( a , b ) {\displaystyle C(\alpha ,\beta )} is the closure of b { 0 , K } {\displaystyle \beta \cup \{0,K\}} under addition, ( x , e ) - ph ( x , e ) {\displaystyle (\xi ,\eta )\rightarrow \varphi (\xi ,\eta )} , x - O x {\displaystyle \xi \rightarrow \Omega _{\xi }} given x < K, x - Ks ( x ) {\displaystyle \xi \rightarrow \Xi (\xi )} given x < a, and ( x , p , d ) - Ps p x ( d ) {\displaystyle (\xi ,\pi ,\delta )\rightarrow \Psi _{\pi }^{\xi }(\delta )} given x <= d < a {\displaystyle \xi \leq \delta <\alpha } .
  • Ks ( a ) = min ( M a { K } ) {\displaystyle \Xi (\alpha )=\min(M^{\alpha }\cup \{K\})} .
  • For x <= a {\displaystyle \xi \leq \alpha } , Ps p x ( a ) = min ( { r M x p : C ( a , r ) p = r p , a C ( a , r ) } { p } ) {\displaystyle \Psi _{\pi }^{\xi }(\alpha )=\min(\{\rho \in M^{\xi }\cap \pi :C(\alpha ,\rho )\cap \pi =\rho \land \pi ,\alpha \in C(\alpha ,\rho )\}\cup \{\pi \})} .

Collapsing large cardinals

[edit]

As noted in the introduction, the use and definition of ordinal collapsing functions is strongly connected with the theory of ordinal analysis, so the collapse of this or that large cardinal must be mentioned simultaneously with the theory for which it provides a proof-theoretic analysis.

  • Gerhard Jager and Wolfram Pohlers[6] described the collapse of an inaccessible cardinal to describe the ordinal-theoretic strength of Kripke-Platek set theory augmented by the recursive inaccessibility of the class of ordinals (KPi), which is also proof-theoretically equivalent[1] to D 2 1 {\displaystyle \Delta _{2}^{1}} -comprehension plus bar induction. Roughly speaking, this collapse can be obtained by adding the a - O a {\displaystyle \alpha \mapsto \Omega _{\alpha }} function itself to the list of constructions to which the C ( ) {\displaystyle C(\cdot )} collapsing system applies.
  • Michael Rathjen[7] then described the collapse of a Mahlo cardinal to describe the ordinal-theoretic strength of Kripke-Platek set theory augmented by the recursive Mahloness of the class of ordinals (KPM).
  • Rathjen[8] later described the collapse of a weakly compact cardinal to describe the ordinal-theoretic strength of Kripke-Platek set theory augmented by certain reflection principles (concentrating on the case of P 3 {\displaystyle \Pi _{3}} -reflection). Very roughly speaking, this proceeds by introducing the first cardinal Ks ( a ) {\displaystyle \Xi (\alpha )} which is a {\displaystyle \alpha } -hyper-Mahlo and adding the a - Ks ( a ) {\displaystyle \alpha \mapsto \Xi (\alpha )} function itself to the collapsing system.
  • In a 2015 paper, Toshiyasu Arai has created ordinal collapsing functions ps p x - {\displaystyle \psi _{\pi }^{\vec {\xi }}} for a vector of ordinals x {\displaystyle \xi } , which collapse P n 1 {\displaystyle \Pi _{n}^{1}} -indescribable cardinals for n > 0 {\displaystyle n>0} . These are used to carry out ordinal analysis of Kripke-Platek set theory augmented by P n + 2 {\displaystyle \Pi _{n+2}} -reflection principles. [9]
  • Rathjen has investigated the collapse of yet larger cardinals, with the ultimate goal of achieving an ordinal analysis of P 2 1 {\displaystyle \Pi _{2}^{1}} -comprehension (which is proof-theoretically equivalent to the augmentation of Kripke-Platek by S 1 {\displaystyle \Sigma _{1}} -separation).[10]

Notes

[edit]
  1. ^ a b Rathjen, 1995 (Bull. Symbolic Logic)
  2. ^ Kahle, 2002 (Synthese)
  3. ^ a b Buchholz, 1986 (Ann. Pure Appl. Logic)
  4. ^ Rathjen, 2005 (Fischbachau slides)
  5. ^ Takeuti, 1967 (Ann. Math.)
  6. ^ Jager & Pohlers, 1983 (Bayer. Akad. Wiss. Math.-Natur. Kl. Sitzungsber.)
  7. ^ Rathjen, 1991 (Arch. Math. Logic)
  8. ^ Rathjen, 1994 (Ann. Pure Appl. Logic)
  9. ^ T. Arai, A simplified analysis of first-order reflection (2015).
  10. ^ Rathjen, 2005 (Arch. Math. Logic)

References

[edit]