Why are the axiom of specification is an axiom schema? Why not just a single axiom? The Next CEO of Stack OverflowHow can I write the Axiom of Specification as a sentence?What properties are allowed in comprehension axiom of ZFC?Axiom schema of specification - Existence of intersection and set differenceAxiom schema of specification (formula arguments)Universal Specification axiomWhat are the subsets of an infinite set?Axiom of Power SetWhy/When we need the axiom schema of replacement?Is the definability axiom schema consistent with ZF?Finite axiomatization of second-order NBG

How can the PCs determine if an item is a phylactery?

Compensation for working overtime on Saturdays

Physiological effects of huge anime eyes

Finitely generated matrix groups whose eigenvalues are all algebraic

Find the majority element, which appears more than half the time

Small nick on power cord from an electric alarm clock, and copper wiring exposed but intact

A hang glider, sudden unexpected lift to 25,000 feet altitude, what could do this?

Is a linearly independent set whose span is dense a Schauder basis?

Is there a rule of thumb for determining the amount one should accept for of a settlement offer?

Does int main() need a declaration on C++?

pgfplots: How to draw a tangent graph below two others?

Variance of Monte Carlo integration with importance sampling

Raspberry pi 3 B with Ubuntu 18.04 server arm64: what pi version

Planeswalker Ability and Death Timing

Simplify trigonometric expression using trigonometric identities

Strange use of "whether ... than ..." in official text

What happens if you break a law in another country outside of that country?

My boss doesn't want me to have a side project

Shortening a title without changing its meaning

My ex-girlfriend uses my Apple ID to login to her iPad, do I have to give her my Apple ID password to reset it?

Is a distribution that is normal, but highly skewed, considered Gaussian?

Noise during hard braking

Is it OK to decorate a log book cover?

Man transported from Alternate World into ours by a Neutrino Detector



Why are the axiom of specification is an axiom schema? Why not just a single axiom?



The Next CEO of Stack OverflowHow can I write the Axiom of Specification as a sentence?What properties are allowed in comprehension axiom of ZFC?Axiom schema of specification - Existence of intersection and set differenceAxiom schema of specification (formula arguments)Universal Specification axiomWhat are the subsets of an infinite set?Axiom of Power SetWhy/When we need the axiom schema of replacement?Is the definability axiom schema consistent with ZF?Finite axiomatization of second-order NBG










7












$begingroup$


So I was reading about the ZFC axioms, and apparently some of them are actually "axiom schemas." For example, there is the "axiom schema of specification," which basically says that give a set $A$ and a formula $phi(x)$, a subset of $A$ exists where all the elements satisfy $phi(x)$.



This is apparently not one axiom, but a schema of infinitely many axioms, because there is one axiom for every $phi(x)$. So that must mean that for whatever reason, just letting $phi(x)$ be an arbitrary formula does not make a valid axiom. So are there rules for what an axiom can say?



So my questions are: Why is this not allowed to be one axiom? What are the rules for what an axiom is allowed to be? And why?










share|cite|improve this question











$endgroup$







  • 1




    $begingroup$
    Not "a schema of infinite axioms", but "a schema of infinitely many axioms" (the first sounds like each axiom may be infinite).
    $endgroup$
    – Alex Kruckman
    2 days ago










  • $begingroup$
    @AlexKruckman Fair enough. I'll edit to fix that.
    $endgroup$
    – RothX
    2 days ago















7












$begingroup$


So I was reading about the ZFC axioms, and apparently some of them are actually "axiom schemas." For example, there is the "axiom schema of specification," which basically says that give a set $A$ and a formula $phi(x)$, a subset of $A$ exists where all the elements satisfy $phi(x)$.



This is apparently not one axiom, but a schema of infinitely many axioms, because there is one axiom for every $phi(x)$. So that must mean that for whatever reason, just letting $phi(x)$ be an arbitrary formula does not make a valid axiom. So are there rules for what an axiom can say?



So my questions are: Why is this not allowed to be one axiom? What are the rules for what an axiom is allowed to be? And why?










share|cite|improve this question











$endgroup$







  • 1




    $begingroup$
    Not "a schema of infinite axioms", but "a schema of infinitely many axioms" (the first sounds like each axiom may be infinite).
    $endgroup$
    – Alex Kruckman
    2 days ago










  • $begingroup$
    @AlexKruckman Fair enough. I'll edit to fix that.
    $endgroup$
    – RothX
    2 days ago













7












7








7


1



$begingroup$


So I was reading about the ZFC axioms, and apparently some of them are actually "axiom schemas." For example, there is the "axiom schema of specification," which basically says that give a set $A$ and a formula $phi(x)$, a subset of $A$ exists where all the elements satisfy $phi(x)$.



This is apparently not one axiom, but a schema of infinitely many axioms, because there is one axiom for every $phi(x)$. So that must mean that for whatever reason, just letting $phi(x)$ be an arbitrary formula does not make a valid axiom. So are there rules for what an axiom can say?



So my questions are: Why is this not allowed to be one axiom? What are the rules for what an axiom is allowed to be? And why?










share|cite|improve this question











$endgroup$




So I was reading about the ZFC axioms, and apparently some of them are actually "axiom schemas." For example, there is the "axiom schema of specification," which basically says that give a set $A$ and a formula $phi(x)$, a subset of $A$ exists where all the elements satisfy $phi(x)$.



This is apparently not one axiom, but a schema of infinitely many axioms, because there is one axiom for every $phi(x)$. So that must mean that for whatever reason, just letting $phi(x)$ be an arbitrary formula does not make a valid axiom. So are there rules for what an axiom can say?



So my questions are: Why is this not allowed to be one axiom? What are the rules for what an axiom is allowed to be? And why?







logic set-theory axioms






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited yesterday









Asaf Karagila

307k33440773




307k33440773










asked 2 days ago









RothXRothX

651713




651713







  • 1




    $begingroup$
    Not "a schema of infinite axioms", but "a schema of infinitely many axioms" (the first sounds like each axiom may be infinite).
    $endgroup$
    – Alex Kruckman
    2 days ago










  • $begingroup$
    @AlexKruckman Fair enough. I'll edit to fix that.
    $endgroup$
    – RothX
    2 days ago












  • 1




    $begingroup$
    Not "a schema of infinite axioms", but "a schema of infinitely many axioms" (the first sounds like each axiom may be infinite).
    $endgroup$
    – Alex Kruckman
    2 days ago










  • $begingroup$
    @AlexKruckman Fair enough. I'll edit to fix that.
    $endgroup$
    – RothX
    2 days ago







1




1




$begingroup$
Not "a schema of infinite axioms", but "a schema of infinitely many axioms" (the first sounds like each axiom may be infinite).
$endgroup$
– Alex Kruckman
2 days ago




$begingroup$
Not "a schema of infinite axioms", but "a schema of infinitely many axioms" (the first sounds like each axiom may be infinite).
$endgroup$
– Alex Kruckman
2 days ago












$begingroup$
@AlexKruckman Fair enough. I'll edit to fix that.
$endgroup$
– RothX
2 days ago




$begingroup$
@AlexKruckman Fair enough. I'll edit to fix that.
$endgroup$
– RothX
2 days ago










4 Answers
4






active

oldest

votes


















19












$begingroup$

This is just the choice of underlying logic. ZFC is a theory in first-order logic, and the strictures of that logical system rule out certain kinds of expressions. There are other logics, and their study comprises abstract model theory.



Very roughly, there are two competing hopes for a logical system:



  • It should be expressive: things we intuitively want to be able to say, should be say-able in the system.


  • It should be not too wild: e.g. there should be a well-behaved notion of proof.


It turns out that these are fundamentally in tension. For example, if we want proofs to be finite, then our logical system can't capture infinite structures up to isomorphism (this is the compactness theorem, essentially).



So why did we pick first-order logic after all, given that it forces us to use axiom schemata (and other inefficiencies)? Well, first-order logic seems to sit at a sweet spot here: it's fairly expressive, but also has a very well-behaved notion of proof and a more technical property called the "Lowenheim-Skolem property" which roughly says that it doesn't interact too much with set theory (indeed, it's the most expressive logic with these properties - this is due to Lindstrom).



This paper of Ferrairos may be of interest with regard to how first-order logic emerged as "the" primary logic of mathematics.






share|cite|improve this answer









$endgroup$




















    8












    $begingroup$

    In ZF, all expressions must ultimately be a syntactically valid, finite combination of variable names, the $forall$ quantifier, parentheses, the logical operations $lnot$ and $lor$, $=$ and finally $in$. That's it.



    Of course, in practice we have a lot of other symbols, like $subseteq$ and $exists$, but technically they are all defined as specific shorthands for combinations of the symbols above.



    There is no way to use these to say $forall phi(phitext is a formulatoldots)$, the way one might want to do to make the axiom schema into actual axioms.






    share|cite|improve this answer









    $endgroup$












    • $begingroup$
      I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
      $endgroup$
      – RothX
      2 days ago






    • 5




      $begingroup$
      +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
      $endgroup$
      – Alex Kruckman
      2 days ago






    • 2




      $begingroup$
      As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
      $endgroup$
      – Alex Kruckman
      2 days ago







    • 1




      $begingroup$
      (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
      $endgroup$
      – Alex Kruckman
      2 days ago


















    8












    $begingroup$

    Noah Schweber pointed out that there is a tension between expressiveness of a logic and having a nice proof theory. There is another tension, between expressiveness and inconsistency.



    More expressive logical systems were developed in the early 1930s by Church (a form of $lambda$ calculus) and separately by Curry (a form of combinatory logic, essentially a different kind of $lambda$ calculus). These logics were more expressive in the sense that they could refer to their own formulas more directly than in first-order logic, essentially by allowing variables to refer to terms or formulas.



    Unfortunately, both of these systems were shown to be inconsistent by Kleene and Rosser in a joint paper in 1935. (Church had already tried to modify his system to avoid inconsistency, but they showed his revised system was inconsistent as well as Curry's system of the time.) More information is available in the article "Paradoxes and Contemporary Logic" by Andrea Cantini and Riccardo Bruni in the Stanford Encyclopedia of Philosophy. (Recall that other, earlier logics systems that tried to be very strong, such as Russell's original system for Principia Mathematica, were also found to be inconsistent.)



    After the inconsistencies were found, Church and Curry both turned their attention to weaker systems, including the simply typed $lambda$ calculus developed by Church. The inconsistent systems slipped into history, but they are still important examples on the limits to what can be put into a logic.



    We now realize that there is a limit on how much a logic can refer to itself. Variations of Richard's paradox and Curry's paradox arise easily with too much self-reference. In a sense, first-order logic and theories such as Peano Arithmetic and ZFC stay just inside this limit. The result is that PA and ZFC are consistent but are subject to Gödel's incompleteness theorems. Adding just slightly more self-reference - which seems to be very hard to avoid in systems that can quantify over and manipulate their own formulas - tends to create systems that are inconsistent or where some terms are undefined or some formulas have undefined truth values. You can't have it all in a consistent logic.



    First-order logic avoids all of this by having no direct way for formulas or terms to refer to or quantify over other formulas or terms. We don't have to worry about undefined terms or undefined truth values, and the logic itself is consistent. A side effect is that infinite lists of formulas sometimes have to be included as infinite lists of axioms, rather than as a single axiom that quantifies over the formulas. This is usually viewed as an acceptable cost, given the other nice properties of the logic.






    share|cite|improve this answer











    $endgroup$




















      2












      $begingroup$

      The other answers have explained that these axiom schemas of ZFC have to be schemas and not axioms, because we want ZFC to be a first-order theory. The reasoning is that we need to quantify over all formulae over ZFC. In particular, the Specification schema of ZFC can be stated as:




      For each $(k+1)$-parameter sentence $φ$, ZFC has the axiom:




      $∀p[1..k] ∀A ∃B ∀x ( x∈B ⇔ x∈A ∧ φ(p[1..k],x) )$.





      Notice that we cannot have a single axiom that quantifies over all parametrized sentences (over the theory itself), because that is not (directly) expressible in first-order logic. In fact, it can be shown that ZFC cannot be finitely axiomatized (using the same language).



      But that is not quite the full story. There is a general procedure to convert any sufficiently nice first-order theory $T$ into another first-order theory $T'$ (with a different language) that interprets $T$ and yet is finitely axiomatized. Doing this to PA yields a well-known second-order theory called ACA0, and doing this to ZFC yields almost NBG.



      The basic idea to use two sorts, one sort $I$ for the original domain of $T$, and the other sort $J$ for 'classes', where each class is a subcollection of $I$ that represents some predicate over $T$. (For this, we need $T$ to support some reasonable ordered-pair encoding.) We then axiomatize the existence of the class of all objects in $I$, namely $∃C∈J ∀x∈I ( x∈C )$, which for convenience we shall state as existence of the class $ x : x∈I $. We also axiomatize the existence of the class $ (x,x) : x∈I $ to capture equality, and the existence of a class to capture each predicate/function-symbol of $T$. For example, to capture "$∈$" of ZFC we would have the class $ (x,y) : x,y∈I ∧ x∈y $. We also axiomatize existence of the singleton class $x$ for any $x∈I$, and that classes are closed under complement, union, intersection, cartesian product and projection.



      If $T$ had finitely many predicate/function symbols, then the above axioms are finitely many, and yet allow us to construct any definable class of tuples over $T$, namely $ x[1..k] : φ(x[1..k]) $ for any $k$-parameter sentence $φ$ over $T$, as a member of $J$. Furthermore, every class that we can explicitly construct corresponds to some parametrized sentence over $T$. Thus we can now express any axiom schema of $T$ that quantifies over all parametrized sentences over $T$ as a single axiom quantifying over all members of $J$, and can prove that the resulting theory $T'$ is conservative over $T$ in the specific sense that every sentence over $T$ is provable by $T$ iff its translation (i.e. restricting every quantifier to $I$) is provable by $T'$.



      Since any many-sorted first-order theory can be easily interpreted by a one-sorted first-order theory (i.e. replacing the sorts by predicate-symbols), it is inconsequential that we used two sorts in $T'$.



      Quite apart from the finite axiomatizability of NBG, some set theorists prefer to say that they are working in NBG rather than ZFC, since it intrinsically allows definition and quantification over classes. But if one wants to be able to define a class whose defining formula involves quantifying over classes (like one can in ZFC define a set whose defining formula can quantify over sets), then the result (MK) is again is no longer finitely axiomatizable (in the language of NBG), because you would need an axiom schema for Specification of classes.






      share|cite|improve this answer











      $endgroup$












      • $begingroup$
        Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
        $endgroup$
        – user21820
        yesterday











      Your Answer





      StackExchange.ifUsing("editor", function ()
      return StackExchange.using("mathjaxEditing", function ()
      StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
      StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
      );
      );
      , "mathjax-editing");

      StackExchange.ready(function()
      var channelOptions =
      tags: "".split(" "),
      id: "69"
      ;
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function()
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled)
      StackExchange.using("snippets", function()
      createEditor();
      );

      else
      createEditor();

      );

      function createEditor()
      StackExchange.prepareEditor(
      heartbeatType: 'answer',
      autoActivateHeartbeat: false,
      convertImagesToLinks: true,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: 10,
      bindNavPrevention: true,
      postfix: "",
      imageUploader:
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      ,
      noCode: true, onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      );



      );













      draft saved

      draft discarded


















      StackExchange.ready(
      function ()
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3168456%2fwhy-are-the-axiom-of-specification-is-an-axiom-schema-why-not-just-a-single-axi%23new-answer', 'question_page');

      );

      Post as a guest















      Required, but never shown

























      4 Answers
      4






      active

      oldest

      votes








      4 Answers
      4






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      19












      $begingroup$

      This is just the choice of underlying logic. ZFC is a theory in first-order logic, and the strictures of that logical system rule out certain kinds of expressions. There are other logics, and their study comprises abstract model theory.



      Very roughly, there are two competing hopes for a logical system:



      • It should be expressive: things we intuitively want to be able to say, should be say-able in the system.


      • It should be not too wild: e.g. there should be a well-behaved notion of proof.


      It turns out that these are fundamentally in tension. For example, if we want proofs to be finite, then our logical system can't capture infinite structures up to isomorphism (this is the compactness theorem, essentially).



      So why did we pick first-order logic after all, given that it forces us to use axiom schemata (and other inefficiencies)? Well, first-order logic seems to sit at a sweet spot here: it's fairly expressive, but also has a very well-behaved notion of proof and a more technical property called the "Lowenheim-Skolem property" which roughly says that it doesn't interact too much with set theory (indeed, it's the most expressive logic with these properties - this is due to Lindstrom).



      This paper of Ferrairos may be of interest with regard to how first-order logic emerged as "the" primary logic of mathematics.






      share|cite|improve this answer









      $endgroup$

















        19












        $begingroup$

        This is just the choice of underlying logic. ZFC is a theory in first-order logic, and the strictures of that logical system rule out certain kinds of expressions. There are other logics, and their study comprises abstract model theory.



        Very roughly, there are two competing hopes for a logical system:



        • It should be expressive: things we intuitively want to be able to say, should be say-able in the system.


        • It should be not too wild: e.g. there should be a well-behaved notion of proof.


        It turns out that these are fundamentally in tension. For example, if we want proofs to be finite, then our logical system can't capture infinite structures up to isomorphism (this is the compactness theorem, essentially).



        So why did we pick first-order logic after all, given that it forces us to use axiom schemata (and other inefficiencies)? Well, first-order logic seems to sit at a sweet spot here: it's fairly expressive, but also has a very well-behaved notion of proof and a more technical property called the "Lowenheim-Skolem property" which roughly says that it doesn't interact too much with set theory (indeed, it's the most expressive logic with these properties - this is due to Lindstrom).



        This paper of Ferrairos may be of interest with regard to how first-order logic emerged as "the" primary logic of mathematics.






        share|cite|improve this answer









        $endgroup$















          19












          19








          19





          $begingroup$

          This is just the choice of underlying logic. ZFC is a theory in first-order logic, and the strictures of that logical system rule out certain kinds of expressions. There are other logics, and their study comprises abstract model theory.



          Very roughly, there are two competing hopes for a logical system:



          • It should be expressive: things we intuitively want to be able to say, should be say-able in the system.


          • It should be not too wild: e.g. there should be a well-behaved notion of proof.


          It turns out that these are fundamentally in tension. For example, if we want proofs to be finite, then our logical system can't capture infinite structures up to isomorphism (this is the compactness theorem, essentially).



          So why did we pick first-order logic after all, given that it forces us to use axiom schemata (and other inefficiencies)? Well, first-order logic seems to sit at a sweet spot here: it's fairly expressive, but also has a very well-behaved notion of proof and a more technical property called the "Lowenheim-Skolem property" which roughly says that it doesn't interact too much with set theory (indeed, it's the most expressive logic with these properties - this is due to Lindstrom).



          This paper of Ferrairos may be of interest with regard to how first-order logic emerged as "the" primary logic of mathematics.






          share|cite|improve this answer









          $endgroup$



          This is just the choice of underlying logic. ZFC is a theory in first-order logic, and the strictures of that logical system rule out certain kinds of expressions. There are other logics, and their study comprises abstract model theory.



          Very roughly, there are two competing hopes for a logical system:



          • It should be expressive: things we intuitively want to be able to say, should be say-able in the system.


          • It should be not too wild: e.g. there should be a well-behaved notion of proof.


          It turns out that these are fundamentally in tension. For example, if we want proofs to be finite, then our logical system can't capture infinite structures up to isomorphism (this is the compactness theorem, essentially).



          So why did we pick first-order logic after all, given that it forces us to use axiom schemata (and other inefficiencies)? Well, first-order logic seems to sit at a sweet spot here: it's fairly expressive, but also has a very well-behaved notion of proof and a more technical property called the "Lowenheim-Skolem property" which roughly says that it doesn't interact too much with set theory (indeed, it's the most expressive logic with these properties - this is due to Lindstrom).



          This paper of Ferrairos may be of interest with regard to how first-order logic emerged as "the" primary logic of mathematics.







          share|cite|improve this answer












          share|cite|improve this answer



          share|cite|improve this answer










          answered 2 days ago









          Noah SchweberNoah Schweber

          128k10151294




          128k10151294





















              8












              $begingroup$

              In ZF, all expressions must ultimately be a syntactically valid, finite combination of variable names, the $forall$ quantifier, parentheses, the logical operations $lnot$ and $lor$, $=$ and finally $in$. That's it.



              Of course, in practice we have a lot of other symbols, like $subseteq$ and $exists$, but technically they are all defined as specific shorthands for combinations of the symbols above.



              There is no way to use these to say $forall phi(phitext is a formulatoldots)$, the way one might want to do to make the axiom schema into actual axioms.






              share|cite|improve this answer









              $endgroup$












              • $begingroup$
                I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
                $endgroup$
                – RothX
                2 days ago






              • 5




                $begingroup$
                +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
                $endgroup$
                – Alex Kruckman
                2 days ago






              • 2




                $begingroup$
                As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
                $endgroup$
                – Alex Kruckman
                2 days ago







              • 1




                $begingroup$
                (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
                $endgroup$
                – Alex Kruckman
                2 days ago















              8












              $begingroup$

              In ZF, all expressions must ultimately be a syntactically valid, finite combination of variable names, the $forall$ quantifier, parentheses, the logical operations $lnot$ and $lor$, $=$ and finally $in$. That's it.



              Of course, in practice we have a lot of other symbols, like $subseteq$ and $exists$, but technically they are all defined as specific shorthands for combinations of the symbols above.



              There is no way to use these to say $forall phi(phitext is a formulatoldots)$, the way one might want to do to make the axiom schema into actual axioms.






              share|cite|improve this answer









              $endgroup$












              • $begingroup$
                I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
                $endgroup$
                – RothX
                2 days ago






              • 5




                $begingroup$
                +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
                $endgroup$
                – Alex Kruckman
                2 days ago






              • 2




                $begingroup$
                As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
                $endgroup$
                – Alex Kruckman
                2 days ago







              • 1




                $begingroup$
                (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
                $endgroup$
                – Alex Kruckman
                2 days ago













              8












              8








              8





              $begingroup$

              In ZF, all expressions must ultimately be a syntactically valid, finite combination of variable names, the $forall$ quantifier, parentheses, the logical operations $lnot$ and $lor$, $=$ and finally $in$. That's it.



              Of course, in practice we have a lot of other symbols, like $subseteq$ and $exists$, but technically they are all defined as specific shorthands for combinations of the symbols above.



              There is no way to use these to say $forall phi(phitext is a formulatoldots)$, the way one might want to do to make the axiom schema into actual axioms.






              share|cite|improve this answer









              $endgroup$



              In ZF, all expressions must ultimately be a syntactically valid, finite combination of variable names, the $forall$ quantifier, parentheses, the logical operations $lnot$ and $lor$, $=$ and finally $in$. That's it.



              Of course, in practice we have a lot of other symbols, like $subseteq$ and $exists$, but technically they are all defined as specific shorthands for combinations of the symbols above.



              There is no way to use these to say $forall phi(phitext is a formulatoldots)$, the way one might want to do to make the axiom schema into actual axioms.







              share|cite|improve this answer












              share|cite|improve this answer



              share|cite|improve this answer










              answered 2 days ago









              ArthurArthur

              121k7121207




              121k7121207











              • $begingroup$
                I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
                $endgroup$
                – RothX
                2 days ago






              • 5




                $begingroup$
                +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
                $endgroup$
                – Alex Kruckman
                2 days ago






              • 2




                $begingroup$
                As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
                $endgroup$
                – Alex Kruckman
                2 days ago







              • 1




                $begingroup$
                (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
                $endgroup$
                – Alex Kruckman
                2 days ago
















              • $begingroup$
                I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
                $endgroup$
                – RothX
                2 days ago






              • 5




                $begingroup$
                +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
                $endgroup$
                – Alex Kruckman
                2 days ago






              • 2




                $begingroup$
                As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
                $endgroup$
                – Alex Kruckman
                2 days ago







              • 1




                $begingroup$
                (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
                $endgroup$
                – Alex Kruckman
                2 days ago















              $begingroup$
              I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
              $endgroup$
              – RothX
              2 days ago




              $begingroup$
              I see what you're getting at, but I feel like there's something more. You say that in ZF, all expressions must be as you described. But why? That's not one of the axioms of ZF. Is that a rule for all axioms, or does it only apply in ZF? And either way, why?
              $endgroup$
              – RothX
              2 days ago




              5




              5




              $begingroup$
              +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
              $endgroup$
              – Alex Kruckman
              2 days ago




              $begingroup$
              +1 and it's probably worth answering the question "why?": Because ZFC is a first-order theory in the language of set theory, which means that its axioms must be sentences of first-order logic in the language with a single binary relation symbol $in$. That is, the logical symbols mentioned in the answer are not chosen arbitrarily, they're the building blocks of first-order logic.
              $endgroup$
              – Alex Kruckman
              2 days ago




              2




              2




              $begingroup$
              As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
              $endgroup$
              – Alex Kruckman
              2 days ago





              $begingroup$
              As for why we want ZFC to be a first-order theory, this is a more complicated question. It essentially comes down to the fact that (1) first-order logic is restricted enough to have a good proof system, but (2) expressive enough that we can do mathematics in first-order set theory.
              $endgroup$
              – Alex Kruckman
              2 days ago





              1




              1




              $begingroup$
              (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
              $endgroup$
              – Alex Kruckman
              2 days ago




              $begingroup$
              (It seems I've said almost exactly the same things as Noah did in his concurrently written answer, but he said them better!)
              $endgroup$
              – Alex Kruckman
              2 days ago











              8












              $begingroup$

              Noah Schweber pointed out that there is a tension between expressiveness of a logic and having a nice proof theory. There is another tension, between expressiveness and inconsistency.



              More expressive logical systems were developed in the early 1930s by Church (a form of $lambda$ calculus) and separately by Curry (a form of combinatory logic, essentially a different kind of $lambda$ calculus). These logics were more expressive in the sense that they could refer to their own formulas more directly than in first-order logic, essentially by allowing variables to refer to terms or formulas.



              Unfortunately, both of these systems were shown to be inconsistent by Kleene and Rosser in a joint paper in 1935. (Church had already tried to modify his system to avoid inconsistency, but they showed his revised system was inconsistent as well as Curry's system of the time.) More information is available in the article "Paradoxes and Contemporary Logic" by Andrea Cantini and Riccardo Bruni in the Stanford Encyclopedia of Philosophy. (Recall that other, earlier logics systems that tried to be very strong, such as Russell's original system for Principia Mathematica, were also found to be inconsistent.)



              After the inconsistencies were found, Church and Curry both turned their attention to weaker systems, including the simply typed $lambda$ calculus developed by Church. The inconsistent systems slipped into history, but they are still important examples on the limits to what can be put into a logic.



              We now realize that there is a limit on how much a logic can refer to itself. Variations of Richard's paradox and Curry's paradox arise easily with too much self-reference. In a sense, first-order logic and theories such as Peano Arithmetic and ZFC stay just inside this limit. The result is that PA and ZFC are consistent but are subject to Gödel's incompleteness theorems. Adding just slightly more self-reference - which seems to be very hard to avoid in systems that can quantify over and manipulate their own formulas - tends to create systems that are inconsistent or where some terms are undefined or some formulas have undefined truth values. You can't have it all in a consistent logic.



              First-order logic avoids all of this by having no direct way for formulas or terms to refer to or quantify over other formulas or terms. We don't have to worry about undefined terms or undefined truth values, and the logic itself is consistent. A side effect is that infinite lists of formulas sometimes have to be included as infinite lists of axioms, rather than as a single axiom that quantifies over the formulas. This is usually viewed as an acceptable cost, given the other nice properties of the logic.






              share|cite|improve this answer











              $endgroup$

















                8












                $begingroup$

                Noah Schweber pointed out that there is a tension between expressiveness of a logic and having a nice proof theory. There is another tension, between expressiveness and inconsistency.



                More expressive logical systems were developed in the early 1930s by Church (a form of $lambda$ calculus) and separately by Curry (a form of combinatory logic, essentially a different kind of $lambda$ calculus). These logics were more expressive in the sense that they could refer to their own formulas more directly than in first-order logic, essentially by allowing variables to refer to terms or formulas.



                Unfortunately, both of these systems were shown to be inconsistent by Kleene and Rosser in a joint paper in 1935. (Church had already tried to modify his system to avoid inconsistency, but they showed his revised system was inconsistent as well as Curry's system of the time.) More information is available in the article "Paradoxes and Contemporary Logic" by Andrea Cantini and Riccardo Bruni in the Stanford Encyclopedia of Philosophy. (Recall that other, earlier logics systems that tried to be very strong, such as Russell's original system for Principia Mathematica, were also found to be inconsistent.)



                After the inconsistencies were found, Church and Curry both turned their attention to weaker systems, including the simply typed $lambda$ calculus developed by Church. The inconsistent systems slipped into history, but they are still important examples on the limits to what can be put into a logic.



                We now realize that there is a limit on how much a logic can refer to itself. Variations of Richard's paradox and Curry's paradox arise easily with too much self-reference. In a sense, first-order logic and theories such as Peano Arithmetic and ZFC stay just inside this limit. The result is that PA and ZFC are consistent but are subject to Gödel's incompleteness theorems. Adding just slightly more self-reference - which seems to be very hard to avoid in systems that can quantify over and manipulate their own formulas - tends to create systems that are inconsistent or where some terms are undefined or some formulas have undefined truth values. You can't have it all in a consistent logic.



                First-order logic avoids all of this by having no direct way for formulas or terms to refer to or quantify over other formulas or terms. We don't have to worry about undefined terms or undefined truth values, and the logic itself is consistent. A side effect is that infinite lists of formulas sometimes have to be included as infinite lists of axioms, rather than as a single axiom that quantifies over the formulas. This is usually viewed as an acceptable cost, given the other nice properties of the logic.






                share|cite|improve this answer











                $endgroup$















                  8












                  8








                  8





                  $begingroup$

                  Noah Schweber pointed out that there is a tension between expressiveness of a logic and having a nice proof theory. There is another tension, between expressiveness and inconsistency.



                  More expressive logical systems were developed in the early 1930s by Church (a form of $lambda$ calculus) and separately by Curry (a form of combinatory logic, essentially a different kind of $lambda$ calculus). These logics were more expressive in the sense that they could refer to their own formulas more directly than in first-order logic, essentially by allowing variables to refer to terms or formulas.



                  Unfortunately, both of these systems were shown to be inconsistent by Kleene and Rosser in a joint paper in 1935. (Church had already tried to modify his system to avoid inconsistency, but they showed his revised system was inconsistent as well as Curry's system of the time.) More information is available in the article "Paradoxes and Contemporary Logic" by Andrea Cantini and Riccardo Bruni in the Stanford Encyclopedia of Philosophy. (Recall that other, earlier logics systems that tried to be very strong, such as Russell's original system for Principia Mathematica, were also found to be inconsistent.)



                  After the inconsistencies were found, Church and Curry both turned their attention to weaker systems, including the simply typed $lambda$ calculus developed by Church. The inconsistent systems slipped into history, but they are still important examples on the limits to what can be put into a logic.



                  We now realize that there is a limit on how much a logic can refer to itself. Variations of Richard's paradox and Curry's paradox arise easily with too much self-reference. In a sense, first-order logic and theories such as Peano Arithmetic and ZFC stay just inside this limit. The result is that PA and ZFC are consistent but are subject to Gödel's incompleteness theorems. Adding just slightly more self-reference - which seems to be very hard to avoid in systems that can quantify over and manipulate their own formulas - tends to create systems that are inconsistent or where some terms are undefined or some formulas have undefined truth values. You can't have it all in a consistent logic.



                  First-order logic avoids all of this by having no direct way for formulas or terms to refer to or quantify over other formulas or terms. We don't have to worry about undefined terms or undefined truth values, and the logic itself is consistent. A side effect is that infinite lists of formulas sometimes have to be included as infinite lists of axioms, rather than as a single axiom that quantifies over the formulas. This is usually viewed as an acceptable cost, given the other nice properties of the logic.






                  share|cite|improve this answer











                  $endgroup$



                  Noah Schweber pointed out that there is a tension between expressiveness of a logic and having a nice proof theory. There is another tension, between expressiveness and inconsistency.



                  More expressive logical systems were developed in the early 1930s by Church (a form of $lambda$ calculus) and separately by Curry (a form of combinatory logic, essentially a different kind of $lambda$ calculus). These logics were more expressive in the sense that they could refer to their own formulas more directly than in first-order logic, essentially by allowing variables to refer to terms or formulas.



                  Unfortunately, both of these systems were shown to be inconsistent by Kleene and Rosser in a joint paper in 1935. (Church had already tried to modify his system to avoid inconsistency, but they showed his revised system was inconsistent as well as Curry's system of the time.) More information is available in the article "Paradoxes and Contemporary Logic" by Andrea Cantini and Riccardo Bruni in the Stanford Encyclopedia of Philosophy. (Recall that other, earlier logics systems that tried to be very strong, such as Russell's original system for Principia Mathematica, were also found to be inconsistent.)



                  After the inconsistencies were found, Church and Curry both turned their attention to weaker systems, including the simply typed $lambda$ calculus developed by Church. The inconsistent systems slipped into history, but they are still important examples on the limits to what can be put into a logic.



                  We now realize that there is a limit on how much a logic can refer to itself. Variations of Richard's paradox and Curry's paradox arise easily with too much self-reference. In a sense, first-order logic and theories such as Peano Arithmetic and ZFC stay just inside this limit. The result is that PA and ZFC are consistent but are subject to Gödel's incompleteness theorems. Adding just slightly more self-reference - which seems to be very hard to avoid in systems that can quantify over and manipulate their own formulas - tends to create systems that are inconsistent or where some terms are undefined or some formulas have undefined truth values. You can't have it all in a consistent logic.



                  First-order logic avoids all of this by having no direct way for formulas or terms to refer to or quantify over other formulas or terms. We don't have to worry about undefined terms or undefined truth values, and the logic itself is consistent. A side effect is that infinite lists of formulas sometimes have to be included as infinite lists of axioms, rather than as a single axiom that quantifies over the formulas. This is usually viewed as an acceptable cost, given the other nice properties of the logic.







                  share|cite|improve this answer














                  share|cite|improve this answer



                  share|cite|improve this answer








                  edited yesterday

























                  answered yesterday









                  Carl MummertCarl Mummert

                  67.8k7133252




                  67.8k7133252





















                      2












                      $begingroup$

                      The other answers have explained that these axiom schemas of ZFC have to be schemas and not axioms, because we want ZFC to be a first-order theory. The reasoning is that we need to quantify over all formulae over ZFC. In particular, the Specification schema of ZFC can be stated as:




                      For each $(k+1)$-parameter sentence $φ$, ZFC has the axiom:




                      $∀p[1..k] ∀A ∃B ∀x ( x∈B ⇔ x∈A ∧ φ(p[1..k],x) )$.





                      Notice that we cannot have a single axiom that quantifies over all parametrized sentences (over the theory itself), because that is not (directly) expressible in first-order logic. In fact, it can be shown that ZFC cannot be finitely axiomatized (using the same language).



                      But that is not quite the full story. There is a general procedure to convert any sufficiently nice first-order theory $T$ into another first-order theory $T'$ (with a different language) that interprets $T$ and yet is finitely axiomatized. Doing this to PA yields a well-known second-order theory called ACA0, and doing this to ZFC yields almost NBG.



                      The basic idea to use two sorts, one sort $I$ for the original domain of $T$, and the other sort $J$ for 'classes', where each class is a subcollection of $I$ that represents some predicate over $T$. (For this, we need $T$ to support some reasonable ordered-pair encoding.) We then axiomatize the existence of the class of all objects in $I$, namely $∃C∈J ∀x∈I ( x∈C )$, which for convenience we shall state as existence of the class $ x : x∈I $. We also axiomatize the existence of the class $ (x,x) : x∈I $ to capture equality, and the existence of a class to capture each predicate/function-symbol of $T$. For example, to capture "$∈$" of ZFC we would have the class $ (x,y) : x,y∈I ∧ x∈y $. We also axiomatize existence of the singleton class $x$ for any $x∈I$, and that classes are closed under complement, union, intersection, cartesian product and projection.



                      If $T$ had finitely many predicate/function symbols, then the above axioms are finitely many, and yet allow us to construct any definable class of tuples over $T$, namely $ x[1..k] : φ(x[1..k]) $ for any $k$-parameter sentence $φ$ over $T$, as a member of $J$. Furthermore, every class that we can explicitly construct corresponds to some parametrized sentence over $T$. Thus we can now express any axiom schema of $T$ that quantifies over all parametrized sentences over $T$ as a single axiom quantifying over all members of $J$, and can prove that the resulting theory $T'$ is conservative over $T$ in the specific sense that every sentence over $T$ is provable by $T$ iff its translation (i.e. restricting every quantifier to $I$) is provable by $T'$.



                      Since any many-sorted first-order theory can be easily interpreted by a one-sorted first-order theory (i.e. replacing the sorts by predicate-symbols), it is inconsequential that we used two sorts in $T'$.



                      Quite apart from the finite axiomatizability of NBG, some set theorists prefer to say that they are working in NBG rather than ZFC, since it intrinsically allows definition and quantification over classes. But if one wants to be able to define a class whose defining formula involves quantifying over classes (like one can in ZFC define a set whose defining formula can quantify over sets), then the result (MK) is again is no longer finitely axiomatizable (in the language of NBG), because you would need an axiom schema for Specification of classes.






                      share|cite|improve this answer











                      $endgroup$












                      • $begingroup$
                        Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
                        $endgroup$
                        – user21820
                        yesterday















                      2












                      $begingroup$

                      The other answers have explained that these axiom schemas of ZFC have to be schemas and not axioms, because we want ZFC to be a first-order theory. The reasoning is that we need to quantify over all formulae over ZFC. In particular, the Specification schema of ZFC can be stated as:




                      For each $(k+1)$-parameter sentence $φ$, ZFC has the axiom:




                      $∀p[1..k] ∀A ∃B ∀x ( x∈B ⇔ x∈A ∧ φ(p[1..k],x) )$.





                      Notice that we cannot have a single axiom that quantifies over all parametrized sentences (over the theory itself), because that is not (directly) expressible in first-order logic. In fact, it can be shown that ZFC cannot be finitely axiomatized (using the same language).



                      But that is not quite the full story. There is a general procedure to convert any sufficiently nice first-order theory $T$ into another first-order theory $T'$ (with a different language) that interprets $T$ and yet is finitely axiomatized. Doing this to PA yields a well-known second-order theory called ACA0, and doing this to ZFC yields almost NBG.



                      The basic idea to use two sorts, one sort $I$ for the original domain of $T$, and the other sort $J$ for 'classes', where each class is a subcollection of $I$ that represents some predicate over $T$. (For this, we need $T$ to support some reasonable ordered-pair encoding.) We then axiomatize the existence of the class of all objects in $I$, namely $∃C∈J ∀x∈I ( x∈C )$, which for convenience we shall state as existence of the class $ x : x∈I $. We also axiomatize the existence of the class $ (x,x) : x∈I $ to capture equality, and the existence of a class to capture each predicate/function-symbol of $T$. For example, to capture "$∈$" of ZFC we would have the class $ (x,y) : x,y∈I ∧ x∈y $. We also axiomatize existence of the singleton class $x$ for any $x∈I$, and that classes are closed under complement, union, intersection, cartesian product and projection.



                      If $T$ had finitely many predicate/function symbols, then the above axioms are finitely many, and yet allow us to construct any definable class of tuples over $T$, namely $ x[1..k] : φ(x[1..k]) $ for any $k$-parameter sentence $φ$ over $T$, as a member of $J$. Furthermore, every class that we can explicitly construct corresponds to some parametrized sentence over $T$. Thus we can now express any axiom schema of $T$ that quantifies over all parametrized sentences over $T$ as a single axiom quantifying over all members of $J$, and can prove that the resulting theory $T'$ is conservative over $T$ in the specific sense that every sentence over $T$ is provable by $T$ iff its translation (i.e. restricting every quantifier to $I$) is provable by $T'$.



                      Since any many-sorted first-order theory can be easily interpreted by a one-sorted first-order theory (i.e. replacing the sorts by predicate-symbols), it is inconsequential that we used two sorts in $T'$.



                      Quite apart from the finite axiomatizability of NBG, some set theorists prefer to say that they are working in NBG rather than ZFC, since it intrinsically allows definition and quantification over classes. But if one wants to be able to define a class whose defining formula involves quantifying over classes (like one can in ZFC define a set whose defining formula can quantify over sets), then the result (MK) is again is no longer finitely axiomatizable (in the language of NBG), because you would need an axiom schema for Specification of classes.






                      share|cite|improve this answer











                      $endgroup$












                      • $begingroup$
                        Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
                        $endgroup$
                        – user21820
                        yesterday













                      2












                      2








                      2





                      $begingroup$

                      The other answers have explained that these axiom schemas of ZFC have to be schemas and not axioms, because we want ZFC to be a first-order theory. The reasoning is that we need to quantify over all formulae over ZFC. In particular, the Specification schema of ZFC can be stated as:




                      For each $(k+1)$-parameter sentence $φ$, ZFC has the axiom:




                      $∀p[1..k] ∀A ∃B ∀x ( x∈B ⇔ x∈A ∧ φ(p[1..k],x) )$.





                      Notice that we cannot have a single axiom that quantifies over all parametrized sentences (over the theory itself), because that is not (directly) expressible in first-order logic. In fact, it can be shown that ZFC cannot be finitely axiomatized (using the same language).



                      But that is not quite the full story. There is a general procedure to convert any sufficiently nice first-order theory $T$ into another first-order theory $T'$ (with a different language) that interprets $T$ and yet is finitely axiomatized. Doing this to PA yields a well-known second-order theory called ACA0, and doing this to ZFC yields almost NBG.



                      The basic idea to use two sorts, one sort $I$ for the original domain of $T$, and the other sort $J$ for 'classes', where each class is a subcollection of $I$ that represents some predicate over $T$. (For this, we need $T$ to support some reasonable ordered-pair encoding.) We then axiomatize the existence of the class of all objects in $I$, namely $∃C∈J ∀x∈I ( x∈C )$, which for convenience we shall state as existence of the class $ x : x∈I $. We also axiomatize the existence of the class $ (x,x) : x∈I $ to capture equality, and the existence of a class to capture each predicate/function-symbol of $T$. For example, to capture "$∈$" of ZFC we would have the class $ (x,y) : x,y∈I ∧ x∈y $. We also axiomatize existence of the singleton class $x$ for any $x∈I$, and that classes are closed under complement, union, intersection, cartesian product and projection.



                      If $T$ had finitely many predicate/function symbols, then the above axioms are finitely many, and yet allow us to construct any definable class of tuples over $T$, namely $ x[1..k] : φ(x[1..k]) $ for any $k$-parameter sentence $φ$ over $T$, as a member of $J$. Furthermore, every class that we can explicitly construct corresponds to some parametrized sentence over $T$. Thus we can now express any axiom schema of $T$ that quantifies over all parametrized sentences over $T$ as a single axiom quantifying over all members of $J$, and can prove that the resulting theory $T'$ is conservative over $T$ in the specific sense that every sentence over $T$ is provable by $T$ iff its translation (i.e. restricting every quantifier to $I$) is provable by $T'$.



                      Since any many-sorted first-order theory can be easily interpreted by a one-sorted first-order theory (i.e. replacing the sorts by predicate-symbols), it is inconsequential that we used two sorts in $T'$.



                      Quite apart from the finite axiomatizability of NBG, some set theorists prefer to say that they are working in NBG rather than ZFC, since it intrinsically allows definition and quantification over classes. But if one wants to be able to define a class whose defining formula involves quantifying over classes (like one can in ZFC define a set whose defining formula can quantify over sets), then the result (MK) is again is no longer finitely axiomatizable (in the language of NBG), because you would need an axiom schema for Specification of classes.






                      share|cite|improve this answer











                      $endgroup$



                      The other answers have explained that these axiom schemas of ZFC have to be schemas and not axioms, because we want ZFC to be a first-order theory. The reasoning is that we need to quantify over all formulae over ZFC. In particular, the Specification schema of ZFC can be stated as:




                      For each $(k+1)$-parameter sentence $φ$, ZFC has the axiom:




                      $∀p[1..k] ∀A ∃B ∀x ( x∈B ⇔ x∈A ∧ φ(p[1..k],x) )$.





                      Notice that we cannot have a single axiom that quantifies over all parametrized sentences (over the theory itself), because that is not (directly) expressible in first-order logic. In fact, it can be shown that ZFC cannot be finitely axiomatized (using the same language).



                      But that is not quite the full story. There is a general procedure to convert any sufficiently nice first-order theory $T$ into another first-order theory $T'$ (with a different language) that interprets $T$ and yet is finitely axiomatized. Doing this to PA yields a well-known second-order theory called ACA0, and doing this to ZFC yields almost NBG.



                      The basic idea to use two sorts, one sort $I$ for the original domain of $T$, and the other sort $J$ for 'classes', where each class is a subcollection of $I$ that represents some predicate over $T$. (For this, we need $T$ to support some reasonable ordered-pair encoding.) We then axiomatize the existence of the class of all objects in $I$, namely $∃C∈J ∀x∈I ( x∈C )$, which for convenience we shall state as existence of the class $ x : x∈I $. We also axiomatize the existence of the class $ (x,x) : x∈I $ to capture equality, and the existence of a class to capture each predicate/function-symbol of $T$. For example, to capture "$∈$" of ZFC we would have the class $ (x,y) : x,y∈I ∧ x∈y $. We also axiomatize existence of the singleton class $x$ for any $x∈I$, and that classes are closed under complement, union, intersection, cartesian product and projection.



                      If $T$ had finitely many predicate/function symbols, then the above axioms are finitely many, and yet allow us to construct any definable class of tuples over $T$, namely $ x[1..k] : φ(x[1..k]) $ for any $k$-parameter sentence $φ$ over $T$, as a member of $J$. Furthermore, every class that we can explicitly construct corresponds to some parametrized sentence over $T$. Thus we can now express any axiom schema of $T$ that quantifies over all parametrized sentences over $T$ as a single axiom quantifying over all members of $J$, and can prove that the resulting theory $T'$ is conservative over $T$ in the specific sense that every sentence over $T$ is provable by $T$ iff its translation (i.e. restricting every quantifier to $I$) is provable by $T'$.



                      Since any many-sorted first-order theory can be easily interpreted by a one-sorted first-order theory (i.e. replacing the sorts by predicate-symbols), it is inconsequential that we used two sorts in $T'$.



                      Quite apart from the finite axiomatizability of NBG, some set theorists prefer to say that they are working in NBG rather than ZFC, since it intrinsically allows definition and quantification over classes. But if one wants to be able to define a class whose defining formula involves quantifying over classes (like one can in ZFC define a set whose defining formula can quantify over sets), then the result (MK) is again is no longer finitely axiomatizable (in the language of NBG), because you would need an axiom schema for Specification of classes.







                      share|cite|improve this answer














                      share|cite|improve this answer



                      share|cite|improve this answer








                      edited yesterday

























                      answered yesterday









                      user21820user21820

                      39.9k544159




                      39.9k544159











                      • $begingroup$
                        Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
                        $endgroup$
                        – user21820
                        yesterday
















                      • $begingroup$
                        Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
                        $endgroup$
                        – user21820
                        yesterday















                      $begingroup$
                      Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
                      $endgroup$
                      – user21820
                      yesterday




                      $begingroup$
                      Of course, neither is my post anywhere near the full story... I just hope it can dispel any potential misconception that we cannot have a finitely axiomatized first-order theory that captures ZFC set theory. =)
                      $endgroup$
                      – user21820
                      yesterday

















                      draft saved

                      draft discarded
















































                      Thanks for contributing an answer to Mathematics Stack Exchange!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid


                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.

                      Use MathJax to format equations. MathJax reference.


                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function ()
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3168456%2fwhy-are-the-axiom-of-specification-is-an-axiom-schema-why-not-just-a-single-axi%23new-answer', 'question_page');

                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      How to write a 12-bar blues melodyI-IV-V blues progressionHow to play the bridges in a standard blues progressionHow does Gdim7 fit in C# minor?question on a certain chord progressionMusicology of Melody12 bar blues, spread rhythm: alternative to 6th chord to avoid finger stretchChord progressions/ Root key/ MelodiesHow to put chords (POP-EDM) under a given lead vocal melody (starting from a good knowledge in music theory)Are there “rules” for improvising with the minor pentatonic scale over 12-bar shuffle?Confusion about blues scale and chords

                      What if the end-user didn't have the required library?What is setup.py?What is a clean, pythonic way to have multiple constructors in Python?What does Ruby have that Python doesn't, and vice versa?What is the reason for having '//' in Python?How do I create a namespace package in Python?How to package shared objects that python modules depend on?setuptools vs. distutils: why is distutils still a thing?Navigation in Windows 10 vs code not going to virtualenv library when the same library is installed at user levelPython create package for local usePackaging a project that uses multiple python versionsWhy is permission denied on pip install except for when “--user” is included at end of command?

                      Why did Thanos need his ship to help him in the battle scene?Which actor plays Thanos in the Avengers mid-credits scene?Are there economic implications portrayed in comics where the buildings and cities are ruined almost daily?Old X-Men comic where team travels to alien world with a ring-like sun that needs recharging?Why does Ego need help sleeping?Is there an objective answer to who “the strongest Avenger” is?How did Banner get unstuck?Why did Thanos get hit?How did Thanos (or anyone) know the Infinity Stones would give him this power?Did Thanos leave Eitri alive for his after-sales service?In Avengers 1, why does Thanos need Loki?