Mathematics
From The Beginning
by
James Adrian
Please send comments or questions here,
or copy this address into your favorite email program:
jim@futurebeacon.com
Contents
This system of terms and statements is offered as a
starting point or
foundation for mathematics. Its distinctive
features are these:
Nothing may contain itself.
A and B are
equal if and only if they have the same
meaning.
The definitions of the terms
elements,
sets,
subsets,
empty sets,
ordered sets and others are different from those in other systems.
A
set is defined not only by what it contains, but also by what
it
may contain according to its criterion of element inclusion.
Definitions may contain references to
time such as the terms
previously,
before,
after, and
subsequently.
Sets that contain all of the same elements are
coincident, but
they are not necessarily
equal.
Sets are
closed under operations by means other than infinite sets.
Axioms are not used.
Mathematics is a direct consequence of our most basic perceptions; therefore, it must not be freely invented. It must be
rigorously derived. The creation of any given definition is constrained by those meanings established previous to the statement
of that definition.
In this work, a statement is not taken to be true unless it is known to be true. No assumptions are made. A statement may
not be listed as an axiom if it can be shown to be true. This eliminates axioms. It has been widely stated and believed that
such a restriction relegates math to trivial results and precludes the possibility of certain results that are regarded as valuable. This
is simply not true.
There are mathematical results that are critically needed by scientists, engineers, programmers, economists, statisticians, and many others
who are not principally (if at all) in the business of furthering mathematics itself . I would place in another category the results
valued by mathematicians that are not needed by anybody else. It is this latter category that is partly excluded from this system
by my not employing axioms. Incompleteness theorems and multiple orders of infinity are examples of mathematical results
that cannot be created within this system. It will be shown that the mathematical results that are important to non-mathematicians
can be obtained without axioms.
Some have questioned whether non-trivial theorems can be proved in a system formed without axioms. The goal is
not to create interesting or difficult challenges. It is to define methods of calculation that are useful. A system in
which the truth of inferences is often obvious is a good system, not a bad system.
An entirely empirical mathematics will accomplish yet another worthy goal: The experience of newcomers will be enhanced
but not be contradicted. This can only increase participation.
Chapter I
From Perceptions to Ordered Sets
Formal Definitions
By and large, dictionaries contain descriptions that remind us of meanings and associate some meaning with others. Sometimes,
but not always, a dictionary will state a formal definition. Providing only formal definitions is not the purpose of most
dictionaries. They provide a language reference for all words in common use whether they can be assigned a formal definition
or not.
A formal definition is a statement of the exact meaning of a term. Such a statement is logically constructed from the meanings
of previously known terms. Any statement that is offered as a definition must have all of the following properties:
A proposed definition must name the term being defined and provide a description of that term.
Here is an example: A
hydrocarbon (defined term) is
a compound consisting only of carbon and hydrogen (description).
Other sentence structures are possible, but both the defined term and its description must be somewhere in the
statement. A statement may be comprised of more than a single sentence.
The term being defined and the description of that term must be interchangeable in any context.
If the description of the defined term cannot truthfully be substituted for the defined term within any spoken or
written statement, then the definition is defective. It must be clear that the term and its description have exactly
the same meaning.
Other than the term being defined, the meaning of each term used in a definition must be known before
that definition is stated.
Each term used in a description must refer to only a single meaning.
In English, there are many terms that have meanings that depend upon their context, but in any mathematical
context, words and phrases must each be assigned a single meaning.
The term being defined must not appear in the description of that term.
Including the term being defined in the description of that term creates a so-called circular definition
(which is not a formal definition). The following may seem like an exception, but it's not:
A
good car is a car you like.
The trick here is that the term being defined is
good car and not
car.
Definition - A term is
well defined if and only if it is described by a statement that has all of the properties
required of a definition.
Basic Perceptions and Terms
Sentences may contain more than a single statement. A sentence may be a statement but so may a clause. A
statement may be written as a clause in a compound sentence like this: "Birds fly and fish swim." This entire
sentence is a statement for our purposes in this document. It contains statements which could be written this way:
"Birds fly." "Fish swim." All of the sentences in quotation marks in this paragraph are statements.
Mathematics is a direct consequence of our most basic perceptions. Formal definitions cannot be the starting point for
math because they rely upon pre-existing terms. Consider these definitions:
Definition - A
term is a word or phrase.
Definition - A
name is a word, phrase, character, symbol, or mark that is used to refer to something other than itself.
If the terms used to define these terms must themselves defined, we have no starting point. The process must start with
terms that we understand without defining them. To create mathematics, there must be
mathematical terms
at the
start that we understand precisely by some means other than by defining them. There are several important terms that we
come to understand through experience alone:
You can tell the difference between a
single thing and a
pair of things as surely as you can tell night from day
or red from blue. Children communicate those perceptions and reason with them as soon as they find out which words
other people use to refer to them. We all have an innate ability to perceive a
single thing, a
pair of things,
and
indefinitely many things. These terms are given meaning through experience in the world. Terms this
basic cannot be defined by more basic terms. They are called
undefined terms. The may also be called
empirical terms or
empirically defined terms. For example, the term
thing cannot be assigned a
scientific or mathematical definition. Attempts will always be circular. The terms
any,
some,
more than,
less than,
at least,
no more than,
at most and
few refer to perceptions,
not to definitions. These and related meanings are the starting point for our mathematical definitions. These terms
are associated with their corresponding experiences empirically.
The term
thing is intended to be thoroughly indefinite as to the nature or description of whatever is being referred
to.
Thing often refers to an object, but a list of things might sometimes be a list of terms, attributes or attitudes;
or a list of other things that are not typically thought of as objects. As used in this document, the terms
item and
thing will have exactly the same meaning.
Prior to counting and calculating, we learn some mathematically useful terms associated with the perceptions they refer to, but we also
come to understand certain fundamental features of the world as we perceive it:
To us, a pile of rocks is not merely a lot of rocks, it is an item named in the singular. We are capable of perceiving a rock pile
as a single item that is distinct from any of the things that it contains. In fact, we seem to insist on doing so. Along with
this kind of perception comes a compelling point of reasoning that I believe we all acknowledge as true: No such thing may
contain itself.
The term
contain is used in the same sense that it is used in the sentence "The bottle will contain a boat." In this
document, the terms
contained by,
contained in, and
in, are used as synonyms.
The passage of time is demonstrated by any succession of events. Event A may occur before, after or simultaneous with
event B. The meaning of the terms associated with time are exceedingly well known. Time is very often measured
and characterized with the help of mathematics; but traditionally, time has not been used to help create math itself. In my
opinion, this should change.
Mathematical structures are routinely created by precise definitions. There is no doubt that things like
a series of
events can also be precisely defined. By defining specific actions, operations and procedures, mathematical concepts
can be both created and illustrated. Nowadays, most of the procedures in this world are computer programs. I feel
that defining structures that go through time is no longer prohibited by custom or contraindicated by any compelling reason.
The term
theorem usually means a statement together with its proof, but this term is sometimes given the same meaning
as the term
conjecture, which means a statement that appears to be true, has not yet been proved or disproved, and is not
intended to be assumed true or used to help prove other statements. This meaning was used most famously in the case
of Fermat's Last Theorem. It was stated in 1637 but not proved until 1995. Statements made here
(other than historical ones) will not be called theorems unless they are stated together with their proof.
A theorem is generally regarded by mathematicians as a mathematical statement whose truth can be proved on the basis of a given
set of axioms or assumptions. Rarely does a published definition of the term allow for the possibility that a statement might
be proved on the basis of definitions and the empirical terms that preceed them, but however a statement is proved, the statement
together with its proof is a theorem.
Anything may be known by more than a single name. Languages differ. Different academic disciplines, and different
groups give there own chosen names to things. The name of a thing does not change the thing. This universal truth
comes in handy: It means that p and q can be shown to be equal to each other at anytime after p and q are defined.
I introduce a changed definition of the term
equal:
Definition - A and B are
equal, or
A equals B if and only if A has the same meaning as B.
Names, words, phrases, symbols or descriptions that have the same
meaning are said to be
equal and said to be
equal to each other. Terms that have the same meaning are interchangeable in any context. For a pair of
things to be equal they need not have the same name or be referred to by the same term or symbol. If a pair of things are
said to be equal, it is their meanings that are the same, not their spelling or appearance. Numbers are not the only things
that can be equal. Here, there is no difference between the term
equal and the term
equivalent. The
following examples illustrate the use of the
equals sign - the symbol for equality:
A = a thing
The above statement is said this way: "A equals a thing." Here are some other examples:
B = an item
a thing = an item
a pair of things = a single thing together with another single thing
---------------
Any entity that appears on the left side of an equal sign can be substituted anywhere for the entity appearing on the
right side of the equal sign. The reverse is also true.
This leads to the easy proof of a statement that has been used as an axiom elsewhere:
Theorem - A = A.
Proof
Suppose B = A. (Anything may be known by more than a single name.)
A = A. (A may be substituted for B because A and B have the same meaning.)
---------------
Because terms that are equal are interchangeable, A= B implies B = A. Likewise, if A = B and B = C, then A = C.
Throughout this document, the mathematical meaning of the word
but is exactly the same as the mathematical meaning
of the word
and. The term
but carries connotations like
not withstanding previous considerations
and the like. It may be helpful to know that you can substitute
and for
but anywhere in a proof without
altering the mathematical meaning of its statements.
A Few Rules of Inference:
If the truth of statement P implies the truth of statement Q, and if
statement P is true, then statement Q is true.
If the truth of statement P implies the truth of statement Q, and statement
Q is false, then statement P is false.
If statement P and statement Q cannot both be true, and statement P is
true, then statement Q is false.
If P implies Q, and Q implies R, then P implies R.
If P implies Q, and Q implies P, then P is true if and only if Q is true.
If all things of type T necessarily have attribute A, and a specific thing R
is of type T, then R has attribute A.
Consistency itself requires these rules of inference to be acknowledged as valid in every language, in every court of law, and in every
human field of endeavor. Indeed, a sincere objection to any one of them is evidence of metal defect. These rules were
not invented. No mathematical system disputes them. Why? Any stated rule of inference is merely a recognition
of the consistent use of our terms. Other cases may be recognized. It is not the rules that drive our reasoning. We
have adapted to a consistent world. Our reasoning acknowledges and enforces the rules by which we infer.
Other undefined observations, terms and perceptions will be discussed when we are closer to the place of their use in this document.
Containment
Definition - Item x is an
element if and only if x is
contained by another item, is
contained in another
item or is
in another item.
The term
collection is widely understood, but another similar term is needed - a term that escapes the over-learned
notion that a collection is a gathering or assemblage that necessarily involves at least a pair of things. Simply saying
that a collection may contain just a single item, or perhaps even nothing, generates a certain cognitive dissonance
brought about through the contradicting and unlearning of years of experience. It is always best to build new
knowledge upon existing knowledge without rearranging learned associations and meanings that are already well
established.
The term
box could be considered because a box could be empty, or it could contain a single thing, or many of things,
or even another box; but a box is an object that might not easily be accepted as overlapping another box or containing
unique objects in common with another box. A term like
region,
space,
place or
area would
solve these problems, but they carry extraneous associations having to do with grid coordinates, directions, and the like. It
would be better to find a term that is abstract in the sense that it is not required to be an object or place - a term that is devoid of its
own peculiar associations. Such a term is
set.
The term
set is well known to mathematicians, but it may not be part of your experience. This problem is solved
by utilizing familiar meanings as properties of the new meaning.
Definition - S is a
set if and only if the following statements are true:
S does not contain itself.
S contains nothing, or S contains a single element, or S contains more than a single element.
The elements in S are each unique in S.
Each named element in S has a name that is unique among the elements named in S.
Elements in S may each have any description other than being S.
S may be defined as containing elements exclusively of a particular description.
---------------
Notice that any set contains no duplicates.
Notation - The notation for sets is the listing in curly brackets of whatever it contains:
Set X = {Jim's chair, the Washington monument, Mary's gold pen}.
The definition of the term
element permits a set to be contained in another set as an element. It is important to
realize that if element q is in set B, and set B is in set A, element q is not thereby required to be in set A. A less abstract
model that can be used to clarify this is the idea of a
list. If item q is in list B, and list B is in list A, there still may be
no entry in list A that is called q. If q is to be in list A, q must be listed in list A. If list B is listed in list A by
name, the items in list B are not automatically listed in list A.
We may now recognize that every collection is also a set:
Definition - A
collection is a set containing at least a pair of elements.
Of course, a set is not always a collection because some sets are empty, and some sets contain only a single thing.
Definition - An
empty set is a set that contains nothing.
Notation - S = {} shall indicate that set S is empty.
Definition - S is a
non-empty set if and only if S is a set, and S contains a single element or more than a single
element.
Notice that, since a set may contain nothing, an empty set is a set. An empty set contains nothing and does not
contain anything. Since no set may contain itself, any empty set may not contain itself. If it did, it would
contain something other than nothing. It would contain a set. This is different from the empty set of some
other systems.
Also unlike other systems, nothing prevents us from discovering that more than one uniquely defined set is empty. Because
any set S may be defined as having elements of any description (other than being S), we cannot prevent sometimes discovering
that S is empty and yet distinct from another empty set. Any particular set may not be defined merely by what it contains,
but by what it
may contain according to its criterion of element inclusion. In the course of deducing what elements
there are in a given set, the answer is sometimes
none.
In this system, abstractness is not a value in itself. A bank account that is empty is not regarded as identical to a region of
space that contains nothing. No useful purpose is served by insisting that they are both identical to a unique empty
set. This language is consistent with the notion that math is to help us quantitatively analyze real things. Sets that
contain all of the same elements do not necessarily have the same meaning. Accordingly, when every element in set A is in
set B, and every element in set B is in set A, A and B are
coincident, but they are not necessarily
equal:
Definition - Set A and set B are
coincident if and only if every element in set A is in set B, and every element in
set B is in set A.
Definition - Set A and set B are
equal if and only if A and B have the same meaning.
Definition -
Set A is coincident with set B if and only if every element in set A is in set B, and every element in
set B is in set A.
Definition - The
universal set U is the set that contains every element and every set other than itself.
This provides an ability to discuss elements that are not explicitly assigned to a named set. The phrase
for any
element v not in S is meaningful because, whatever else v may be, v is an element in U.
Definition - Set S is a
subset of set T if and only if each element in set S is also an element in set T, and S is not
coincident with T.
Definition - Set S is a
proper subset of set T if and only if S is a subset of T.
Definition - Set T is the
superset of set S if and only if S is a subset of T.
In this system, a set S may not be called a subset of set T if S is coincident with T. This means that in no sense may any set
contain itself. The term
proper subset may be used as an emphatic indication that the subset and the superset are not
coincident.
Definition - Set S is the
intersection of set T and set R if and only if each element x in S is an element in
T, and each element x in S is also an element in R, and S contains no other elements.
Definition - Set S is the
intersection of the sets in set K if and only if each element in set S is also in each
set in set K, and S contains no other elements.
Definition - X is the
union of all of the sets in set C if and only if X is a set that contains every element that
is in any set in set C, and X contains no other elements.
---------------
Definition -
Set B is the complement of set A in set C if and only if B contains all of the
elements in C that are not in A, and B does not contain A or C, and A does not contain B or C.
Axioms
These days, an axiom is a statement that has these attributes:
It is assumed to be true.
It is incapable of being proved.
It seems useful in helping to prove other statements.
---------------
If axioms are used at all, axioms must be limited to those statements that we cannot prove. They are statements of logical
relationships that we must perceive with great certainty. If an axiom is ever found to be factually false, all statements that
rely upon the truth of that axiom are then recognized as not proved.
Consider these statements:
Anything may be known by more than a single name.
Nothing may contain itself.
If B is contained in A, then A is not contained in B.
If event A occurs before event B, then event B does not occur before event A.
If event A occurs after event B, then event B does not occur after event A.
If event A occurs before event B, and event B occurs before event C, then event A occurs before event C.
If event A occurs after event B, and event B occurs after event C, then event A occurs after event C.
If event A occurs before event B, then event B occurs after event A.
A = A.
If A = B, then B = A.
If A = B, and B = C, then A = C.
---------------
The statements listed above can be used as evidence in a proof, but they are not really axioms. Let's take a closer
look at each of the them:
Anything may be known by more than a single name. If this is assumed to be true, p and q can be shown
to be equal to each other at anytime after p and q are defined. This is essential, but also obvious from the universally
accepted meaning of the term
name. Its contradiction would be an impediment to proving statements, and it
may be helpful to remember that it is true, but no system disputes it. It is factually true prior to being assumed and
therefore it cannot be an axiom.
Nothing may contain itself. This statement is true of all things and it follows from the meaning of the term
contain. The statement can be used as evidence in a proof, but so can many other statements that are true by
definition or by the agreed meaning of undefined terms. It is not an axiom. The widespread understanding of
the term as it is used in sentences such as "the box does not contain the letter" is the only meaning of this term that is used in this
system. The set of all things that are not human contains itself and is therefore not a possible set. Like
the sound of one hand clapping, it owes its apparent meaning to the fact that language goes through time.
If B is contained in A, then A is not contained in B. This statement can be used to help prove other statements,
but it follows from the meaning of the term
contain.
If event A occurs before event B, then event B does not occur before event A.
If event A occurs after event B, then event B does not occur after event A.
If event A occurs before event B, and event B occurs before event C, then event A occurs before event C.
If event A occurs after event B, and event B occurs after event C, then event A occurs after event C.
If event A occurs before event B, then event B occurs after event A.
These statements introduce time and remind us that such statements can be used to help prove other statements, but they follow from
the meaning of the terms
before and
after. Since they are true by virtue of the meaning of the terms use to
express them, they are
actually true and cannot be assumed true. They are not axioms. No axiom would be
meaningful if it were a statement that could not be otherwise.
A = A.
If A = B, then B = A.
If A = B, and B = C, then A = C.
These statements follow from the meaning of the term
equal. A and B are equal, or A equals B if and only if A has
the same meaning as B. With this definition, proving all three of these statements of equality is straightforward, and so they
cannot be used as axioms.
There are no axioms. The statements considered above could not be otherwise. They are actually true and
not assumed. They are true by virtue of the definitions and the agreed meaning of the undefined terms that are used to state
them. They may be used as evidence in any proof. They are important in this system but may not be provable in other
systems.
In general, paradoxes are avoided by eliminating axioms. The argument is whether anything of value to math is left. I
argue that everything of real value is indeed left. I make the case by showing that there are other ways to obtain the results that
others have said require axioms.
It is the meaning of the terms we use that create mathematics, physics, chemistry or any particular system. If the definitions
we make are formal and precise, they are born of other sensible definitions and also born of undefined terms (empirically defined
terms). There would be no formal definitions if no undefined terms were in use. We get our undefined terms from
perceptions. Fortunately, some of these perceptions are shared widely and recognized as true of the world. Only
these undefined terms are useful as undefined terms in a science.
There are many undefined terms that nobody bothers to mention in the development of technical jargon. "If," "we," "were," "to,"
"accept," "this," "as," and "true" are all undefined terms in most formal systems.
The long history of mistakes in math makes the it necessary to point out statements that I would rather take as granted. "Nothing
may contain itself" and the others above are such statements.
Order
The development of numbers that follows requires the prior existence of
ordered sets (such as the alphabet). Therefore,
the prior existence of numbers, or of terms such as
less than or
greater than cannot be used to construct a definition
of the term
ordered set. Instead, this term is defined by starting with the undefined term
association.
An association can be prompted by information that we receive from the environment, or from a dream, or from a memory, or from
a thought. An association may be made inadvertently or intentionally in the process of thinking. We are free to
associate anything with anything else for any reason. Multiple meanings of this term are widely understood, but in order
to be used in formal definitions, a single meaning must be chosen. Consider this statement:
Lake Ontario is always associated with water, but
water is not always associated with Lake Ontario.
This is different from a pair of things being associated with each other symmetrically. Some associations only go in a
single direction, or tend toward a single direction. We need a meaning that is abstracted from any particular example, such
as the Lake Ontario example, in order to focus on the relationship and not the accidental characteristics. Here is the specific
meaning that will be used in this writing:
A is associated with B whenever it is explicitly stated that A is associated with B; and whenever A is associated with B, B is
not thereby associated with A. In such a case, B is associated with A if only if that fact is also explicitly stated.
This is not a definition. It is a legitimate undefined term having a specific meaning with which we have extensive
experience. This is the meaning that we understood when we leaned the alphabet. Few of us can quickly recite
the alphabet backwards. None of us learned the alphabet as a system of definitions leading to the establishment of an
ordered set. We simply associated each letter with the next in this precise and very limited sense.
Notation - A → B means that A is associated with B. B ← A also means that A is associated with
B. Up arrows and down arrows are used in the same way. If the arrow points away from A and toward B, then
it means that A is associated with B. (And it does
not tell us that B is associated with A.)
Definition - S is an
ordered set if and only if the following statements are true:
S is a set containing at least a pair of elements.
A single element j in S, and only j in S, is such that no other element in S is associated with it.
A single element k in S and only k in S is not associated with any element in S.
Each element p in S other than element k in S is associated with a single other element in S, and only that single other element in S.
---------------
Notation - In the case of an ordered set, the curly brackets that are normally used to enclose elements in a set are replaced
by parentheses. Unless otherwise specifically required, the starting element is written on the left and the ending element is
written on the right. Here is an example:
S = (t, u, v, w, x, y, z) = (t → u → v → w → x → y → z)
The term
next has a useful meaning in this context:
Definition - Element y is
next in order from x if and only if the following statements are true:
S is an ordered set.
Element x in S is associated with element y in S.
---------------
Definition - S is an
unordered set if an only if S is a set, and S is not an ordered set.
Definition - S is an
unordered pair if an only if S is an unordered set, and S contains exactly a pair of elements.
Definition - S is an
ordered pair if an only if S is an ordered set, and S contains exactly a pair of elements.
Definition - Element x in S is
the starting element of S if and only if the following statements are true:
S is an ordered set.
No element in S is associated with x.
---------------
Definition - Element z in S is
the ending element of S if and only if the following statements are true:
S is an ordered set.
Element z in S is not associated with any element in S.
---------------
Definition - A is the
English alphabet if and only if A is an ordered set and A = {A, B, C, D, E, F, G, H, I, J, K, L, M,
N, O, P, Q, R, S, T, U, V, W, X, Y, Z} or A = {a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, v, w, x, y, z}.
Recall that set A and set B are coincident if and only if every element in set A is in set B, and every element in set B is in set A, whereas
set A and set B are equal if and only if A and B have the same meaning. If set A is an ordered set and set B is not, they are not
equal but they are coincident. It makes little sense to defined A and B as equal if they differ in such a consequential characteristic.