afl-material: comparison handouts/notation.tex

equal deleted inserted replaced

-:3b9496db3fb9
+:3390e863d796
 \begin{document}
 \section*{A Crash-Course on Notation}
-There are innumerable books available about compilers,
+There are innumerable books available about compilers, automata theory
-automata theory and formal languages. Unfortunately, they
+and formal languages. Unfortunately, they often use their own
-often use their own notational conventions and their own
+notational conventions and their own symbols. This handout is meant to
-symbols. This handout is meant to clarify some of the notation
+clarify some of the notation I will use. I apologise in advance that
-I will use. I appologise in advance that sometimes I will be a
+sometimes I will be a bit fuzzy\ldots the problem is that often we
-bit fuzzy\ldots the problem is that often we want to have
+want to have convenience in our mathematical definitions (to make them
-convenience in our mathematical definitions (to make them
+readable and understandable), but other times we need pedantic
-readable and understandable), but sometimes we need precision
+precision for actual programs.
-for actual programs.
 \subsubsection*{Characters and Strings}
 The most important concept in this module are strings. Strings
 are composed of \defn{characters}. While characters are surely
 characters stand for, except we do care about the fact that
 for example the character $a$ is not equal to $b$ and so on.
 Why do I make this distinction? Because we often need to
 define functions using variables ranging over characters. We
 need to somehow say this is a variable, say $c$, ranging over
-characters, while this is the atual character \pcode{c}.
+characters, while this is the actual character \pcode{c}.
 An \defn{alphabet} is a (non-empty) finite set of characters.
 Often the letter $\Sigma$ is used to refer to an alphabet. For
 example the ASCII characters \pcode{a} to \pcode{z} form an
 alphabet. The digits $0$ to $9$ are another alphabet. The
 contain only the letters $a$ and $b$, for example. In this
 case we will state that the alphabet is the set $\{a, b\}$.
 \defn{Strings} are lists of characters. Unfortunately, there
 are many ways how we can write down strings. In programming
-languages, they are usually written as \dq{$hello$} where the
+languages, they are usually written as \dq{\texttt{hello}} where the
 double quotes indicate that we are dealing with a string. In
-programming languages, such as Scala, strings have a special
+typed programming languages, such as Scala, strings have a special
 type---namely \pcode{String} which is different from the type
-for lists of chatacters. This is because strings can be
+for lists of characters. This is because strings can be
-efficiently represented in memory, unlike general lists. Since
+efficiently represented in memory, unlike lists. Since
-\code{String} and the type of lists of characters,
+\code{String} and the type of lists of characters
-\code{List[Char]} are not the same, we need to explicitly
+(\code{List[Char]}) are not the same, we need to explicitly
 coerce elements between the two types, for example
 \begin{lstlisting}[numbers=none]
 scala> "abc".toList
 res01: List[Char] = List(a, b, c)
 \end{lstlisting}
-\noindent Since in our (mathematical) definitions we regard
+\noindent
-strings as lists of characters, we will also write
+However, we do not want to do this kind of explicit coercion in our
-\dq{$hello$} as
+pencil-and-paper, everyday arguments.  So in our (mathematical)
+definitions we regard strings as lists of characters, we will also
+write \dq{$hello$} as
 \[
 [\text{\it h, e, l, l, o}] \qquad\text{or simply}\qquad \textit{hello}
 \]
 Two strings, say $s_1$ and $s_2$, can be \defn{concatenated},
 which we write as $s_1 @ s_2$. If we regard $s_1$ and $s_2$ as
 lists of characters, then $@$ is the list-append function.
 Suppose we are given two strings \dq{\textit{foo}} and
-\dq{\textit{bar}}, then their concatenation, writen
+\dq{\textit{bar}}, then their concatenation, written
 \dq{\textit{foo}} $@$ \dq{\textit{bar}}, gives
 \dq{\textit{foobar}}. But as said above, we will often
 simplify our life and just drop the double quotes whenever it
 is clear we are talking about strings, So we will often just
 write \textit{foo}, \textit{bar}, \textit{foobar} or
 \textit{foo $@$ bar}.
-Occasionally we will use the notation $a^n$ for strings, which
+Occasionally we will use the notation $a^n$ for strings, which stands
-stands for the string of $n$ repeated $a$s. So $a^{n}b^{n}$ is
+for the string of $n$ repeated $a$s. So $a^{n}b^{n}$ is a string that
-a string that has as many $a$s by as many $b$s.
+has some number of $a$s followed by the same number of $b$s.  A simple
+property of string concatenation is \emph{associativity}, meaning
-A simple property of string concatenation is
-\emph{associativity}, meaning
 \[(s_1 @ s_2) @ s_3 = s_1 @ (s_2 @ s_3)\]
 \noindent are always equal strings. The empty string behaves
 like a \emph{unit element}, therefore
 \[
 \{1, 2, 3\}
 \]
-\noindent The notation $\in$ means \emph{element of}, so $1
+\noindent The notation $\in$ means \emph{element of}, so $1 \in \{1,
-\in \{1, 2, 3\}$ is true and $4 \in \{1, 2, 3\}$ is false.
+2, 3\}$ is true and $4 \in \{1, 2, 3\}$ is false.  Note that the
-Sets can potentially have infinitely many elements. For
+\emph{list} $[1, 2, 3]$ is something different from the \emph{set}
-example the set of all natural numbers $\{0, 1, 2, \ldots\}$
+$\{1, 2, 3\}$: in the former we care about the order and potentially
-is infinite. This set is often also abbreviated as
+several occurrences of a number; while with the latter we do not.
-$\mathbb{N}$. We can define sets by giving all elements, for
+Also sets can potentially have infinitely many elements, whereas lists
-example $\{0, 1\}$ for the set containing just $0$ and $1$,
+cannot. For example
-but also by \defn{set comprehensions}. For example the set of
+the set of all natural numbers $\{0, 1, 2, \ldots\}$ is infinite. This
-all even natural numbers can be defined as
+set is often also abbreviated as $\mathbb{N}$. Lists can be very large, but they cannot contain infinitely many elements.
+We can define sets by giving all elements, for example $\{0, 1\}$ for
+the set containing just $0$ and $1$, but also by \defn{set
+comprehensions}. For example the set of all even natural numbers can
+be defined as
 \[
 \{n\;|\;n\in\mathbb{N} \wedge n\;\text{is even}\}
 \]
 \[
 \{n\;|\; n^2 < 9 \wedge n \in \mathbb{N}\}
 \]
-\noindent Notice that set comprehensions could be used
+\noindent Can you see why this defines the set $\{0, 1, 2\}$?  Notice
-to define set union, intersection and difference:
+that set comprehensions could be used to define set union,
+intersection and difference:
 \begin{eqnarray*}
 A \cup B & \dn & \{x\;|\; x \in A \vee x \in B\}\\
 A \cap B & \dn & \{x\;|\; x \in A \wedge x \in B\}\\
 A \backslash B & \dn & \{x\;|\; x \in A \wedge x \not\in B\}
 \ldots
 \]
 \noindent but using the big union notation is more concise.
-While this stuff about sete might all look trivial or even
+As an aside: While this stuff about sets might all look trivial or even needlessly
-needlessly pedantic, \emph{Nature} is never simple. If you
+pedantic, \emph{Nature} is never simple. If you want to be amazed how
-want to be amazed how complicated sets can get, watch out for
+complicated sets can get, watch out for the last lecture just before
-the last lecture just before Christmas where I want you to
+Christmas where I want to convince you of the fact that some sets are
-convince you of the fact that some sets are more infinite than
+more infinite than others. Yes, you read correctly, there can be sets
-others. Actually that will be a fact that is very relevant to
+that are ``more infinite'' then others. If you think this is obvious:
-the material of this module.
+say you have the infinite set $\mathbb{N}\backslash\{0\} = \{1, 2, 3, 4, \ldots\}$ which is all
+the natural numbers except $0$, and then compare it to the set
+$\{0, 1, 2, 3, 4, \ldots\}$ which contains the $0$. If you think,
+the second must be more infinite\ldots{} well, then think again. Because the two
+infinite sets
+\begin{center}
+$\{1, 2, 3, 4, \ldots\}$ and
+$\{0, 1, 2, 3, 4, \ldots\}$
+\end{center}
+\noindent
+contain actually the same number of elements. Does this make sense?
+Though this might all look strange this about infinite sets will be a
+topic that is very relevant to the material of this module. It tells
+us what we can compute with a computer (actually algorithm) and what
+we cannot. But during the first 9 lectures we can go by without this
+``weird'' stuff.
 Another important notion in this module are \defn{languages}, which
 are sets of strings. One of the main goals for us will be how to
 (formally) specify languages and to find out whether a string
 is in a language or not.\footnote{You might wish to ponder
 \noindent Note the difference in the last two lines: the empty
 set behaves like $0$ for multiplication and the set $\{[]\}$
 like $1$ for multiplication ($n * 1 = n$ and $n * 0 = 0$).
-Following the language concatenation, we can define a
+Using the operation of language concatenation, we can define a
 \defn{language power} operation as follows:
 \begin{eqnarray*}
 A^0     & \dn & \{[]\}\\
 A^{n+1} & \dn & A \,@\, A^n
 \noindent This definition is by recursion on natural numbers.
 Note carefully that the zero-case is not defined as the empty
 set, but the set containing the empty string. So no matter
 what the set $A$ is, $A^0$ will always be $\{[]\}$. (There is
 another hint about a connection between the $@$-operation and
-multiplication: How is $x^n$ defined recursively and what is
+multiplication: How is $x^n$ defined in mathematics and what is
 $x^0$?)
 Next we can define the \defn{star operation} for languages:
 $A\star$ is the union of all powers of $A$, or short
 \noindent This star operation is often also called
 \emph{Kleene-star}. Unfolding the definition in \eqref{star}
 gives
 \[
-A^0 \cup A^1 \cup A^2 \cup A^3 \cup \ldots
+A\star \dn A^0 \cup A^1 \cup A^2 \cup A^3 \cup \ldots
 \]
 \noindent
 which is equal to
 \[
-\{[]\} \,\cup\, A \,\cup\, A @ A \,\cup\, A @ A @ A \,\cup\, \ldots
+A\star \dn \{[]\} \,\cup\, A \,\cup\, A @ A \,\cup\, A @ A @ A \,\cup\, \ldots
 \]
 \noindent We can see that the empty string is always in $A\star$,
 no matter what $A$ is. This is because $[] \in A^0$. To make
 sure you understand these definitions, I leave you to answer
 what $\{[]\}\star$ and $\varnothing\star$ are?
 Recall that an alphabet is often referred to by the letter
 $\Sigma$. We can now write for the set of \emph{all} strings
-over this alphabet $\Sigma\star$. In doing so we also include the
+over this alphabet as $\Sigma\star$. In doing so we also include the
 empty string as a possible string over $\Sigma$. So if $\Sigma
 = \{a, b\}$, then $\Sigma\star$ is
 \[
 \{[], a, b, aa, ab, ba, bb, aaa, aab, aba, abb, baa, bab, \ldots\}

changeset 504	3390e863d796
parent 502	81d20ccb8801
child 505	ca51b1d4dae5