afl-material: comparison handouts/ho06.tex

equal deleted inserted replaced

-:bb24d4e207b6
+:d40d7d7b85bc
 The general idea behind parser combinators is to transform the input
 into sets of pairs, like so
 \begin{center}
 $\underbrace{\text{list of tokens}}_{\text{input}}$
-$\Rightarrow$
+$\quad\Rightarrow\quad$
 $\underbrace{\text{set of (parsed part, unprocessed part)}}_{\text{output}}$
 \end{center}
 \noindent
 Given the extended effort we have spent implementing a lexer in order
 parser cannot recognise anything from the input at all, then parser
 combinators just return the empty set $\{\}$. This will indicate
 something ``went wrong''\ldots or more precisely, nothing could be
 parsed.
-Also important to note is that the type \texttt{T} for the processed
+Also important to note is that the output type \texttt{T} for the
-part is different from the input type \texttt{I} in the parse. In the
+processed part can potentially be different from the input type
-example above is just happens to be the same. The reason for the
+\texttt{I} in the parser. In the example above is just happens to be
-difference is that in general we are interested in
+the same. The reason for the difference is that in general we are
-transforming our input into something ``different''\ldots for example
+interested in transforming our input into something
-into a tree; or if we implement the grammar for arithmetic
+``different''\ldots for example into a tree; or if we implement the
-expressions, we might be interested in the actual integer number the
+grammar for arithmetic expressions, we might be interested in the
-arithmetic expression, say \texttt{1 + 2 * 3}, stands for. In this way
+actual integer number the arithmetic expression, say \texttt{1 + 2 *
-we can use parser combinators to implement relatively easily a
+3}, stands for. In this way we can use parser combinators to
-calculator, for instance.
+implement relatively easily a calculator, for instance (we shall do
+this later on).
-The main idea of parser combinators is that we can easily build parser
-combinators out of smaller components following very closely the
+The main driving force behind parser combinators is that we can easily
-structure of a grammar. In order to implement this in a
+build parser combinators out of smaller components following very
+closely the structure of a grammar. In order to implement this in a
 functional/object-oriented programming language, like Scala, we need
 to specify an abstract class for parser combinators. In the abstract
 class we specify that \texttt{I} is the \emph{input type} of the
 parser combinator and that \texttt{T} is the \emph{output type}.  This
 implies that the function \texttt{parse} takes an argument of type
 ("(" ~ E ~ ")") ==> { case ((x, y), z) => y } | NumParserInt
 \end{lstlisting}
 \end{center}
 \noindent
-Let us try out on some examples:
+Let us try out some examples:
 \begin{center}
 \begin{tabular}{rcl}
 input strings & & output of \pcode{parse_all}\medskip\\
 \texttt{\Grid{1+2+3}} & $\rightarrow$ & \texttt{Set(6)}\\
 \texttt{\Grid{4*2+3}} & $\rightarrow$ & \texttt{Set(11)}\\
 \texttt{\Grid{4*(2+3)}} & $\rightarrow$ & \texttt{Set(20)}\\
+\texttt{\Grid{(4)*((2+3))}} & $\rightarrow$ & \texttt{Set(20)}\\
 \texttt{\Grid{4/2+3}} & $\rightarrow$ & \texttt{Set()}\\
 \texttt{\Grid{1\VS +\VS 2\VS +\VS 3}} & $\rightarrow$ & \texttt{Set()}\\
 \end{tabular}
 \end{center}
 \noindent
-All examples should be quite self-explanatory: the last two do not
+Note that we call \pcode{parse_all}, not \pcode{parse}.  The examples
-produce any result because our parser did not define what to do in
+should be quite self-explanatory. The last two example do not produce
-case of division (could be easily added) but also has no idea what to
+any integer result because our parser does not define what to do in
-do with whitescpaces. To deal with them is the task of the lexer. We
+case of division (could be easily added), but also has no idea what to
-can deal with them inside the grammar, but that would render many
+do with whitescpaces. To deal with them is the task of the lexer! Yes,
-grammars becoming unintelligible.
+we can deal with them inside the grammar, but that would render many
+grammars becoming unintelligible, including this one.\footnote{If you
+think an easy solution is to extend the notion of what a number
+should be, then think again---you still would have to deal with
+cases like \texttt{\Grid{(\VS (\VS 2+3)\VS )}}. Jusat think you have
+a grammar for a full-blown language where there are numerous such cases.}
 \end{document}
 %%% Local Variables:
 %%% mode: latex

changeset 594	d40d7d7b85bc
parent 593	bb24d4e207b6
child 595	4bf0096bc06b