afl-material: comparison handouts/ho06.tex

equal deleted inserted replaced

-:529460073b24
+:6ee22f196884
 \emph{not} a simple pair \texttt{(ab, cde)} as one might errorneously
 expects.  The parser returns the ampty set in the other examples,
 because they do not fit with what the parser is supposed to parse.
-A slightly more complicated parser is \pcode{("a" || "b") ~ "c"}.
+A slightly more complicated parser is \pcode{("a" || "b") ~ "c"}
 which parses as first character either an \texttt{a} or \texttt{b}
 followed by a \texttt{c}. This parser produces the following outputs.
 \begin{center}
 \begin{tabular}{rcl}
 input strings & & output\medskip\\
-\texttt{\Grid{abc}} & $\rightarrow$ & $\left\{((\texttt{\Grid{a}}, \texttt{\Grid{b}}), \texttt{\Grid{c}})\right\}$\\
+\texttt{\Grid{acde}} & $\rightarrow$ & $\left\{((\texttt{\Grid{a}}, \texttt{\Grid{c}}), \texttt{\Grid{de}})\right\}$\\
-\texttt{\Grid{bbc}} & $\rightarrow$ & $\left\{((\texttt{\Grid{b}}, \texttt{\Grid{b}}), \texttt{\Grid{c}})\right\}$\\
+\texttt{\Grid{bcde}} & $\rightarrow$ & $\left\{((\texttt{\Grid{b}}, \texttt{\Grid{c}}), \texttt{\Grid{de}})\right\}$\\
-\texttt{\Grid{aac}} & $\rightarrow$ & $\{\}$
+\texttt{\Grid{abde}} & $\rightarrow$ & $\{\}$
 \end{tabular}
 \end{center}
-\noindent Two more examples: first consider the parser \pcode{('a' ~
+\noindent
-'a') ~ 'a'} and the input \pcode{aaaa}:
+Now consider the parser \pcode{("a" ~ "b") ~ "c"} which parses
+\texttt{a}, \texttt{b}, \texttt{c} in sequence. This parser produces
+the following outputs.
+\begin{center}
+\begin{tabular}{rcl}
+input strings & & output\medskip\\
+\texttt{\Grid{abcde}} & $\rightarrow$ & $\left\{(((\texttt{\Grid{a}},\texttt{\Grid{b}}), \texttt{\Grid{c}}), \texttt{\Grid{de}})\right\}$\\
+\texttt{\Grid{abde}} & $\rightarrow$ & $\{\}$\\
+\texttt{\Grid{bcde}} & $\rightarrow$ & $\{\}$
+\end{tabular}
+\end{center}
+\noindent The second and third example fail, because something is
+``missing'' in the sequence we are looking for. Also notice how the
+results nest with sequences: the parsed part is a nested pair of the
+form \pcode{((a, b), c)}. Two more examples: first consider the parser
+\pcode{("a" ~ "a") ~ "a"} and the input \pcode{aaaa}:
 \begin{center}
 \begin{tabular}{rcl}
 input string & & output\medskip\\
 \texttt{\Grid{aaaa}} & $\rightarrow$ &
 $\left\{(((\texttt{\Grid{a}}, \texttt{\Grid{a}}), \texttt{\Grid{a}}), \texttt{\Grid{a}})\right\}$\\
 \end{tabular}
 \end{center}
-\noindent Notice how the results nest deeper and deeper as
+\noindent Notice how the results nests deeper and deeper as pairs (the
-pairs (the last \pcode{a} is in the unprocessed part). To
+last \pcode{a} is in the unprocessed part). To consume everything of
-consume everything of this string we can use the parser
+this string we can use the parser \pcode{(("a" ~ "a") ~ "a") ~
-\pcode{(('a' ~'a') ~ 'a') ~ 'a'}. Then the output is as
+"a"}. Then the output is as follows:
-follows:
 \begin{center}
 \begin{tabular}{rcl}
 input string & & output\medskip\\
 \texttt{\Grid{aaaa}} & $\rightarrow$ &
 \end{tabular}
 \end{center}
 \noindent This is an instance where the parser consumed
 completely the input, meaning the unprocessed part is just the
-empty string.
+empty string. So if we called \pcode{parse_all} instead of \pcode{parse}
+we would get back the result
+\[
+\left\{(((\texttt{\Grid{a}}, \texttt{\Grid{a}}), \texttt{\Grid{a}}), \texttt{\Grid{a}})\right\}
+\]
+\noindent where the unprocessed (empty) parts have been stripped away
+from the pairs; everything where the second part was not empty has
+been thrown away as ultimately-unsuccessful-parses.
 Note carefully that constructing a parser such \pcode{'a' ||
 ('a' ~ 'b')} will result in a typing error. The first
 parser has as output type a single character (recall the type

changeset 585	6ee22f196884
parent 584	529460073b24
child 586	451a95e1bc25