updated
authorChristian Urban <urbanc@in.tum.de>
Wed, 24 Oct 2018 13:07:13 +0100
changeset 585 6ee22f196884
parent 584 529460073b24
child 586 451a95e1bc25
updated
handouts/ho06.pdf
handouts/ho06.tex
Binary file handouts/ho06.pdf has changed
--- a/handouts/ho06.tex	Wed Oct 24 12:49:23 2018 +0100
+++ b/handouts/ho06.tex	Wed Oct 24 13:07:13 2018 +0100
@@ -423,21 +423,39 @@
 because they do not fit with what the parser is supposed to parse.
 
 
-A slightly more complicated parser is \pcode{("a" || "b") ~ "c"}.
+A slightly more complicated parser is \pcode{("a" || "b") ~ "c"}
 which parses as first character either an \texttt{a} or \texttt{b}
 followed by a \texttt{c}. This parser produces the following outputs.
 
 \begin{center}
 \begin{tabular}{rcl}
 input strings & & output\medskip\\
-\texttt{\Grid{abc}} & $\rightarrow$ & $\left\{((\texttt{\Grid{a}}, \texttt{\Grid{b}}), \texttt{\Grid{c}})\right\}$\\
-\texttt{\Grid{bbc}} & $\rightarrow$ & $\left\{((\texttt{\Grid{b}}, \texttt{\Grid{b}}), \texttt{\Grid{c}})\right\}$\\
-\texttt{\Grid{aac}} & $\rightarrow$ & $\{\}$
+\texttt{\Grid{acde}} & $\rightarrow$ & $\left\{((\texttt{\Grid{a}}, \texttt{\Grid{c}}), \texttt{\Grid{de}})\right\}$\\
+\texttt{\Grid{bcde}} & $\rightarrow$ & $\left\{((\texttt{\Grid{b}}, \texttt{\Grid{c}}), \texttt{\Grid{de}})\right\}$\\
+\texttt{\Grid{abde}} & $\rightarrow$ & $\{\}$
 \end{tabular}
 \end{center}
 
-\noindent Two more examples: first consider the parser \pcode{('a' ~
-'a') ~ 'a'} and the input \pcode{aaaa}:
+\noindent
+Now consider the parser \pcode{("a" ~ "b") ~ "c"} which parses
+\texttt{a}, \texttt{b}, \texttt{c} in sequence. This parser produces
+the following outputs.
+
+\begin{center}
+\begin{tabular}{rcl}
+input strings & & output\medskip\\
+\texttt{\Grid{abcde}} & $\rightarrow$ & $\left\{(((\texttt{\Grid{a}},\texttt{\Grid{b}}), \texttt{\Grid{c}}), \texttt{\Grid{de}})\right\}$\\
+\texttt{\Grid{abde}} & $\rightarrow$ & $\{\}$\\
+\texttt{\Grid{bcde}} & $\rightarrow$ & $\{\}$
+\end{tabular}
+\end{center}
+
+
+\noindent The second and third example fail, because something is
+``missing'' in the sequence we are looking for. Also notice how the
+results nest with sequences: the parsed part is a nested pair of the
+form \pcode{((a, b), c)}. Two more examples: first consider the parser
+\pcode{("a" ~ "a") ~ "a"} and the input \pcode{aaaa}:
 
 \begin{center}
 \begin{tabular}{rcl}
@@ -447,11 +465,10 @@
 \end{tabular}
 \end{center}
 
-\noindent Notice how the results nest deeper and deeper as
-pairs (the last \pcode{a} is in the unprocessed part). To
-consume everything of this string we can use the parser
-\pcode{(('a' ~'a') ~ 'a') ~ 'a'}. Then the output is as
-follows:
+\noindent Notice how the results nests deeper and deeper as pairs (the
+last \pcode{a} is in the unprocessed part). To consume everything of
+this string we can use the parser \pcode{(("a" ~ "a") ~ "a") ~
+  "a"}. Then the output is as follows:
 
 \begin{center}
 \begin{tabular}{rcl}
@@ -463,7 +480,16 @@
 
 \noindent This is an instance where the parser consumed
 completely the input, meaning the unprocessed part is just the
-empty string.
+empty string. So if we called \pcode{parse_all} instead of \pcode{parse}
+we would get back the result
+
+\[
+\left\{(((\texttt{\Grid{a}}, \texttt{\Grid{a}}), \texttt{\Grid{a}}), \texttt{\Grid{a}})\right\}
+\]
+
+\noindent where the unprocessed (empty) parts have been stripped away
+from the pairs; everything where the second part was not empty has
+been thrown away as ultimately-unsuccessful-parses.
 
 
 Note carefully that constructing a parser such \pcode{'a' ||