afl-material: comparison handouts/ho02.tex

equal deleted inserted replaced

-:6e7c3db9023d
+:1be892087df2
 \[
 s \in L(r)
 \]
 \noindent
-Clearly we cannot use the function $L$ directly in order to solve this problem, because in general
+we can look at an algorithm to solve this problem.
-the set of strings $L$ returns is infinite (recall what $L(a^*)$ is). In such cases there is no algorithm
+Clearly we cannot use the function $L$ directly for this, because in general
-then can test exhaustively, whether a string is member of this set.
+the set of strings $L$ returns is infinite (recall what $L(a^*)$ is). In such cases there is no way we can implement
+an exhaustive test for whether a string is member of this set or not.
-The algorithm we define below consists of two parts. One is the function $nullable$ which takes a
+The algorithm we will define below consists of two parts. One is the function $nullable$ which takes a
 regular expression as argument and decides whether it can match the empty string (this means it returns a
 boolean). This can be easily defined recursively as follows:
 \begin{center}
 \begin{tabular}{@ {}l@ {\hspace{2mm}}c@ {\hspace{2mm}}l@ {}}
 \[
 nullable(r) \;\;\text{if and only if}\;\; ""\in L(r)
 \]
 \noindent
-On the left-hand side we have a function we can implement; on the right we have its specification.
+Note on the left-hand side we have a function we can implement; on the right we have its specification.
-The other function is calculating a \emph{derivative} of a regular expression. This is a function
+The other function of our matching algorithm calculates a \emph{derivative} of a regular expression. This is a function
 which will take a regular expression, say $r$, and a character, say $c$, as argument and return
-a new regular expression. Beware that the intuition behind this function is not so easy to grasp on first
+a new regular expression. Be careful that the intuition behind this function is not so easy to grasp on first
 reading. Essentially this function solves the following problem: if $r$ can match a string of the form
 $c\!::\!s$, what does the regular expression look like that can match just $s$. The definition of this
 function is as follows:
 \begin{center}
 Der\,c\,A\;\dn\;\{s\,|\,c\!::\!s \in A\}
 \]
 \noindent
 which essentially transforms a set of strings $A$ by filtering out all strings that do not start with $c$ and then
-strip off the $c$ from all the remaining strings. For example suppose $A = \{"f\!oo", "bar", "f\!rak"\}$ then
+strips off the $c$ from all the remaining strings. For example suppose $A = \{"f\!oo", "bar", "f\!rak"\}$ then
 \[
 Der\,f\,A = \{"oo", "rak"\}\quad,\quad
 Der\,b\,A = \{"ar"\}  \quad \text{and} \quad
 Der\,a\,A = \varnothing
 \]
 L(der\,c\,r) = Der\,c\,(L(r))
 \]
 \noindent
 This property clarifies what regular expression $der$ calculates, namely take the set of strings
-that $r$ can match ($L(r)$), filter out all strings not starting with $c$ and strip off the $c$ from the
+that $r$ can match (that is $L(r)$), filter out all strings not starting with $c$ and strip off the $c$ from the
 remaining strings---this is exactly the language that $der\,c\,r$ can match.
-For our matching algorithm we need to lift the notion of derivatives from characters to strings. This can be
+If we want to find out whether the string $"abc"$ is matched by the regular expression $r$
+then we can iteratively apply $Der$ as follows
+\begin{enumerate}
+\item $Der\,a\,(L(r))$
+\item $Der\,b\,(Der\,a\,(L(r)))$
+\item $Der\,c\,(Der\,b\,(Der\,a\,(L(r))))$
+\end{enumerate}
+\noindent
+In the last step we need to test whether the empty string is in the set. Our matching algorithm will work similarly,
+just using regular expression instead of sets. For this we need to lift the notion of derivatives from characters to strings. This can be
 done using the following function, taking a string and regular expression as input and a regular expression
 as output.
 \begin{center}
 \begin{tabular}{@ {}l@ {\hspace{2mm}}c@ {\hspace{2mm}}l@ {\hspace{-10mm}}l@ {}}

changeset 140	1be892087df2
parent 133	09efdf5cf07c
child 217	cd6066f1056a