afl-material: comparison handouts/ho02.tex

equal deleted inserted replaced

-:617c3b0e0a81
+:499007a7bce2
 \usepackage{../graphics}
 \usepackage{../data}
 \begin{document}
-\fnote{\copyright{} Christian Urban, King's College London, 2014, 2015, 2016, 2017}
+\fnote{\copyright{} Christian Urban, King's College London, 2014, 2015, 2016, 2017, 2018}
 \section*{Handout 2 (Regular Expression Matching)}
 This lecture is about implementing a more efficient regular expression
 original set ($L(r_1)$), then $\textit{Der}\,c\,(\textit{Der}\,b\,(\textit{Der}\,a\,(L(r_1))))$
 must contain the empty string. If not, then $abc$ was not in the
 language we started with.
 Our matching algorithm using $\textit{der}$ and $\textit{nullable}$ works
-similarly, just using regular expression instead of sets. In order to
+similarly, just using regular expressions instead of sets. In order to
 define our algorithm we need to extend the notion of derivatives from single
 characters to strings. This can be done using the following
 function, taking a string and a regular expression as input and
 a regular expression as output.
 the time from the original string until the string is exhausted.
 Having $\textit{der}s$ in place, we can finally define our matching
 algorithm:
 \[
-\textit{matches}\,s\,r \dn \textit{nullable}(\textit{ders}\,s\,r)
+\textit{matches}\,r\,s \dn \textit{nullable}(\textit{ders}\,s\,r)
 \]
 \noindent
 and we can claim that
 \[
-\textit{matches}\,s\,r\quad\text{if and only if}\quad s\in L(r)
+\textit{matches}\,r\,s\quad\text{if and only if}\quad s\in L(r)
 \]
 \noindent holds, which means our algorithm satisfies the
 specification. Of course we can claim many things\ldots
 whether the claim holds any water is a different question,
 \noindent I leave it to you to contemplate whether such a
 simplification can have any impact on the correctness of our algorithm
 (will it change any answers?). Figure~\ref{scala2} gives a
 simplification function that recursively traverses a regular
 expression and simplifies it according to the rules given at the
-beginning. There are only rules for $+$, $\cdot$ and $n$-times (the
+beginning. There are only rules for $+$ and $\cdot$. There is
-latter because we added it in the second version of our
+no simplification rule for a star, because
-matcher). There is no simplification rule for a star, because
 empirical data and also a little thought showed that simplifying under
 a star is a waste of computation time. The simplification function
 will be called after every derivation.  This additional step removes
 all the ``junk'' the derivative function introduced. Does this improve
 the speed? You bet!!
 a^{\{n\}}$.  We need a third of this time to do the same with strings
 up to 11,000 \texttt{a}s.  Similarly, Java 8 and Python needed 30
 seconds to find out the regular expression $(a^*)^* \cdot b$ does not
 match the string of 28 \texttt{a}s. In Java 9 and later this has been
 cranked up to 39,000 \texttt{a}s, but we can do the same in the same
-amount of time for strings composed of nearly 6,000,000 \texttt{a}s:
+amount of time for strings composed of nearly 6,000,000 \texttt{a}s.
+This is shown in the following plot.
 \begin{center}
 \begin{tikzpicture}
 \begin{axis}[

changeset 571	499007a7bce2
parent 566	b153c04834eb
child 618	f4818c95a32e