cst_tests: comparison etnms/etnms.tex

equal deleted inserted replaced

-:788f4aa28bc5
+:1260b383ae2c
 formally established their correctness. We have also not yet looked
 at extended regular expressions, such as bounded repetitions,
 negation and back-references.
 \end{abstract}
+\section{Introduction}
+While we believe derivatives of regular expressions, written
+$r\backslash s$, are a beautiful concept (in terms of ease of
+implementing them in functional programming languages and in terms of
+reasoning about them formally), they have one major drawback: every
+derivative step can make regular expressions grow drastically in
+size. This in turn has negative effect on the runtime of the
+corresponding lexing algorithms. Consider for example the regular
+expression $(a+aa)^*$ and the short string $aaaaaaaaaaaa$. The
+corresponding derivative contains already 8668 nodes where we assume
+the derivative is given as a tree. The reason for the poor runtime of
+the derivative-based lexing algorithms is that they need to traverse
+such trees over and over again. The solution is to find a complete set
+of simplification rules that keep the sizes of derivatives uniformly
+small.
 \section{Recapitulation of Concepts From the Last Report}
 \subsection*{Regular Expressions and Derivatives}
 Suppose (basic) regular expressions are given by the following grammar:
 will be reduced to just 6 and stays constant, no matter how long the
 input string is.
-\section{Introduction}
+\section{Current Work and Progress}
-While we believe derivatives of regular expressions, written
-$r\backslash s$, are a beautiful concept (in terms of ease of
-implementing them in functional programming languages and in terms of
-reasoning about them formally), they have one major drawback: every
-derivative step can make regular expressions grow drastically in
-size. This in turn has negative effect on the runtime of the
-corresponding lexing algorithms. Consider for example the regular
-expression $(a+aa)^*$ and the short string $aaaaaaaaaaaa$. The
-corresponding derivative contains already 8668 nodes where we assume
-the derivative is given as a tree. The reason for the poor runtime of
-the derivative-based lexing algorithms is that they need to traverse
-such trees over and over again. The solution is to find a complete set
-of simplification rules that keep the sizes of derivatives uniformly
-small.
 For reasons beyond this report, it turns out that a complete set of
 simplification rules depends on values being encoded as
 bitsequences.\footnote{Values are the results the lexing algorithms
 generate; they encode how a regular expression matched a string.} We
 already know that the lexing algorithm using bitsequences but

changeset 126	1260b383ae2c
parent 125	788f4aa28bc5
child 127	580e044af0f7