lexing: comparison thys2/Paper/Paper.thy

equal deleted inserted replaced

-:484403cf0c9d
+:6e269f557fc5
 \cite[Page 14]{Sulzmann2014}.
 Given the growth of the
 derivatives in some cases even after aggressive simplification, this
 is a hard to believe fact. A similar claim about a theoretical runtime
 of @{text "O(n\<^sup>2)"} is made for the Verbatim lexer, which calculates
-POSIX matches and is based on
+tokens according to POSIX rules \cite{verbatim}. For this it uses Brzozowski's
-derivatives \cite{verbatim}. In this case derivatives are not even simplified.
+derivatives .
-They write: ``For a specific list of lexical rules, Verbatim has quadratic
+They write: ``The results of our empirical tests [..] confirm that Verbatim has
-theoretical time complexity with respect to the length of the input string.''
+$O(n^2)$ time complexity.'' \cite[Section~VII]{verbatim}.
 While their correctness proof for Verbatim is formalised in Coq, the claim about
-the runtime complexity is only supported by emperical evidence.
+the runtime complexity is only supported by some emperical evidence.
-When we
+In the context of our observation with the ``growth problem'' of derivatives,
-tried out their extracted OCaml code with our example
+we
-\mbox{@{text "(a + aa)\<^sup>*"}}, it took around 5 minutes to tokenise a
+tried out their extracted OCaml code with the example
+\mbox{@{text "(a + aa)\<^sup>*"}} as a single lexing rule, and it took for us around 5 minutes to tokenise a
 string of 40 $a$'s and that increased to approximately 19 minutes when the
-string was 50 $a$'s long. Given that derivatives are not simplified in
+string is 50 $a$'s long. Given that derivatives are not simplified in the Verbatim
-the work on Verbatim, such numbers are not surprising.
+lexer, such numbers are not surprising.
 Clearly our result of having finite
 derivatives might sound rather weak in this context but we think such effeciency claims
 really require further scrutiny.\medskip
 \noindent

changeset 460	6e269f557fc5
parent 459	484403cf0c9d
child 461	c4b6906068a9