lexing: comparison ChengsongTanPhdThesis/Chapters/Introduction.tex

equal deleted inserted replaced

-:bc5571c38d1f
+:2ad20ba5b178
 %This part is about regular expressions, Brzozowski derivatives,
 %and a bit-coded lexing algorithm with proven correctness and time bounds.
 %TODO: look up snort rules to use here--give readers idea of what regexes look like
+\marginpar{rephrasing using "imprecise words"}
 Regular expressions, since their inception in the 1940s,
 have been subject to extensive study and implementation.
 Their primary application lies in text processing--finding
 matches and identifying patterns in a string.
 %It is often used to match strings that comprises of numerous fields,
 %where certain fields may recur or be omitted.
 For example, a simple regular expression that tries
 to recognise email addresses is
 \marginpar{rephrased from "the regex for recognising" to "a simple regex that tries to match email"}
 \begin{center}
-$[a-z0-9.\_]^\backslash+@[a-z0-9.-]^\backslash+\.\{a-z\}\{2,6\}$
+\verb|[a-z0-9._]^+@[a-z0-9.-]^+\.\{a-z\}\{2,6\}|
 %$[a-z0-9._]^+@[a-z0-9.-]^+\.[a-z]{2,6}$.
 \end{center}
 \marginpar{Simplified example, but the distinction between . and escaped . is correct
-and therefore left unchanged.}
+and therefore left unchanged. Also verbatim package does not straightforwardly support superscripts so + kept as they are.}
 %Using this, regular expression matchers and lexers are able to extract
 %the domain names by the use of \verb|[a-zA-Z0-9.-]+|.
 \marginpar{Rewrote explanation for the expression.}
 The bracketed sub-expressions are used to extract specific parts of an email address.
 The local part is recognised by the expression enclosed in

changeset 654	2ad20ba5b178
parent 653	bc5571c38d1f
child 664	ba44144875b1