regexp: comparison csupp.tex

equal deleted inserted replaced

-:093e45c44d91
+:86a4182c73e7
 \begin{center}
 \begin{tabular}{c}
 \\[-5mm]
-\LARGE\bf Certified Parsing\\[-10mm]
+\LARGE\bf Novel Certified Parsers\\[-10mm]
 \mbox{}
 \end{tabular}
 \end{center}
 \thispagestyle{empty}
 \mbox{}\\[-5mm]
 Parsers transform plain text into some abstract structure that can be analyzed by
 computers for further processing.  One might think that parsers have been
 studied to death, and after \emph{yacc} and \emph{lex} no new results can be
 obtained in this area.  However recent developments and novel approaches make
 it increasingly clear, that this is not true anymore~\cite{Might11}. And
-there is a practical need for new results: for example the future HTML 5
+there is a real practical need for new results: for example the future HTML 5
 Standard abandons a well-defined grammar specification, in favour of a bespoke
 parser given as pseudo code.
 This work targets parsers from a certification point of view. Increasingly,
 parsers are part of certified compilers, like
 correctness of the compiler. In this project, we aim to change this situation
 with the help of the theorem prover Isabelle/HOL.
 Only in the last few years, theorem provers have become powerful enough for
 establishing the correctness of some standard lexing and parsing
-algorithms. For this, the algorithms still need to be formulated in way so
+algorithms. For this, the algorithms still need to be formulated in a way so
 that it is easy to reason about them. In our earlier work about lexing and
 regular languages, we showed that this precludes well-known algorithms based
 automata~\cite{WuZhangUrban11}. However we showed also that regular languages can be formulated and
 reasoned about entirely in terms regular expressions, which can be easily
 represented in theorem provers. This work uses the device of derivatives of
 algorithm or inventing a new one. For this we will first devise a
 fixed-point semantics of PEGs, against which we can certify a parser. For this
 semantics we take as starting point the paper~\cite{Ford04}, which does not
 treat left-recursion, but gives an operational semantics for PEG
 parsing. There are also good indications that we can adapt work on Boolean
-Grammars~\cite{Okhotin04}, which are similar to PEGs, and for which the
+Grammars~\cite{Okhotin04}, which are similar to PEGs and for which the
 paper~\cite{KountouriotisNR09} gives a fixed-point semantics
 to negation operators, but not to the Kleene star.
-For the parsing algorithm, we might be able to draw inspiration from parsers
+For the parsing algorithm, we might be able to build upon
-based on the classic Cocke-Younger-Kasami (CYK)
+the classic Cocke-Younger-Kasami (CYK)
 algorithms~\cite{KountouriotisNR09} and
 Early~\cite{AycHor02, Earley70} parsers. The defect of CYK algorithms, however,
 is that the grammar specifications given by the user need to be transformed
 into a normal form. This transformation may potentially lead to rule explosion
 and hence inefficient parsing. We will investigate whether this transformation
 ~\cite{Brzozowski64,Might11,OwensReppyTuron09}
 can be generalised to PEG parsing. In earlier work, we showed that lexing based on
 derivatives gives rise to very elegant regular expression matchers that can be
 certified in a theorem prover with ease.  We will study whether the idea of
 taking a derivative of a regular expression can be extended to rules in
-grammars. The problem that needs to be overcome is again how to deal with possible
+grammars. The problem that needs to be addressed is again how to deal with
-left-recursion in grammar rules.
+left-recursive grammar rules.
 \bibliography{Journal/document/root}
 \bibliographystyle{abbrv}

changeset 243	86a4182c73e7
parent 242	093e45c44d91
child 244	a9598a206c41