regexp: comparison Journal/Paper.thy

equal deleted inserted replaced

-:e5e32faa2446
+:92ca56c1a199
 @{text "A\<^sub>1 \<uplus> A\<^sub>2 \<equiv> {(1, x) | x \<in> A\<^sub>1} \<union> {(2, y) | y \<in> A\<^sub>2}"}
 \end{equation}
 \noindent
 changes the type---the disjoint union is not a set, but a set of
-pairs. Using this definition for disjoint union means we do not have a
+pairs. Using this definition for disjoint union means we do not have
-single type for the states of automata. As a result we will not be able to
+a single type for the states of automata. As a result we will not be
-define a regular language as one for which there exists
+able to define a regular language as one for which there exists an
-an automaton that recognises all its strings (Definition~\ref{baddef}). This
+automaton that recognises all its strings
-is because we cannot make a definition in HOL that is only polymorphic in
+(Definition~\ref{baddef}). This is because we cannot make a
-the state type, but not in the predicate for regularity; and there is no
+definition in HOL that is only polymorphic in the state type, but
-type quantification available in HOL (unlike in Coq, for
+not in the predicate for regularity; and there is no type
-example).\footnote{Slind already pointed out this problem in an email to the
+quantification available in HOL.\footnote{Slind already pointed out
-HOL4 mailing list on 21st April 2005.}
+this problem in an email to the HOL4 mailing list on 21st April
-%$^,$\footnote{While in Coq one can avoid
+2005.} Coq, for example, has quantification over types and thus can
-%this particular problem, all other difficulties we point out below still apply.}
+state such a definition.  This has been recently exploited in a
+slick formalisation of the Myhill-Nerode theorem in Coq by
+\citeN{XXX}.
 An alternative, which provides us with a single type for states of automata,
 is to give every state node an identity, for example a natural number, and
 then be careful to rename these identities apart whenever connecting two
 automata. This results in clunky proofs establishing that properties are
 We presented two proofs for the second direction of the
 Myhill-Nerode Theorem. One direct proof using tagging-functions and
 another using partial derivatives. This part of our work is where
 our method using regular expressions shines, because we can perform
-an induction on the structure of refular expressions. However, it is
+an induction on the structure of regular expressions. However, it is
 also the direction where we had to spend most of the `conceptual'
 time, as our first proof based on tagging-functions is new for
 establishing the Myhill-Nerode Theorem. All standard proofs of this
 direction proceed by arguments over automata.
 from all these comparisons is that if one is interested in formalising
 results from regular language theory, not results from automata theory,
 then regular expressions are easier to work with formally.
 The code of
 our formalisation \cite{myhillnerodeafp11} can be found in the Archive of Formal Proofs at
-\mbox{\url{http://afp.sourceforge.net/entries/Myhill-Nerode.shtml}}.\smallskip
+\mbox{\url{http://afp.sourceforge.net/entries/Myhill-Nerode.shtml}}.
+Our future work will focus on formalising the regular expression matchers
+developed by \citeN{Sulzmann12} which generate variable assignments for
+regular expression submatching.\smallskip
 \noindent
 {\bf Acknowledgements:}
 We are grateful for the comments we received from Larry Paulson.  Tobias
 Nipkow made us aware of the properties in Theorem~\ref{subseqreg} and Tjark
 Weber helped us with proving them. Christian Sternagel provided us with a

changeset 386	92ca56c1a199
parent 385	e5e32faa2446
child 387	288637d9dcde