regexp: comparison Journal/Paper.thy

equal deleted inserted replaced

-:813e7257c7c3
+:d47c2143ab8a
 begin
 declare [[show_question_marks = false]]
 consts
-REL :: "(string \<times> string) \<Rightarrow> bool"
+REL :: "(string \<times> string) set"
 UPLUS :: "'a set \<Rightarrow> 'a set \<Rightarrow> (nat \<times> 'a) set"
 abbreviation
 "EClass x R \<equiv> R `` {x}"
 an automaton that recognises all its strings (Definition~\ref{baddef}). This
 is because we cannot make a definition in HOL that is only polymorphic in
 the state type, but not in the predicate for regularity; and there is no
 type quantification available in HOL (unlike in Coq, for
 example).\footnote{Slind already pointed out this problem in an email to the
-HOL4 mailing list on 21st April 2005.}
+HOL4 mailing list on 21st April 2005.}$^,$\footnote{While in Coq one can avoid
+this particular problem, all other difficulties we point out below still apply.}
 An alternative, which provides us with a single type for states of automata,
 is to give every state node an identity, for example a natural number, and
 then be careful to rename these identities apart whenever connecting two
 automata. This results in clunky proofs establishing that properties are
 Kozen's proof of the Myhill-Nerode Theorem requires that automata do not
 have inaccessible states \cite{Kozen97}. Another subtle side-condition is
 completeness of automata, that is automata need to have total transition
 functions and at most one `sink' state from which there is no connection to
 a final state (Brzozowski mentions this side-condition in the context of
-state complexity of automata \cite{Brzozowski10}). Such side-conditions mean
+state complexity of automata \cite{Brzozowski10}, but it is also needed
+in the usual construction of the complement automaton). Such side-conditions mean
 that if we define a regular language as one for which there exists \emph{a}
 finite automaton that recognises all its strings (see
 Definition~\ref{baddef}), then we need a lemma which ensures that another
 equivalent one can be found satisfying the side-condition, and also need to
 make sure our operations on automata preserve them. Unfortunately, such
 \noindent
 In what follows we shall use this convenient short-hand notation for images of sets
 also with other functions.
 *}
 section {* The Myhill-Nerode Theorem, First Part *}
 text {*
 \noindent
 Nipkow of the Gauss-Jordan algorithm \cite{Nipkow11}.
 While our formalisation might appear large, it should be seen
 in the context of the work done by Constable at al \cite{Constable00} who
 formalised the Myhill-Nerode Theorem in Nuprl using automata. They write
-that their four-member team needed something on the magnitude of 18 months
+that their four-member team would need something on the magnitude of 18 months
-for their formalisation. It is hard to gauge the size of a
+for their formalisation of the first eleven chapters of \cite{HopcroftUllman69},
+which includes the Myhill-Nerode theorem. It is hard to gauge the size of a
 formalisation in Nurpl, but from what is shown in the Nuprl Math Library
 about their development it seems substantially larger than ours. We attribute
 this to our use of regular expressions, which meant we did not need to `fight'
 the theorem prover.
 Also, Filli\^atre reports that his formalisation in

changeset 334	d47c2143ab8a
parent 259	aad64c63960e
child 338	e7504bfdbd50