regexp: comparison Paper/Paper.thy

equal deleted inserted replaced

-:2335fcb96052
+:d63baacbdb16
 abbreviation
 "EClass x R \<equiv> R `` {x}"
 notation (latex output)
 str_eq_rel ("\<approx>\<^bsub>_\<^esub>") and
+str_eq ("_ \<approx>\<^bsub>_\<^esub> _") and
 Seq (infixr "\<cdot>" 100) and
 Star ("_\<^bsup>\<star>\<^esup>") and
 pow ("_\<^bsup>_\<^esup>" [100, 100] 100) and
 Suc ("_+1" [100] 100) and
 quotient ("_ \<^raw:\ensuremath{\!\sslash\!}> _" [90, 90] 90) and
 REL ("\<approx>") and
 UPLUS ("_ \<^raw:\ensuremath{\uplus}> _" [90, 90] 90) and
-L ("L '(_')" [0] 101) and
+L ("L'(_')" [0] 101) and
+Lam ("\<lambda>'(_')" [100] 100) and
+Trn ("_, _" [100, 100] 100) and
 EClass ("\<lbrakk>_\<rbrakk>\<^bsub>_\<^esub>" [100, 100] 100) and
-transition ("_ \<^raw:\ensuremath{\stackrel{\text{>_\<^raw:}}{\Longrightarrow}}> _" [100, 100, 100] 100)
+transition ("_ \<^raw:\ensuremath{\stackrel{\text{>_\<^raw:}}{\Longmapsto}}> _" [100, 100, 100] 100)
 (*>*)
 section {* Introduction *}
 @{text "\<lbrakk>x\<rbrakk>\<^isub>\<approx>"} for the equivalence class defined
 as @{text "{y | y \<approx> x}"}.
 Central to our proof will be the solution of equational systems
-involving regular expressions. For this we will use Arden's lemma \cite{}
+involving regular expressions. For this we will use Arden's lemma \cite{Brzozowski64}
 which solves equations of the form @{term "X = A ;; X \<union> B"} provided
 @{term "[] \<notin> A"}. However we will need the following ``reverse''
 version of Arden's lemma.
-\begin{lemma}[Reverse Arden's Lemma]\mbox{}\\
+\begin{lemma}[Reverse Arden's Lemma]\label{arden}\mbox{}\\
 If @{thm (prem 1) ardens_revised} then
 @{thm (lhs) ardens_revised} has the unique solution
 @{thm (rhs) ardens_revised}.
 \end{lemma}
 \begin{proof}
 For the right-to-left direction we assume @{thm (rhs) ardens_revised} and show
-that @{thm (lhs) ardens_revised} holds. From Prop.~\ref{langprops}$(i)$
+that @{thm (lhs) ardens_revised} holds. From Prop.~\ref{langprops}@{text "(i)"}
 we have @{term "A\<star> = {[]} \<union> A ;; A\<star>"},
 which is equal to @{term "A\<star> = {[]} \<union> A\<star> ;; A"}. Adding @{text B} to both
 sides gives @{term "B ;; A\<star> = B ;; ({[]} \<union> A\<star> ;; A)"}, whose right-hand side
 is equal to @{term "(B ;; A\<star>) ;; A \<union> B"}. This completes this direction.
 Using this property we can show that @{term "B ;; (A \<up> n) \<subseteq> X"} holds for
 all @{text n}. From this we can infer @{term "B ;; A\<star> \<subseteq> X"} using the definition
 of @{text "\<star>"}.
 For the inclusion in the other direction we assume a string @{text s}
 with length @{text k} is element in @{text X}. Since @{thm (prem 1) ardens_revised}
-we know by Prop.~\ref{langprops}$(ii)$ that
+we know by Prop.~\ref{langprops}@{text "(ii)"} that
 @{term "s \<notin> X ;; (A \<up> Suc k)"} since its length is only @{text k}
 (the strings in @{term "X ;; (A \<up> Suc k)"} are all longer).
 From @{text "(*)"} it follows then that
 @{term s} must be element in @{term "(\<Union>m\<in>{0..k}. B ;; (A \<up> m))"}. This in turn
-implies that @{term s} is in @{term "(\<Union>n. B ;; (A \<up> n))"}. Using Prop.~\ref{langprops}$(iii)$
+implies that @{term s} is in @{term "(\<Union>n. B ;; (A \<up> n))"}. Using Prop.~\ref{langprops}@{text "(iii)"}
 this is equal to @{term "B ;; A\<star>"}, as we needed to show.\qed
 \end{proof}
 \noindent
 Regular expressions are defined as the following inductive datatype
 *}
 section {* Finite Partitions Imply Regularity of a Language *}
 text {*
+The central definition in the Myhill-Nerode theorem is the
+\emph{Myhill-Nerode relation}, which states that w.r.t.~a language two
+strings are related, provided there is no distinguishing extension in this
+language. This can be defined as:
 \begin{definition}[Myhill-Nerode Relation]\mbox{}\\
-@{thm str_eq_rel_def[simplified]}
+@{thm str_eq_def[simplified str_eq_rel_def Pair_Collect]}
 \end{definition}
 \noindent
-It is easy to see that @{term "\<approx>A"} is an equivalence relation, which partitions
+It is easy to see that @{term "\<approx>A"} is an equivalence relation, which
-the set of all string, @{text "UNIV"}, into a set of equivalence classed. We define
+partitions the set of all strings, @{text "UNIV"}, into a set of disjoint
-the set @{term "finals A"} as those equivalence classes that contain strings of
+equivalence classes. One direction of the Myhill-Nerode theorem establishes
-@{text A}, namely
+that if there are finitely many equivalence classes, then the language is
+regular. In our setting we have therefore
+\begin{theorem}\label{myhillnerodeone}
+@{thm[mode=IfThen] hard_direction}
+\end{theorem}
+\noindent
+To prove this theorem, we define the set @{term "finals A"} as those equivalence
+classes that contain strings of @{text A}, namely
+%
 \begin{equation}
 @{thm finals_def}
 \end{equation}
 \noindent
-It is easy to show that @{thm lang_is_union_of_finals} holds. We can also define
+It is straightforward to show that @{thm lang_is_union_of_finals} holds.
-a notion of \emph{transition} between equivalence classes as
+Therefore if we know that there exists a regular expression for every
+equivalence class in @{term "finals A"} (which by assumption must be
+a finite set), then we can just combine these regular expressions with @{const ALT}
+and obtain a regular expression that matches every string in @{text A}.
+We prove Thm.~\ref{myhillnerodeone} by calculating a regular expression for
+\emph{all} equivalence classes, not just the ones in @{term "finals A"}. We
+first define a notion of \emph{transition} between equivalence classes
+%
 \begin{equation}
 @{thm transition_def}
 \end{equation}
 \noindent
-which means if we add the character @{text c} to all strings in the equivalence
+which means that if we concatenate all strings matching the regular expression @{text r}
-class @{text Y} HERE
+to the end of all strings in the equivalence class @{text Y}, we obtain a subset of
+@{text X}. Note that we do not define any automaton here, we merely relate two sets
-\begin{theorem}
+w.r.t.~a regular expression.
-Given a language @{text A}.
-@{thm[mode=IfThen] hard_direction}
+Next we build an equational system that
-\end{theorem}
+contains an equation for each equivalence class. Suppose we have
+the equivalence classes @{text "X\<^isub>1,\<dots>,X\<^isub>n"}, there must be one and only one that
+contains the empty string @{text "[]"} (since equivalence classes are disjoint).
+Let us assume @{text "[] \<in> X\<^isub>1"}. We can build the following equational system
+\begin{center}
+\begin{tabular}{rcl}
+@{text "X\<^isub>1"} & @{text "="} & @{text "(Y\<^isub>1\<^isub>1, CHAR c\<^isub>1\<^isub>1) + \<dots> + (Y\<^isub>1\<^isub>p, CHAR c\<^isub>1\<^isub>p) + \<lambda>(EMPTY)"} \\
+@{text "X\<^isub>2"} & @{text "="} & @{text "(Y\<^isub>2\<^isub>1, CHAR c\<^isub>2\<^isub>1) + \<dots> + (Y\<^isub>2\<^isub>o, CHAR c\<^isub>2\<^isub>o)"} \\
+& $\vdots$ \\
+@{text "X\<^isub>n"} & @{text "="} & @{text "(Y\<^isub>n\<^isub>1, CHAR c\<^isub>n\<^isub>1) + \<dots> + (Y\<^isub>n\<^isub>q, CHAR c\<^isub>n\<^isub>q)"}\\
+\end{tabular}
+\end{center}
+\noindent
+where the pairs @{text "(Y\<^isub>i\<^isub>j, r\<^isub>i\<^isub>j)"} stand for all transitions
+@{term "Y\<^isub>i\<^isub>j \<Turnstile>r\<^isub>i\<^isub>j\<Rightarrow> X\<^isub>i"}.  The term @{text "\<lambda>(EMPTY)"} acts as a marker for the equivalence
+class containing @{text "[]"}. (Note that we mark, roughly speaking, the
+single ``initial'' state in the equational system, which is different from
+the method by Brzozowski \cite{Brzozowski64}, which marks the ``terminal''
+states.) Overloading the function @{text L} for the two kinds of terms in the
+equational system as follows
+\begin{center}
+@{thm L_rhs_e.simps(1)[where r="r", THEN eq_reflection]}\hspace{20mm}
+@{thm L_rhs_e.simps(2)[where X="X" and r="r", THEN eq_reflection]}
+\end{center}
+\noindent
+we can prove for @{text "X\<^isub>2\<^isub>.\<^isub>.\<^isub>n"} that the following equations
+%
+\begin{equation}\label{inv1}
+@{text "X\<^isub>i = L(Y\<^isub>i\<^isub>1, CHAR c\<^isub>i\<^isub>1) \<union> \<dots> \<union> L(Y\<^isub>i\<^isub>q, CHAR c\<^isub>i\<^isub>q)"}.
+\end{equation}
+\noindent
+hold. Similarly for @{text "X\<^isub>1"} we can show the following equation
+%
+\begin{equation}\label{inv2}
+@{text "X\<^isub>1 = L(Y\<^isub>i\<^isub>1, CHAR c\<^isub>i\<^isub>1) \<union> \<dots> \<union> L(Y\<^isub>i\<^isub>p, CHAR c\<^isub>i\<^isub>p) \<union> L(\<lambda>(EMPTY))"}.
+\end{equation}
+\noindent
+hold. The reason for adding the @{text \<lambda>}-marker to our equational system is
+to obtain this equations, which only holds in this form since none of
+the other terms contain the empty string. Our proof of Thm.~\ref{myhillnerodeone}
+will be by transforming the equational system into a \emph{solved form}
+maintaining the invariants \eqref{inv1} and \eqref{inv2}. From the solved
+form we will be able to read off the regular expressions using our
+variant of Arden's Lemma (Lem.~\ref{arden}).
 *}
 section {* Regular Expressions Generate Finitely Many Partitions *}
 text {*

changeset 75	d63baacbdb16
parent 71	426070e68b21
child 77	63bc9f9d96ba