regexp: Journal/Paper.thy@2c56b20032a7 (annotated)

24 f72c82bf59e5 added paper urbanc parents: diff changeset	1	(<)
f72c82bf59e5 added paper urbanc parents: diff changeset	2	theory Paper
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	3	imports "../Closures2" "../Attic/Prefix_subtract"
24 f72c82bf59e5 added paper urbanc parents: diff changeset	4	begin
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	5
a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	6	declare [[show_question_marks = false]]
a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	7
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	8	consts
334 d47c2143ab8a partially updated conference paper; slightly tuned journal paper urbanc parents: 259 diff changeset	9	REL :: "(string \<times> string) set"
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	10	UPLUS :: "'a set \<Rightarrow> 'a set \<Rightarrow> (nat \<times> 'a) set"
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	11
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	12	abbreviation
8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	13	"EClass x R \<equiv> R `` {x}"
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	14
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	15	abbreviation
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	16	"Append_rexp2 r_itm r \<equiv> Append_rexp r r_itm"
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	17
a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	18
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	19	abbreviation
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	20	"pow" (infixl "\<up>" 100)
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	21	where
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	22	"A \<up> n \<equiv> A ^^ n"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	23
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	24	syntax (latex output)
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	25	"_Collect" :: "pttrn => bool => 'a set" ("(1{_ \| _})")
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	26	"_CollectIn" :: "pttrn => 'a set => bool => 'a set" ("(1{_ \<in> _ \| _})")
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	27	translations
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	28	"_Collect p P" <= "{p. P}"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	29	"_Collect p P" <= "{p\|xs. P}"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	30	"_CollectIn p A P" <= "{p : A. P}"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	31
242 093e45c44d91 polished proposal urbanc parents: 240 diff changeset	32	syntax (latex output)
093e45c44d91 polished proposal urbanc parents: 240 diff changeset	33	"_UNION_le" :: "'a \<Rightarrow> 'a => 'b set => 'b set" ("(3\<Union>(00\<^bsub>_ \<le> _\<^esub>)/ _)" [0, 0, 10] 10)
093e45c44d91 polished proposal urbanc parents: 240 diff changeset	34
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	35	abbreviation "ZERO \<equiv> Zero"
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	36	abbreviation "ONE \<equiv> One"
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	37	abbreviation "ATOM \<equiv> Atom"
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	38	abbreviation "PLUS \<equiv> Plus"
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	39	abbreviation "TIMES \<equiv> Times"
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	40	abbreviation "TIMESS \<equiv> Timess"
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	41	abbreviation "STAR \<equiv> Star"
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	42	abbreviation "ALLS \<equiv> Star Allreg"
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	43
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	44	definition
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	45	Delta :: "'a lang \<Rightarrow> 'a lang"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	46	where
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	47	"Delta A = (if [] \<in> A then {[]} else {})"
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	48
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	49	notation (latex output)
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	50	str_eq ("\<approx>\<^bsub>_\<^esub>") and
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	51	str_eq_applied ("_ \<approx>\<^bsub>_\<^esub> _") and
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	52	conc (infixr "\<cdot>" 100) and
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	53	star ("_\<^bsup>\<star>\<^esup>" [101] 200) and
50 32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	54	pow ("_\<^bsup>_\<^esup>" [100, 100] 100) and
58 0d4d5bb321dc a little bit in the introduction urbanc parents: 54 diff changeset	55	Suc ("_+1" [100] 100) and
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	56	quotient ("_ \<^raw:\ensuremath{\!\sslash\!}> _" [90, 90] 90) and
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	57	REL ("\<approx>") and
67 7478be786f87 more intro urbanc parents: 66 diff changeset	58	UPLUS ("_ \<^raw:\ensuremath{\uplus}> _" [90, 90] 90) and
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	59	lang ("\<^raw:\ensuremath{\cal{L}}>" 101) and
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	60	lang ("\<^raw:\ensuremath{\cal{L}}>'(_')" [0] 101) and
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	61	lang_trm ("\<^raw:\ensuremath{\cal{L}}>'(_')" [0] 101) and
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	62	Lam ("\<lambda>'(_')" [100] 100) and
89 42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	63	Trn ("'(_, _')" [100, 100] 100) and
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	64	EClass ("\<lbrakk>_\<rbrakk>\<^bsub>_\<^esub>" [100, 100] 100) and
88 1436fc451bb9 added something about Setalt and folds urbanc parents: 86 diff changeset	65	transition ("_ \<^raw:\ensuremath{\stackrel{\text{>_\<^raw:}}{\Longmapsto}}> _" [100, 100, 100] 100) and
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	66	Setalt ("\<^raw:\ensuremath{\bigplus}>_" [1000] 999) and
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	67	Append_rexp2 ("_ \<^raw:\ensuremath{\triangleleft}> _" [100, 100] 100) and
e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	68	Append_rexp_rhs ("_ \<^raw:\ensuremath{\triangleleft}> _" [100, 100] 50) and
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	69
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	70	uminus ("\<^raw:\ensuremath{\overline{\isa{>_\<^raw:}}}>" [100] 100) and
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	71	tag_Plus ("+tag _ _" [100, 100] 100) and
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	72	tag_Plus ("+tag _ _ _" [100, 100, 100] 100) and
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	73	tag_Times ("\<times>tag _ _" [100, 100] 100) and
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	74	tag_Times ("\<times>tag _ _ _" [100, 100, 100] 100) and
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	75	tag_Star ("\<star>tag _" [100] 100) and
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	76	tag_Star ("\<star>tag _ _" [100, 100] 100) and
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	77	tag_eq ("\<^raw:$\threesim$>\<^bsub>_\<^esub>" [100] 100) and
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	78	Delta ("\<Delta>'(_')") and
180 b755090d0f3d added a picture urbanc parents: 179 diff changeset	79	nullable ("\<delta>'(_')") and
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	80	Cons ("_ :: _" [100, 100] 100) and
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	81	Rev ("Rev _" [1000] 100) and
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	82	Deriv ("Der _ _" [1000, 1000] 100) and
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	83	Derivs ("Ders") and
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	84	ONE ("ONE" 999) and
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	85	ZERO ("ZERO" 999) and
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	86	pderivs_lang ("pdersl") and
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	87	UNIV1 ("UNIV\<^isup>+") and
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	88	Deriv_lang ("Dersl") and
217 05da74214979 a few bits on the journal paper urbanc parents: 203 diff changeset	89	Derivss ("Derss") and
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	90	deriv ("der") and
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	91	derivs ("ders") and
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	92	pderiv ("pder") and
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	93	pderivs ("pders") and
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	94	pderivs_set ("pderss") and
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	95	SUBSEQ ("Sub") and
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	96	SUPSEQ ("Sup") and
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	97	UP ("'(_')\<up>") and
13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	98	ALLS ("ALL")
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	99
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	100
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	101	lemmas Deriv_simps = Deriv_empty Deriv_epsilon Deriv_char Deriv_union
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	102
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	103	definition
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	104	Der :: "'a \<Rightarrow> 'a lang \<Rightarrow> 'a lang"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	105	where
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	106	"Der c A \<equiv> {s. [c] @ s \<in> A}"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	107
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	108	definition
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	109	Ders :: "'a list \<Rightarrow> 'a lang \<Rightarrow> 'a lang"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	110	where
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	111	"Ders s A \<equiv> {s'. s @ s' \<in> A}"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	112
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	113
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	114	lemma meta_eq_app:
ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	115	shows "f \<equiv> \<lambda>x. g x \<Longrightarrow> f x \<equiv> g x"
ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	116	by auto
ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	117
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	118	lemma str_eq_def':
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	119	shows "x \<approx>A y \<equiv> (\<forall>z. x @ z \<in> A \<longleftrightarrow> y @ z \<in> A)"
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	120	unfolding str_eq_def by simp
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	121
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	122	lemma conc_def':
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	123	"A \<cdot> B = {s\<^isub>1 @ s\<^isub>2 \| s\<^isub>1 s\<^isub>2. s\<^isub>1 \<in> A \<and> s\<^isub>2 \<in> B}"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	124	unfolding conc_def by simp
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	125
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	126	lemma conc_Union_left:
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	127	shows "B \<cdot> (\<Union>n. A \<up> n) = (\<Union>n. B \<cdot> (A \<up> n))"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	128	unfolding conc_def by auto
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	129
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	130	lemma test:
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	131	assumes X_in_eqs: "(X, rhs) \<in> Init (UNIV // \<approx>A)"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	132	shows "X = \<Union> (lang_trm ` rhs)"
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	133	using assms l_eq_r_in_eqs by (simp)
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	134
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	135	abbreviation
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	136	notprec ("_ \<^raw:\mbox{$\not\preceq$}>_")
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	137	where
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	138	"notprec x y \<equiv> \<not>(x \<preceq> y)"
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	139
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	140	abbreviation
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	141	minimal_syn ("min\<^bsub>_\<^esub> _")
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	142	where
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	143	"minimal_syn A x \<equiv> minimal x A"
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	144
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	145
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	146	(* THEOREMS *)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	147
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	148	notation (Rule output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	149	"==>" ("\<^raw:\mbox{}\inferrule{\mbox{>_\<^raw:}}>\<^raw:{\mbox{>_\<^raw:}}>")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	150
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	151	syntax (Rule output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	152	"_bigimpl" :: "asms \<Rightarrow> prop \<Rightarrow> prop"
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	153	("\<^raw:\mbox{}\inferrule{>_\<^raw:}>\<^raw:{\mbox{>_\<^raw:}}>")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	154
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	155	"_asms" :: "prop \<Rightarrow> asms \<Rightarrow> asms"
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	156	("\<^raw:\mbox{>_\<^raw:}\\>/ _")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	157
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	158	"_asm" :: "prop \<Rightarrow> asms" ("\<^raw:\mbox{>_\<^raw:}>")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	159
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	160	notation (Axiom output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	161	"Trueprop" ("\<^raw:\mbox{}\inferrule{\mbox{}}{\mbox{>_\<^raw:}}>")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	162
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	163	notation (IfThen output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	164	"==>" ("\<^raw:{\normalsize{}>If\<^raw:\,}> _/ \<^raw:{\normalsize \,>then\<^raw:\,}>/ _.")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	165	syntax (IfThen output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	166	"_bigimpl" :: "asms \<Rightarrow> prop \<Rightarrow> prop"
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	167	("\<^raw:{\normalsize{}>If\<^raw:\,}> _ /\<^raw:{\normalsize \,>then\<^raw:\,}>/ _.")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	168	"_asms" :: "prop \<Rightarrow> asms \<Rightarrow> asms" ("\<^raw:\mbox{>_\<^raw:}> /\<^raw:{\normalsize \,>and\<^raw:\,}>/ _")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	169	"_asm" :: "prop \<Rightarrow> asms" ("\<^raw:\mbox{>_\<^raw:}>")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	170
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	171	notation (IfThenNoBox output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	172	"==>" ("\<^raw:{\normalsize{}>If\<^raw:\,}> _/ \<^raw:{\normalsize \,>then\<^raw:\,}>/ _.")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	173	syntax (IfThenNoBox output)
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	174	"_bigimpl" :: "asms \<Rightarrow> prop \<Rightarrow> prop"
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	175	("\<^raw:{\normalsize{}>If\<^raw:\,}> _ /\<^raw:{\normalsize \,>then\<^raw:\,}>/ _.")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	176	"_asms" :: "prop \<Rightarrow> asms \<Rightarrow> asms" ("_ /\<^raw:{\normalsize \,>and\<^raw:\,}>/ _")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	177	"_asm" :: "prop \<Rightarrow> asms" ("_")
61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	178
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	179	lemma pow_length:
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	180	assumes a: "[] \<notin> A"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	181	and b: "s \<in> A \<up> Suc n"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	182	shows "n < length s"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	183	using b
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	184	proof (induct n arbitrary: s)
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	185	case 0
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	186	have "s \<in> A \<up> Suc 0" by fact
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	187	with a have "s \<noteq> []" by auto
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	188	then show "0 < length s" by auto
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	189	next
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	190	case (Suc n)
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	191	have ih: "\<And>s. s \<in> A \<up> Suc n \<Longrightarrow> n < length s" by fact
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	192	have "s \<in> A \<up> Suc (Suc n)" by fact
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	193	then obtain s1 s2 where eq: "s = s1 @ s2" and : "s1 \<in> A" and *: "s2 \<in> A \<up> Suc n"
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	194	by (auto simp add: Seq_def)
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	195	from ih ** have "n < length s2" by simp
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	196	moreover have "0 < length s1" using * a by auto
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	197	ultimately show "Suc n < length s" unfolding eq
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	198	by (simp only: length_append)
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	199	qed
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	200
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	201
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	202
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	203
24 f72c82bf59e5 added paper urbanc parents: diff changeset	204	(>)
f72c82bf59e5 added paper urbanc parents: diff changeset	205
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	206
24 f72c82bf59e5 added paper urbanc parents: diff changeset	207	section {* Introduction *}
f72c82bf59e5 added paper urbanc parents: diff changeset	208
f72c82bf59e5 added paper urbanc parents: diff changeset	209	text {*
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	210	\noindent
58 0d4d5bb321dc a little bit in the introduction urbanc parents: 54 diff changeset	211	Regular languages are an important and well-understood subject in Computer
60 fb08f41ca33d a bit more tuning on the introduction urbanc parents: 59 diff changeset	212	Science, with many beautiful theorems and many useful algorithms. There is a
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	213	wide range of textbooks on this subject, many of which are aimed at students
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	214	and contain very detailed `pencil-and-paper' proofs
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	215	(e.g.~the textbooks by \citeN{HopcroftUllman69} and by \citeN{Kozen97}).
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	216	It seems natural to exercise theorem provers by
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	217	formalising the theorems and by verifying formally the algorithms.
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	218
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	219	A popular choice for a theorem prover would be one based on Higher-Order
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	220	Logic (HOL), for example HOL4, HOLlight or Isabelle/HOL. For the development
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	221	presented in this paper we will use the Isabelle/HOL. HOL is a predicate calculus
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	222	that allows quantification over predicate variables. Its type system is
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	223	based on the Simple Theory of Types by \citeN{Church40}. Although many
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	224	mathematical concepts can be conveniently expressed in HOL, there are some
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	225	limitations that hurt when attempting a simple-minded formalisation
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	226	of regular languages in it.
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	227
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	228	The typical approach to
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	229	regular languages, taken for example by \citeN{HopcroftUllman69}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	230	and by \citeN{Kozen97}, is to introduce finite deterministic automata and then
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	231	define most notions in terms of them. For example, a regular language is
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	232	normally defined as:
59 fc35eb54fdc9 more on the intro urbanc parents: 58 diff changeset	233
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	234	\begin{dfntn}\label{baddef}
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	235	A language @{text A} is \emph{regular}, provided there is a
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	236	finite deterministic automaton that recognises all strings of @{text "A"}.
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	237	\end{dfntn}
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	238
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	239	\noindent
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	240	This approach has many benefits. Among them is the fact that it is easy to
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	241	convince oneself that regular languages are closed under complementation:
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	242	one just has to exchange the accepting and non-accepting states in the
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	243	corresponding automaton to obtain an automaton for the complement language.
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	244	The problem, however, lies with formalising such reasoning in a
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	245	theorem prover. Automata are built up from states and transitions that are
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	246	commonly represented as graphs, matrices or functions, none of which, unfortunately,
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	247	can be defined as an inductive datatype.
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	248
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	249	In case of graphs and matrices, this means we have to build our own
14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	250	reasoning infrastructure for them, as neither Isabelle/HOL nor HOL4 nor
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	251	HOLlight support them with libraries. Also, reasoning about graphs and
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	252	matrices can be a hassle in theorem provers, because
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	253	we have to be able to combine automata. Consider for
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	254	example the operation of sequencing two automata, say $A_1$ and $A_2$, by
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	255	connecting the accepting states of $A_1$ to the initial state of $A_2$:
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	256
60 fb08f41ca33d a bit more tuning on the introduction urbanc parents: 59 diff changeset	257	\begin{center}
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	258	\begin{tabular}{ccc}
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	259	\begin{tikzpicture}[scale=1]
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	260	%\draw[step=2mm] (-1,-1) grid (1,1);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	261
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	262	\draw[rounded corners=1mm, very thick] (-1.0,-0.3) rectangle (-0.2,0.3);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	263	\draw[rounded corners=1mm, very thick] ( 0.2,-0.3) rectangle ( 1.0,0.3);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	264
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	265	\node (A) at (-1.0,0.0) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	266	\node (B) at ( 0.2,0.0) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	267
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	268	\node (C) at (-0.2, 0.13) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	269	\node (D) at (-0.2,-0.13) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	270
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	271	\node (E) at (1.0, 0.2) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	272	\node (F) at (1.0,-0.0) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	273	\node (G) at (1.0,-0.2) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	274
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	275	\draw (-0.6,0.0) node {\small$A_1$};
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	276	\draw ( 0.6,0.0) node {\small$A_2$};
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	277	\end{tikzpicture}
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	278
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	279	&
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	280
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	281	\raisebox{2.1mm}{\bf\Large$\;\;\;\Rightarrow\,\;\;$}
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	282
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	283	&
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	284
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	285	\begin{tikzpicture}[scale=1]
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	286	%\draw[step=2mm] (-1,-1) grid (1,1);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	287
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	288	\draw[rounded corners=1mm, very thick] (-1.0,-0.3) rectangle (-0.2,0.3);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	289	\draw[rounded corners=1mm, very thick] ( 0.2,-0.3) rectangle ( 1.0,0.3);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	290
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	291	\node (A) at (-1.0,0.0) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	292	\node (B) at ( 0.2,0.0) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	293
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	294	\node (C) at (-0.2, 0.13) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	295	\node (D) at (-0.2,-0.13) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	296
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	297	\node (E) at (1.0, 0.2) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	298	\node (F) at (1.0,-0.0) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	299	\node (G) at (1.0,-0.2) [circle, very thick, draw, fill=white, inner sep=0.4mm] {};
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	300
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	301	\draw (C) to [very thick, bend left=45] (B);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	302	\draw (D) to [very thick, bend right=45] (B);
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	303
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	304	\draw (-0.6,0.0) node {\small$A_1$};
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	305	\draw ( 0.6,0.0) node {\small$A_2$};
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	306	\end{tikzpicture}
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	307
828ea293b61f more on the introduction urbanc parents: 61 diff changeset	308	\end{tabular}
60 fb08f41ca33d a bit more tuning on the introduction urbanc parents: 59 diff changeset	309	\end{center}
fb08f41ca33d a bit more tuning on the introduction urbanc parents: 59 diff changeset	310
fb08f41ca33d a bit more tuning on the introduction urbanc parents: 59 diff changeset	311	\noindent
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	312	On `paper' we can define the corresponding
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	313	graph in terms of the disjoint
88 1436fc451bb9 added something about Setalt and folds urbanc parents: 86 diff changeset	314	union of the state nodes. Unfortunately in HOL, the standard definition for disjoint
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	315	union, namely
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	316	%
14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	317	\begin{equation}\label{disjointunion}
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	318	@{text "A\<^isub>1 \<uplus> A\<^isub>2 \<equiv> {(1, x) \| x \<in> A\<^isub>1} \<union> {(2, y) \| y \<in> A\<^isub>2}"}
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	319	\end{equation}
60 fb08f41ca33d a bit more tuning on the introduction urbanc parents: 59 diff changeset	320
61 070f543e2560 more to the intro urbanc parents: 60 diff changeset	321	\noindent
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	322	changes the type---the disjoint union is not a set, but a set of
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	323	pairs. Using this definition for disjoint union means we do not have a
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	324	single type for the states of automata. As a result we will not be able to
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	325	define a regular language as one for which there exists
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	326	an automaton that recognises all its strings (Definition~\ref{baddef}). This
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	327	is because we cannot make a definition in HOL that is only polymorphic in
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	328	the state type, but not in the predicate for regularity; and there is no
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	329	type quantification available in HOL (unlike in Coq, for
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	330	example).\footnote{Slind already pointed out this problem in an email to the
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	331	HOL4 mailing list on 21st April 2005.}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	332	%$^,$\footnote{While in Coq one can avoid
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	333	%this particular problem, all other difficulties we point out below still apply.}
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	334
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	335	An alternative, which provides us with a single type for states of automata,
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	336	is to give every state node an identity, for example a natural number, and
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	337	then be careful to rename these identities apart whenever connecting two
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	338	automata. This results in clunky proofs establishing that properties are
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	339	invariant under renaming. Similarly, connecting two automata represented as
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	340	matrices results in messy constructions, which are not pleasant to
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	341	formally reason about. \citeN[Page 67]{Braibant12}, for example, writes that there are no
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	342	problems with reasoning about matrices, but that there is an
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	343	``intrinsic difficulty of working with rectangular matrices'' in some
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	344	parts of his formalisation of formal languages in Coq.
66 828ea293b61f more on the introduction urbanc parents: 61 diff changeset	345
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	346	Functions are much better supported in Isabelle/HOL, but they still lead to
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	347	similar problems as with graphs. Composing, for example, two
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	348	non-deterministic automata in parallel requires also the formalisation of
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	349	disjoint unions. \citeN{Nipkow98} dismisses for this the option of
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	350	using identities, because it leads according to him to ``messy
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	351	proofs''. Since he does not need to define what regular languages are,
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	352	Nipkow opts for a variant of \eqref{disjointunion} using bit lists, but
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	353	writes\smallskip
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	354
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	355
14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	356	\begin{quote}
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	357	\it%
2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	358	\begin{tabular}{@ {}l@ {}p{0.88\textwidth}@ {}}
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	359	`` & All lemmas appear obvious given a picture of the composition of automata\ldots
d3fe0597080a updated paper urbanc parents: 100 diff changeset	360	Yet their proofs require a painful amount of detail.''
d3fe0597080a updated paper urbanc parents: 100 diff changeset	361	\end{tabular}
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	362	\end{quote}\smallskip
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	363
d3fe0597080a updated paper urbanc parents: 100 diff changeset	364	\noindent
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	365	and\smallskip
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	366
d3fe0597080a updated paper urbanc parents: 100 diff changeset	367	\begin{quote}
d3fe0597080a updated paper urbanc parents: 100 diff changeset	368	\it%
d3fe0597080a updated paper urbanc parents: 100 diff changeset	369	\begin{tabular}{@ {}l@ {}p{0.88\textwidth}@ {}}
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	370	`` & If the reader finds the above treatment in terms of bit lists revoltingly
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	371	concrete, I cannot disagree. A more abstract approach is clearly desirable.''
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	372	\end{tabular}
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	373	\end{quote}\smallskip
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	374
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	375	%\noindent
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	376	%Moreover, it is not so clear how to conveniently impose a finiteness
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	377	%condition upon functions in order to represent \emph{finite} automata. The
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	378	%best is probably to resort to more advanced reasoning frameworks, such as
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	379	%\emph{locales} or \emph{type classes}, which are \emph{not} available in all
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	380	%HOL-based theorem provers.
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	381
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	382	Because of these problems to do with representing automata, formalising
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	383	automata theory is surprisingly not as easy as one might think, despite the
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	384	sometimes very detailed, but informal, textbook proofs. \citeN{LammichTuerk12}
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	385	formalised Hopcroft's algorithm using an automata library of 27 kloc in
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	386	Isabelle/HOL. There they use matrices for representing automata.
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	387	Functions are used by \citeN{Nipkow98}, who establishes
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	388	the link between regular expressions and automata in the context of
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	389	lexing. \citeN{BerghoferReiter09} use functions as well for formalising automata
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	390	working over bit strings in the context of Presburger arithmetic.
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	391	A Larger formalisation of automata theory, including the Myhill-Nerode theorem,
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	392	was carried out in Nuprl by \citeN{Constable00} which also uses functions.
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	393	Other large formailsations of automata theory in Coq are by
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	394	\citeN{Filliatre97} who essentially uses graphs and by \citeN{Almeidaetal10}
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	395	and \citeN{Braibant12}, who both use matrices. Many of these works,
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	396	like \citeN{Nipkow98} or \citeN{Braibant12}, mention intrinsic problems with
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	397	their representation of
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	398	automata which made them `fight' their respective theorem prover.
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	399
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	400	%Also, one might consider automata as just convenient `vehicles' for
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	401	%establishing properties about regular languages. However, paper proofs
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	402	%about automata often involve subtle side-conditions which are easily
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	403	%overlooked, but which make formal reasoning rather painful. For example
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	404	%Kozen's proof of the Myhill-Nerode Theorem requires that automata do not
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	405	%have inaccessible states \cite{Kozen97}. Another subtle side-condition is
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	406	%completeness of automata, that is automata need to have total transition
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	407	%functions and at most one `sink' state from which there is no connection to
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	408	%a final state (Brzozowski mentions this side-condition in the context of
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	409	%state complexity of automata \cite{Brzozowski10}, but it is also needed
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	410	%in the usual construction of the complement automaton). Such side-conditions mean
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	411	%that if we define a regular language as one for which there exists \emph{a}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	412	%finite automaton that recognises all its strings (see
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	413	%Definition~\ref{baddef}), then we need a lemma which ensures that another
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	414	%equivalent one can be found satisfying the side-condition, and also need to
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	415	%make sure our operations on automata preserve them. Unfortunately, such
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	416	%`little' and `obvious' lemmas make formalisations of automata theory
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	417	%hair-pulling experiences.
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	418
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	419	In this paper, we will not attempt to formalise automata theory in
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	420	Isabelle/HOL nor will we attempt to formalise automata proofs from the
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	421	literature, but take a different approach to regular languages than is
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	422	usually taken. Instead of defining a regular language as one where there
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	423	exists an automaton that recognises all its strings, we define a
82 14b12b5de6d3 added coments about functions urbanc parents: 79 diff changeset	424	regular language as:
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	425
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	426	\begin{dfntn}\label{regular}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	427	A language @{text A} is \emph{regular}, provided there is a regular expression
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	428	that matches all strings of @{text "A"}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	429	\end{dfntn}
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	430
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	431	\noindent
258 1abf8586ee6b added slides for a talk in St Andrews urbanc parents: 257 diff changeset	432	And then `forget' automata completely.
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	433	The reason is that regular expressions %, unlike graphs, matrices and
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	434	%functions,
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	435	can be defined as an inductive datatype and a reasoning
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	436	infrastructure for them (like induction and recursion) comes for free in
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	437	HOL. %Moreover, no side-conditions will be needed for regular expressions,
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	438	%like we need for automata.
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	439	This convenience of regular expressions has
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	440	recently been exploited in HOL4 with a formalisation of regular expression
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	441	matching based on derivatives by \citeN{OwensSlind08}, and with an equivalence
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	442	checker for regular expressions in Isabelle/HOL by \citeN{KraussNipkow11}
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	443	and in Matida by \citeN{Asperti12} and in Coq by \citeN{CoquandSiles12}. The
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	444	main purpose of this paper is to show that a central result about regular
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	445	languages---the Myhill-Nerode Theorem---can be recreated by only using
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	446	regular expressions. This theorem gives necessary and sufficient conditions
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	447	for when a language is regular. As a corollary of this theorem we can easily
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	448	establish the usual closure properties, including complementation, for
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	449	regular languages. We use the Continuation Lemma, which is also a corollary
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	450	of the Myhill-Nerode Theorem, for establishing
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	451	the non-regularity of the language @{text "a\<^isup>nb\<^isup>n"}.\medskip
61 070f543e2560 more to the intro urbanc parents: 60 diff changeset	452
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	453	\noindent
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	454	{\bf Contributions:} There is an extensive literature on regular languages.
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	455	To our best knowledge, our proof of the Myhill-Nerode Theorem is the first
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	456	that is based on regular expressions, only. The part of this theorem stating
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	457	that finitely many partitions imply regularity of the language is proved by
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	458	an argument about solving equational systems. This argument appears to be
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	459	folklore. For the other part, we give two proofs: one direct proof using
edc642266a82 polished the introduction urbanc parents: 174 diff changeset	460	certain tagging-functions, and another indirect proof using Antimirov's
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	461	partial derivatives \citeyear{Antimirov95}. Again to our best knowledge, the
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	462	tagging-functions have not been used before for establishing the Myhill-Nerode
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	463	Theorem. Derivatives of regular expressions have been used recently quite
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	464	widely in the literature; partial derivatives, in contrast, attract much
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	465	less attention. However, partial derivatives are more suitable in the
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	466	context of the Myhill-Nerode Theorem, since it is easier to establish
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	467	formally their finiteness result. We are not aware of any proof that uses
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	468	either of them for proving the Myhill-Nerode Theorem.
24 f72c82bf59e5 added paper urbanc parents: diff changeset	469	*}
f72c82bf59e5 added paper urbanc parents: diff changeset	470
50 32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	471	section {* Preliminaries *}
32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	472
32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	473	text {*
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	474	\noindent
67 7478be786f87 more intro urbanc parents: 66 diff changeset	475	Strings in Isabelle/HOL are lists of characters with the \emph{empty string}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	476	being represented by the empty list, written @{term "[]"}. We assume there
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	477	are only finitely many different characters. \emph{Languages} are sets of
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	478	strings. The language containing all strings is written in Isabelle/HOL as
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	479	@{term "UNIV::string set"}. The concatenation of two languages is written
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	480	@{term "A \<cdot> B"} and a language raised to the power @{text n} is written
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	481	@{term "A \<up> n"}. They are defined as usual
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	482
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	483	\begin{center}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	484	\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	485	@{thm (lhs) conc_def'[THEN eq_reflection, where A1="A" and B1="B"]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	486	& @{text "\<equiv>"} & @{thm (rhs) conc_def'[THEN eq_reflection, where A1="A" and B1="B"]}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	487
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	488	@{thm (lhs) lang_pow.simps(1)[THEN eq_reflection, where A1="A"]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	489	& @{text "\<equiv>"} & @{thm (rhs) lang_pow.simps(1)[THEN eq_reflection, where A1="A"]}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	490
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	491	@{thm (lhs) lang_pow.simps(2)[THEN eq_reflection, where A1="A" and n1="n"]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	492	& @{text "\<equiv>"} & @{thm (rhs) lang_pow.simps(2)[THEN eq_reflection, where A1="A" and n1="n"]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	493	\end{tabular}
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	494	\end{center}
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	495
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	496	\noindent
113 ec774952190c polished everything urbanc parents: 112 diff changeset	497	where @{text "@"} is the list-append operation. The Kleene-star of a language @{text A}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	498	is defined as the union over all powers, namely @{thm star_def[where A="A", THEN eq_reflection]}.
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	499	In the paper
88 1436fc451bb9 added something about Setalt and folds urbanc parents: 86 diff changeset	500	we will make use of the following properties of these constructions.
58 0d4d5bb321dc a little bit in the introduction urbanc parents: 54 diff changeset	501
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	502	\begin{prpstn}\label{langprops}\mbox{}\\
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	503	\begin{tabular}{@ {}lp{10cm}}
180 b755090d0f3d added a picture urbanc parents: 179 diff changeset	504	(i) & @{thm star_unfold_left} \\
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	505	(ii) & @{thm[mode=IfThen] pow_length}\\
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	506	(iii) & @{thm conc_Union_left} \\
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	507	(iv) & If @{thm (prem 1) star_decom} and @{thm (prem 2) star_decom} then
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	508	there exists an @{text "x\<^isub>p"} and @{text "x\<^isub>s"} with @{text "x = x\<^isub>p @ x\<^isub>s"}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	509	and @{term "x\<^isub>p \<noteq> []"} such that @{term "x\<^isub>p \<in> A"} and @{term "x\<^isub>s \<in> A\<star>"}.
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	510	\end{tabular}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	511	\end{prpstn}
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	512
426070e68b21 more on the paper urbanc parents: 70 diff changeset	513	\noindent
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	514	In @{text "(ii)"} we use the notation @{term "length s"} for the length of a
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	515	string; this property states that if \mbox{@{term "[] \<notin> A"}} then the lengths of
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	516	the strings in @{term "A \<up> (Suc n)"} must be longer than @{text n}.
b73478aaf33e more on paper urbanc parents: 187 diff changeset	517	Property @{text "(iv)"} states that a non-empty string in @{term "A\<star>"} can
b73478aaf33e more on paper urbanc parents: 187 diff changeset	518	always be split up into a non-empty prefix belonging to @{text "A"} and the
b73478aaf33e more on paper urbanc parents: 187 diff changeset	519	rest being in @{term "A\<star>"}. We omit
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	520	the proofs for these properties, but invite the reader to consult our
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	521	formalisation.\footnote{Available under \citeN{myhillnerodeafp11} in the Archive of Formal Proofs at\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	522	\url{http://afp.sourceforge.net/entries/Myhill-Nerode.shtml}.}
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	523
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	524	The notation in Isabelle/HOL for the quotient of a language @{text A}
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	525	according to an equivalence relation @{term REL} is @{term "A // REL"}. We
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	526	will write @{text "\<lbrakk>x\<rbrakk>\<^isub>\<approx>"} for the equivalence class defined as
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	527	\mbox{@{text "{y \| y \<approx> x}"}}, and have @{text "x \<approx> y"} if and only if @{text
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	528	"\<lbrakk>x\<rbrakk>\<^isub>\<approx> = \<lbrakk>y\<rbrakk>\<^isub>\<approx>"}.
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	529
426070e68b21 more on the paper urbanc parents: 70 diff changeset	530
51 6cfb92de4654 some tuning of the paper urbanc parents: 50 diff changeset	531	Central to our proof will be the solution of equational systems
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	532	involving equivalence classes of languages. For this we will use Arden's Lemma
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	533	(see for example \cite[Page 100]{Sakarovitch09}),
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	534	which solves equations of the form @{term "X = A \<cdot> X \<union> B"} provided
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	535	@{term "[] \<notin> A"}. However we will need the following `reversed'
9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	536	version of Arden's Lemma (`reversed' in the sense of changing the order of @{term "A \<cdot> X"} to
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	537	\mbox{@{term "X \<cdot> A"}}).\footnote{The details of the proof for the reversed Arden's Lemma
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	538	are given in the Appendix.}
50 32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	539
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	540	\begin{lmm}[(Reversed Arden's Lemma)]\label{arden}\mbox{}\\
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	541	If @{thm (prem 1) reversed_Arden} then
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	542	@{thm (lhs) reversed_Arden} if and only if
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	543	@{thm (rhs) reversed_Arden}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	544	\end{lmm}
50 32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	545
67 7478be786f87 more intro urbanc parents: 66 diff changeset	546	\noindent
88 1436fc451bb9 added something about Setalt and folds urbanc parents: 86 diff changeset	547	Regular expressions are defined as the inductive datatype
67 7478be786f87 more intro urbanc parents: 66 diff changeset	548
7478be786f87 more intro urbanc parents: 66 diff changeset	549	\begin{center}
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	550	\begin{tabular}{rcl}
6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	551	@{text r} & @{text "::="} & @{term ZERO}\\
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	552	& @{text"\|"} & @{term One}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	553	& @{text"\|"} & @{term "Atom c"}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	554	& @{text"\|"} & @{term "Times r r"}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	555	& @{text"\|"} & @{term "Plus r r"}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	556	& @{text"\|"} & @{term "Star r"}
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	557	\end{tabular}
67 7478be786f87 more intro urbanc parents: 66 diff changeset	558	\end{center}
7478be786f87 more intro urbanc parents: 66 diff changeset	559
7478be786f87 more intro urbanc parents: 66 diff changeset	560	\noindent
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	561	and the language matched by a regular expression is defined by recursion as
67 7478be786f87 more intro urbanc parents: 66 diff changeset	562
7478be786f87 more intro urbanc parents: 66 diff changeset	563	\begin{center}
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	564	\begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l}
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	565	@{thm (lhs) lang.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) lang.simps(1)}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	566	@{thm (lhs) lang.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) lang.simps(2)}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	567	@{thm (lhs) lang.simps(3)[where a="c"]} & @{text "\<equiv>"} & @{thm (rhs) lang.simps(3)[where a="c"]}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	568	@{thm (lhs) lang.simps(4)[where ?r="r\<^isub>1" and ?s="r\<^isub>2"]} & @{text "\<equiv>"} &
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	569	@{thm (rhs) lang.simps(4)[where ?r="r\<^isub>1" and ?s="r\<^isub>2"]}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	570	@{thm (lhs) lang.simps(5)[where ?r="r\<^isub>1" and ?s="r\<^isub>2"]} & @{text "\<equiv>"} &
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	571	@{thm (rhs) lang.simps(5)[where ?r="r\<^isub>1" and ?s="r\<^isub>2"]}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	572	@{thm (lhs) lang.simps(6)[where r="r"]} & @{text "\<equiv>"} &
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	573	@{thm (rhs) lang.simps(6)[where r="r"]}\\
67 7478be786f87 more intro urbanc parents: 66 diff changeset	574	\end{tabular}
7478be786f87 more intro urbanc parents: 66 diff changeset	575	\end{center}
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	576
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	577	Given a finite set of regular expressions @{text rs}, we will make use of the operation of generating
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	578	a regular expression that matches the union of all languages of @{text rs}.
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	579	This definion is not trivial in a theorem prover, because @{text rs} (being a set) is unordered,
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	580	but the regular expression needs an order. Since
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	581	we only need to know the
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	582	existence
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	583	of such a regular expression, we can use Isabelle/HOL's @{const "fold_graph"} and Hilbert's
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	584	choice operator, written @{text "SOME"} in Isabelle/HOL, for defining @{term "\<Uplus>rs"}.
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	585	This operation, roughly speaking, folds @{const PLUS} over the
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	586	set @{text rs} with @{const ZERO} for the empty set. We can prove that for a finite set @{text rs}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	587	%
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	588	\begin{equation}\label{uplus}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	589	\mbox{@{thm (lhs) folds_plus_simp} @{text "= \<Union> (\<calL> ` rs)"}}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	590	\end{equation}
88 1436fc451bb9 added something about Setalt and folds urbanc parents: 86 diff changeset	591
1436fc451bb9 added something about Setalt and folds urbanc parents: 86 diff changeset	592	\noindent
90 97b783438316 added an example urbanc parents: 89 diff changeset	593	holds, whereby @{text "\<calL> ` rs"} stands for the
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	594	image of the set @{text rs} under function @{text "\<calL>"} defined as
b73478aaf33e more on paper urbanc parents: 187 diff changeset	595
b73478aaf33e more on paper urbanc parents: 187 diff changeset	596	\begin{center}
b73478aaf33e more on paper urbanc parents: 187 diff changeset	597	@{term "lang ` rs \<equiv> {lang r \| r. r \<in> rs}"}
b73478aaf33e more on paper urbanc parents: 187 diff changeset	598	\end{center}
b73478aaf33e more on paper urbanc parents: 187 diff changeset	599
b73478aaf33e more on paper urbanc parents: 187 diff changeset	600	\noindent
b73478aaf33e more on paper urbanc parents: 187 diff changeset	601	In what follows we shall use this convenient short-hand notation for images of sets
b73478aaf33e more on paper urbanc parents: 187 diff changeset	602	also with other functions.
50 32bff8310071 revised proof of Ardens lemma urbanc parents: 39 diff changeset	603	*}
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	604
133 3ab755a96cef minor change urbanc parents: 132 diff changeset	605	section {* The Myhill-Nerode Theorem, First Part *}
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	606
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	607	text {*
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	608	\noindent
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	609	The key definition in the Myhill-Nerode Theorem is the
47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	610	\emph{Myhill-Nerode Relation}, which states that w.r.t.~a language two
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	611	strings are related, provided there is no distinguishing extension in this
338 e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	612	language. This can be defined as a ternary relation.
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	613
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	614	\begin{dfntn}[(Myhill-Nerode Relation)]\label{myhillneroderel}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	615	Given a language @{text A}, two strings @{text x} and
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	616	@{text y} are Myhill-Nerode related provided
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	617	\begin{center}
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	618	@{thm str_eq_def'}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	619	\end{center}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	620	\end{dfntn}
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	621
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	622	\noindent
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	623	It is easy to see that @{term "\<approx>A"} is an equivalence relation, which
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	624	partitions the set of all strings, @{text "UNIV"}, into a set of disjoint
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	625	equivalence classes. To illustrate this quotient construction, let us give a simple
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	626	example: consider the regular language containing just
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	627	the string @{text "[c]"}. The relation @{term "\<approx>({[c]})"} partitions @{text UNIV}
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	628	into three equivalence classes @{text "X\<^isub>1"}, @{text "X\<^isub>2"} and @{text "X\<^isub>3"}
90 97b783438316 added an example urbanc parents: 89 diff changeset	629	as follows
97b783438316 added an example urbanc parents: 89 diff changeset	630
97b783438316 added an example urbanc parents: 89 diff changeset	631	\begin{center}
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	632	\begin{tabular}{l}
6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	633	@{text "X\<^isub>1 = {[]}"}\\
6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	634	@{text "X\<^isub>2 = {[c]}"}\\
90 97b783438316 added an example urbanc parents: 89 diff changeset	635	@{text "X\<^isub>3 = UNIV - {[], [c]}"}
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	636	\end{tabular}
90 97b783438316 added an example urbanc parents: 89 diff changeset	637	\end{center}
97b783438316 added an example urbanc parents: 89 diff changeset	638
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	639	One direction of the Myhill-Nerode Theorem establishes
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	640	that if there are finitely many equivalence classes, like in the example above, then
2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	641	the language is regular. In our setting we therefore have to show:
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	642
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	643	\begin{thrm}\label{myhillnerodeone}
96 3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	644	@{thm[mode=IfThen] Myhill_Nerode1}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	645	\end{thrm}
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	646
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	647	\noindent
90 97b783438316 added an example urbanc parents: 89 diff changeset	648	To prove this theorem, we first define the set @{term "finals A"} as those equivalence
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	649	classes from @{term "UNIV // \<approx>A"} that contain strings of @{text A}, namely
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	650	%
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	651	\begin{equation}
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	652	@{thm finals_def}
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	653	\end{equation}
426070e68b21 more on the paper urbanc parents: 70 diff changeset	654
426070e68b21 more on the paper urbanc parents: 70 diff changeset	655	\noindent
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	656	In our running example, @{text "X\<^isub>2"} is the only
f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	657	equivalence class in @{term "finals {[c]}"}.
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	658	It is straightforward to show that in general
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	659	%
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	660	\begin{equation}\label{finalprops}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	661	@{thm lang_is_union_of_finals}\hspace{15mm}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	662	@{thm finals_in_partitions}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	663	\end{equation}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	664
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	665	\noindent
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	666	hold.
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	667	Therefore if we know that there exists a regular expression for every
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	668	equivalence class in \mbox{@{term "finals A"}} (which by assumption must be
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	669	a finite set), then we can use @{text "\<bigplus>"} to obtain a regular expression
98 36f9d19be0e6 included comments by Xingyuan urbanc parents: 96 diff changeset	670	that matches every string in @{text A}.
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	671
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	672
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	673	Our proof of Theorem~\ref{myhillnerodeone} relies on a method that can calculate a
79 bba9c80735f9 started to define things more directly urbanc parents: 77 diff changeset	674	regular expression for \emph{every} equivalence class, not just the ones
77 63bc9f9d96ba small additions urbanc parents: 75 diff changeset	675	in @{term "finals A"}. We
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	676	first define the notion of \emph{one-character-transition} between
2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	677	two equivalence classes
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	678	%
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	679	\begin{equation}
426070e68b21 more on the paper urbanc parents: 70 diff changeset	680	@{thm transition_def}
426070e68b21 more on the paper urbanc parents: 70 diff changeset	681	\end{equation}
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	682
71 426070e68b21 more on the paper urbanc parents: 70 diff changeset	683	\noindent
338 e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	684	which means that if we append the character @{text c} to the end of all
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	685	strings in the equivalence class @{text Y}, we obtain a subset of
77 63bc9f9d96ba small additions urbanc parents: 75 diff changeset	686	@{text X}. Note that we do not define an automaton here, we merely relate two sets
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	687	(with the help of a character). In our concrete example we have
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	688	@{term "X\<^isub>1 \<Turnstile>c\<Rightarrow> X\<^isub>2"}, @{term "X\<^isub>1 \<Turnstile>d\<^isub>i\<Rightarrow> X\<^isub>3"} with @{text "d\<^isub>i"} being any
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	689	other character than @{text c}, and @{term "X\<^isub>3 \<Turnstile>c\<^isub>j\<Rightarrow> X\<^isub>3"} for any
194 5347d7556487 some typos urbanc parents: 193 diff changeset	690	character @{text "c\<^isub>j"}.
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	691
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	692	Next we construct an \emph{initial equational system} that
fd39492b187c a few more changes urbanc parents: 154 diff changeset	693	contains an equation for each equivalence class. We first give
fd39492b187c a few more changes urbanc parents: 154 diff changeset	694	an informal description of this construction. Suppose we have
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	695	the equivalence classes @{text "X\<^isub>1,\<dots>,X\<^isub>n"}, there must be one and only one that
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	696	contains the empty string @{text "[]"} (since equivalence classes are disjoint).
338 e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	697	Let us assume @{text "[] \<in> X\<^isub>1"}. We build the following initial equational system
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	698
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	699	\begin{center}
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	700	\begin{tabular}{rcl}
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	701	@{text "X\<^isub>1"} & @{text "="} & @{text "(Y\<^isub>1\<^isub>1, ATOM c\<^isub>1\<^isub>1) + \<dots> + (Y\<^isub>1\<^isub>p, ATOM c\<^isub>1\<^isub>p) + \<lambda>(ONE)"} \\
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	702	@{text "X\<^isub>2"} & @{text "="} & @{text "(Y\<^isub>2\<^isub>1, ATOM c\<^isub>2\<^isub>1) + \<dots> + (Y\<^isub>2\<^isub>o, ATOM c\<^isub>2\<^isub>o)"} \\
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	703	& $\vdots$ \\
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	704	@{text "X\<^isub>n"} & @{text "="} & @{text "(Y\<^isub>n\<^isub>1, ATOM c\<^isub>n\<^isub>1) + \<dots> + (Y\<^isub>n\<^isub>q, ATOM c\<^isub>n\<^isub>q)"}\\
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	705	\end{tabular}
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	706	\end{center}
70 8ab3a06577cf slightly more on the paper urbanc parents: 67 diff changeset	707
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	708	\noindent
338 e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	709	where the terms @{text "(Y\<^isub>i\<^isub>j, ATOM c\<^isub>i\<^isub>j)"} are pairs consiting of an equivalence class and
e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	710	a regular expression. In the initial equational system, they
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	711	stand for all transitions @{term "Y\<^isub>i\<^isub>j \<Turnstile>c\<^isub>i\<^isub>j\<Rightarrow>
159 990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	712	X\<^isub>i"}.
990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	713	%The intuition behind the equational system is that every
990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	714	%equation @{text "X\<^isub>i = rhs\<^isub>i"} in this system
990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	715	%corresponds roughly to a state of an automaton whose name is @{text X\<^isub>i} and its predecessor states
990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	716	%are the @{text "Y\<^isub>i\<^isub>j"}; the @{text "c\<^isub>i\<^isub>j"} are the labels of the transitions from these
990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	717	%predecessor states to @{text X\<^isub>i}.
990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	718	There can only be
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	719	finitely many terms of the form @{text "(Y\<^isub>i\<^isub>j, ATOM c\<^isub>i\<^isub>j)"} in a right-hand side
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	720	since by assumption there are only finitely many
159 990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	721	equivalence classes and only finitely many characters.
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	722	The term @{text "\<lambda>(ONE)"} in the first equation acts as a marker for the initial state, that
159 990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	723	is the equivalence class
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	724	containing the empty string @{text "[]"}.\footnote{Note that we mark, roughly speaking, the
115 c5f138b5fc88 added comment from Larry urbanc parents: 114 diff changeset	725	single `initial' state in the equational system, which is different from
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	726	the method by \citeN{Brzozowski64}, where he marks the
115 c5f138b5fc88 added comment from Larry urbanc parents: 114 diff changeset	727	`terminal' states. We are forced to set up the equational system in our
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	728	way, because the Myhill-Nerode Relation determines the `direction' of the
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	729	transitions---the successor `state' of an equivalence class @{text Y} can
23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	730	be reached by adding a character to the end of @{text Y}. This is also the
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	731	reason why we have to use our reversed version of Arden's Lemma.}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	732	In our running example we have the initial equational system
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	733	%
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	734	\begin{equation}\label{exmpcs}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	735	\mbox{\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	736	@{term "X\<^isub>1"} & @{text "="} & @{text "\<lambda>(ONE)"}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	737	@{term "X\<^isub>2"} & @{text "="} & @{text "(X\<^isub>1, ATOM c)"}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	738	@{term "X\<^isub>3"} & @{text "="} & @{text "(X\<^isub>1, ATOM d\<^isub>1) + \<dots> + (X\<^isub>1, ATOM d\<^isub>n)"}\\
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	739	& & \mbox{}\hspace{10mm}@{text "+ (X\<^isub>3, ATOM c\<^isub>1) + \<dots> + (X\<^isub>3, ATOM c\<^isub>m)"}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	740	\end{tabular}}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	741	\end{equation}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	742
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	743	\noindent
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	744	where @{text "d\<^isub>1\<dots>d\<^isub>n"} is the sequence of all characters
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	745	but not containing @{text c}, and @{text "c\<^isub>1\<dots>c\<^isub>m"} is the sequence of all
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	746	characters.
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	747
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	748	Overloading the function @{text \<calL>} for the two kinds of terms in the
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	749	equational system, we have
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	750
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	751	\begin{center}
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	752	@{text "\<calL>(Y, r) \<equiv>"} %
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	753	@{thm (rhs) lang_trm.simps(2)[where X="Y" and r="r", THEN eq_reflection]}\hspace{10mm}
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	754	@{thm lang_trm.simps(1)[where r="r", THEN eq_reflection]}
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	755	\end{center}
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	756
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	757	\noindent
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	758	and we can prove for @{text "X\<^isub>2\<^isub>.\<^isub>.\<^isub>n"} that the following equations
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	759	%
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	760	\begin{equation}\label{inv1}
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	761	@{text "X\<^isub>i = \<calL>(Y\<^isub>i\<^isub>1, ATOM c\<^isub>i\<^isub>1) \<union> \<dots> \<union> \<calL>(Y\<^isub>i\<^isub>q, ATOM c\<^isub>i\<^isub>q)"}.
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	762	\end{equation}
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	763
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	764	\noindent
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	765	hold. Similarly for @{text "X\<^isub>1"} we can show the following equation
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	766	%
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	767	\begin{equation}\label{inv2}
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	768	@{text "X\<^isub>1 = \<calL>(Y\<^isub>1\<^isub>1, ATOM c\<^isub>1\<^isub>1) \<union> \<dots> \<union> \<calL>(Y\<^isub>1\<^isub>p, ATOM c\<^isub>1\<^isub>p) \<union> \<calL>(\<lambda>(ONE))"}.
75 d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	769	\end{equation}
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	770
d63baacbdb16 parts of the 3 section urbanc parents: 71 diff changeset	771	\noindent
160 ea2e5acbfe4a added comments from Chunhan urbanc parents: 159 diff changeset	772	holds. The reason for adding the @{text \<lambda>}-marker to our initial equational system is
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	773	to obtain this equation: it only holds with the marker, since none of
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	774	the other terms contain the empty string. The point of the initial equational system is
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	775	that solving it means we will be able to extract a regular expression for every equivalence class.
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	776
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	777	Our representation for the equations in Isabelle/HOL are pairs,
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	778	where the first component is an equivalence class (a set of strings)
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	779	and the second component
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	780	is a set of terms. Given a set of equivalence
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	781	classes @{text CS}, our initial equational system @{term "Init CS"} is thus
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	782	formally defined as
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	783	%
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	784	\begin{equation}\label{initcs}
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	785	\mbox{\begin{tabular}{rcl}
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	786	@{thm (lhs) Init_rhs_def} & @{text "\<equiv>"} &
2409827d8eb8 updated urbanc parents: 98 diff changeset	787	@{text "if"}~@{term "[] \<in> X"}\\
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	788	& & @{text "then"}~@{term "{Trn Y (ATOM c) \| Y c. Y \<in> CS \<and> Y \<Turnstile>c\<Rightarrow> X} \<union> {Lam ONE}"}\\
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	789	& & @{text "else"}~@{term "{Trn Y (ATOM c)\| Y c. Y \<in> CS \<and> Y \<Turnstile>c\<Rightarrow> X}"}\\
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	790	@{thm (lhs) Init_def} & @{text "\<equiv>"} & @{thm (rhs) Init_def}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	791	\end{tabular}}
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	792	\end{equation}
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	793
2409827d8eb8 updated urbanc parents: 98 diff changeset	794	\noindent
2409827d8eb8 updated urbanc parents: 98 diff changeset	795	Because we use sets of terms
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	796	for representing the right-hand sides of equations, we can
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	797	prove \eqref{inv1} and \eqref{inv2} more concisely as
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	798	%
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	799	\begin{lmm}\label{inv}
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	800	If @{thm (prem 1) test} then @{text "X = \<Union> \<calL> ` rhs"}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	801	\end{lmm}
77 63bc9f9d96ba small additions urbanc parents: 75 diff changeset	802
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	803	\noindent
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	804	Our proof of Theorem~\ref{myhillnerodeone} will proceed by transforming the
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	805	initial equational system into one in \emph{solved form} maintaining the invariant
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	806	in Lemma~\ref{inv}. From the solved form we will be able to read
89 42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	807	off the regular expressions.
42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	808
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	809	In order to transform an equational system into solved form, we have two
89 42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	810	operations: one that takes an equation of the form @{text "X = rhs"} and removes
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	811	any recursive occurrences of @{text X} in the @{text rhs} using our variant of Arden's
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	812	Lemma. The other operation takes an equation @{text "X = rhs"}
89 42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	813	and substitutes @{text X} throughout the rest of the equational system
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	814	adjusting the remaining regular expressions appropriately. To define this adjustment
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	815	we define the \emph{append-operation} taking a term and a regular expression as argument
89 42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	816
42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	817	\begin{center}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	818	\begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	819	@{thm (lhs) Append_rexp.simps(2)[where X="Y" and r="r\<^isub>1" and rexp="r\<^isub>2", THEN eq_reflection]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	820	& @{text "\<equiv>"} &
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	821	@{thm (rhs) Append_rexp.simps(2)[where X="Y" and r="r\<^isub>1" and rexp="r\<^isub>2", THEN eq_reflection]}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	822	@{thm (lhs) Append_rexp.simps(1)[where r="r\<^isub>1" and rexp="r\<^isub>2", THEN eq_reflection]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	823	& @{text "\<equiv>"} &
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	824	@{thm (rhs) Append_rexp.simps(1)[where r="r\<^isub>1" and rexp="r\<^isub>2", THEN eq_reflection]}
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	825	\end{tabular}
89 42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	826	\end{center}
42af13d194c9 a bit more on the paper urbanc parents: 88 diff changeset	827
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	828	\noindent
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	829	We lift this operation to entire right-hand sides of equations, written as
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	830	@{thm (lhs) Append_rexp_rhs_def[where rexp="r"]}. With this we can define
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	831	the \emph{arden-operation} for an equation of the form @{text "X = rhs"} as:
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	832	%
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	833	\begin{equation}\label{arden_def}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	834	\mbox{\begin{tabular}{rc@ {\hspace{2mm}}r@ {\hspace{1mm}}l}
94 5b12cd0a3b3c latest on the paper urbanc parents: 93 diff changeset	835	@{thm (lhs) Arden_def} & @{text "\<equiv>"}~~\mbox{} & \multicolumn{2}{@ {\hspace{-2mm}}l}{@{text "let"}}\\
5b12cd0a3b3c latest on the paper urbanc parents: 93 diff changeset	836	& & @{text "rhs' ="} & @{term "rhs - {Trn X r \| r. Trn X r \<in> rhs}"} \\
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	837	& & @{text "r' ="} & @{term "Star (\<Uplus> {r. Trn X r \<in> rhs})"}\\
50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	838	& & \multicolumn{2}{@ {\hspace{-2mm}}l}{@{text "in"}~~@{term "Append_rexp_rhs rhs' r'"}}\\
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	839	\end{tabular}}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	840	\end{equation}
93 2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	841
2aa3756dcc9f more on the paper urbanc parents: 92 diff changeset	842	\noindent
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	843	In this definition, we first delete all terms of the form @{text "(X, r)"} from @{text rhs};
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	844	then we calculate the combined regular expressions for all @{text r} coming
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	845	from the deleted @{text "(X, r)"}, and take the @{const Star} of it;
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	846	finally we append this regular expression to @{text rhs'}. If we apply this
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	847	operation to the right-hand side of @{text "X\<^isub>3"} in \eqref{exmpcs}, we obtain
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	848	the equation:
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	849
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	850	\begin{center}
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	851	\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	852	@{term "X\<^isub>3"} & @{text "="} &
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	853	@{text "(X\<^isub>1, TIMES (ATOM d\<^isub>1) (STAR \<^raw:\ensuremath{\bigplus}>{ATOM c\<^isub>1,\<dots>, ATOM c\<^isub>m})) + \<dots> "}\\
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	854	& & \mbox{}\hspace{13mm}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	855	@{text "\<dots> + (X\<^isub>1, TIMES (ATOM d\<^isub>n) (STAR \<^raw:\ensuremath{\bigplus}>{ATOM c\<^isub>1,\<dots>, ATOM c\<^isub>m}))"}
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	856	\end{tabular}
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	857	\end{center}
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	858
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	859
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	860	\noindent
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	861	That means we eliminated the recursive occurrence of @{text "X\<^isub>3"} on the
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	862	right-hand side. Note we used the abbreviation
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	863	@{text "\<^raw:\ensuremath{\bigplus}>{ATOM c\<^isub>1,\<dots>, ATOM c\<^isub>m}"}
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	864	to stand for a regular expression that matches with every character. In
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	865	our algorithm we are only interested in the existence of such a regular expression
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	866	and do not specify it any further.
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	867
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	868	It can be easily seen that the @{text "Arden"}-operation mimics Arden's
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	869	Lemma on the level of equations. To ensure the non-emptiness condition of
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	870	Arden's Lemma we say that a right-hand side is @{text ardenable} provided
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	871
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	872	\begin{center}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	873	@{thm ardenable_def}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	874	\end{center}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	875
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	876	\noindent
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	877	This allows us to prove a version of Arden's Lemma on the level of equations.
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	878
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	879	\begin{lmm}\label{ardenable}
113 ec774952190c polished everything urbanc parents: 112 diff changeset	880	Given an equation @{text "X = rhs"}.
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	881	If @{text "X = \<Union>\<calL> ` rhs"},
179 edacc141060f small improvements urbanc parents: 178 diff changeset	882	@{thm (prem 2) Arden_preserves_soundness}, and
edacc141060f small improvements urbanc parents: 178 diff changeset	883	@{thm (prem 3) Arden_preserves_soundness}, then
135 604518f0127f final polished urbanc parents: 134 diff changeset	884	@{text "X = \<Union>\<calL> ` (Arden X rhs)"}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	885	\end{lmm}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	886
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	887	\noindent
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	888	Our @{text ardenable} condition is slightly stronger than needed for applying Arden's Lemma,
194 5347d7556487 some typos urbanc parents: 193 diff changeset	889	but we can still ensure that it holds throughout our algorithm of transforming equations
338 e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	890	into solved form.
e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	891
e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	892	The \emph{substitution-operation} takes an equation
95 9540c2f2ea77 more things urbanc parents: 94 diff changeset	893	of the form @{text "X = xrhs"} and substitutes it into the right-hand side @{text rhs}.
94 5b12cd0a3b3c latest on the paper urbanc parents: 93 diff changeset	894
5b12cd0a3b3c latest on the paper urbanc parents: 93 diff changeset	895	\begin{center}
95 9540c2f2ea77 more things urbanc parents: 94 diff changeset	896	\begin{tabular}{rc@ {\hspace{2mm}}r@ {\hspace{1mm}}l}
9540c2f2ea77 more things urbanc parents: 94 diff changeset	897	@{thm (lhs) Subst_def} & @{text "\<equiv>"}~~\mbox{} & \multicolumn{2}{@ {\hspace{-2mm}}l}{@{text "let"}}\\
9540c2f2ea77 more things urbanc parents: 94 diff changeset	898	& & @{text "rhs' ="} & @{term "rhs - {Trn X r \| r. Trn X r \<in> rhs}"} \\
9540c2f2ea77 more things urbanc parents: 94 diff changeset	899	& & @{text "r' ="} & @{term "\<Uplus> {r. Trn X r \<in> rhs}"}\\
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	900	& & \multicolumn{2}{@ {\hspace{-2mm}}l}{@{text "in"}~~@{term "rhs' \<union> Append_rexp_rhs xrhs r'"}}\\
95 9540c2f2ea77 more things urbanc parents: 94 diff changeset	901	\end{tabular}
94 5b12cd0a3b3c latest on the paper urbanc parents: 93 diff changeset	902	\end{center}
95 9540c2f2ea77 more things urbanc parents: 94 diff changeset	903
9540c2f2ea77 more things urbanc parents: 94 diff changeset	904	\noindent
134 08afbed1c8c7 chunhan's comments urbanc parents: 133 diff changeset	905	We again delete first all occurrences of @{text "(X, r)"} in @{text rhs}; we then calculate
95 9540c2f2ea77 more things urbanc parents: 94 diff changeset	906	the regular expression corresponding to the deleted terms; finally we append this
9540c2f2ea77 more things urbanc parents: 94 diff changeset	907	regular expression to @{text "xrhs"} and union it up with @{text rhs'}. When we use
9540c2f2ea77 more things urbanc parents: 94 diff changeset	908	the substitution operation we will arrange it so that @{text "xrhs"} does not contain
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	909	any occurrence of @{text X}. For example substituting the first equation in
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	910	\eqref{exmpcs} into the right-hand side of the second, thus eliminating the equivalence
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	911	class @{text "X\<^isub>1"}, gives us the equation
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	912	%
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	913	\begin{equation}\label{exmpresult}
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	914	\mbox{\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	915	@{term "X\<^isub>2"} & @{text "="} & @{text "\<lambda>(TIMES ONE (ATOM c))"}
178 c6ebfe052109 added more examles urbanc parents: 177 diff changeset	916	\end{tabular}}
c6ebfe052109 added more examles urbanc parents: 177 diff changeset	917	\end{equation}
96 3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	918
134 08afbed1c8c7 chunhan's comments urbanc parents: 133 diff changeset	919	With these two operations in place, we can define the operation that removes one equation
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	920	from an equational systems @{text ES}. The operation @{const Subst_all}
96 3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	921	substitutes an equation @{text "X = xrhs"} throughout an equational system @{text ES};
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	922	@{const Remove} then completely removes such an equation from @{text ES} by substituting
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	923	it to the rest of the equational system, but first eliminating all recursive occurrences
96 3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	924	of @{text X} by applying @{const Arden} to @{text "xrhs"}.
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	925
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	926	\begin{center}
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	927	\begin{tabular}{rcl}
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	928	@{thm (lhs) Subst_all_def} & @{text "\<equiv>"} & @{thm (rhs) Subst_all_def}\\
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	929	@{thm (lhs) Remove_def} & @{text "\<equiv>"} & @{thm (rhs) Remove_def}
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	930	\end{tabular}
3b9deda4f459 simplified a bit the proof urbanc parents: 95 diff changeset	931	\end{center}
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	932
2409827d8eb8 updated urbanc parents: 98 diff changeset	933	\noindent
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	934	Finally, we can define how an equational system should be solved. For this
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	935	we will need to iterate the process of eliminating equations until only one equation
154 7c68b9ad4486 implemented most suggestions from the reviewers urbanc parents: 149 diff changeset	936	will be left in the system. However, we do not just want to have any equation
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	937	as being the last one, but the one involving the equivalence class for
6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	938	which we want to calculate the regular
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	939	expression. Let us suppose this equivalence class is @{text X}.
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	940	Since @{text X} is the one to be solved, in every iteration step we have to pick an
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	941	equation to be eliminated that is different from @{text X}. In this way
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	942	@{text X} is kept to the final step. The choice is implemented using Hilbert's choice
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	943	operator, written @{text SOME} in the definition below.
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	944
2409827d8eb8 updated urbanc parents: 98 diff changeset	945	\begin{center}
2409827d8eb8 updated urbanc parents: 98 diff changeset	946	\begin{tabular}{rc@ {\hspace{4mm}}r@ {\hspace{1mm}}l}
2409827d8eb8 updated urbanc parents: 98 diff changeset	947	@{thm (lhs) Iter_def} & @{text "\<equiv>"}~~\mbox{} & \multicolumn{2}{@ {\hspace{-4mm}}l}{@{text "let"}}\\
2409827d8eb8 updated urbanc parents: 98 diff changeset	948	& & @{text "(Y, yrhs) ="} & @{term "SOME (Y, yrhs). (Y, yrhs) \<in> ES \<and> X \<noteq> Y"} \\
2409827d8eb8 updated urbanc parents: 98 diff changeset	949	& & \multicolumn{2}{@ {\hspace{-4mm}}l}{@{text "in"}~~@{term "Remove ES Y yrhs"}}\\
2409827d8eb8 updated urbanc parents: 98 diff changeset	950	\end{tabular}
2409827d8eb8 updated urbanc parents: 98 diff changeset	951	\end{center}
2409827d8eb8 updated urbanc parents: 98 diff changeset	952
2409827d8eb8 updated urbanc parents: 98 diff changeset	953	\noindent
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	954	The last definition we need applies @{term Iter} over and over until a condition
159 990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	955	@{text Cond} is \emph{not} satisfied anymore. This condition states that there
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	956	are more than one equation left in the equational system @{text ES}. To solve
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	957	an equational system we use Isabelle/HOL's @{text while}-operator as follows:
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	958
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	959	\begin{center}
2409827d8eb8 updated urbanc parents: 98 diff changeset	960	@{thm Solve_def}
2409827d8eb8 updated urbanc parents: 98 diff changeset	961	\end{center}
2409827d8eb8 updated urbanc parents: 98 diff changeset	962
101 d3fe0597080a updated paper urbanc parents: 100 diff changeset	963	\noindent
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	964	We are not concerned here with the definition of this operator (see
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	965	\cite{BerghoferNipkow00} for example), but note that we
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	966	eliminate in each @{const Iter}-step a single equation, and therefore have a
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	967	well-founded termination order by taking the cardinality of the equational
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	968	system @{text ES}. This enables us to prove properties about our definition
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	969	of @{const Solve} when we `call' it with the equivalence class @{text X} and
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	970	the initial equational system @{term "Init (UNIV // \<approx>A)"} from
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	971	\eqref{initcs} using the principle:
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	972	%
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	973	\begin{equation}\label{whileprinciple}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	974	\mbox{\begin{tabular}{l}
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	975	@{term "invariant (Init (UNIV // \<approx>A))"} \\
f460d5f75cb5 updated urbanc parents: 101 diff changeset	976	@{term "\<forall>ES. invariant ES \<and> Cond ES \<longrightarrow> invariant (Iter X ES)"}\\
f460d5f75cb5 updated urbanc parents: 101 diff changeset	977	@{term "\<forall>ES. invariant ES \<and> Cond ES \<longrightarrow> card (Iter X ES) < card ES"}\\
f460d5f75cb5 updated urbanc parents: 101 diff changeset	978	@{term "\<forall>ES. invariant ES \<and> \<not> Cond ES \<longrightarrow> P ES"}\\
f460d5f75cb5 updated urbanc parents: 101 diff changeset	979	\hline
f460d5f75cb5 updated urbanc parents: 101 diff changeset	980	\multicolumn{1}{c}{@{term "P (Solve X (Init (UNIV // \<approx>A)))"}}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	981	\end{tabular}}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	982	\end{equation}
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	983
f460d5f75cb5 updated urbanc parents: 101 diff changeset	984	\noindent
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	985	This principle states that given an invariant (which we will specify below)
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	986	we can prove a property
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	987	@{text "P"} involving @{const Solve}. For this we have to discharge the following
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	988	proof obligations: first the
113 ec774952190c polished everything urbanc parents: 112 diff changeset	989	initial equational system satisfies the invariant; second the iteration
154 7c68b9ad4486 implemented most suggestions from the reviewers urbanc parents: 149 diff changeset	990	step @{text "Iter"} preserves the invariant as long as the condition @{term Cond} holds;
113 ec774952190c polished everything urbanc parents: 112 diff changeset	991	third @{text "Iter"} decreases the termination order, and fourth that
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	992	once the condition does not hold anymore then the property @{text P} must hold.
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	993
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	994	The property @{term P} in our proof will state that @{term "Solve X (Init (UNIV // \<approx>A))"}
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	995	returns with a single equation @{text "X = xrhs"} for some @{text "xrhs"}, and
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	996	that this equational system still satisfies the invariant. In order to get
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	997	the proof through, the invariant is composed of the following six properties:
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	998
f460d5f75cb5 updated urbanc parents: 101 diff changeset	999	\begin{center}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1000	\begin{tabular}{@ {}rcl@ {\hspace{-13mm}}l @ {}}
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1001	@{text "invariant ES"} & @{text "\<equiv>"} &
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	1002	@{term "finite ES"} & @{text "(finiteness)"}\\
f460d5f75cb5 updated urbanc parents: 101 diff changeset	1003	& @{text "\<and>"} & @{thm (rhs) finite_rhs_def} & @{text "(finiteness rhs)"}\\
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1004	& @{text "\<and>"} & @{text "\<forall>(X, rhs)\<in>ES. X = \<Union>\<calL> ` rhs"} & @{text "(soundness)"}\\
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	1005	& @{text "\<and>"} & @{thm (rhs) distinctness_def}\\
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1006	& & & @{text "(distinctness)"}\\
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1007	& @{text "\<and>"} & @{thm (rhs) ardenable_all_def} & @{text "(ardenable)"}\\
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	1008	& @{text "\<and>"} & @{thm (rhs) validity_def} & @{text "(validity)"}\\
103 f460d5f75cb5 updated urbanc parents: 101 diff changeset	1009	\end{tabular}
f460d5f75cb5 updated urbanc parents: 101 diff changeset	1010	\end{center}
f460d5f75cb5 updated urbanc parents: 101 diff changeset	1011
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1012	\noindent
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1013	The first two ensure that the equational system is always finite (number of equations
160 ea2e5acbfe4a added comments from Chunhan urbanc parents: 159 diff changeset	1014	and number of terms in each equation); the third makes sure the `meaning' of the
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1015	equations is preserved under our transformations. The other properties are a bit more
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1016	technical, but are needed to get our proof through. Distinctness states that every
154 7c68b9ad4486 implemented most suggestions from the reviewers urbanc parents: 149 diff changeset	1017	equation in the system is distinct. @{text Ardenable} ensures that we can always
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	1018	apply the @{text Arden} operation.
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1019	The last property states that every @{text rhs} can only contain equivalence classes
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1020	for which there is an equation. Therefore @{text lhss} is just the set containing
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1021	the first components of an equational system,
212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1022	while @{text "rhss"} collects all equivalence classes @{text X} in the terms of the
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1023	form @{term "Trn X r"}. That means formally @{thm (lhs) lhss_def}~@{text "\<equiv> {X \| (X, rhs) \<in> ES}"}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1024	and @{thm (lhs) rhss_def}~@{text "\<equiv> {X \| (X, r) \<in> rhs}"}.
108 212bfa431fa5 filled details in one place urbanc parents: 107 diff changeset	1025
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1026
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1027	It is straightforward to prove that the initial equational system satisfies the
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1028	invariant.
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1029
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1030	\begin{lmm}\label{invzero}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1031	@{thm[mode=IfThen] Init_ES_satisfies_invariant}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1032	\end{lmm}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1033
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1034	\begin{proof}
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1035	Finiteness is given by the assumption and the way how we set up the
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1036	initial equational system. Soundness is proved in Lemma~\ref{inv}. Distinctness
154 7c68b9ad4486 implemented most suggestions from the reviewers urbanc parents: 149 diff changeset	1037	follows from the fact that the equivalence classes are disjoint. The @{text ardenable}
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1038	property also follows from the setup of the initial equational system, as does
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1039	validity.
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1040	\end{proof}
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1041
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1042	\noindent
ec774952190c polished everything urbanc parents: 112 diff changeset	1043	Next we show that @{text Iter} preserves the invariant.
ec774952190c polished everything urbanc parents: 112 diff changeset	1044
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1045	\begin{lmm}\label{iterone} If
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1046	@{thm (prem 1) iteration_step_invariant[where xrhs="rhs"]},
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1047	@{thm (prem 2) iteration_step_invariant[where xrhs="rhs"]} and
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1048	@{thm (prem 3) iteration_step_invariant[where xrhs="rhs"]}, then
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1049	@{thm (concl) iteration_step_invariant[where xrhs="rhs"]}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1050	\end{lmm}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1051
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	1052	\begin{proof}
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	1053	The argument boils down to choosing an equation @{text "Y = yrhs"} to be eliminated
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1054	and to show that @{term "Subst_all (ES - {(Y, yrhs)}) Y (Arden Y yrhs)"}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1055	preserves the invariant.
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1056	We prove this as follows:
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1057
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1058	\begin{center}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	1059	\begin{tabular}{@ {}l@ {}}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1060	@{text "\<forall>ES."}~@{thm (prem 1) Subst_all_satisfies_invariant} implies
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1061	@{thm (concl) Subst_all_satisfies_invariant}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	1062	\end{tabular}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1063	\end{center}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1064
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1065	\noindent
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	1066	Finiteness is straightforward, as the @{const Subst} and @{const Arden} operations
116 342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1067	keep the equational system finite. These operations also preserve soundness
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1068	and distinctness (we proved soundness for @{const Arden} in Lemma~\ref{ardenable}).
154 7c68b9ad4486 implemented most suggestions from the reviewers urbanc parents: 149 diff changeset	1069	The property @{text ardenable} is clearly preserved because the append-operation
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1070	cannot make a regular expression to match the empty string. Validity is
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1071	given because @{const Arden} removes an equivalence class from @{text yrhs}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1072	and then @{const Subst_all} removes @{text Y} from the equational system.
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	1073	Having proved the implication above, we can instantiate @{text "ES"} with @{text "ES - {(Y, yrhs)}"}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1074	which matches with our proof-obligation of @{const "Subst_all"}. Since
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	1075	\mbox{@{term "ES = ES - {(Y, yrhs)} \<union> {(Y, yrhs)}"}}, we can use the assumption
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1076	to complete the proof.
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	1077	\end{proof}
6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	1078
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1079	\noindent
ec774952190c polished everything urbanc parents: 112 diff changeset	1080	We also need the fact that @{text Iter} decreases the termination measure.
ec774952190c polished everything urbanc parents: 112 diff changeset	1081
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1082	\begin{lmm}\label{itertwo} If
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1083	@{thm (prem 1) iteration_step_measure[simplified (no_asm), where xrhs="rhs"]},
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1084	@{thm (prem 2) iteration_step_measure[simplified (no_asm), where xrhs="rhs"]} and
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1085	@{thm (prem 3) iteration_step_measure[simplified (no_asm), where xrhs="rhs"]}, then\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1086	\mbox{@{thm (concl) iteration_step_measure[simplified (no_asm), where xrhs="rhs"]}}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1087	\end{lmm}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1088
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1089	\begin{proof}
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1090	By assumption we know that @{text "ES"} is finite and has more than one element.
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1091	Therefore there must be an element @{term "(Y, yrhs) \<in> ES"} with
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1092	@{term "(Y, yrhs) \<noteq> (X, rhs)"}. Using the distinctness property we can infer
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1093	that @{term "Y \<noteq> X"}. We further know that @{text "Remove ES Y yrhs"}
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1094	removes the equation @{text "Y = yrhs"} from the system, and therefore
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1095	the cardinality of @{const Iter} strictly decreases.
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1096	\end{proof}
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1097
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1098	\noindent
134 08afbed1c8c7 chunhan's comments urbanc parents: 133 diff changeset	1099	This brings us to our property we want to establish for @{text Solve}.
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1100
ec774952190c polished everything urbanc parents: 112 diff changeset	1101
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1102	\begin{lmm}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1103	If @{thm (prem 1) Solve} and @{thm (prem 2) Solve} then there exists
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1104	a @{text rhs} such that @{term "Solve X (Init (UNIV // \<approx>A)) = {(X, rhs)}"}
5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1105	and @{term "invariant {(X, rhs)}"}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1106	\end{lmm}
104 5bd73aa805a7 updated paper urbanc parents: 103 diff changeset	1107
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	1108	\begin{proof}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1109	In order to prove this lemma using \eqref{whileprinciple}, we have to use a slightly
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1110	stronger invariant since Lemma~\ref{iterone} and \ref{itertwo} have the precondition
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1111	that @{term "(X, rhs) \<in> ES"} for some @{text rhs}. This precondition is needed
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1112	in order to choose in the @{const Iter}-step an equation that is not \mbox{@{term "X = rhs"}}.
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1113	Therefore our invariant cannot be just @{term "invariant ES"}, but must be
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1114	@{term "invariant ES \<and> (\<exists>rhs. (X, rhs) \<in> ES)"}. By assumption
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1115	@{thm (prem 2) Solve} and Lemma~\ref{invzero}, the more general invariant holds for
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1116	the initial equational system. This is premise 1 of~\eqref{whileprinciple}.
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1117	Premise 2 is given by Lemma~\ref{iterone} and the fact that @{const Iter} might
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1118	modify the @{text rhs} in the equation @{term "X = rhs"}, but does not remove it.
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1119	Premise 3 of~\eqref{whileprinciple} is by Lemma~\ref{itertwo}. Now in premise 4
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1120	we like to show that there exists a @{text rhs} such that @{term "ES = {(X, rhs)}"}
e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1121	and that @{text "invariant {(X, rhs)}"} holds, provided the condition @{text "Cond"}
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1122	does not holds. By the stronger invariant we know there exists such a @{text "rhs"}
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1123	with @{term "(X, rhs) \<in> ES"}. Because @{text Cond} is not true, we know the cardinality
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1124	of @{text ES} is @{text 1}. This means @{text "ES"} must actually be the set @{text "{(X, rhs)}"},
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1125	for which the invariant holds. This allows us to conclude that
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1126	@{term "Solve X (Init (UNIV // \<approx>A)) = {(X, rhs)}"} and @{term "invariant {(X, rhs)}"} hold,
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1127	as needed.
107 6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	1128	\end{proof}
6f4f9b7b9891 updated paper urbanc parents: 106 diff changeset	1129
106 91dc591de63f updated paper urbanc parents: 105 diff changeset	1130	\noindent
91dc591de63f updated paper urbanc parents: 105 diff changeset	1131	With this lemma in place we can show that for every equivalence class in @{term "UNIV // \<approx>A"}
91dc591de63f updated paper urbanc parents: 105 diff changeset	1132	there exists a regular expression.
91dc591de63f updated paper urbanc parents: 105 diff changeset	1133
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1134	\begin{lmm}\label{every_eqcl_has_reg}
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1135	@{thm[mode=IfThen] every_eqcl_has_reg}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1136	\end{lmm}
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1137
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1138	\begin{proof}
138 2dfe13bc1443 three typos urbanc parents: 137 diff changeset	1139	By the preceding lemma, we know that there exists a @{text "rhs"} such
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1140	that @{term "Solve X (Init (UNIV // \<approx>A))"} returns the equation @{text "X = rhs"},
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1141	and that the invariant holds for this equation. That means we
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1142	know @{text "X = \<Union>\<calL> ` rhs"}. We further know that
109 79b37ef9505f minor updated urbanc parents: 108 diff changeset	1143	this is equal to \mbox{@{text "\<Union>\<calL> ` (Arden X rhs)"}} using the properties of the
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1144	invariant and Lemma~\ref{ardenable}. Using the validity property for the equation @{text "X = rhs"},
156 fd39492b187c a few more changes urbanc parents: 154 diff changeset	1145	we can infer that @{term "rhss rhs \<subseteq> {X}"} and because the @{text Arden} operation
106 91dc591de63f updated paper urbanc parents: 105 diff changeset	1146	removes that @{text X} from @{text rhs}, that @{term "rhss (Arden X rhs) = {}"}.
113 ec774952190c polished everything urbanc parents: 112 diff changeset	1147	This means the right-hand side @{term "Arden X rhs"} can only consist of terms of the form @{term "Lam r"}.
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	1148	So we can collect those (finitely many) regular expressions @{text rs} and have @{term "X = lang (\<Uplus>rs)"}.
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1149	With this we can conclude the proof.
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1150	\end{proof}
ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1151
106 91dc591de63f updated paper urbanc parents: 105 diff changeset	1152	\noindent
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1153	Lemma~\ref{every_eqcl_has_reg} allows us to finally give a proof for the first direction
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1154	of the Myhill-Nerode Theorem.
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1155
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1156	\begin{proof}[of Theorem~\ref{myhillnerodeone}]
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1157	By Lemma~\ref{every_eqcl_has_reg} we know that there exists a regular expression for
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1158	every equivalence class in @{term "UNIV // \<approx>A"}. Since @{text "finals A"} is
110 e500cab16be4 completed first direction urbanc parents: 109 diff changeset	1159	a subset of @{term "UNIV // \<approx>A"}, we also know that for every equivalence class
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1160	in @{term "finals A"} there exists a regular expression. Moreover by assumption
106 91dc591de63f updated paper urbanc parents: 105 diff changeset	1161	we know that @{term "finals A"} must be finite, and therefore there must be a finite
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1162	set of regular expressions @{text "rs"} such that
176 6969de1eb96b latest version of the journal paper urbanc parents: 175 diff changeset	1163	@{term "\<Union>(finals A) = lang (\<Uplus>rs)"}.
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1164	Since the left-hand side is equal to @{text A}, we can use @{term "\<Uplus>rs"}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1165	as the regular expression that is needed in the theorem.
105 ae6ad1363eb9 updated paper urbanc parents: 104 diff changeset	1166	\end{proof}
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	1167
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	1168	\noindent
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1169	Note that our algorithm for solving equational systems provides also a method for
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1170	calculating a regular expression for the complement of a regular language:
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1171	if we combine all regular
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	1172	expressions corresponding to equivalence classes not in @{term "finals A"},
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1173	then we obtain a regular expression for the complement language @{term "- A"}.
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1174	This is similar to the usual construction of a `complement automaton'.
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	1175	*}
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	1176
100 2409827d8eb8 updated urbanc parents: 98 diff changeset	1177	section {* Myhill-Nerode, Second Part *}
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1178
a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1179	text {*
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	1180	\noindent
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1181	In this section we will give a proof for establishing the second
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1182	part of the Myhill-Nerode Theorem. It can be formulated in our setting as follows:
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1183
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1184	\begin{thrm}\label{myhillnerodetwo}
135 604518f0127f final polished urbanc parents: 134 diff changeset	1185	Given @{text "r"} is a regular expression, then @{thm Myhill_Nerode2}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1186	\end{thrm}
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1187
116 342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1188	\noindent
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1189	The proof will be by induction on the structure of @{text r}. It turns out
116 342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1190	the base cases are straightforward.
342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1191
342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1192
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1193	\begin{proof}[(Base Cases)]
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	1194	The cases for @{const ZERO}, @{const ONE} and @{const ATOM} are routine, because
149 e122cb146ecc added the most current versions of the theories. urbanc parents: 145 diff changeset	1195	we can easily establish that
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1196
114 c5eb5f3065ae updated bib urbanc parents: 113 diff changeset	1197	\begin{center}
c5eb5f3065ae updated bib urbanc parents: 113 diff changeset	1198	\begin{tabular}{l}
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	1199	@{thm quot_zero_eq}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	1200	@{thm quot_one_subset}\\
21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	1201	@{thm quot_atom_subset}
114 c5eb5f3065ae updated bib urbanc parents: 113 diff changeset	1202	\end{tabular}
c5eb5f3065ae updated bib urbanc parents: 113 diff changeset	1203	\end{center}
c5eb5f3065ae updated bib urbanc parents: 113 diff changeset	1204
116 342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1205	\noindent
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1206	hold, which shows that @{term "UNIV // \<approx>(lang r)"} must be finite.
114 c5eb5f3065ae updated bib urbanc parents: 113 diff changeset	1207	\end{proof}
109 79b37ef9505f minor updated urbanc parents: 108 diff changeset	1208
116 342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	1209	\noindent
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1210	Much more interesting, however, are the inductive cases. They seem hard to be solved
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	1211	directly. The reader is invited to try.\footnote{The induction hypothesis is not strong enough
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	1212	to make any progress with the TIME and STAR cases.}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1213
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1214	In order to see how our proof proceeds consider the following suggestive picture
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1215	given by \citeN{Constable00}:
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1216	%
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1217	\begin{equation}\label{pics}
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1218	\mbox{\begin{tabular}{c@ {\hspace{10mm}}c@ {\hspace{10mm}}c}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1219	\begin{tikzpicture}[scale=0.95]
180 b755090d0f3d added a picture urbanc parents: 179 diff changeset	1220	%Circle
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1221	\draw[thick] (0,0) circle (1.1);
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1222	\end{tikzpicture}
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1223	&
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1224	\begin{tikzpicture}[scale=0.95]
180 b755090d0f3d added a picture urbanc parents: 179 diff changeset	1225	%Circle
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1226	\draw[thick] (0,0) circle (1.1);
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1227	%Main rays
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1228	\foreach \a in {0, 90,...,359}
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1229	\draw[very thick] (0, 0) -- (\a:1.1);
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1230	\foreach \a / \l in {45/1, 135/2, 225/3, 315/4}
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1231	\draw (\a: 0.65) node {$a_\l$};
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1232	\end{tikzpicture}
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1233	&
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1234	\begin{tikzpicture}[scale=0.95]
180 b755090d0f3d added a picture urbanc parents: 179 diff changeset	1235	%Circle
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1236	\draw[thick] (0,0) circle (1.1);
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1237	%Main rays
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1238	\foreach \a in {0, 45,...,359}
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1239	\draw[very thick] (0, 0) -- (\a:1.1);
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1240	\foreach \a / \l in {22.5/1.1, 67.5/1.2, 112.5/2.1, 157.5/2.2, 202.4/3.1, 247.5/3.2, 292.5/4.1, 337.5/4.2}
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1241	\draw (\a: 0.77) node {$a_{\l}$};
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1242	\end{tikzpicture}\\
b755090d0f3d added a picture urbanc parents: 179 diff changeset	1243	@{term UNIV} & @{term "UNIV // (\<approx>(lang r))"} & @{term "UNIV // R"}
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1244	\end{tabular}}
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1245	\end{equation}
179 edacc141060f small improvements urbanc parents: 178 diff changeset	1246
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1247	\noindent
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1248	The relation @{term "\<approx>(lang r)"} partitions the set of all strings, @{term UNIV}, into some
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1249	equivalence classes. To show that there are only finitely many of them, it
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1250	suffices to show in each induction step that another relation, say @{text
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1251	R}, has finitely many equivalence classes and refines @{term "\<approx>(lang r)"}.
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1252
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1253	\begin{dfntn}
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1254	A relation @{text "R\<^isub>1"} \emph{refines} @{text "R\<^isub>2"}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1255	provided @{text "R\<^isub>1 \<subseteq> R\<^isub>2"}.
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1256	\end{dfntn}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1257
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1258	\noindent
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1259	For constructing @{text R}, we will rely on some \emph{tagging-functions}
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1260	defined over strings. Given the inductive hypothesis, it will be easy to
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1261	prove that the \emph{range} of these tagging-functions is finite. The range
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1262	of a function @{text f} is defined as
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1263
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1264	\begin{center}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1265	@{text "range f \<equiv> f ` UNIV"}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1266	\end{center}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1267
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1268	\noindent
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1269	that means we take the image of @{text f} w.r.t.~all elements in the
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1270	domain. With this we will be able to infer that the tagging-functions, seen
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1271	as relations, give rise to finitely many equivalence classes.
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1272	Finally we will show that the tagging-relations are more refined than
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1273	@{term "\<approx>(lang r)"}, which implies that @{term "UNIV // \<approx>(lang r)"} must
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1274	also be finite. We formally define the notion of a \emph{tagging-relation}
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1275	as follows.
97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1276
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1277
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1278	\begin{dfntn}[(Tagging-Relation)] Given a tagging-function @{text tag}, then two strings @{text x}
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1279	and @{text y} are \emph{tag-related} provided
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1280	\begin{center}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1281	@{text "x \<^raw:$\threesim$>\<^bsub>tag\<^esub> y \<equiv> tag x = tag y"}\;.
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1282	\end{center}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1283	\end{dfntn}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1284
145 099e20f25b25 corrected small typo urbanc parents: 143 diff changeset	1285
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1286	In order to establish finiteness of a set @{text A}, we shall use the following powerful
118 c3fa11ee776e first proof urbanc parents: 117 diff changeset	1287	principle from Isabelle/HOL's library.
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1288	%
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1289	\begin{equation}\label{finiteimageD}
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1290	@{thm[mode=IfThen] finite_imageD}
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1291	\end{equation}
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1292
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1293	\noindent
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1294	It states that if an image of a set under an injective function @{text f} (injective over this set)
131 6b4c20714b4f chunhan's comments urbanc parents: 130 diff changeset	1295	is finite, then the set @{text A} itself must be finite. We can use it to establish the following
118 c3fa11ee776e first proof urbanc parents: 117 diff changeset	1296	two lemmas.
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1297
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1298	\begin{lmm}\label{finone}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1299	@{thm[mode=IfThen] finite_eq_tag_rel}
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1300	\end{lmm}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1301
22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1302	\begin{proof}
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1303	We set in \eqref{finiteimageD}, @{text f} to be @{text "X \<mapsto> tag ` X"}. We have
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1304	@{text "range f"} to be a subset of @{term "Pow (range tag)"}, which we know must be
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	1305	finite by assumption. Now @{term "f ` (UNIV // =tag=)"} is a subset of @{text "range f"},
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1306	and so also finite. Injectivity amounts to showing that @{text "X = Y"} under the
ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1307	assumptions that @{text "X, Y \<in> "}~@{term "UNIV // =tag="} and @{text "f X = f Y"}.
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1308	From the assumptions we obtain \mbox{@{text "x \<in> X"}} and @{text "y \<in> Y"} with
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1309	@{text "tag x = tag y"}. Since @{text x} and @{text y} are tag-related, this in
23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1310	turn means that the equivalence classes @{text X}
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1311	and @{text Y} must be equal. Therefore \eqref{finiteimageD} allows us to conclude
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1312	with @{thm (concl) finite_eq_tag_rel}.
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1313	\end{proof}
22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1314
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1315	\begin{lmm}\label{fintwo}
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1316	Given two equivalence relations @{text "R\<^isub>1"} and @{text "R\<^isub>2"}, whereby
118 c3fa11ee776e first proof urbanc parents: 117 diff changeset	1317	@{text "R\<^isub>1"} refines @{text "R\<^isub>2"}.
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1318	If @{thm (prem 1) refined_partition_finite[where ?R1.0="R\<^isub>1" and ?R2.0="R\<^isub>2"]}
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1319	then @{thm (concl) refined_partition_finite[where ?R1.0="R\<^isub>1" and ?R2.0="R\<^isub>2"]}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1320	\end{lmm}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1321
22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1322	\begin{proof}
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1323	We prove this lemma again using \eqref{finiteimageD}. This time we set @{text f} to
118 c3fa11ee776e first proof urbanc parents: 117 diff changeset	1324	be @{text "X \<mapsto>"}~@{term "{R\<^isub>1 `` {x} \| x. x \<in> X}"}. It is easy to see that
135 604518f0127f final polished urbanc parents: 134 diff changeset	1325	@{term "finite (f ` (UNIV // R\<^isub>2))"} because it is a subset of @{term "Pow (UNIV // R\<^isub>1)"},
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1326	which must be finite by assumption. What remains to be shown is that @{text f} is injective
118 c3fa11ee776e first proof urbanc parents: 117 diff changeset	1327	on @{term "UNIV // R\<^isub>2"}. This is equivalent to showing that two equivalence
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1328	classes, say @{text "X"} and @{text Y}, in @{term "UNIV // R\<^isub>2"} are equal, provided
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1329	@{text "f X = f Y"}. For @{text "X = Y"} to be equal, we have to find two elements
c3fa11ee776e first proof urbanc parents: 117 diff changeset	1330	@{text "x \<in> X"} and @{text "y \<in> Y"} such that they are @{text R\<^isub>2} related.
135 604518f0127f final polished urbanc parents: 134 diff changeset	1331	We know there exists a @{text "x \<in> X"} with \mbox{@{term "X = R\<^isub>2 `` {x}"}}.
604518f0127f final polished urbanc parents: 134 diff changeset	1332	From the latter fact we can infer that @{term "R\<^isub>1 ``{x} \<in> f X"}
123 23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1333	and further @{term "R\<^isub>1 ``{x} \<in> f Y"}. This means we can obtain a @{text y}
23c0e6f2929d polished everywhere...two cases still missing urbanc parents: 122 diff changeset	1334	such that @{term "R\<^isub>1 `` {x} = R\<^isub>1 `` {y}"} holds. Consequently @{text x} and @{text y}
118 c3fa11ee776e first proof urbanc parents: 117 diff changeset	1335	are @{text "R\<^isub>1"}-related. Since by assumption @{text "R\<^isub>1"} refines @{text "R\<^isub>2"},
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1336	they must also be @{text "R\<^isub>2"}-related, as we need to show.
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1337	\end{proof}
22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1338
22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1339	\noindent
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1340	Chaining Lemma~\ref{finone} and \ref{fintwo} together, means in order to show
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1341	that @{term "UNIV // \<approx>(lang r)"} is finite, we have to construct a tagging-function whose
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1342	range can be shown to be finite and whose tagging-relation refines @{term "\<approx>(lang r)"}.
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1343	Let us attempt the @{const PLUS}-case first. We take as tagging-function
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1344
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1345	\begin{center}
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1346	@{thm tag_Plus_def[where A="A" and B="B", THEN meta_eq_app]}
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1347	\end{center}
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	1348
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1349	\noindent
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1350	where @{text "A"} and @{text "B"} are some arbitrary languages. The reason for this choice
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1351	is that we need to establish that @{term "=(tag_Plus A B)="} refines @{term "\<approx>(A \<union> B)"}.
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1352	This amounts to showing @{term "x \<approx>A y"} or @{term "x \<approx>B y"} under the assumption
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1353	@{term "x"}~@{term "=(tag_Plus A B)="}~@{term y}. As we shall see, this definition will
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1354	provide us with just the right assumptions in order to get the proof through.
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1355
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1356	\begin{proof}[(@{const "PLUS"}-Case)]
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1357	We can show in general, if @{term "finite (UNIV // \<approx>A)"} and @{term "finite
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1358	(UNIV // \<approx>B)"} then @{term "finite ((UNIV // \<approx>A) \<times> (UNIV // \<approx>B))"}
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1359	holds. The range of @{term "tag_Plus A B"} is a subset of this product
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1360	set---so finite. For the refinement proof-obligation, we know that @{term
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1361	"(\<approx>A `` {x}, \<approx>B `` {x}) = (\<approx>A `` {y}, \<approx>B `` {y})"} holds by assumption. Then
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1362	clearly either @{term "x \<approx>A y"} or @{term "x \<approx>B y"}, as we needed to
183 c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1363	show. Finally we can discharge this case by setting @{text A} to @{term
c4893e84c88e cleaned up the proofs in Myhill_2 urbanc parents: 182 diff changeset	1364	"lang r\<^isub>1"} and @{text B} to @{term "lang r\<^isub>2"}.
119 ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1365	\end{proof}
ece3f197b92b first two proofs in 2 direction urbanc parents: 118 diff changeset	1366
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1367	\noindent
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1368	The @{const TIMES}-case is slightly more complicated. We first prove the
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1369	following lemma, which will aid the proof about refinement.
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1370
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1371	\begin{lmm}\label{refinement}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1372	The relation @{text "\<^raw:$\threesim$>\<^bsub>tag\<^esub>"} refines @{term "\<approx>A"}, provided for
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1373	all strings @{text x}, @{text y} and @{text z} we have that \mbox{@{text "x \<^raw:$\threesim$>\<^bsub>tag\<^esub> y"}}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1374	and @{term "x @ z \<in> A"} imply @{text "y @ z \<in> A"}.
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1375	\end{lmm}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1376
109 79b37ef9505f minor updated urbanc parents: 108 diff changeset	1377
121 1cf12a107b03 added directory with the small files and numbers of lines urbanc parents: 120 diff changeset	1378	\noindent
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1379	We therefore can analyse how the strings @{text "x @ z"} are in the language
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1380	@{text A} and then construct an appropriate tagging-function to infer that
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1381	@{term "y @ z"} are also in @{text A}. For this we will use the notion of
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1382	the set of all possible \emph{partitions} of a string:
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1383	%
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1384	\begin{equation}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1385	@{thm Partitions_def}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1386	\end{equation}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1387
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1388	\noindent
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1389	If we know that @{text "(x\<^isub>p, x\<^isub>s) \<in> Partitions x"}, we will
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1390	refer to @{text "x\<^isub>p"} as the \emph{prefix} of the string @{text x},
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1391	and respectively to @{text "x\<^isub>s"} as the \emph{suffix}.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1392
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1393
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1394	Now assuming @{term "x @ z \<in> A \<cdot> B"}, there are only two possible ways of how to `split'
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1395	this string to be in @{term "A \<cdot> B"}:
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	1396	%
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1397	\begin{center}
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1398	\begin{tabular}{c}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1399	\scalebox{1}{
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1400	\begin{tikzpicture}[scale=0.8,fill=gray!20]
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1401	\node[draw,minimum height=3.8ex, fill] (x) { $\hspace{4.8em}@{text x}\hspace{4.8em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1402	\node[draw,minimum height=3.8ex, right=-0.03em of x, fill] (za) { $\hspace{0.6em}@{text "z\<^isub>p"}\hspace{0.6em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1403	\node[draw,minimum height=3.8ex, right=-0.03em of za, fill] (zza) { $\hspace{2.6em}@{text "z\<^isub>s"}\hspace{2.6em}$ };
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1404
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1405	\draw[decoration={brace,transform={yscale=3}},decorate]
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1406	(x.north west) -- ($(za.north west)+(0em,0em)$)
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1407	node[midway, above=0.5em]{@{text x}};
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1408
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1409	\draw[decoration={brace,transform={yscale=3}},decorate]
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1410	($(za.north west)+(0em,0ex)$) -- ($(zza.north east)+(0em,0ex)$)
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1411	node[midway, above=0.5em]{@{text z}};
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1412
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1413	\draw[decoration={brace,transform={yscale=3}},decorate]
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1414	($(x.north west)+(0em,3ex)$) -- ($(zza.north east)+(0em,3ex)$)
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1415	node[midway, above=0.8em]{@{term "x @ z \<in> A \<cdot> B"}};
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1416
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1417	\draw[decoration={brace,transform={yscale=3}},decorate]
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1418	($(za.south east)+(0em,0ex)$) -- ($(x.south west)+(0em,0ex)$)
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1419	node[midway, below=0.5em]{@{text "x @ z\<^isub>p \<in> A"}};
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1420
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1421	\draw[decoration={brace,transform={yscale=3}},decorate]
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1422	($(zza.south east)+(0em,0ex)$) -- ($(za.south east)+(0em,0ex)$)
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1423	node[midway, below=0.5em]{@{text "z\<^isub>s \<in> B"}};
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1424	\end{tikzpicture}}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1425	\\[2mm]
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1426	\scalebox{1}{
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1427	\begin{tikzpicture}[scale=0.8,fill=gray!20]
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1428	\node[draw,minimum height=3.8ex, fill] (xa) { $\hspace{3em}@{text "x\<^isub>p"}\hspace{3em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1429	\node[draw,minimum height=3.8ex, right=-0.03em of xa, fill] (xxa) { $\hspace{0.2em}@{text "x\<^isub>s"}\hspace{0.2em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1430	\node[draw,minimum height=3.8ex, right=-0.03em of xxa, fill] (z) { $\hspace{5em}@{text z}\hspace{5em}$ };
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1431
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1432	\draw[decoration={brace,transform={yscale=3}},decorate]
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1433	(xa.north west) -- ($(xxa.north east)+(0em,0em)$)
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1434	node[midway, above=0.5em]{@{text x}};
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1435
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1436	\draw[decoration={brace,transform={yscale=3}},decorate]
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1437	(z.north west) -- ($(z.north east)+(0em,0em)$)
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1438	node[midway, above=0.5em]{@{text z}};
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1439
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1440	\draw[decoration={brace,transform={yscale=3}},decorate]
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1441	($(xa.north west)+(0em,3ex)$) -- ($(z.north east)+(0em,3ex)$)
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	1442	node[midway, above=0.8em]{@{term "x @ z \<in> A \<cdot> B"}};
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1443
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1444	\draw[decoration={brace,transform={yscale=3}},decorate]
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1445	($(z.south east)+(0em,0ex)$) -- ($(xxa.south west)+(0em,0ex)$)
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1446	node[midway, below=0.5em]{@{term "x\<^isub>s @ z \<in> B"}};
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1447
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1448	\draw[decoration={brace,transform={yscale=3}},decorate]
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1449	($(xa.south east)+(0em,0ex)$) -- ($(xa.south west)+(0em,0ex)$)
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1450	node[midway, below=0.5em]{@{term "x\<^isub>p \<in> A"}};
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1451	\end{tikzpicture}}
159 990c12ab1562 edits; sqeezed to 16 pages urbanc parents: 157 diff changeset	1452	\end{tabular}
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1453	\end{center}
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	1454	%
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1455	\noindent
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1456	Either @{text x} and a prefix of @{text "z"} is in @{text A} and the rest in @{text B}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1457	(first picture) or there is a prefix of @{text x} in @{text A} and the rest is in @{text B}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1458	(second picture). In both cases we have to show that @{term "y @ z \<in> A \<cdot> B"}. The first case
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1459	we will only go through if we know that @{term "x \<approx>A y"} holds @{text "(*)"}. Because then
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1460	we can infer from @{term "x @ z\<^isub>p \<in> A"} that @{term "y @ z\<^isub>p \<in> A"} holds for all @{text "z\<^isub>p"}.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1461	In the second case we only know that @{text "x\<^isub>p"} and @{text "x\<^isub>s"} is one possible partition
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1462	of the string @{text x}. We have to know that both @{text "x\<^isub>p"} and the
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1463	corresponding partition @{text "y\<^isub>p"} are in @{text "A"}, and that @{text "x\<^isub>s"} is `@{text B}-related'
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1464	to @{text "y\<^isub>s"} @{text "(**)"}. From the latter fact we can infer that @{text "y\<^isub>s @ z \<in> B"}.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1465	This will solve the second case.
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1466	Taking the two requirements, @{text "()"} and @{text "(*)"}, together we define the
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1467	tagging-function in the @{const Times}-case as:
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1468
121 1cf12a107b03 added directory with the small files and numbers of lines urbanc parents: 120 diff changeset	1469	\begin{center}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1470	@{thm (lhs) tag_Times_def[where ?A="A" and ?B="B"]}~@{text "\<equiv>"}~
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1471	@{text "(\<lbrakk>x\<rbrakk>\<^bsub>\<approx>A\<^esub>, {\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>B\<^esub> \| x\<^isub>p \<in> A \<and> (x\<^isub>p, x\<^isub>s) \<in> Partitions x})"}
121 1cf12a107b03 added directory with the small files and numbers of lines urbanc parents: 120 diff changeset	1472	\end{center}
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1473
62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1474	\noindent
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1475	Note that we have to make the assumption for all suffixes @{text "x\<^isub>s"}, since we do
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1476	not know anything about how the string @{term x} is partitioned.
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1477	With this definition in place, let us prove the @{const "Times"}-case.
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1478
125 62925473bf6b added pictures for seq-case urbanc parents: 124 diff changeset	1479
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1480	\begin{proof}[(@{const TIMES}-Case)]
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1481	If @{term "finite (UNIV // \<approx>A)"} and @{term "finite (UNIV // \<approx>B)"}
8440863a9900 seq case finished urbanc parents: 126 diff changeset	1482	then @{term "finite ((UNIV // \<approx>A) \<times> (Pow (UNIV // \<approx>B)))"} holds. The range of
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1483	@{term "tag_Times A B"} is a subset of this product set, and therefore finite.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1484	For the refinement of @{term "\<approx>(A \<cdot> B)"} and @{text "\<^raw:$\threesim$>\<^bsub>\<times>tag A B\<^esub>"},
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1485	we have by Lemma \ref{refinement}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1486
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1487	\begin{center}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1488	@{term "tag_Times A B x = tag_Times A B y"}
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1489	\end{center}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1490
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1491	\noindent
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1492	and @{term "x @ z \<in> A \<cdot> B"}, and have to establish @{term "y @ z \<in> A \<cdot>
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1493	B"}. As shown in the pictures above, there are two cases to be
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1494	considered. First, there exists a @{text "z\<^isub>p"} and @{text
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1495	"z\<^isub>s"} such that @{term "x @ z\<^isub>p \<in> A"} and @{text "z\<^isub>s
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1496	\<in> B"}. By the assumption about @{term "tag_Times A B"} we have @{term "\<approx>A
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1497	`` {x} = \<approx>A `` {y}"} and thus @{term "x \<approx>A y"}. Hence by the Myhill-Nerode
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1498	Relation @{term "y @ z\<^isub>p \<in> A"} holds. Using @{text "z\<^isub>s \<in> B"},
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1499	we can conclude in this case with @{term "y @ z \<in> A \<cdot> B"} (recall @{text
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1500	"z\<^isub>p @ z\<^isub>s = z"}).
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1501
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1502	Second there exists a partition @{text "x\<^isub>p"} and @{text "x\<^isub>s"} with
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1503	@{text "x\<^isub>p \<in> A"} and @{text "x\<^isub>s @ z \<in> B"}. We therefore have
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1504
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1505	\begin{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1506	@{text "\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>B\<^esub> \<in> {\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>B\<^esub> \| x\<^isub>p \<in> A \<and> (x\<^isub>p, x\<^isub>s) \<in> Partitions x}"}
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1507	\end{center}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1508
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1509	\noindent
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1510	and by the assumption about @{term "tag_Times A B"} also
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1511
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1512	\begin{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1513	@{text "\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>B\<^esub> \<in> {\<lbrakk>y\<^isub>s\<rbrakk>\<^bsub>\<approx>B\<^esub> \| y\<^isub>p \<in> A \<and> (y\<^isub>p, y\<^isub>s) \<in> Partitions y}"}
127 8440863a9900 seq case finished urbanc parents: 126 diff changeset	1514	\end{center}
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1515
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1516	\noindent
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1517	This means there must be a partition @{text "y\<^isub>p"} and @{text "y\<^isub>s"}
8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1518	such that @{term "y\<^isub>p \<in> A"} and @{term "\<approx>B `` {x\<^isub>s} = \<approx>B ``
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1519	{y\<^isub>s}"}. Unfolding the Myhill-Nerode Relation and together with the
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1520	facts that @{text "x\<^isub>p \<in> A"} and \mbox{@{text "x\<^isub>s @ z \<in> B"}}, we
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1521	obtain @{term "y\<^isub>p \<in> A"} and @{text "y\<^isub>s @ z \<in> B"}, as needed in
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1522	this case. We again can complete the @{const TIMES}-case by setting @{text
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1523	A} to @{term "lang r\<^isub>1"} and @{text B} to @{term "lang r\<^isub>2"}.
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1524	\end{proof}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1525
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1526	\noindent
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1527	The case for @{const Star} is similar to @{const TIMES}, but poses a few
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1528	extra challenges. To deal with them, we define first the notion of a \emph{string
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1529	prefix} and a \emph{strict string prefix}:
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1530
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1531	\begin{center}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1532	\begin{tabular}{l}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1533	@{text "x \<le> y \<equiv> \<exists>z. y = x @ z"}\\
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1534	@{text "x < y \<equiv> x \<le> y \<and> x \<noteq> y"}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1535	\end{tabular}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1536	\end{center}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1537
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1538	When analysing the case of @{text "x @ z"} being an element in @{term "A\<star>"}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1539	and @{text x} is not the empty string, we have the following picture:
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1540
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1541	\begin{center}
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1542	\scalebox{1}{
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1543	\begin{tikzpicture}[scale=0.8,fill=gray!20]
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1544	\node[draw,minimum height=3.8ex, fill] (xa) { $\hspace{4em}@{text "x\<^bsub>pmax\<^esub>"}\hspace{4em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1545	\node[draw,minimum height=3.8ex, right=-0.03em of xa, fill] (xxa) { $\hspace{0.5em}@{text "x\<^bsub>s\<^esub>"}\hspace{0.5em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1546	\node[draw,minimum height=3.8ex, right=-0.03em of xxa, fill] (za) { $\hspace{2em}@{text "z\<^isub>a"}\hspace{2em}$ };
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1547	\node[draw,minimum height=3.8ex, right=-0.03em of za, fill] (zb) { $\hspace{7em}@{text "z\<^isub>b"}\hspace{7em}$ };
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1548
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1549	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1550	(xa.north west) -- ($(xxa.north east)+(0em,0em)$)
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1551	node[midway, above=0.5em]{@{text x}};
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1552
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1553	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1554	(za.north west) -- ($(zb.north east)+(0em,0em)$)
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1555	node[midway, above=0.5em]{@{text z}};
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1556
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1557	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1558	($(xa.north west)+(0em,3ex)$) -- ($(zb.north east)+(0em,3ex)$)
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1559	node[midway, above=0.8em]{@{term "x @ z \<in> A\<star>"}};
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1560
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1561	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1562	($(za.south east)+(0em,0ex)$) -- ($(xxa.south west)+(0em,0ex)$)
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1563	node[midway, below=0.5em]{@{term "x\<^isub>s @ z\<^isub>a \<in> A"}};
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1564
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1565	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1566	($(xa.south east)+(0em,0ex)$) -- ($(xa.south west)+(0em,0ex)$)
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1567	node[midway, below=0.5em]{@{text "x\<^bsub>pmax\<^esub> \<in> A\<^isup>\<star>"}};
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1568
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1569	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1570	($(zb.south east)+(0em,0ex)$) -- ($(zb.south west)+(0em,0ex)$)
136 13b0f3dac9a2 final final polishing urbanc parents: 135 diff changeset	1571	node[midway, below=0.5em]{@{term "z\<^isub>b \<in> A\<star>"}};
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1572
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1573	\draw[decoration={brace,transform={yscale=3}},decorate]
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1574	($(zb.south east)+(0em,-4ex)$) -- ($(xxa.south west)+(0em,-4ex)$)
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1575	node[midway, below=0.5em]{@{term "x\<^isub>s @ z \<in> A\<star>"}};
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1576	\end{tikzpicture}}
6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1577	\end{center}
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	1578	%
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1579	\noindent
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1580	We can find a strict prefix @{text "x\<^isub>p"} of @{text x} such that @{term "x\<^isub>p \<in> A\<star>"},
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1581	@{text "x\<^isub>p < x"} and the rest @{term "x\<^isub>s @ z \<in> A\<star>"}. For example the empty string
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1582	@{text "[]"} would do (recall @{term "x \<noteq> []"}).
135 604518f0127f final polished urbanc parents: 134 diff changeset	1583	There are potentially many such prefixes, but there can only be finitely many of them (the
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1584	string @{text x} is finite). Let us therefore choose the longest one and call it
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1585	@{text "x\<^bsub>pmax\<^esub>"}. Now for the rest of the string @{text "x\<^isub>s @ z"} we
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1586	know it is in @{term "A\<star>"} and cannot be the empty string. By Property~\ref{langprops}@{text "(iv)"},
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1587	we can separate
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1588	this string into two parts, say @{text "a"} and @{text "b"}, such that @{text "a \<noteq> []"}, @{text "a \<in> A"}
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1589	and @{term "b \<in> A\<star>"}. Now @{text a} must be strictly longer than @{text "x\<^isub>s"},
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1590	otherwise @{text "x\<^bsub>pmax\<^esub>"} is not the longest prefix. That means @{text a}
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1591	`overlaps' with @{text z}, splitting it into two components @{text "z\<^isub>a"} and
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1592	@{text "z\<^isub>b"}. For this we know that @{text "x\<^isub>s @ z\<^isub>a \<in> A"} and
135 604518f0127f final polished urbanc parents: 134 diff changeset	1593	@{term "z\<^isub>b \<in> A\<star>"}. To cut a story short, we have divided @{term "x @ z \<in> A\<star>"}
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1594	such that we have a string @{text a} with @{text "a \<in> A"} that lies just on the
184 2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1595	`border' of @{text x} and @{text z}. This string is @{text "x\<^isub>s @ z\<^isub>a"}.
2455db3b06ac more on the paper urbanc parents: 183 diff changeset	1596
135 604518f0127f final polished urbanc parents: 134 diff changeset	1597	In order to show that @{term "x @ z \<in> A\<star>"} implies @{term "y @ z \<in> A\<star>"}, we use
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1598	the following tagging-function:
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	1599	%
121 1cf12a107b03 added directory with the small files and numbers of lines urbanc parents: 120 diff changeset	1600	\begin{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1601	@{thm (lhs) tag_Star_def[where ?A="A", THEN meta_eq_app]}~@{text "\<equiv>"}~
338 e7504bfdbd50 made the changes thes 2nd referee suggested and made it to compile again urbanc parents: 334 diff changeset	1602	@{text "{\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>A\<^esub> \| x\<^isub>p < x \<and> x\<^isub>p \<in> A\<^isup>\<star> \<and> (x\<^isub>p, x\<^isub>s) \<in> Partitions x}"}
121 1cf12a107b03 added directory with the small files and numbers of lines urbanc parents: 120 diff changeset	1603	\end{center}
128 6d2693c78c37 finished picture urbanc parents: 127 diff changeset	1604
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1605	\begin{proof}[(@{const Star}-Case)]
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1606	If @{term "finite (UNIV // \<approx>A)"}
3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1607	then @{term "finite (Pow (UNIV // \<approx>A))"} holds. The range of
181 97090fc7aa9f some experiments with the proofs in Myhill_2 urbanc parents: 180 diff changeset	1608	@{term "tag_Star A"} is a subset of this set, and therefore finite.
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1609	Again we have to show under the assumption @{term "x"}~@{term "=(tag_Star A)="}~@{term y}
8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1610	that @{term "x @ z \<in> A\<star>"} implies @{term "y @ z \<in> A\<star>"}.
8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1611
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1612	We first need to consider the case that @{text x} is the empty string.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1613	From the assumption about strict prefixes in @{text "\<^raw:$\threesim$>\<^bsub>\<star>tag A\<^esub>"}, we
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1614	can infer @{text y} is the empty string and
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1615	then clearly have @{term "y @ z \<in> A\<star>"}. In case @{text x} is not the empty
134 08afbed1c8c7 chunhan's comments urbanc parents: 133 diff changeset	1616	string, we can divide the string @{text "x @ z"} as shown in the picture
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1617	above. By the tagging-function and the facts @{text "x\<^bsub>pmax\<^esub> \<in> A\<^isup>\<star>"} and @{text "x\<^bsub>pmax\<^esub> < x"},
8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1618	we have
8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1619
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1620	\begin{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1621	@{text "\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>A\<^esub> \<in> {\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>A\<^esub> \| x\<^bsub>pmax\<^esub> < x \<and> x\<^bsub>pmax\<^esub> \<in> A\<^isup>\<star> \<and> (x\<^bsub>pmax\<^esub>, x\<^isub>s) \<in> Partitions x}"}
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1622	\end{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1623
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1624	\noindent
3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1625	which by assumption is equal to
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1626
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1627	\begin{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1628	@{text "\<lbrakk>x\<^isub>s\<rbrakk>\<^bsub>\<approx>A\<^esub> \<in> {\<lbrakk>y\<^isub>s\<rbrakk>\<^bsub>\<approx>A\<^esub> \| y\<^bsub>p\<^esub> < y \<and> y\<^bsub>p\<^esub> \<in> A\<^isup>\<star> \<and> (y\<^bsub>p\<^esub>, y\<^isub>s) \<in> Partitions y}"}
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1629	\end{center}
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1630
130 3e4ad22193e7 pre-final version urbanc parents: 129 diff changeset	1631	\noindent
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1632	From this we know there exist a partition @{text "y\<^isub>p"} and @{text
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1633	"y\<^isub>s"} with @{term "y\<^isub>p \<in> A\<star>"} and also @{term "x\<^isub>s \<approx>A
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1634	y\<^isub>s"}. Unfolding the Myhill-Nerode Relation we know @{term
185 8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1635	"y\<^isub>s @ z\<^isub>a \<in> A"}. We also know that @{term "z\<^isub>b \<in> A\<star>"}.
8749db46d5e6 completed the taging-function section urbanc parents: 184 diff changeset	1636	Therefore @{term "y\<^isub>p @ (y\<^isub>s @ z\<^isub>a) @ z\<^isub>b \<in>
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1637	A\<star>"}, which means @{term "y @ z \<in> A\<star>"}. The last step is to set
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1638	@{text "A"} to @{term "lang r"} and thus complete the proof.
121 1cf12a107b03 added directory with the small files and numbers of lines urbanc parents: 120 diff changeset	1639	\end{proof}
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1640	*}
a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	1641
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1642	section {* Second Part proved using Partial Derivatives *}
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	1643
e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	1644	text {*
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1645	\label{derivatives}
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	1646	\noindent
d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	1647	As we have seen in the previous section, in order to establish
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1648	the second direction of the Myhill-Nerode Theorem, it is sufficient to find
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1649	a more refined relation than @{term "\<approx>(lang r)"} for which we can
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1650	show that there are only finitely many equivalence classes. So far we
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1651	showed this directly by induction on @{text "r"} using tagging-functions.
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1652	However, there is also an indirect method to come up with such a refined
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1653	relation by using derivatives of regular expressions introduced by \citeN{Brzozowski64}.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1654
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1655	Assume the following two definitions for the \emph{left-quotient} of a language,
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1656	which we write as @{term "Deriv c A"} and @{term "Derivs s A"} where @{text c}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1657	is a character and @{text s} a string, respectively:
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1658
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1659	\begin{center}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1660	\begin{tabular}{r@ {\hspace{1mm}}c@ {\hspace{2mm}}l}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1661	@{thm (lhs) Der_def} & @{text "\<equiv>"} & @{thm (rhs) Der_def}\\
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1662	@{thm (lhs) Ders_def} & @{text "\<equiv>"} & @{thm (rhs) Ders_def}\\
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1663	\end{tabular}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1664	\end{center}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1665
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1666	\noindent
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1667	In order to aid readability, we shall make use of the following abbreviation
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1668
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1669	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1670	@{abbrev "Derivss s As"}
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1671	\end{center}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1672
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1673	\noindent
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1674	where we apply the left-quotient to a set of languages and then combine the results.
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1675	Clearly we have the following equivalence between the Myhill-Nerode Relation
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1676	(Definition~\ref{myhillneroderel}) and left-quotients
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1677	%
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1678	\begin{equation}\label{mhders}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1679	@{term "x \<approx>A y"} \hspace{4mm}\text{if and only if}\hspace{4mm} @{term "Derivs x A = Derivs y A"}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1680	\end{equation}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1681
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1682	\noindent
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1683	It is also straightforward to establish the following properties of left-quotients
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1684	%
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1685	\begin{equation}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1686	\mbox{\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{2mm}}l}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1687	@{thm (lhs) Deriv_simps(1)} & $=$ & @{thm (rhs) Deriv_simps(1)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1688	@{thm (lhs) Deriv_simps(2)} & $=$ & @{thm (rhs) Deriv_simps(2)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1689	@{thm (lhs) Deriv_simps(3)} & $=$ & @{thm (rhs) Deriv_simps(3)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1690	@{thm (lhs) Deriv_simps(4)} & $=$ & @{thm (rhs) Deriv_simps(4)}\\
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1691	@{thm (lhs) Der_conc} & $=$ & @{thm (rhs) Der_conc}\\
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1692	@{thm (lhs) Deriv_star} & $=$ & @{thm (rhs) Deriv_star}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1693	@{thm (lhs) Derivs_simps(1)} & $=$ & @{thm (rhs) Derivs_simps(1)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1694	@{thm (lhs) Derivs_simps(2)} & $=$ & @{thm (rhs) Derivs_simps(2)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1695	%@{thm (lhs) Derivs_simps(3)[where ?s1.0="s\<^isub>1" and ?s2.0="s\<^isub>2"]} & $=$
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1696	% & @{thm (rhs) Derivs_simps(3)[where ?s1.0="s\<^isub>1" and ?s2.0="s\<^isub>2"]}\\
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1697	\end{tabular}}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1698	\end{equation}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1699
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1700	\noindent
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1701	Note that in the last equation we use the list-cons operator written
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	1702	\mbox{@{text "_ :: _"}}. The only interesting case is the case of @{term "A\<star>"}
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1703	where we use Property~\ref{langprops}@{text "(i)"} in order to infer that
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1704	@{term "Deriv c (A\<star>) = Deriv c (A \<cdot> A\<star>)"}. We can then complete the proof by
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1705	using the fifth equation and noting that @{term "Deriv c (A\<star>) \<subseteq> (Deriv
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1706	c A) \<cdot> A\<star>"} provided @{text "[] \<in> A"}.
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1707
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1708	\citeN{Brzozowski64} observed that the left-quotients for languages of
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1709	regular expressions can be calculated directly using the notion of
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1710	\emph{derivatives of a regular expression}. We define
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1711	this notion in Isabelle/HOL as follows:
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1712	%
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1713	\begin{center}
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1714	\begin{longtable}{@ {}l@ {\hspace{1mm}}c@ {\hspace{1.5mm}}l@ {}}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1715	@{thm (lhs) deriv.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) deriv.simps(1)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1716	@{thm (lhs) deriv.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) deriv.simps(2)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1717	@{thm (lhs) deriv.simps(3)[where c'="d"]} & @{text "\<equiv>"} & @{thm (rhs) deriv.simps(3)[where c'="d"]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1718	@{thm (lhs) deriv.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1719	& @{text "\<equiv>"} & @{thm (rhs) deriv.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1720	@{thm (lhs) deriv.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	1721	& @{text "\<equiv>"} & @{text "if"}~@{term "nullable r\<^isub>1"}~@{text "then"}~%
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1722	@{term "Plus (Times (deriv c r\<^isub>1) r\<^isub>2) (deriv c r\<^isub>2)"}\\
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	1723	& & \phantom{@{text "if"}~@{term "nullable r\<^isub>1"}~}@{text "else"}~%
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1724	@{term "Times (deriv c r\<^isub>1) r\<^isub>2"}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1725	@{thm (lhs) deriv.simps(6)} & @{text "\<equiv>"} & @{thm (rhs) deriv.simps(6)}\smallskip\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1726	@{thm (lhs) derivs.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) derivs.simps(1)}\\
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1727	@{thm (lhs) derivs.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) derivs.simps(2)}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1728	\end{longtable}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1729	\end{center}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1730
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1731	\noindent
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1732	The last two clauses extend derivatives from characters to strings. The
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1733	boolean function @{term "nullable r"} needed in the @{const Times}-case tests
197 cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1734	whether a regular expression can recognise the empty string. It can be defined as
cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1735	follows.
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1736	%
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1737	\begin{center}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1738	\begin{longtable}{@ {}l@ {\hspace{1mm}}c@ {\hspace{1.5mm}}l@ {}}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1739	@{thm (lhs) nullable.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) nullable.simps(1)}\\
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1740	@{thm (lhs) nullable.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) nullable.simps(2)}\\
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1741	@{thm (lhs) nullable.simps(3)} & @{text "\<equiv>"} & @{thm (rhs) nullable.simps(3)}\\
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1742	@{thm (lhs) nullable.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1743	& @{text "\<equiv>"} & @{thm (rhs) nullable.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1744	@{thm (lhs) nullable.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1745	& @{text "\<equiv>"} & @{thm (rhs) nullable.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1746	@{thm (lhs) nullable.simps(6)} & @{text "\<equiv>"} & @{thm (rhs) nullable.simps(6)}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1747	\end{longtable}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1748	\end{center}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1749
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1750	\noindent
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1751	By induction on the regular expression @{text r}, respectively on the string @{text s},
197 cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1752	one can easily show that left-quotients and derivatives of regular expressions
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1753	relate as follows (see for example~\cite{Sakarovitch09}):
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1754	%
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1755	\begin{equation}\label{Dersders}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1756	\mbox{\begin{tabular}{c}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1757	@{thm Deriv_deriv}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1758	@{thm Derivs_derivs}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1759	\end{tabular}}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1760	\end{equation}
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1761
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1762	\noindent
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1763	The importance of this fact in the context of the Myhill-Nerode Theorem is that
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1764	we can use \eqref{mhders} and \eqref{Dersders} in order to
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1765	establish that
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1766
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1767	\begin{center}
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1768	@{term "x \<approx>(lang r) y"} \hspace{4mm}if and only if\hspace{4mm}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1769	@{term "lang (derivs x r) = lang (derivs y r)"}.
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1770	\end{center}
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1771
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1772	\noindent
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1773	holds and hence
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1774	%
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1775	\begin{equation}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1776	@{term "x \<approx>(lang r) y"}\hspace{4mm}\mbox{provided}\hspace{4mm}@{term "derivs x r = derivs y r"}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1777	\end{equation}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1778
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1779
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1780	\noindent
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1781	This means the right-hand side (seen as a relation) refines the Myhill-Nerode
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1782	Relation. Consequently, we can use @{text
197 cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1783	"\<^raw:$\threesim$>\<^bsub>(\<lambda>x. ders x r)\<^esub>"} as a
cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1784	tagging-relation. However, in order to be useful for the second part of the
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1785	Myhill-Nerode Theorem, we have to be able to establish that for the
197 cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1786	corresponding language there are only finitely many derivatives---thus
cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1787	ensuring that there are only finitely many equivalence
cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1788	classes. Unfortunately, this is not true in general. Sakarovitch gives an
cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1789	example where a regular expression has infinitely many derivatives
218 28e98ede8599 added a few points urbanc parents: 217 diff changeset	1790	w.r.t.~the language @{text "(ab)\<^isup>\<star> \<union> (ab)\<^isup>\<star>a"}, which is formally
28e98ede8599 added a few points urbanc parents: 217 diff changeset	1791	written in our notation as \mbox{@{text "{[a,b]}\<^isup>\<star> \<union> ({[a,b]}\<^isup>\<star> \<cdot> {[a]})"}}
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1792	(see \cite[Page~141]{Sakarovitch09}).
197 cf1c17431dab a bit more polishing urbanc parents: 196 diff changeset	1793
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1794
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1795	What \citeN{Brzozowski64} established is that for every language there
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	1796	\emph{are} only finitely `dissimilar' derivatives for a regular
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1797	expression. Two regular expressions are said to be \emph{similar} provided
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1798	they can be identified using the using the @{text "ACI"}-identities:
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1799	%
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1800	\begin{equation}\label{ACI}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1801	\mbox{\begin{tabular}{cl}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1802	(@{text A}) & @{term "Plus (Plus r\<^isub>1 r\<^isub>2) r\<^isub>3"} $\equiv$ @{term "Plus r\<^isub>1 (Plus r\<^isub>2 r\<^isub>3)"}\\
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1803	(@{text C}) & @{term "Plus r\<^isub>1 r\<^isub>2"} $\equiv$ @{term "Plus r\<^isub>2 r\<^isub>1"}\\
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1804	(@{text I}) & @{term "Plus r r"} $\equiv$ @{term "r"}\\
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1805	\end{tabular}}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1806	\end{equation}
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1807
2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1808	\noindent
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1809	Carrying this idea through, we must not consider the set of all derivatives,
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	1810	but the one modulo @{text "ACI"}. In principle, this can be done formally,
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1811	but it is very painful in a theorem prover (since there is no
194 5347d7556487 some typos urbanc parents: 193 diff changeset	1812	direct characterisation of the set of dissimilar derivatives).
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1813
174 2b414a8a7132 more on the section about derivatives urbanc parents: 173 diff changeset	1814
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1815	Fortunately, there is a much simpler approach using \emph{partial
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1816	derivatives}. They were introduced by \citeN{Antimirov95} and can be defined
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1817	in Isabelle/HOL as follows:
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1818	%
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	1819	\begin{center}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1820	\begin{longtable}{@ {}l@ {\hspace{1mm}}c@ {\hspace{1.5mm}}l@ {}}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1821	@{thm (lhs) pderiv.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) pderiv.simps(1)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1822	@{thm (lhs) pderiv.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) pderiv.simps(2)}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1823	@{thm (lhs) pderiv.simps(3)[where c'="d"]} & @{text "\<equiv>"} & @{thm (rhs) pderiv.simps(3)[where c'="d"]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1824	@{thm (lhs) pderiv.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1825	& @{text "\<equiv>"} & @{thm (rhs) pderiv.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1826	@{thm (lhs) pderiv.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	1827	& @{text "\<equiv>"} & @{text "if"}~@{term "nullable r\<^isub>1"}~@{text "then"}~%
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1828	@{term "(Timess (pderiv c r\<^isub>1) r\<^isub>2) \<union> (pderiv c r\<^isub>2)"}\\
177 50cc1a39c990 more one the paper urbanc parents: 176 diff changeset	1829	& & \phantom{@{text "if"}~@{term "nullable r\<^isub>1"}~}@{text "else"}~%
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1830	@{term "Timess (pderiv c r\<^isub>1) r\<^isub>2"}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1831	@{thm (lhs) pderiv.simps(6)} & @{text "\<equiv>"} & @{thm (rhs) pderiv.simps(6)}\smallskip\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1832	@{thm (lhs) pderivs.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) pderivs.simps(1)}\\
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1833	@{thm (lhs) pderivs.simps(2)} & @{text "\<equiv>"} & @{text "\<Union> (pders s) ` (pder c r)"}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1834	\end{longtable}
175 edc642266a82 polished the introduction urbanc parents: 174 diff changeset	1835	\end{center}
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	1836
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1837	\noindent
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1838	Again the last two clauses extend partial derivatives from characters to strings.
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1839	Unlike `simple' derivatives, the functions for partial derivatives return sets of regular
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1840	expressions. In the @{const Times} and @{const Star} cases we therefore use the
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1841	auxiliary definition
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1842
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1843	\begin{center}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1844	@{text "TIMESS rs r \<equiv> {TIMES r' r \| r' \<in> rs}"}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1845	\end{center}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1846
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1847	\noindent
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1848	in order to `sequence' a regular expression with a set of regular
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1849	expressions. Note that in the last clause we first build the set of partial
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1850	derivatives w.r.t~the character @{text c}, then build the image of this set under the
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1851	function @{term "pderivs s"} and finally `union up' all resulting sets. It will be
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1852	convenient to introduce for this the following abbreviation
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1853
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1854	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1855	@{abbrev "pderivs_set s rs"}
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1856	\end{center}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1857
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1858	\noindent
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1859	which simplifies the last clause of @{const "pderivs"} to
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1860
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1861	\begin{center}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1862	\begin{tabular}{@ {}l@ {\hspace{1mm}}c@ {\hspace{1.5mm}}l@ {}}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1863	@{thm (lhs) pderivs.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) pderivs.simps(2)}\\
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1864	\end{tabular}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1865	\end{center}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1866
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1867	Partial derivatives can be seen as having the @{text "ACI"}-identities already built in:
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1868	taking the partial derivatives of the
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1869	regular expressions in \eqref{ACI} gives us in each case
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1870	equal sets. \citeN{Antimirov95} showed a similar result to
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1871	\eqref{Dersders} for partial derivatives, namely
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1872	%
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1873	\begin{equation}\label{Derspders}
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1874	\mbox{\begin{tabular}{lc}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1875	@{text "(i)"} & @{thm Deriv_pderiv}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1876	@{text "(ii)"} & @{thm Derivs_pderivs}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1877	\end{tabular}}
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1878	\end{equation}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1879
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1880	\begin{proof}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1881	The first fact is by a simple induction on @{text r}. For the second we slightly
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1882	modify Antimirov's proof by performing an induction on @{text s} where we
194 5347d7556487 some typos urbanc parents: 193 diff changeset	1883	generalise over all @{text r}. That means in the @{text "cons"}-case the
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1884	induction hypothesis is
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1885
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1886	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1887	@{text "(IH)"}\hspace{3mm}@{term "\<forall>r. Derivs s (lang r) = \<Union> lang ` (pderivs s r)"}
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1888	\end{center}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1889
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	1890	\noindent
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1891	With this we can establish
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1892
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1893	\begin{center}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1894	\begin{tabular}{r@ {\hspace{1.5mm}}c@ {\hspace{1.5mm}}ll}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1895	@{term "Derivs (c # s) (lang r)"}
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1896	& @{text "="} & @{term "Derivs s (Deriv c (lang r))"} & by def.\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1897	& @{text "="} & @{term "Derivs s (\<Union> lang ` (pderiv c r))"} & by @{text "("}\ref{Derspders}@{text ".i)"}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1898	& @{text "="} & @{term "\<Union> (Derivs s) ` (lang ` (pderiv c r))"} & by def.~of @{text "Ders"}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1899	& @{text "="} & @{term "\<Union> lang ` (\<Union> pderivs s ` (pderiv c r))"} & by IH\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1900	& @{text "="} & @{term "\<Union> lang ` (pderivs (c # s) r)"} & by def.\\
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1901	\end{tabular}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1902	\end{center}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1903
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1904	\noindent
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1905	Note that in order to apply the induction hypothesis in the fourth equation, we
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1906	need the generalisation over all regular expressions @{text r}. The case for
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1907	the empty string is routine and omitted.
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1908	\end{proof}
9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	1909
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1910	\noindent
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1911	Taking \eqref{Dersders} and \eqref{Derspders} together gives the relationship
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1912	between languages of derivatives and partial derivatives
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1913	%
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1914	\begin{equation}
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1915	\mbox{\begin{tabular}{lc}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1916	@{text "(i)"} & @{thm deriv_pderiv[symmetric]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1917	@{text "(ii)"} & @{thm derivs_pderivs[symmetric]}
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1918	\end{tabular}}
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1919	\end{equation}
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1920
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1921	\noindent
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1922	These two properties confirm the observation made earlier
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1923	that by using sets, partial derivatives have the @{text "ACI"}-identities
190 b73478aaf33e more on paper urbanc parents: 187 diff changeset	1924	of derivatives already built in.
b73478aaf33e more on paper urbanc parents: 187 diff changeset	1925
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	1926	Antimirov also proved that for every language and every regular expression
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1927	there are only finitely many partial derivatives, whereby the set of partial
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1928	derivatives of @{text r} w.r.t.~a language @{text A} is defined as
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1929	%
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1930	\begin{equation}\label{Pdersdef}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1931	@{thm pderivs_lang_def}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1932	\end{equation}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1933
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1934	\begin{thrm}[\cite{Antimirov95}]\label{antimirov}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1935	For every language @{text A} and every regular expression @{text r},
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1936	\mbox{@{thm finite_pderivs_lang}}.
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1937	\end{thrm}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1938
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1939	\noindent
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1940	Antimirov's proof first establishes this theorem for the language @{term UNIV1},
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1941	which is the set of all non-empty strings. For this he proves:
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	1942	%
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	1943	\begin{equation}\label{pdersunivone}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1944	\mbox{\begin{tabular}{l}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1945	@{thm pderivs_lang_Zero}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1946	@{thm pderivs_lang_One}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1947	@{thm pderivs_lang_Atom}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1948	@{thm pderivs_lang_Plus[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1949	@{thm pderivs_lang_Times[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1950	@{thm pderivs_lang_Star}\\
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1951	\end{tabular}}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1952	\end{equation}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1953
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1954	\noindent
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1955	from which one can deduce by induction on @{text r} that
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1956
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1957	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1958	@{thm finite_pderivs_lang_UNIV1}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1959	\end{center}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1960
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1961	\noindent
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1962	holds. Now Antimirov's theorem follows because
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1963
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1964	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1965	@{thm pderivs_lang_UNIV}\\
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1966	\end{center}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1967
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1968	\noindent
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1969	and for all languages @{text "A"}, @{term "pderivs_lang A r"} is a subset of
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1970	@{term "pderivs_lang UNIV r"}. Since we follow Antimirov's proof quite
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	1971	closely in our formalisation (only the last two cases of
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1972	\eqref{pdersunivone} involve some non-routine induction arguments), we omit
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	1973	the details.
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1974
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	1975	Let us now return to our proof for the second direction in the Myhill-Nerode
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	1976	Theorem. The point of the above calculations is to use
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	1977	@{text "\<^raw:$\threesim$>\<^bsub>(\<lambda>x. pders x r)\<^esub>"} as tagging-relation.
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1978
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1979
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	1980	\begin{proof}[of Theorem~\ref{myhillnerodetwo} (second version)]
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1981	Using \eqref{mhders}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1982	and \eqref{Derspders} we can easily infer that
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1983
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1984	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1985	@{term "x \<approx>(lang r) y"}\hspace{4mm}\mbox{provided}\hspace{4mm}@{term "pderivs x r = pderivs y r"}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1986	\end{center}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1987
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1988	\noindent
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	1989	which means the tagging-relation @{text "\<^raw:$\threesim$>\<^bsub>(\<lambda>x. pders x r)\<^esub>"} refines @{term "\<approx>(lang r)"}.
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1990	So we know by Lemma~\ref{fintwo}, \mbox{@{term "finite (UNIV // (\<approx>(lang r)))"}}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1991	holds if @{term "finite (UNIV // (=(\<lambda>x. pderivs x r)=))"}. In order to establish
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1992	the latter, we can use Lemma~\ref{finone} and show that the range of the
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1993	tagging-function \mbox{@{term "\<lambda>x. pderivs x r"}} is finite. For this recall Definition
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1994	\ref{Pdersdef}, which gives us that
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1995
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1996	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	1997	@{thm pderivs_lang_def[where A="UNIV", simplified]}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1998	\end{center}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	1999
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2000	\noindent
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2001	Now the range of @{term "\<lambda>x. pderivs x r"} is a subset of @{term "Pow (pderivs_lang UNIV r)"},
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2002	which we know is finite by Theorem~\ref{antimirov}. Consequently there
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2003	are only finitely many equivalence classes of @{text "\<^raw:$\threesim$>\<^bsub>(\<lambda>x. pders x r)\<^esub>"}.
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2004	This relation refines @{term "\<approx>(lang r)"}, and therefore we can again conclude the
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2005	second part of the Myhill-Nerode Theorem.
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2006	\end{proof}
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	2007	*}
e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	2008
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2009	section {* Closure Properties of Regular Languages *}
39 a59473f0229d tuned a little bit the section about finite partitions urbanc parents: 37 diff changeset	2010
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2011	text {*
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2012	\label{closure}
187 9f46a9571e37 more on the derivatives section urbanc parents: 186 diff changeset	2013	\noindent
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2014	The beauty of regular languages is that they are closed under many set
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2015	operations. Closure under union, concatenation and Kleene-star are trivial
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2016	to establish given our definition of regularity (recall Definition~\ref{regular}).
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2017	More interesting in our setting is the closure under complement, because it seems difficult
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2018	to construct a regular expression for the complement language by direct
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2019	means. However the existence of such a regular expression can now be easily
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2020	proved using both parts of the Myhill-Nerode Theorem, since
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2021
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2022	\begin{center}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2023	@{term "s\<^isub>1 \<approx>A s\<^isub>2"} if and only if @{term "s\<^isub>1 \<approx>(-A) s\<^isub>2"}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2024	\end{center}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2025
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2026	\noindent
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2027	holds for any strings @{text "s\<^isub>1"} and @{text
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2028	"s\<^isub>2"}. Therefore @{text A} and the complement language @{term "-A"}
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2029	give rise to the same partitions. So if one is finite, the other is too, and
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2030	vice versa. As noted earlier, our algorithm for solving equational systems
250 b1946e131ce8 small typo urbanc parents: 249 diff changeset	2031	actually calculates a regular expression for the complement language.
b1946e131ce8 small typo urbanc parents: 249 diff changeset	2032	Calculating such a regular expression via
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2033	automata using the standard method would be quite involved. It includes the
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2034	steps: regular expression @{text "\<Rightarrow>"} non-deterministic automaton @{text
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2035	"\<Rightarrow>"} deterministic automaton @{text "\<Rightarrow>"} complement automaton @{text "\<Rightarrow>"}
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2036	regular expression. Clearly not something you want to formalise in a theorem
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2037	prover if it is cumbersome to reason about automata.
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2038
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2039	%Once closure under complement is established, closure under intersection
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2040	%and set difference is also easy, because
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2041	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2042	%\begin{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2043	%\begin{tabular}{c}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2044	%@{term "A \<inter> B = - (- A \<union> - B)"}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2045	%@{term "A - B = - (- A \<union> B)"}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2046	%\end{tabular}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2047	%\end{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2048	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2049	%\noindent
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2050	%Since all finite languages are regular, then by closure under complement also
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2051	%all co-finite languages.
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2052	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2053	%Closure of regular languages under reversal, that is
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2054	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2055	%\begin{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2056	%@{text "A\<^bsup>-1\<^esup> \<equiv> {s\<^bsup>-1\<^esup> \| s \<in> A}"}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2057	%\end{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2058	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2059	%\noindent
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2060	%can be shown with the help of the following operation defined recursively over
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2061	%regular expressions
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2062	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2063	%\begin{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2064	%\begin{tabular}{r@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2065	%@{thm (lhs) Rev.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) Rev.simps(1)}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2066	%@{thm (lhs) Rev.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) Rev.simps(2)}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2067	%@{thm (lhs) Rev.simps(3)} & @{text "\<equiv>"} & @{thm (rhs) Rev.simps(3)}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2068	%@{thm (lhs) Rev.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]} & @{text "\<equiv>"} &
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2069	% @{thm (rhs) Rev.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2070	%@{thm (lhs) Rev.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]} & @{text "\<equiv>"} &
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2071	% @{thm (rhs) Rev.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2072	%@{thm (lhs) Rev.simps(6)} & @{text "\<equiv>"} & @{thm (rhs) Rev.simps(6)}\\
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2073	%\end{tabular}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2074	%\end{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2075	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2076	%\noindent
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2077	%For this operation we can show
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2078	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2079	%\begin{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2080	%@{text "(\<calL>(r))\<^bsup>-1\<^esup>"}~@{text "="}~@{thm (rhs) rev_lang}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2081	%\end{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2082	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2083	%\noindent
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2084	%from which closure under reversal of regular languages follows.
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2085
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2086	A perhaps surprising fact is that regular languages are closed under any
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2087	left-quotient. Define
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2088
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2089	\begin{center}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2090	@{abbrev "Deriv_lang B A"}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2091	\end{center}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2092
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2093	\noindent
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2094	and assume @{text B} is any language and @{text A} is regular, then @{term
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2095	"Deriv_lang B A"} is regular. To see this consider the following argument
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2096	using partial derivatives (which we used in Section~\ref{derivatives}): From @{text A}
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2097	being regular we know there exists
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2098	a regular expression @{text r} such that @{term "A = lang r"}. We also know
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2099	that @{term "pderivs_lang B r"} is finite for every language @{text B} and
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2100	regular expression @{text r} (recall Theorem~\ref{antimirov}). By definition
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2101	and \eqref{Derspders} we have
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2102	%
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2103	\begin{equation}\label{eqq}
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2104	@{term "Deriv_lang B (lang r) = (\<Union> lang ` (pderivs_lang B r))"}
193 2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2105	\end{equation}
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2106
2a5ac68db24b finished section about derivatives and closure properties urbanc parents: 190 diff changeset	2107	\noindent
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2108	Since there are only finitely many regular expressions in @{term "pderivs_lang
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2109	B r"}, we know by \eqref{uplus} that there exists a regular expression so that
203 5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2110	the right-hand side of \eqref{eqq} is equal to the language \mbox{@{term "lang (\<Uplus>(pderivs_lang B
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2111	r))"}}. Thus the regular expression @{term "\<Uplus>(pderivs_lang B r)"} verifies that
5d724fe0e096 changes according to afp-submission urbanc parents: 201 diff changeset	2112	@{term "Deriv_lang B A"} is regular.
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2113
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2114	Even more surprising is the fact given first by \citeN{Haines69} stating
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2115	that for \emph{every} language @{text A}, the language
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2116	consisting of all (scattered) substrings of @{text A} is regular (see also
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2117	\cite{Shallit08,Gasarch09}).
258 1abf8586ee6b added slides for a talk in St Andrews urbanc parents: 257 diff changeset	2118	A \emph{(scattered) substring} can be obtained
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2119	by striking out zero or more characters from a string. This can be defined
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2120	inductively in Isabelle/HOL by the following three rules:
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2121
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2122	%\begin{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2123	%@ {thm[mode=Axiom] emb0[where bs="x"]}\hspace{10mm}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2124	%@ {thm[mode=Rule] emb1[where as="x" and b="c" and bs="y"]}\hspace{10mm}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2125	%@ {thm[mode=Rule] emb2[where as="x" and a="c" and bs="y"]}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2126	%\end{center}
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2127	\begin{center}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2128	@{thm[mode=Axiom] emb0}\hspace{10mm}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2129	@{thm[mode=Rule] emb1}\hspace{10mm}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2130	@{thm[mode=Rule] emb2}
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2131	\end{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2132
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2133	\noindent
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2134	It is straightforward to prove that @{text "\<preceq>"} is a partial order. Now define the
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2135	\emph{language of substrings} and \emph{superstrings} of a language @{text A}
e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2136	respectively as
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2137
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2138	\begin{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2139	\begin{tabular}{l}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2140	@{thm SUBSEQ_def}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2141	@{thm SUPSEQ_def}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2142	\end{tabular}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2143	\end{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2144
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2145	\noindent
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2146	We like to establish
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2147
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2148	\begin{thrm}[\cite{Haines69}]\label{subseqreg}
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2149	For every language @{text A}, the languages @{text "(i)"} @{term "SUBSEQ A"} and
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2150	@{text "(ii)"} @{term "SUPSEQ A"}
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2151	are regular.
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2152	\end{thrm}
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2153
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2154	\noindent
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2155	Our proof follows the one given by \citeN[Pages 92--95]{Shallit08}, except that we use
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2156	Higman's Lemma, which is already proved in the Isabelle/HOL library by
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2157	Sternagel.
256 acbae3a11fb5 set -> language urbanc parents: 254 diff changeset	2158	Higman's Lemma allows us to infer that every language @{text A} of antichains, satisfying
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2159	%
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2160	\begin{equation}\label{higman}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2161	@{text "\<forall>x, y \<in> A."}~@{term "x \<noteq> y \<longrightarrow> \<not>(x \<preceq> y) \<and> \<not>(y \<preceq> x)"}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2162	\end{equation}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2163
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2164	\noindent
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2165	is finite.
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2166
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2167	The first step in our proof of Theorem~\ref{subseqreg} is to establish the
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2168	following simple properties for @{term SUPSEQ}
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2169	%
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2170	\begin{equation}\label{supseqprops}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2171	\mbox{\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2172	@{thm (lhs) SUPSEQ_simps(1)} & @{text "\<equiv>"} & @{thm (rhs) SUPSEQ_simps(1)}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2173	@{thm (lhs) SUPSEQ_simps(2)} & @{text "\<equiv>"} & @{thm (rhs) SUPSEQ_simps(2)}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2174	@{thm (lhs) SUPSEQ_atom} & @{text "\<equiv>"} & @{thm (rhs) SUPSEQ_atom}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2175	@{thm (lhs) SUPSEQ_union} & @{text "\<equiv>"} & @{thm (rhs) SUPSEQ_union}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2176	@{thm (lhs) SUPSEQ_conc} & @{text "\<equiv>"} & @{thm (rhs) SUPSEQ_conc}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2177	@{thm (lhs) SUPSEQ_star} & @{text "\<equiv>"} & @{thm (rhs) SUPSEQ_star}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2178	\end{tabular}}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2179	\end{equation}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2180
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2181	\noindent
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2182	whereby the last equation follows from the fact that @{term "A\<star>"} contains the
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2183	empty string. With these properties at our disposal we can establish the lemma
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2184
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2185	\begin{lmm}
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2186	If @{text A} is regular, then also @{term "SUPSEQ A"}.
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2187	\end{lmm}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2188
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2189	\begin{proof}
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	2190	Since our alphabet is finite, we have a regular expression, written @{text ALL}, that
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2191	matches every string. Using this regular expression we can inductively define
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	2192	the operation @{text "r\<up>"}
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2193
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2194	\begin{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2195	\begin{tabular}{l@ {\hspace{1mm}}c@ {\hspace{1mm}}l}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2196	@{thm (lhs) UP.simps(1)} & @{text "\<equiv>"} & @{thm (rhs) UP.simps(1)}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2197	@{thm (lhs) UP.simps(2)} & @{text "\<equiv>"} & @{thm (rhs) UP.simps(2)}\\
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	2198	@{thm (lhs) UP.simps(3)} & @{text "\<equiv>"} & @{thm (rhs) UP.simps(3)}\\
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2199	@{thm (lhs) UP.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]} &
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2200	@{text "\<equiv>"} & @{thm (rhs) UP.simps(4)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2201	@{thm (lhs) UP.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]} &
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2202	@{text "\<equiv>"} & @{thm (rhs) UP.simps(5)[where ?r1.0="r\<^isub>1" and ?r2.0="r\<^isub>2"]}\\
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2203	@{thm (lhs) UP.simps(6)} & @{text "\<equiv>"} & @{thm (rhs) UP.simps(6)}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2204	\end{tabular}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2205	\end{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2206
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2207	\noindent
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	2208	and use \eqref{supseqprops} to establish that @{thm lang_UP} holds. This shows
251 821ff177a478 corrected typo found by Xingyuan urbanc parents: 250 diff changeset	2209	that @{term "SUPSEQ A"} is regular, provided @{text A} is.
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2210	\end{proof}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2211
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2212	\noindent
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2213	Now we can prove the main lemma w.r.t.~@{const "SUPSEQ"}, namely
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2214
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2215	\begin{lmm}\label{mset}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2216	For every language @{text A}, there exists a finite language @{text M} such that
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2217	\begin{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2218	\mbox{@{term "SUPSEQ M = SUPSEQ A"}}\;.
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2219	\end{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2220	\end{lmm}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2221
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2222	\begin{proof}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2223	For @{text M} we take the set of all minimal elements of @{text A}. An element @{text x}
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2224	is said to be \emph{minimal} in @{text A} provided
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2225
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2226	\begin{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2227	@{thm minimal_def}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2228	\end{center}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2229
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2230	\noindent
239 13de6a49294e polished SUBSEQ urbanc parents: 238 diff changeset	2231	By Higman's Lemma \eqref{higman} we know
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2232	that @{term "M \<equiv> {x \<in> A. minimal x A}"} is finite, since every minimal element is incomparable,
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2233	except with itself.
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2234	It is also straightforward to show that @{term "SUPSEQ M \<subseteq> SUPSEQ A"}. For
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2235	the other direction we have @{term "x \<in> SUPSEQ A"}. From this we obtain
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2236	a @{text y} such that @{term "y \<in> A"} and @{term "y \<preceq> x"}. Since we have that
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2237	the relation \mbox{@{term "{(y, x). y \<preceq> x \<and> x \<noteq> y}"}} is well-founded, there must
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2238	be a minimal element @{text "z"} such that @{term "z \<in> A"} and @{term "z \<preceq> y"},
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2239	and hence by transitivity also \mbox{@{term "z \<preceq> x"}} (here we deviate from the argument
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2240	given by \citeN{Shallit08}, because Isabelle/HOL provides already an extensive infrastructure
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2241	for reasoning about well-foundedness). Since @{term "z"} is
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2242	minimal and an element in @{term A}, we also know that @{term z} is in @{term M}.
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2243	From this together with \mbox{@{term "z \<preceq> x"}}, we can infer that @{term x} is in
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2244	@{term "SUPSEQ M"}, as required.
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2245	\end{proof}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2246
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2247	\noindent
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2248	This lemma allows us to establish the second part of Theorem~\ref{subseqreg}.
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2249
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2250	\begin{proof}[of the Second Part of Theorem~\ref{subseqreg}]
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2251	Given any language @{text A}, by Lemma~\ref{mset} we know there exists
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2252	a finite, and thus regular, language @{text M}. We further have @{term "SUPSEQ M = SUPSEQ A"},
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2253	which establishes the second part.
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2254	\end{proof}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2255
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2256	\noindent
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2257	In order to establish the first part of this theorem, we use the
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2258	property proved by \citeN{Shallit08}, namely that
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2259	%
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2260	\begin{equation}\label{compl}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2261	@{thm SUBSEQ_complement}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2262	\end{equation}
e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2263
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2264	\noindent
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2265	holds. Now the first part of Theorem~\ref{subseqreg} is a simple consequence of the second part.
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2266
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2267	\begin{proof}[of the First Part of Theorem~\ref{subseqreg}]
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2268	By the second part, we know the right-hand side of \eqref{compl}
237 e02155fe8136 slight polishing to SUBSEQ urbanc parents: 233 diff changeset	2269	is regular, which means @{term "- SUBSEQ A"} is regular. But since
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2270	we established already that regularity is preserved under complement (using Myhill-Nerode),
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2271	also @{term "SUBSEQ A"} must be regular.
233 e2dc11e12e0b added section about SUBSEQ and SUPSEQ urbanc parents: 218 diff changeset	2272	\end{proof}
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2273
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2274	Finally we like to show that the Myhill-Nerode Theorem is also convenient for establishing
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2275	the non-regularity of languages. For this we use the following version of the Continuation
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2276	Lemma (see for example~\cite{Rosenberg06}).
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2277
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2278	\begin{lmm}[(Continuation Lemma)]
257 f512026d5d6e small change urbanc parents: 256 diff changeset	2279	If a language @{text A} is regular and a set of strings @{text B} is infinite,
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2280	then there exist two distinct strings @{text x} and @{text y} in @{text B}
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2281	such that @{term "x \<approx>A y"}.
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2282	\end{lmm}
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2283
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2284	\noindent
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2285	This lemma can be easily deduced from the Myhill-Nerode Theorem and the Pigeonhole
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2286	Principle: Since @{text A} is regular, there can be only finitely many
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2287	equivalence classes. Hence an infinite set must contain
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2288	at least two strings that are in the same equivalence class, that is
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2289	they need to be related by the Myhill-Nerode Relation.
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2290
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2291	Using this lemma, it is straightforward to establish that the language
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2292	\mbox{@{text "A \<equiv> \<Union>\<^isub>n a\<^sup>n @ b\<^sup>n"}} is not regular (@{text "a\<^sup>n"} stands
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2293	for the strings consisting of @{text n} times the character a; similarly for
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2294	@{text "b\<^isup>n"}). For this consider the infinite set @{text "B \<equiv> \<Union>\<^isub>n a\<^sup>n"}.
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2295
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2296	\begin{lmm}
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2297	No two distinct strings in set @{text "B"} are Myhill-Nerode related by language @{text A}.
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2298	\end{lmm}
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2299
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2300	\begin{proof}
252 8e2c497d699e clarified proof about non-regularity urbanc parents: 251 diff changeset	2301	After unfolding the definition of @{text "B"}, we need to establish that given @{term "i \<noteq> j"},
8e2c497d699e clarified proof about non-regularity urbanc parents: 251 diff changeset	2302	the strings @{text "a\<^sup>i"} and @{text "a\<^sup>j"} are not Myhill-Nerode related by @{text "A"}.
8e2c497d699e clarified proof about non-regularity urbanc parents: 251 diff changeset	2303	That means we have to show that \mbox{@{text "\<forall>z. a\<^sup>i @ z \<in> A = a\<^sup>j @ z \<in> A"}} leads to
8e2c497d699e clarified proof about non-regularity urbanc parents: 251 diff changeset	2304	a contradiction. Let us take @{text "b\<^sup>i"} for @{text "z"}. Then we know @{text "a\<^sup>i @ b\<^sup>i \<in> A"}.
8e2c497d699e clarified proof about non-regularity urbanc parents: 251 diff changeset	2305	But since @{term "i \<noteq> j"}, @{text "a\<^sup>j @ b\<^sup>i \<notin> A"}. Therefore @{text "a\<^sup>i"} and @{text "a\<^sup>j"}
254 c21aaf7723a0 added paper that recently appeared about rexps and pegs urbanc parents: 253 diff changeset	2306	cannot be Myhill-Nerode related by @{text "A"}, and we are done.
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2307	\end{proof}
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2308
17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2309	\noindent
252 8e2c497d699e clarified proof about non-regularity urbanc parents: 251 diff changeset	2310	To conclude the proof of non-regularity for the language @{text A}, the
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2311	Continuation Lemma and the lemma above lead to a contradiction assuming
47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2312	@{text A} is regular. Therefore the language @{text A} is not regular, as we
47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2313	wanted to show.
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2314	*}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2315
117 22ba25b808c8 updated second direction urbanc parents: 116 diff changeset	2316
240 17aa8c8fbe7d added section about non-regularity urbanc parents: 239 diff changeset	2317
54 c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	2318	section {* Conclusion and Related Work *}
c19d2fc2cc69 a bit more on the paper urbanc parents: 53 diff changeset	2319
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	2320	text {*
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2321	\noindent
112 62fdb4bf7239 more on the conclusion urbanc parents: 111 diff changeset	2322	In this paper we took the view that a regular language is one where there
115 c5f138b5fc88 added comment from Larry urbanc parents: 114 diff changeset	2323	exists a regular expression that matches all of its strings. Regular
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2324	expressions can conveniently be defined as a datatype in theorem
145 099e20f25b25 corrected small typo urbanc parents: 143 diff changeset	2325	provers. For us it was therefore interesting to find out how far we can push
154 7c68b9ad4486 implemented most suggestions from the reviewers urbanc parents: 149 diff changeset	2326	this point of view. We have established in Isabelle/HOL both directions
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2327	of the Myhill-Nerode Theorem.
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	2328	%
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2329	\begin{thrm}[(Myhill-Nerode Theorem)]\mbox{}\\
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	2330	A language @{text A} is regular if and only if @{thm (rhs) Myhill_Nerode}.
167 61d0a412a3ae added a journal version urbanc parents: 162 diff changeset	2331	\end{thrm}
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2332
132 f77a7138f791 comments by Xingyuan urbanc parents: 131 diff changeset	2333	\noindent
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2334	Having formalised this theorem means we pushed our point of view quite
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2335	far. Using this theorem we can obviously prove when a language is \emph{not}
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2336	regular---by establishing that it has infinitely many equivalence classes
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2337	generated by the Myhill-Nerode Relation (this is usually the purpose of the
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2338	Pumping Lemma). We can also use it to establish the standard
186 07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2339	textbook results about closure properties of regular languages. Interesting
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2340	is the case of closure under complement, because it seems difficult to
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2341	construct a regular expression for the complement language by direct
07a269d9642b added more to the derivatives section urbanc parents: 185 diff changeset	2342	means. However the existence of such a regular expression can be easily
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2343	proved using the Myhill-Nerode Theorem.
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2344
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2345	%Our insistence on regular expressions for proving the Myhill-Nerode Theorem
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2346	%arose from the problem of defining formally the regularity of a language.
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2347	%In order to guarantee consistency,
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2348	%formalisations in HOL can only extend the logic with definitions that introduce a new concept in
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2349	%terms of already existing notions. A convenient definition for automata
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2350	%(based on graphs) uses a polymorphic type for the state nodes. This allows
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2351	%us to use the standard operation for disjoint union whenever we need to compose two
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2352	%automata. Unfortunately, we cannot use such a polymorphic definition
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2353	%in HOL as part of the definition for regularity of a language (a predicate
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2354	%over sets of strings). Consider for example the following attempt:
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2355	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2356	%\begin{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2357	%@{text "is_regular A \<equiv> \<exists>M(\<alpha>). is_dfa (M) \<and> \<calL>(M) = A"}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2358	%\end{center}
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2359	%
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2360	%\noindent
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2361	%In this definifion, the definiens is polymorphic in the type of the automata
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2362	%@{text "M"} (indicated by dependency on the type-variable @{text "\<alpha>"}), but the definiendum
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2363	%@{text "is_regular"} is not. Such definitions are excluded from HOL, because
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2364	%they can lead easily to inconsistencies (see \cite{PittsHOL4} for a simple
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2365	%example). Also HOL does not contain type-quantifiers which would allow us to
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2366	%get rid of the polymorphism by quantifying over the type-variable
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2367	%@{text "\<alpha>"}. Therefore when defining regularity in terms of automata, the
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2368	%natural way out in HOL is to resort to state nodes with an identity, for
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2369	%example a natural number. Unfortunatly, the consequence is that we have to
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2370	%be careful when combining two automata so that there is no clash between two
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2371	%such states. This makes formalisations quite fiddly and rather
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2372	%unpleasant. Regular expressions proved much more convenient for reasoning in
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2373	%HOL since they can be defined as inductive datatype and a reasoning
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2374	%infrastructure comes for free. The definition of regularity in terms of
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2375	%regular expressions poses no problem at all for HOL. We showed in this
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2376	%paper that they can be used for establishing the central result in regular
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2377	%language theory---the Myhill-Nerode Theorem.
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2378
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2379	While regular expressions are convenient, they have some limitations. One is
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2380	that there seems to be no method of calculating a minimal regular expression
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2381	(for example in terms of length) for a regular language, like there is for
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2382	automata. On the other hand, efficient regular expression matching, without
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2383	using automata, poses no problem as shown by \citeN{OwensReppyTuron09}. For an
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2384	implementation of a simple regular expression matcher, whose correctness has
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2385	been formally established, we refer the reader to
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2386	\citeN{OwensSlind08}. In our opinion, their formalisation is considerably
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2387	slicker than for example the approach to regular expression matching taken
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2388	by \citeN{Harper99} and by \citeN{Yi06}.
116 342983676c8f included comments by Chunhan urbanc parents: 115 diff changeset	2389
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2390	Our proof of the first direction is very much inspired by \emph{Brzozowski's
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2391	algebraic method} \citeyear{Brzozowski64} used to convert a finite automaton to a regular expression.
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2392	The close connection can be seen by considering the
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2393	equivalence classes as the states of the minimal automaton for the regular
201 9fbf6d9f85ae added comments by Xingyuan urbanc parents: 200 diff changeset	2394	language. However there are some subtle differences. Because our equivalence
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2395	classes (or correspondingly states) arise from the Myhill-Nerode Relation, the most natural
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2396	choice is to characterise each state with the set of strings starting from
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2397	the initial state leading up to that state. Usually, however, the states are
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2398	characterised as the strings starting from that state leading to the
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2399	terminal states. The first choice has consequences about how the initial
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2400	equational system is set up. We have the $\lambda$-term on our `initial
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2401	state', while Brzozowski has it on the terminal states. This means we also
fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2402	need to reverse the direction of Arden's Lemma. We have not found anything
249 061b32d78471 a final polishing before submitting later this week urbanc parents: 248 diff changeset	2403	in the `pencil-and-paper-reasoning' literature about our way of proving the
061b32d78471 a final polishing before submitting later this week urbanc parents: 248 diff changeset	2404	first direction of the Myhill-Nerode Theorem, but it appears to be folklore.
112 62fdb4bf7239 more on the conclusion urbanc parents: 111 diff changeset	2405
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2406	We presented two proofs for the second direction of the Myhill-Nerode
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2407	Theorem. One direct proof using tagging-functions and another using partial
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2408	derivatives. This part of our work is where our method using regular
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2409	expressions shines, because we can completely side-step the standard
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2410	argument (for example used by \citeN{Kozen97}) where automata need to be composed. However, it is
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2411	also the direction where we had to spend most of the `conceptual' time, as
b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2412	our first proof based on tagging-functions is new for establishing the
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2413	Myhill-Nerode Theorem. All standard proofs of this direction proceed by
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2414	arguments over automata.
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2415
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2416	The indirect proof for the second direction arose from our interest in
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2417	Brzozowski's derivatives for regular expression matching. While \citeN{Brzozowski64}
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2418	already established that there are only
196 fa8d33d13cb6 polishing of the closure section and conclusion urbanc parents: 194 diff changeset	2419	finitely many dissimilar derivatives for every regular expression, this
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2420	result is not as straightforward to formalise in a theorem prover as one
11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2421	might wish. The reason is that the set of dissimilar derivatives is not
11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2422	defined inductively, but in terms of an ACI-equivalence relation. This
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2423	difficulty prevented for example \citeN{KraussNipkow11} to prove termination of
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2424	their equivalence checker for regular expressions. Their checker is based on Brzozowski's derivatives
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2425	and for their argument the lack of a formal proof of termination is not
11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2426	crucial (it merely lets them ``sleep better'' \cite{KraussNipkow11}). We
11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2427	expect that their development simplifies by using partial derivatives,
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2428	instead of derivatives, and that the termination of the algorithm can be
40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2429	formally established (the main ingredient is
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2430	Theorem~\ref{antimirov}). However, since partial derivatives use sets of
11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2431	regular expressions, one needs to carefully analyse whether the resulting
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2432	algorithm is still executable. Given the infrastructure for
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2433	executable sets introduced by \citeN{Haftmann09} in Isabelle/HOL, it should.
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2434
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2435	We started out by claiming that in a theorem prover it is eaiser to reason
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2436	about regular expressions than about automta. Here are some numbers:
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2437	Our formalisation of the Myhill-Nerode Theorem consists of 780 lines of
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2438	Isabelle/Isar code for the first direction and 460 for the second (the one
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2439	based on tagging-functions), plus around 300 lines of standard material
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2440	about regular languages. The formalisation of derivatives and partial
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2441	derivatives shown in Section~\ref{derivatives} consists of 390 lines of
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2442	code. The closure properties in Section~\ref{closure} (except Theorem~\ref{subseqreg})
087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2443	can be established in
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2444	100 lines of code. The Continuation Lemma and the non-regularity of @{text "a\<^sup>n b\<^sup>n"}
253 bcef7669f55a polished the non-regularity proof urbanc parents: 252 diff changeset	2445	require 70 lines of code.
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2446	The algorithm for solving equational systems, which we
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2447	used in the first direction, is conceptually relatively simple. Still the
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2448	use of sets over which the algorithm operates means it is not as easy to
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2449	formalise as one might wish. However, it seems sets cannot be avoided since
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2450	the `input' of the algorithm consists of equivalence classes and we cannot
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2451	see how to reformulate the theory so that we can use lists or matrices. Lists would be
200 204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2452	much easier to reason about, since we can define functions over them by
204856ef5573 added an example for non-regularity and continuation lemma (the example does not yet work) urbanc parents: 199 diff changeset	2453	recursion. For sets we have to use set-comprehensions, which is slightly
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2454	unwieldy. Matrices would allow us to use the slick formalisation by
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2455	\citeN{Nipkow11} of the Gauss-Jordan algorithm.
172 21ee3a852a02 more on the journal paper urbanc parents: 170 diff changeset	2456
199 11c3c302fa2e a little tuning urbanc parents: 198 diff changeset	2457	While our formalisation might appear large, it should be seen
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2458	in the context of the work done by \citeN{Constable00} who
248 47446f111550 one more itteration on the paper urbanc parents: 247 diff changeset	2459	formalised the Myhill-Nerode Theorem in Nuprl using automata. They write
334 d47c2143ab8a partially updated conference paper; slightly tuned journal paper urbanc parents: 259 diff changeset	2460	that their four-member team would need something on the magnitude of 18 months
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2461	for their formalisation of the first eleven chapters of the textbook by \citeN{HopcroftUllman69},
334 d47c2143ab8a partially updated conference paper; slightly tuned journal paper urbanc parents: 259 diff changeset	2462	which includes the Myhill-Nerode theorem. It is hard to gauge the size of a
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2463	formalisation in Nurpl, but from what is shown in the Nuprl Math Library
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2464	about their development it seems \emph{substantially} larger than ours. We attribute
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2465	this to our use of regular expressions, which meant we did not need to `fight'
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2466	the theorem prover. Recently, \citeN{LammichTuerk12} formalised Hopcroft's
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2467	algorithm in Isabelle/HOL (in 7000 lines of code) using an automata
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2468	library of 27000 lines of code.
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2469	Also, \citeN{Filliatre97} reports that his formalisation in
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2470	Coq of automata theory and Kleene's theorem is ``rather big''.
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2471	\citeN{Almeidaetal10} reported about another
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2472	formalisation of regular languages in Coq. Their
218 28e98ede8599 added a few points urbanc parents: 217 diff changeset	2473	main result is the
28e98ede8599 added a few points urbanc parents: 217 diff changeset	2474	correctness of Mirkin's construction of an automaton from a regular
28e98ede8599 added a few points urbanc parents: 217 diff changeset	2475	expression using partial derivatives. This took approximately 10600 lines
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2476	of code. \citeN{Braibant12} formalised a large part of regular language
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2477	theory and Kleene algebras in Coq. While he is mainly interested
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2478	in implementing decision procedures for Kleene algebras, his library
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2479	includes a proof of the Myhill-Nerode theorem. He reckons that our
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2480	``development is more concise'' than his one based on matrices \cite[Page 67]{Braibant12}.
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2481	He writes that there is no conceptual problems with formally reasoning
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2482	about matrices for automata, but notes ``intrinsic difficult[ies]'' when working
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2483	with matrices in Coq, which is the sort of `fighting' one would encounter also
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2484	in other theorem provers.
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2485
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2486	In terms of time, the estimate for our formalisation is that we
198 b300f2c5d51d final(?) version of the paper urbanc parents: 197 diff changeset	2487	needed approximately 3 months and this included the time to find our proof
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2488	arguments. Unlike \citeN{Constable00}, who were able to follow the Myhill-Nerode
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2489	proof by \citeN{HopcroftUllman69}, we had to find our own arguments. So for us the
245 40b8d485ce8d polished a bit the journal paper urbanc parents: 242 diff changeset	2490	formalisation was not the bottleneck. The code of
372 2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2491	our formalisation \cite{myhillnerodeafp11} can be found in the Archive of Formal Proofs at
2c56b20032a7 made changes and updates to the journal paper Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 350 diff changeset	2492	\mbox{\url{http://afp.sourceforge.net/entries/Myhill-Nerode.shtml}}.\smallskip
162 e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	2493
e93760534354 added directory for journal version; took uptodate version of the theory files urbanc parents: 160 diff changeset	2494	\noindent
173 d371536861bc more on the introduction of the journal paper urbanc parents: 172 diff changeset	2495	{\bf Acknowledgements:}
242 093e45c44d91 polished proposal urbanc parents: 240 diff changeset	2496	We are grateful for the comments we received from Larry Paulson. Tobias
247 087e6c255e33 some more polishing and a link to Haines urbanc parents: 245 diff changeset	2497	Nipkow made us aware of the properties in Theorem~\ref{subseqreg} and Tjark
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2498	Weber helped us with proving them. Christian Sternagel provided us with a
8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2499	version of Higman's Lemma that applies to arbtrary, but finite alphabets.
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2500
350 8ce9a432680b made changes for another journal submission of the MN-paper urbanc parents: 348 diff changeset	2501	\bibliographystyle{acmtrans}
348 bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2502	\bibliography{root}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2503
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2504	\newpage
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2505	\begin{appendix}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2506	\section{Appendix$^\star$}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2507
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2508	\renewcommand{\thefootnote}{\mbox{$\star$}}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2509	\footnotetext{If the reviewers deem more suitable, the authors are
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2510	prepared to drop material or move it to an electronic appendix.}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2511
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2512	\begin{proof}[of Lemma~\ref{arden}]
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2513	For the right-to-left direction we assume @{thm (rhs) reversed_Arden} and show
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2514	that @{thm (lhs) reversed_Arden} holds. From Property~\ref{langprops}@{text "(i)"}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2515	we have @{term "A\<star> = A \<cdot> A\<star> \<union> {[]}"},
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2516	which is equal to @{term "A\<star> = A\<star> \<cdot> A \<union> {[]}"}. Adding @{text B} to both
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2517	sides gives @{term "B \<cdot> A\<star> = B \<cdot> (A\<star> \<cdot> A \<union> {[]})"}, whose right-hand side
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2518	is equal to @{term "(B \<cdot> A\<star>) \<cdot> A \<union> B"}. Applying the assumed equation
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2519	completes this direction.
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2520
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2521	For the other direction we assume @{thm (lhs) reversed_Arden}. By a simple induction
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2522	on @{text n}, we can establish the property
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2523
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2524	\begin{center}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2525	@{text "(*)"}\hspace{5mm} @{thm (concl) reversed_arden_helper}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2526	\end{center}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2527
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2528	\noindent
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2529	Using this property we can show that @{term "B \<cdot> (A \<up> n) \<subseteq> X"} holds for
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2530	all @{text n}. From this we can infer @{term "B \<cdot> A\<star> \<subseteq> X"} using the definition
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2531	of @{text "\<star>"}.
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2532	For the inclusion in the other direction we assume a string @{text s}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2533	with length @{text k} is an element in @{text X}. Since @{thm (prem 1) reversed_Arden}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2534	we know by Property~\ref{langprops}@{text "(ii)"} that
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2535	@{term "s \<notin> X \<cdot> (A \<up> Suc k)"} since its length is only @{text k}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2536	(the strings in @{term "X \<cdot> (A \<up> Suc k)"} are all longer).
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2537	From @{text "(*)"} it follows then that
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2538	@{term s} must be an element in @{term "(\<Union>m\<le>k. B \<cdot> (A \<up> m))"}. This in turn
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2539	implies that @{term s} is in @{term "(\<Union>n. B \<cdot> (A \<up> n))"}. Using Property~\ref{langprops}@{text "(iii)"}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2540	this is equal to @{term "B \<cdot> A\<star>"}, as we needed to show.
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2541	\end{proof}
bea94f1e6771 made changes for another journal submission of the MH-paper urbanc parents: 338 diff changeset	2542	\end{appendix}
92 a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	2543	*}
a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	2544
a9ebc410a5c8 more on paper urbanc parents: 90 diff changeset	2545
24 f72c82bf59e5 added paper urbanc parents: diff changeset	2546	(<)
f72c82bf59e5 added paper urbanc parents: diff changeset	2547	end
f72c82bf59e5 added paper urbanc parents: diff changeset	2548	(>)

author	Christian Urban <christian dot urban at kcl dot ac dot uk>
	Mon, 03 Dec 2012 08:16:58 +0000
changeset 372	2c56b20032a7
parent 350	8ce9a432680b
child 374	01d223421ba0
permissions	-rw-r--r--