cst_tests: comparison corr_pr

equal deleted inserted replaced

-:2c02d27ec0a3
+:2b95bcd2ac73
 \usepackage[english]{babel}
 \usepackage{listings}
 \usepackage{amsthm}
 \usepackage{hyperref}
 \usepackage[margin=0.5in]{geometry}
+\usepackage{pmboxdraw}
 \theoremstyle{theorem}
 \newtheorem{theorem}{Theorem}
 \theoremstyle{lemma}
 \newtheorem{lemma}{Lemma}
 We might as well omit the prime in each rs for simplicty of notation and prove $d ALTS(bs, rs1++rs2) \sim_{m\epsilon} d ALTS(bs, rs1++(rs2 -- rs1) )   $.\\
 We know that the result of derivative is nullable, so there must exist an r in rs1++rs2 s.t. r is nullable.\\
 If $r \in rs1$, then equivalence holds. If $r \in rs2 \wedge  r \notin rs1$, equivalence holds as well. This completes the proof.
 \end{proof}
+\begin{lemma}{nullability relation between a regex and its simplified version}\label{lma9}\\
+$r\ nullable \iff sr\ nullable $
+\end{lemma}
+\begin{lemma}{concatenation + simp invariance of mkepsBC}\label{lma10}\\
+$mkepsBC r1 \cdot sr2 = mkepsBC r1 \cdot r2$ if both r1 and r2 are nullable.
+\end{lemma}
 \begin{theorem}{Correctness Result}
 \begin{itemize}
 Now we proceed to prove that $sdr \sim_{m\epsilon} dsr$. This can be reduced to proving $dr \sim_{m\epsilon} dsr$ as we know that $dr \sim_{m\epsilon} sdr$ by \autoref{lma2}.\\
 we use an induction proof. Base cases are omitted. Here are the 3 inductive cases.
 \begin{itemize}
-\item{$r_1+r_2$}
+\item{$r_1+r_2$}\\
-$r_1+r_2$\\
 The most difficult case is when $sr1$ and $sr2$ are both ALTS, so that they will be opened up in the flats function and some terms in sr2 might be deleted. Or otherwise we can use the argument that $d(r_1+r_2) = dr_1 + dr_2  \sim_{m\epsilon} dsr_1 + dsr_2 \sim_{m\epsilon} ds(r_1+r_2)$,
 the last equivalence being established by \autoref{lma3}.
 When $s(r_1), s(r_2)$ are both ALTS, we have to be more careful for the last equivalence step, namelly, $dsr_1 + dsr_2 \sim_{m\epsilon} ds(r_1+r_2)$. \\
 We have that $LHS = dsr_1 + dsr_2 = d(sr_1 +sr_2)$. Since $sr_1 = ALTS(bs1, rs1)$ and $sr_2 = ALTS(bs2, rs2)$ we have  $  d(sr_1 +sr_2 ) \sim_{m\epsilon} d(ALTS(bs, bs1>>rs1 ++ bs2>>rs2))$ by \autoref{lma4}.
 On the other hand, $RHS = ds(ALTS(bs, r1, r2)) = d Co(bs, dB(flats(s(r1), s(r2)) ) ) = d Co(bs, dB(bs1>>rs1 ++ bs2>>rs2))$ by definition of bsimp and flats.\\
 $d Co(bs, dB(bs1>>rs1 ++ bs2>>rs2)) = d Co(bs, (bs1>>rs1 ++ ((bs2>>rs2)--rs1))) $ by \autoref{lma5}.\\ $d Co(bs, (bs1>>rs1 ++ ((bs2>>rs2)--rs1)        )) = d(ALTS(bs, bs1>>rs1 ++ (bs2>>rs2)--rs1))$ by \autoref{lma6}. \\
 Using \autoref{lma8}, we have  $d(ALTS(bs, bs1>>rs1 ++ (bs2>>rs2)--rs1)) \sim_{m\epsilon} d(ALTS(bs, bs1>>rs1 ++ bs2>>rs2)) \sim_{m\epsilon}  RHS$.\\
 %only the last part we don't have an equality here. We use mkeps equivalence instead because these two regexes are indeed different syntactically. However, they have even more in common than just mkeps equal. We might later augment this proof so that this equivalence relation is changed to something like regular expression equivalence.
 This completes the proof.
 \item{$r*$}\\
-s(r*) = s(r).
+$s(r*) = r*$. Our goal is trivially achieved.
-\item{$r1.r2$}\\
+\item{$r1 \cdot r2$}\\
-using previous.
+When r1 is nullable, $ds r1r2 = dsr1 \cdot sr2 + dsr2  \sim_{m\epsilon}  dr1 \cdot sr2 + dr2 = dr1 \cdot r2 + dr2 $. The last step uses \autoref{lma10}.
+When r1 is not nullable, $ds r1r2 = dsr1 \cdot sr2  \sim_{m\epsilon}  dr1 \cdot sr2  \sim_{m\epsilon}  dr1 \cdot r2 $
 \end{itemize}
 \item
 Proof of second part of the theorem: use a similar structure of argument as in the first part.
+\item
+This proof has a major flaw: it assumes all dr is nullable along the path of deriving r by s. But it could be the case that $s\in L(r)$ but $ \exists s' \in Pref(s) \ s.t.\ ders(s', r)$ is not nullable (or equivalently, $s'\notin L(r)$). One remedy for this is to replace the mkepsBC equivalence relation into some other transitive relation that entails mkepsBC equivalence.
 \end{itemize}
 \end{proof}
+\begin{theorem}{
+This is a very strong claim that has yet to be more carefully examined and proved. However, experiments suggest a very good hope for this.}\\
+Define pushbits as the following:\\
+$pushbits(r) = if( r == ALTS(bs, rs) )\ then\ ALTS(Nil, rs.map(fuse(bs,\_)))  \ else\ r$.\\
+Then we have \mbox{\boldmath$pushbits(ders\_simp(ar, s) ) == simp(ders(ar,s)) \ or\ ders\_simp(ar, s) == simp(ders(ar, s))$}.\\
+Unfortunately this does not hold. A counterexample is\\
+\begin{verbatim}
+r = ASTAR(List(),ASEQ(List(),AALTS(List(),List(ACHAR(List(Z),c), ACHAR(List(S),b))),ASEQ(List(),ASTAR(List(),ACHAR(List(),a)),AALTS(List(),List(ACHAR(List(Z),a), ACHAR(List(S),a))))))
+regex after ders simp
+SEQ
+└-ALT List(S, S, C(b))
+|  └-SEQ
+|  |  └-STA List(S, C(a), S, C(a))
+|  |  |  └-a
+|  |  └-a List(Z)
+|  └-ONE List(S, C(a), Z, Z, C(a))
+└-STA
+└-SEQ
+└-ALT
+|  └-c List(Z)
+|  └-b List(S)
+└-SEQ
+└-STA
+|  └-a
+└-ALT
+└-a List(Z)
+└-a List(S)
+regex after ders and then a single simp
+SEQ
+└-ALT List(S)
+|  └-SEQ List(S, C(b))
+|  |  └-STA List(S, C(a), S, C(a))
+|  |  |  └-a
+|  |  └-a List(Z)
+|  └-ONE List(S, C(b), S, C(a), Z, Z, C(a))
+└-STA
+└-SEQ
+└-ALT
+|  └-c List(Z)
+|  └-b List(S)
+└-SEQ
+└-STA
+|  └-a
+└-ALT
+└-a List(Z)
+└-a List(S)
+\end{verbatim}
+\end{theorem}
 \end{document}
 %The second part might still need some more development.
 %When s is not in the set L(ar), we have that s = s1@s2 s.t. s1 $\in$ L(ar), and any longer string that is a prefix of s does not belong to L(ar).\\

changeset 10	2b95bcd2ac73
parent 9	2c02d27ec0a3
child 11	9c1ca6d6e190