cst_tests: comparison ninems/ninems.tex

equal deleted inserted replaced

-:a67aff8fb06a
+:481c8000de6d
 Figure 6 yields POSIX parse trees. We have tested this claim
 extensively by using the method in Figure~3 as a reference but yet
 have to work out all proof details.''
 \end{quote}
-\noindent
+\noindent We would settle this correctness claim. It is relatively
-We would settle this correctness claim. It is relatively straightforward
+straightforward to establish that after one simplification step, the part of a
-to establish that after one simplification step, the part of a nullable
+nullable derivative that corresponds to a POSIX value remains intact and can
-derivative that corresponds to a POSIX value remains intact and can
+still be collected, in other words, we can show that\comment{Double-check....I
-still be collected, in other words, we can show that\comment{Double-check....I think this  is not the case}
+think this  is not the case}
 \begin{center}
 $\textit{bmkeps} \; r = \textit{bmkeps} \; \textit{simp} \; r\;( r\; \textit{nullable})$
 \end{center}
 as this basically comes down to proving actions like removing the
 additional $r$ in $r+r$  does not delete important POSIX information in
 a regular expression. The hard part of this proof is to establish that
 \begin{center}
-$\textit{bmkeps} \; \textit{blexer}\_{simp} \; r = \textit{bmkeps} \; \textit{blexer} \; \textit{simp} \; r$
+	$\textit{bmkeps} \; \textit{blexer}\_{simp}(s, \; r) = \textit{bmkeps} \; \textit{blexer} \; \textit{simp}(s, \; r)$
 \end{center}
-\noindent\comment{OK from here on you still need to work. Did not read.}
+\noindent That is, if we do derivative on regular expression $r$ and then
-That is, if we do derivative on regular expression r and the simplified version,
+simplify it, and repeat this process until we exhaust the string, we get a
-they can still provide the same POSIX value if there is one .
+regular expression $r''$ that provides the POSIX matching as the result $r'$ of
-This is not as straightforward as the previous proposition, as the two regular expressions $r$ and $\textit{simp}\; r$
+the normal derivative algorithm which only does derivative operation repeatedly
-might become very different regular expressions after repeated application of $\textit{simp}$ and derivative.
+and has no simplification at all.  This might seem at first glance very
-The crucial point is to find the indispensable information of
+unintuitive, as the $r'$ is exponentially larger than $r''$. But this can be
-a regular expression and how it is kept intact during simplification so that it performs
+explained in the following way: we are pruning away the possible matches that
-as good as a regular expression that has not been simplified in the subsequent derivative operations.
+are not POSIX. Since there are exponentially non-POSIX matchings and only 1
-To aid this, we use the helping function retrieve described by Sulzmann and Lu:
+POSIX matching, it is understandable that our $r''$ can be a lot smaller.  we
-\\definition of retrieve\\
+can still provide the same POSIX value if there is one.  This is not as
+straightforward as the previous proposition, as the two regular expressions $r$
+and $\textit{simp}\; r$ might become very different regular expressions after
+repeated application of $\textit{simp}$ and derivative.  The crucial point is
+to find the indispensable information of a regular expression and how it is
+kept intact during simplification so that it performs as good as a regular
+expression that has not been simplified in the subsequent derivative
+operations.  To aid this, we use the helping function retrieve described by
+Sulzmann and Lu: \\definition of retrieve\\
 This function assembled the bitcode that corresponds to a parse tree for
 how the current derivative matches the suffix of the string(the
 characters that have not yet appeared, but is stored in the value).
 Sulzmann and Lu used this to connect the bitcoded algorithm to the older
 algorithm by the following equation:
-\begin{center}
+\begin{center} $inj \;a\; c \; v = \textit{decode} \; (\textit{retrieve}\;
-$inj \;a\; c \; v = \textit{decode} \; (\textit{retrieve}\; ((\textit{internalise}\; r)\backslash_{simp} c) v)$
+	 ((\textit{internalise}\; r)\backslash_{simp} c) v)$ \end{center} A
-\end{center}
+	 little fact that needs to be stated to help comprehension:
-A little fact that needs to be stated to help comprehension:
+	 \begin{center} $r^\uparrow = a$($a$ stands for $\textit{annotated}).$
-\begin{center}
+	 \end{center} Ausaf and Urban also used this fact to prove  the
-$r^\uparrow = a$($a$ stands for $\textit{annotated}).$
+	 correctness of bitcoded algorithm without simplification.  Our purpose
-\end{center}
+	 of using this, however, is try to establish \\ $ \textit{retrieve} \;
-Ausaf and Urban also used this fact to prove  the correctness of bitcoded algorithm without simplification.
+	 a \; v \;=\; \textit{retrieve}  \; \textit{simp}(a) \; v'.$\\ The idea
-Our purpose of using this, however, is try to establish \\
+	 is that using $v'$, a simplified version of $v$ that possibly had gone
-$ \textit{retrieve} \; a \; v \;=\; \textit{retrieve}  \; \textit{simp}(a) \; v'.$\\
+	 through the same simplification step as $\textit{simp}(a)$ we are
-The idea is that using $v'$,
+	 still  able to extract the bit-sequence that gives the same parsing
-a simplified version of $v$ that possibly had gone through the same simplification step as $\textit{simp}(a)$ we are still  able to extract the bit-sequence that gives the same parsing information as the unsimplified one.
+	 information as the unsimplified one.  After establishing this, we
-After establishing this, we might be able to finally bridge the gap of proving\\
+	 might be able to finally bridge the gap of proving\\
-$\textit{retrieve} \; r   \backslash  s \; v = \;\textit{retrieve} \; \textit{simp}(r)  \backslash  s \; v'$\\
+	 $\textit{retrieve} \; r   \backslash  s \; v = \;\textit{retrieve} \;
-and subsequently\\
+	 \textit{simp}(r)  \backslash  s \; v'$\\ and subsequently\\
-$\textit{retrieve} \; r \backslash  s \; v\; = \; \textit{retrieve} \; r  \backslash_{simp}   s \; v'$.\\
+	 $\textit{retrieve} \; r \backslash  s \; v\; = \; \textit{retrieve} \;
-This proves that our simplified version of regular expression still contains all the bitcodes needed.
+	 r  \backslash_{simp}   s \; v'$.\\ This proves that our simplified
+	 version of regular expression still contains all the bitcodes needed.
 The second task is to speed up the more aggressive simplification.
 Currently it is slower than a naive simplification(the naive version as
 implemented in ADU of course can explode in some cases). So it needs to

changeset 79	481c8000de6d
parent 77	058133a9ffe0
child 80	d9d61a648292