cst_tests: comparison ninems/ninems.tex

equal deleted inserted replaced

-:8c1195dd6136
+:de50a65d1b15
 %the current derivative matches the suffix of the string(the characters that
 %have not yet appeared, but will appear as the successive derivatives go on.
 %How do we get this "future" information? By the value $v$, which is
 %computed by a pass of the algorithm that uses
 %$inj$ as described in the previous section).
-using information from both the derivative regular expression and the value.
+using information from both the derivative regular expression and the
-Sulzmann and Lu used this
+value. Sulzmann and Lu poroposed this function, but did not prove
-to connect the bitcoded algorithm to the older algorithm by the following
+anything about it. Ausaf and Urban used it to connect the bitcoded
-equation:
+algorithm to the older algorithm by the following equation:
 \begin{center} $inj \;a\; c \; v = \textit{decode} \; (\textit{retrieve}\;
-	 ((\textit{internalise}\; r)\backslash_{simp} c) v)$
+	 (r^\uparrow)\backslash_{simp} \,c)\,v)$
 \end{center}
-A little fact that needs to be stated to aid comprehension:
-\begin{center}
+\noindent
-	$r^\uparrow = a$($a$ stands for $\textit{annotated}).$
+whereby $r^\uparrow$ stands for the internalised version of $r$. Ausaf
-\end{center}
+and Urban also used this fact to prove  the correctness of bitcoded
-Ausaf and Urban also used this fact to prove  the
+algorithm without simplification.  Our purpose of using this, however,
-correctness of bitcoded algorithm without simplification.  Our purpose
+is to establish
-of using this, however, is to establish
 \begin{center}
 $ \textit{retrieve} \;
-a \; v \;=\; \textit{retrieve}  \; \textit{simp}(a) \; v'.$
+a \; v \;=\; \textit{retrieve}  \; (\textit{simp}\,a) \; v'.$
 \end{center}
-The idea
+The idea is that using $v'$, a simplified version of $v$ that had gone
-is that using $v'$, a simplified version of $v$ that had gone
+through the same simplification step as $\textit{simp}(a)$, we are able
-through the same simplification step as $\textit{simp}(a)$, we are
+to extract the bitcode that gives the same parsing information as the
-able to extract the bit-sequence that gives the same parsing
+unsimplified one. However, we noticed that constructing such a  $v'$
-information as the unsimplified one.  After establishing this, we
+from $v$ is not so straightforward. The point of this is that  we might
-might be able to finally bridge the gap of proving
+be able to finally bridge the gap by proving
-\begin{center}
-$\textit{retrieve} \; r^\uparrow   \backslash  s \; v = \;\textit{retrieve} \;
+\begin{center}
-\textit{bsimp}(r^\uparrow)  \backslash  s \; v'$
+$\textit{retrieve} \; (r^\uparrow   \backslash  s) \; v = \;\textit{retrieve} \;
-\end{center}
+\textit{simp}(r^\uparrow)  \backslash  s \; v'$
+\end{center}
+\noindent
 and subsequently
-\begin{center}
-$\textit{retrieve} \; r^\uparrow \backslash  s \; v\; = \; \textit{retrieve} \;
+\begin{center}
-r^\uparrow  \backslash_{simp}   s \; v'$.
+$\textit{retrieve} \; (r^\uparrow \backslash  s) \; v\; = \; \textit{retrieve} \;
-\end{center}
+(r^\uparrow  \backslash_{simp}  \, s) \; v'$.
-The $\textit{LHS}$ of the above equation is the bitcode we want.
+\end{center}
-This proves that our simplified
-version of regular expression still contains all the bitcodes needed.
+\noindent
-The task here is to find a way to compute the correct $v'$.
+The $\textit{LHS}$ of the above equation is the bitcode we want. This
+would prove that our simplified version of regular expression still
+contains all the bitcodes needed. The task here is to find a way to
+compute the correct $v'$.
 The second task is to speed up the more aggressive simplification.
-Currently it is slower than a naive simplification(the naive version as
+Currently it is slower than the original naive simplification by Ausaf
-implemented in ADU of course can explode in some cases). So it needs to
+and Urban (the naive version as implemented by Ausaf   and Urban of
-be explored how to make it faster. Our possibility would be to explore
+course can ``explode'' in some cases). So it needs to be explored how to
-again the connection to DFAs. This is very much work in progress.
+make our algorithm faster on all inputs. Our possibility would be to
+explore again the connection to DFAs. This is very much work in
+progress.
 \section{Conclusion}
 In this PhD-project we are interested in fast algorithms for regular
 expression matching. While this seems to be a ``settled'' area, in

changeset 84	de50a65d1b15
parent 83	8c1195dd6136
child 85	ba40ab3658ca