lexing: comparison ChengsongTanPhdThesis/Chapters/Finite.tex

equal deleted inserted replaced

-:fa92124d1fb7
+:b306628a0eab
 It tests if all the elements of a list are unique.\\
 With $\textit{rdistinct}$,
 and the flatten function for $\rrexp$s:
 \begin{center}
 \begin{tabular}{@{}lcl@{}}
-$\textit{rflts} \; (_{bs}\sum \textit{as}) :: \textit{as'}$ & $\dn$ & $(\textit{map} \;
+$\textit{rflts} \; (\sum \textit{as}) :: \textit{as'}$ & $\dn$ & $as \; @ \; \textit{rflts} \; as' $ \\
-(\textit{fuse}\;bs)\; \textit{as}) \; @ \; \textit{rflts} \; as' $ \\
 $\textit{rflts} \; \ZERO :: as'$ & $\dn$ & $ \textit{rflts} \;  \textit{as'} $ \\
 $\textit{rflts} \; a :: as'$ & $\dn$ & $a :: \textit{rflts} \; \textit{as'}$ \quad(otherwise)
 \end{tabular}
 \end{center}
 \noindent
 one can chain together all the other modules
-of $\bsimp{\_}$ (removing the functionalities related to bit-sequences)
+such as $\rsimpalts$:
-and get $\textit{rsimp}$ and $\rderssimp{\_}{\_}$.
+\begin{center}
+\begin{tabular}{@{}lcl@{}}
+	  $\rsimpalts \;\; nil$ & $\dn$ & $\RZERO$\\
+	  $\rsimpalts \;\; r::nil$ & $\dn$ & $r$\\
+	  $\rsimpalts \;\; rs$ & $\dn$ & $\sum rs$\\
+\end{tabular}
+\end{center}
+\noindent
+and $\rsimpseq$:
+\begin{center}
+\begin{tabular}{@{}lcl@{}}
+	  $\rsimpseq \;\; \RZERO \; \_ $ &   $=$ &   $\RZERO$\\
+	  $\rsimpseq \;\; \_ \; \RZERO $ &   $=$ &   $\RZERO$\\
+	  $\rsimpseq \;\; \RONE \cdot r_2$ & $\dn$ & $r_2$\\
+	  $\rsimpseq \;\; r_1 r_2$ & $\dn$ & $r_1 \cdot r_2$\\
+\end{tabular}
+\end{center}
+and get $\textit{rsimp}$ and $\rderssimp{\_}{\_}$:
 \begin{center}
 \begin{tabular}{@{}lcl@{}}
-	  $\textit{rsimp} \; (_{bs}a_1\cdot a_2)$ & $\dn$ & $ \textit{rsimp}_{ASEQ} \; bs \;(\textit{rsimp} \; a_1) \; (\textit{rsimp}  \; a_2)  $ \\
+	  $\textit{rsimp} \; (r_1\cdot r_2)$ & $\dn$ & $ \textit{rsimp}_{SEQ} \; bs \;(\textit{rsimp} \; r_1) \; (\textit{rsimp}  \; r_2)  $ \\
-	  $\textit{rsimp} \; (_{bs}\sum \textit{as})$ & $\dn$ & $\textit{rsimp}_{ALTS} \; \textit{bs} \; (\textit{rdistinct} \; ( \textit{rflts} ( \textit{map} \; rsimp \; as)) \; \rerases \; \varnothing) $ \\
+	  $\textit{rsimp} \; (_{bs}\sum \textit{rs})$ & $\dn$ & $\textit{rsimp}_{ALTS} \; \textit{bs} \; (\textit{rdistinct} \; ( \textit{rflts} ( \textit{map} \; rsimp \; rs)) \; \rerases \; \varnothing) $ \\
-$\textit{rsimp} \; a$ & $\dn$ & $\textit{a} \qquad \textit{otherwise}$
+$\textit{rsimp} \; r$ & $\dn$ & $\textit{r} \qquad \textit{otherwise}$
 \end{tabular}
 \end{center}
-We omit these functions, as they are routine. Please refer to the formalisation
+\begin{center}
-(in file BasicIdentities.thy) for the exact definition.
+	\begin{tabular}{@{}lcl@{}}
+		$r\backslash_{rsimp} \, c$ & $\dn$ & $\rsimp \; (r\backslash_r \, c)$
+	\end{tabular}
+\end{center}
+\begin{center}
+	\begin{tabular}{@{}lcl@{}}
+$r \backslash_{rsimps} (c\!::\!s) $ & $\dn$ & $(r \backslash_{rsimp}\, c) \backslash_{rsimps}\, s$ \\
+$r \backslash_{rsimps} [\,] $ & $\dn$ & $r$
+	\end{tabular}
+\end{center}
+\noindent
 With $\rrexp$ the size caclulation of annotated regular expressions'
 simplification and derivatives can be done by the size of their unlifted
 counterpart with the unlifted version of simplification and derivatives applied.
 \begin{lemma}\label{sizeRelations}
 	The following equalities hold:
 	\begin{itemize}
 		\item
 			$\asize{\bsimp{a}} = \rsize{\rsimp{\rerase{a}}}$
 		\item
-			$\asize{\bderssimp{r}{s}} =  \rsize{\rderssimp{\rerase{r}}{s}}$
+			$\asize{\bderssimp{a}{s}} =  \rsize{\rderssimp{\rerase{a}}{s}}$
 	\end{itemize}
 \end{lemma}
 \noindent
+With lemma \ref{sizeRelations},
+a bound on $\rsize{\rderssimp{\rerase{a}}{s}}$
+\[
+	\llbracket \rderssimp{a}{s} \rrbracket_r \leq N_r
+\]
+\noindent
+would apply to $\asize{\bderssimp{a}{s}}$ as well.
+\[
+	\llbracket a \backslash_{bsimps} s \rrbracket \leq N_r.
+\]
 In the following content, we will focus on $\rrexp$'s size bound.
-We will piece together this bound and show the same bound for annotated regular
+Whatever bounds we proved for $ \rsize{\rderssimp{\rerase{r}}{s}}$
-expressions in the end.
+will automatically apply to $\asize{\bderssimp{r}{s}}$.\\
-Unless stated otherwise in this chapter all $\textit{rexp}$s without
+Unless stated otherwise in this chapter all regular expressions without
 bitcodes are seen as $\rrexp$s.
 We also use $r_1 + r_2$ and $\RALTS{[r_1, r_2]}$ interchageably
 as the former suits people's intuitive way of stating a binary alternative
 regular expression.
 %-----------------------------------
-%	SECTION ?
+%	SUB SECTION ROADMAP RREXP BOUND
 %-----------------------------------
-\subsection{Finiteness Proof Using $\rrexp$s}
-Now that we have defined the $\rrexp$ datatype, and proven that its size changes
+%\subsection{Roadmap to a Bound for $\textit{Rrexp}$}
-w.r.t derivatives and simplifications mirrors precisely those of annotated regular expressions,
-we aim to bound the size of $r \backslash s$ for any $\rrexp$  $r$.
+%The way we obtain the bound for $\rrexp$s is by two steps:
-Once we have a bound like:
+%\begin{itemize}
-\[
+%	\item
-	\llbracket r \backslash_{rsimp} s \rrbracket_r \leq N_r
+%		First, we rewrite $r\backslash s$ into something else that is easier
-\]
+%		to bound. This step is especially important for the inductive case
-\noindent
+%		$r_1 \cdot r_2$ and $r^*$, where the derivative can grow and bloat in a wild way,
-we could easily extend that to
+%		but after simplification they will always be equal or smaller to a form consisting of an alternative
-\[
+%		list of regular expressions $f \; (g\; (\sum rs))$ with some functions applied to it, where each element will be distinct after the function application.
-	\llbracket a \backslash_{bsimps} s \rrbracket \leq N_r.
+%	\item
-\]
+%		Then, for such a sum  list of regular expressions $f\; (g\; (\sum rs))$, we can control its size
+%		by estimation, since $\distinctBy$ and $\flts$ are well-behaved and working together would only
-\subsection{Roadmap to a Bound for $\textit{Rrexp}$}
+%		reduce the size of a regular expression, not adding to it.
+%\end{itemize}
-The way we obtain the bound for $\rrexp$s is by two steps:
+%
-\begin{itemize}
+%\section{Step One: Closed Forms}
-	\item
+%We transform the function application $\rderssimp{r}{s}$
-		First, we rewrite $r\backslash s$ into something else that is easier
+%into an equivalent
-		to bound. This step is especially important for the inductive case
+%form $f\; (g \; (\sum rs))$.
-		$r_1 \cdot r_2$ and $r^*$, where the derivative can grow and bloat in a wild way,
+%The functions $f$ and $g$ can be anything from $\flts$, $\distinctBy$ and other helper functions from $\bsimp{\_}$.
-		but after simplification they will always be equal or smaller to a form consisting of an alternative
+%This way we get a different but equivalent way of expressing : $r\backslash s = f \; (g\; (\sum rs))$, we call the
-		list of regular expressions $f \; (g\; (\sum rs))$ with some functions applied to it, where each element will be distinct after the function application.
+%right hand side the "closed form" of $r\backslash s$.
-	\item
+%
-		Then, for such a sum  list of regular expressions $f\; (g\; (\sum rs))$, we can control its size
+%\begin{quote}\it
-		by estimation, since $\distinctBy$ and $\flts$ are well-behaved and working together would only
+%	Claim: For regular expressions $r_1 \cdot r_2$, we claim that
-		reduce the size of a regular expression, not adding to it.
+%	\begin{center}
-\end{itemize}
+%		$ \rderssimp{r_1 \cdot r_2}{s} =
+%		\rsimp{(\sum (r_1 \backslash s \cdot r_2 ) \; :: \;(\map \; \rderssimp{r2}{\_} \;(\vsuf{s}{r_1})))}$
-\section{Step One: Closed Forms}
+%	\end{center}
-We transform the function application $\rderssimp{r}{s}$
+%\end{quote}
-into an equivalent
+%\noindent
-form $f\; (g \; (\sum rs))$.
+%We explain in detail how we reached those claims.
-The functions $f$ and $g$ can be anything from $\flts$, $\distinctBy$ and other helper functions from $\bsimp{\_}$.
-This way we get a different but equivalent way of expressing : $r\backslash s = f \; (g\; (\sum rs))$, we call the
-right hand side the "closed form" of $r\backslash s$.
-\begin{quote}\it
-	Claim: For regular expressions $r_1 \cdot r_2$, we claim that
-	\begin{center}
-		$ \rderssimp{r_1 \cdot r_2}{s} =
-		\rsimp{(\sum (r_1 \backslash s \cdot r_2 ) \; :: \;(\map \; \rderssimp{r2}{\_} \;(\vsuf{s}{r_1})))}$
-	\end{center}
-\end{quote}
-\noindent
-We explain in detail how we reached those claims.
 \subsection{Basic Properties needed for Closed Forms}
+The next steps involves getting a closed form for
+$\rderssimp{r}{s}$ and then obtaining
+an estimate for the closed form.
 \subsubsection{$\textit{rdistinct}$'s Deduplicates Successfully}
 The $\textit{rdistinct}$ function, as its name suggests, will
 remove duplicates in an \emph{r}$\textit{rexp}$ list,
 according to the accumulator
 and leave only one of each different element in a list:

changeset 596	b306628a0eab
parent 595	fa92124d1fb7
child 601	ce4e5151a836