| author | Christian Urban <christian.urban@kcl.ac.uk> | 
| Sun, 01 Oct 2023 15:25:22 +0100 | |
| changeset 934 | 114d6638a3fc | 
| parent 926 | d543eaddc7ba | 
| child 961 | 4543807efe9d | 
| permissions | -rw-r--r-- | 
| 631 | 1  | 
% !TEX program = xelatex  | 
| 0 | 2  | 
\documentclass{article}
 | 
| 
249
 
377c59df7297
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
227 
diff
changeset
 | 
3  | 
\usepackage{../style}
 | 
| 0 | 4  | 
|
5  | 
\begin{document}
 | 
|
6  | 
||
7  | 
\section*{Homework 1}
 | 
|
8  | 
||
| 
331
 
a2c18456c6b7
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
294 
diff
changeset
 | 
9  | 
\HEADER  | 
| 
 
a2c18456c6b7
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
294 
diff
changeset
 | 
10  | 
|
| 0 | 11  | 
\begin{enumerate}
 | 
| 
249
 
377c59df7297
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
227 
diff
changeset
 | 
12  | 
|
| 
401
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
13  | 
\item {\bf (Optional)} If you want to run the code presented
 | 
| 
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
14  | 
in the lectures, install the Scala programming language  | 
| 
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
15  | 
available (for free) from  | 
| 
249
 
377c59df7297
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
227 
diff
changeset
 | 
16  | 
|
| 743 | 17  | 
      \begin{center}
 | 
18  | 
        \url{http://www.scala-lang.org}
 | 
|
19  | 
      \end{center}
 | 
|
20  | 
||
21  | 
and the Ammonite REPL from  | 
|
22  | 
||
23  | 
       \begin{center}
 | 
|
24  | 
       \url{https://ammonite.io}
 | 
|
25  | 
       \end{center}      
 | 
|
| 0 | 26  | 
|
| 
401
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
27  | 
If you want to follow the code I present during the  | 
| 926 | 28  | 
lectures, read the handout about Scala. Make sure Ammonite  | 
29  | 
uses the Scala 3 compiler.  | 
|
| 0 | 30  | 
|
| 639 | 31  | 
%\item {\bf (Optional)} Have a look at the crawler programs.
 | 
32  | 
% Can you find a usage for them in your daily programming  | 
|
33  | 
% life? Can you improve them? For example in cases there  | 
|
34  | 
% are links that appear on different recursion levels, the  | 
|
35  | 
% crawlers visit such web-pages several times. Can this be  | 
|
36  | 
% avoided? Also, the crawlers flag as problematic any page  | 
|
37  | 
% that gives an error, but probably only 404 Not Found  | 
|
38  | 
% errors should be flagged. Can you change that?)  | 
|
| 
104
 
ffde837b1db1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
102 
diff
changeset
 | 
39  | 
|
| 640 | 40  | 
\item {\bf (Optional)} Have a look at the catastrophic backtracking
 | 
41  | 
programs uploaded on KEATS. Convince yourself that they really require  | 
|
42  | 
a lot of computation time. If you have similar examples in your own  | 
|
43  | 
favourite programming language, I am happy to hear about it.  | 
|
44  | 
||
45  | 
||
| 
401
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
46  | 
\item Read the handout of the first lecture and the handout  | 
| 
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
47  | 
about notation. Make sure you understand the concepts of  | 
| 498 | 48  | 
strings and languages. In the context of the CFL-course,  | 
| 
401
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
49  | 
      what is meant by the term \emph{language}?
 | 
| 9 | 50  | 
|
| 876 | 51  | 
      \solution{A language - in this context - is just a set of
 | 
52  | 
strings. Some of these sets can actually not be described by  | 
|
53  | 
regular expressions. Only regular​ languages can. This is  | 
|
54  | 
something for lecture 3.}  | 
|
55  | 
||
| 550 | 56  | 
\item Give the definition for regular expressions---this is an  | 
| 498 | 57  | 
inductive datatype. What is the  | 
| 
355
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
58  | 
meaning of a regular expression? (Hint: The meaning is  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
59  | 
defined recursively.)  | 
| 0 | 60  | 
|
| 876 | 61  | 
      \solution{Here I would also expect the grammar for basic regular
 | 
62  | 
expressions and the definition of the recursive L-function. Discuss  | 
|
63  | 
differences between $r_1 + r_2$ and $r^+$. Discuss differences between  | 
|
64  | 
``real-life regexes'' and regexes in this module.}  | 
|
65  | 
||
| 
355
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
66  | 
\item Assume the concatenation operation of two strings is  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
67  | 
written as $s_1 @ s_2$. Define the operation of  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
68  | 
      \emph{concatenating} two sets of strings. This operation
 | 
| 
394
 
2f9fe225ecc8
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
355 
diff
changeset
 | 
69  | 
is also written as $\_ \,@\, \_$. According to  | 
| 
 
2f9fe225ecc8
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
355 
diff
changeset
 | 
70  | 
      this definition, what is $A \,@\, \{\}$ equal to?
 | 
| 498 | 71  | 
Is in general $A\,@\,B$ equal to $B\,@\,A$?  | 
| 0 | 72  | 
|
| 876 | 73  | 
      \solution{ What is $A @ {[]}$? Are there special cases
 | 
74  | 
where $A @ B = B @ A$? }  | 
|
75  | 
||
| 
355
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
76  | 
\item Assume a set $A$ contains 4 strings and a set $B$  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
77  | 
contains 7 strings. None of the strings is the empty  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
78  | 
string. How many strings are in $A \,@\, B$?  | 
| 
249
 
377c59df7297
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
227 
diff
changeset
 | 
79  | 
|
| 876 | 80  | 
      \solution{28, but there are corner cases where there are fewer
 | 
81  | 
than 28 elements. Can students think of such corner cases?  | 
|
82  | 
      For example $A = \{a, ab, \ldots\}$, $B = \{bc, c,\ldots\}$ }
 | 
|
83  | 
||
| 
267
 
a1544b804d1e
updated homeworks
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
258 
diff
changeset
 | 
84  | 
\item How is the power of a language defined? (Hint: There are two  | 
| 
 
a1544b804d1e
updated homeworks
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
258 
diff
changeset
 | 
85  | 
  rules, one for $\_^0$ and one for $\_^{n+1}$.)
 | 
| 
109
 
f2a90dda7e3b
added
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
104 
diff
changeset
 | 
86  | 
|
| 876 | 87  | 
     \solution{Two rules: 0-case and n+1 case.}
 | 
88  | 
||
| 
355
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
89  | 
\item Let $A = \{[a], [b], [c], [d]\}$. (1) How many strings
 | 
| 
438
 
84608b4b3578
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
416 
diff
changeset
 | 
90  | 
are in $A^4$? (2) Consider also the case of $A^4$ where one of  | 
| 
355
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
91  | 
the strings in $A$ is the empty string, for example $A =  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
92  | 
      \{[a], [b], [c], []\}$.
 | 
| 
293
 
ca349cfe3474
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
267 
diff
changeset
 | 
93  | 
|
| 876 | 94  | 
      \solution{121 is correct. But make sure you understand why it is 121
 | 
95  | 
in cases you do not have a computer at your fingertips.}  | 
|
96  | 
||
| 507 | 97  | 
\item (1) How many basic regular expressions are there to match  | 
| 776 | 98  | 
      \textbf{only} the string $abcd$? (2) How many if they cannot include
 | 
| 
444
 
3056a4c071b0
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
438 
diff
changeset
 | 
99  | 
$\ONE$ and $\ZERO$? (3) How many if they are also not  | 
| 
 
3056a4c071b0
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
438 
diff
changeset
 | 
100  | 
allowed to contain stars? (4) How many if they are also  | 
| 
401
 
5d85dc9779b1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
394 
diff
changeset
 | 
101  | 
not allowed to contain $\_ + \_$?  | 
| 0 | 102  | 
|
| 876 | 103  | 
      \solution{1-3 are infinite (tell the idea why - examples); 4 is five - remember regexes are trees.}
 | 
104  | 
||
| 
355
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
105  | 
\item When are two regular expressions equivalent? Can you  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
106  | 
think of instances where two regular expressions match  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
107  | 
the same strings, but it is not so obvious that they do?  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
108  | 
For example $a + b$ and $b + a$ do not count\ldots they  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
109  | 
obviously match the same strings, namely $[a]$ and  | 
| 
 
a259eec25156
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
331 
diff
changeset
 | 
110  | 
$[b]$.  | 
| 
403
 
564f7584eff1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
401 
diff
changeset
 | 
111  | 
|
| 876 | 112  | 
      \solution{for example $r^* = 1 + r \cdot r^*$ for any regular expression $r$.
 | 
113  | 
Can students think about why this is the case?}  | 
|
114  | 
||
| 416 | 115  | 
\item What is meant by the notions \emph{evil regular expressions}
 | 
| 726 | 116  | 
  and by \emph{catastrophic backtracking}?
 | 
117  | 
||
| 876 | 118  | 
  \solution{catastrophic backtracking also applies to other regexes, not just $(a^*)^*b$}
 | 
119  | 
||
| 726 | 120  | 
\item Given the regular expression $(a + b)^* \cdot b \cdot (a + b)^*$,  | 
| 841 | 121  | 
which of the following regular expressions are equivalent  | 
| 726 | 122  | 
|
123  | 
\begin{center}
 | 
|
124  | 
\begin{tabular}{ll}    
 | 
|
125  | 
1) & $(ab + bb)^* \cdot (a + b)^*$\\ % no  | 
|
126  | 
2) & $(a + b)^* \cdot (ba + bb + b) \cdot (a + b)^*$\\ % yes  | 
|
127  | 
3) & $(a + b)^* \cdot (a + b) \cdot (a + b)^*$ % no  | 
|
128  | 
\end{tabular}
 | 
|
129  | 
\end{center}
 | 
|
| 876 | 130  | 
|
131  | 
  \solution{no, yes (why?), no.}
 | 
|
| 921 | 132  | 
|
133  | 
||
134  | 
\item Given the extended regular expression \texttt{[b-d]a?e+},
 | 
|
135  | 
what does the equivalent basic regular expression look like?  | 
|
| 726 | 136  | 
|
| 934 | 137  | 
  \solution{$(b + c + d) \cdot (a + \ONE) \cdot (e \cdot e^*)$}
 | 
| 921 | 138  | 
|
139  | 
||
| 
403
 
564f7584eff1
updated
 
Christian Urban <christian dot urban at kcl dot ac dot uk> 
parents: 
401 
diff
changeset
 | 
140  | 
\item \POSTSCRIPT  | 
| 0 | 141  | 
\end{enumerate}
 | 
142  | 
||
143  | 
\end{document}
 | 
|
144  | 
||
145  | 
%%% Local Variables:  | 
|
146  | 
%%% mode: latex  | 
|
147  | 
%%% TeX-master: t  | 
|
148  | 
%%% End:  |