cws/cw05.tex
author Christian Urban <christian.urban@kcl.ac.uk>
Fri, 03 Dec 2021 17:45:11 +0000
changeset 853 851d8c00f033
parent 836 c8c30949e06f
child 855 8354095747a5
permissions -rw-r--r--
updated
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
630
3cea57c5501f updated
Christian Urban <urbanc@in.tum.de>
parents: 567
diff changeset
     1
% !TEX program = xelatex
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     2
\documentclass{article}
299
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 298
diff changeset
     3
\usepackage{../style}
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
     4
\usepackage{../graphics}
216
f5ec7c597c5b updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 214
diff changeset
     5
\usepackage{../langs}
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     6
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     7
\begin{document}
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
     8
836
c8c30949e06f updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 821
diff changeset
     9
\section*{Coursework 5}
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    10
722
7c09b7eadc6b updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 719
diff changeset
    11
7c09b7eadc6b updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 719
diff changeset
    12
836
c8c30949e06f updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 821
diff changeset
    13
\noindent This coursework is worth 25\% and is due on \cwFIVE{} at
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    14
18:00. You are asked to implement a compiler targeting the LLVM-IR.
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    15
Be careful that this CW needs some material about the LLVM-IR
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    16
that has not been shown in the lectures and your own experiments
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    17
might be required. You can find information about the LLVM-IR at
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    18
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    19
\begin{itemize}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    20
\item \url{https://bit.ly/3rheZYr}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    21
\item \url{https://llvm.org/docs/LangRef.html}  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    22
\end{itemize}  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    23
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    24
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    25
You can do the implementation of your compiler in any programming
748
fca7f33a426c updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 722
diff changeset
    26
language you like, but you need to submit the source code with which
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    27
you generated the LLVM-IR files, otherwise a mark of 0\% will be
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    28
awarded. You are asked to submit the code of your compiler, but also
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    29
the generated \texttt{.ll} files. You should use the lexer and parser
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    30
from the previous courseworks, but you need to make some modifications
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    31
to them for the `typed' fun-language. I will award up to 5\% if a
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    32
lexer and a parser are correctly implemented. At the end, please
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    33
package everything(!) in a zip-file that creates a directory with the
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    34
name
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    35
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    36
\begin{center}
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    37
\texttt{YournameYourFamilyname}
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    38
\end{center}
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    39
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    40
\noindent
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    41
on my end.
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    42
750
40b7efa5fbed updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 748
diff changeset
    43
\subsection*{Disclaimer\alert}
358
b3129cff41e9 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 333
diff changeset
    44
750
40b7efa5fbed updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 748
diff changeset
    45
It should be understood that the work you submit represents your own
40b7efa5fbed updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 748
diff changeset
    46
effort. You have not copied from anyone else. An exception is the
40b7efa5fbed updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 748
diff changeset
    47
Scala code I showed during the lectures or uploaded to KEATS, which
751
02bc5af1c5f2 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 750
diff changeset
    48
you can both use. You can also use your own code from the CW~1 --
02bc5af1c5f2 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 750
diff changeset
    49
CW~4.
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
    50
299
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 298
diff changeset
    51
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    52
\subsection*{Task}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    53
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    54
The main goal is to lex and parse 4 Fun-programs, including the
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    55
Mandelbrot program shown in Figure~\ref{mand}, and generate
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    56
corresponding code for the LLVM-IR. Unfortunately the calculations for
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    57
the Mandelbrot Set require floating point arithmetic and therefore we
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    58
cannot be as simple-minded about types as we have been so far
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    59
(remember the LLVM-IR is a fully-typed language and needs to know the
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    60
exact types of each expression). The idea is to deal appropriately
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    61
with three types, namely \texttt{Int}, \texttt{Double} and
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    62
\texttt{Void} (they are represented in the LLVM-IR as \texttt{i32},
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    63
\texttt{double} and \texttt{void}). You need to extend the lexer and
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    64
parser accordingly in order to deal with type annotations. The
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    65
Fun-language includes global constants, such as
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    66
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    67
\begin{lstlisting}[numbers=none]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    68
  val Ymin: Double = -1.3;
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    69
  val Maxiters: Int = 1000;
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    70
\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    71
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    72
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    73
where you want to assume that they are `normal' identifiers, just
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    74
starting with a capital letter---all other identifiers should have
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    75
lower-case letters. Function definitions can take arguments of
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    76
type \texttt{Int} or \texttt{Double}, and need to specify a return
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    77
type, which can be \texttt{Void}, for example
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    78
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    79
\begin{lstlisting}[numbers=none]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    80
  def foo(n: Int, x: Double) : Double = ...
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    81
  def id(n: Int) : Int = ...
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    82
  def bar() : Void = ...
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    83
\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    84
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    85
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    86
The idea is to record all typing information that is given
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
    87
in the Fun-program, but then delay any further typing inference to
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    88
after the CPS-translation. That means the parser should
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    89
generate ASTs given by the Scala dataypes:
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    90
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    91
\begin{lstlisting}[numbers=none,language=Scala]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    92
abstract class Exp 
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    93
abstract class BExp  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    94
abstract class Decl 
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    95
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    96
case class Def(name: String, args: List[(String, String)],
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    97
               ty: String, body: Exp) extends Decl
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    98
case class Main(e: Exp) extends Decl
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
    99
case class Const(name: String, v: Int) extends Decl
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   100
case class FConst(name: String, x: Float) extends Decl
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   101
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   102
case class Call(name: String, args: List[Exp]) extends Exp
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   103
case class If(a: BExp, e1: Exp, e2: Exp) extends Exp
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   104
case class Var(s: String) extends Exp
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   105
case class Num(i: Int) extends Exp     // integer numbers
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   106
case class FNum(i: Float) extends Exp  // floating numbers
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   107
case class ChConst(c: Int) extends Exp // char constant
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   108
case class Aop(o: String, a1: Exp, a2: Exp) extends Exp
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   109
case class Sequence(e1: Exp, e2: Exp) extends Exp
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   110
case class Bop(o: String, a1: Exp, a2: Exp) extends BExp
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   111
\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   112
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   113
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   114
This datatype distinguishes whether the global constant is an integer
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   115
constant or floating constant. Also a function definition needs to
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   116
record the return type of the function, namely the argument
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   117
\texttt{ty} in \texttt{Def}, and the arguments consist of an pairs of
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   118
identifier names and types (\texttt{Int} or \texttt{Double}). The hard
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   119
part of the CW is to design the K-intermediate language and infer all
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   120
necessary types in order to generate LLVM-IR code. You can check
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   121
your LLVM-IR code by running it with the interpreter \texttt{lli}.
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   122
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   123
\begin{figure}[t]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   124
\lstinputlisting[language=Scala]{../progs/fun2/mand.fun}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   125
\caption{The Mandelbrot program in the `typed' Fun-language.\label{mand}}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   126
\end{figure}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   127
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   128
\begin{figure}[t]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   129
\includegraphics[scale=0.35]{../progs/fun2/out.png}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   130
\caption{Ascii output of the Mandelbrot program.\label{mand}}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   131
\end{figure}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   132
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   133
Also note that the second version of the Mandelbrot program and also
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   134
the Tower of Hanoi program uses character constants, like \texttt{'a'},
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   135
\texttt{'1'}, \texttt{'$\backslash$n'} and so on. When they are tokenised,
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   136
such characters should be interpreted as the corresponding ASCII code (an
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   137
integer), such that we can use them in calculations like \texttt{'a' + 10}
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   138
where the result should be 107. As usual, the character \texttt{'$\backslash$n'} is the
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   139
ASCII code 10.
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   140
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   141
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   142
\subsection*{LLVM-IR}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   143
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   144
There are some subtleties in the LLVM-IR you need to be aware of:
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   145
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   146
\begin{itemize}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   147
\item \textbf{Global constants}: While global constants such as
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   148
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   149
\begin{lstlisting}[numbers=none]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   150
val Max : Int = 10;
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   151
\end{lstlisting}
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   152
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   153
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   154
can be easily defined in the LLVM-IR as follows
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   155
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   156
\begin{lstlisting}[numbers=none]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   157
@Max = global i32 10
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   158
\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   159
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   160
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   161
they cannot easily be referenced. If you want to use
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   162
this constant then you need to generate code such as
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   163
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   164
\begin{lstlisting}[numbers=none]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   165
%tmp_22 = load i32, i32* @Max
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   166
\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   167
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   168
\noindent
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   169
first, which treats \texttt{@Max} as an Integer-pointer (type
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   170
\texttt{i32*}) that needs to be loaded into a local variable,
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   171
here \texttt{\%tmp\_22}.
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   172
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   173
\item \textbf{Void-Functions}: While integer and double functions
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   174
  can easily be called and their results can be allocated to a
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   175
  temporary variable:
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   176
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   177
  \begin{lstlisting}[numbers=none]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   178
   %tmp_23 = call i32 @sqr (i32 %n)
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   179
  \end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   180
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   181
  void-functions cannot be allocated to a variable. They need to be
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   182
  called just as
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   183
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   184
  \begin{lstlisting}[numbers=none]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   185
  call void @print_int (i32 %tmp_23)
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   186
\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   187
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   188
\item \textbf{Floating-Point Operations}: While integer operations
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   189
  are specified in the LLVM-IR as
201
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 200
diff changeset
   190
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   191
  \begin{lstlisting}[numbers=none,language=Scala]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   192
  def compile_op(op: String) = op match {
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   193
    case "+" => "add i32 "
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   194
    case "*" => "mul i32 "
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   195
    case "-" => "sub i32 "
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   196
    case "==" => "icmp eq i32 "
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   197
    case "!=" => "icmp ne i32 "
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   198
    case "<=" => "icmp sle i32 " // signed less or equal
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   199
    case "<"  => "icmp slt i32 " // signed less than
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   200
  }\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   201
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   202
  the corresponding operations on doubles are
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   203
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   204
  \begin{lstlisting}[numbers=none,language=Scala]
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   205
  def compile_dop(op: String) = op match {
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   206
    case "+" => "fadd double "
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   207
    case "*" => "fmul double "
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   208
    case "-" => "fsub double "
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   209
    case "==" => "fcmp oeq double "
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   210
    case "!=" => "fcmp one double "
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   211
    case "<=" => "fcmp ole double "   
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   212
    case "<"  => "fcmp olt double "   
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   213
  }\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   214
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   215
\item \textbf{Typing}: In order to leave the CPS-translations
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   216
  as is, it makes sense to defer the full type-inference to the
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   217
  K-intermediate-language. For this it is good to define
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   218
  the \texttt{KVar} constructor as
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   219
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   220
\begin{lstlisting}[numbers=none,language=Scala]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   221
case class KVar(s: String, ty: Ty = "UNDEF") extends KVal\end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   222
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   223
  where first a default type, for example \texttt{UNDEF}, is
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   224
  given. Then you need to define two typing functions
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   225
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   226
  \begin{lstlisting}[numbers=none,language=Scala]  
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   227
    def typ_val(v: KVal, ts: TyEnv) = ???
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   228
    def typ_exp(a: KExp, ts: TyEnv) = ???
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   229
  \end{lstlisting}
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   230
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   231
  Both functions require a typing-environment that updates
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   232
  the information about what type each variable, operation
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   233
  and so on receives. Once the types are inferred, the
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   234
  LLVM-IR code can be generated. Since we are dealing only
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   235
  with simple first-order functions, nothing on the scale
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   236
  as the `Hindley-Milner' typing-algorithm is needed. I suggest
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   237
  to just look at what data is avaliable and generate all
836
c8c30949e06f updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 821
diff changeset
   238
  missing information by ``simple means''\ldots rather than
c8c30949e06f updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 821
diff changeset
   239
  looking at the literature which solves the problem
c8c30949e06f updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 821
diff changeset
   240
  with much heavier machinery.
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   241
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   242
\item \textbf{Build-In Functions}: The `prelude' comes
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   243
  with several build-in functions: \texttt{new\_line()},
853
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   244
  \texttt{skip}, \texttt{print\_int(n)}, \texttt{print\_space()},
851d8c00f033 updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 836
diff changeset
   245
  \texttt{print\_star()} and \texttt{print\_char(n)}. You can find the `prelude' for
821
c3f9e0fe08cb updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 820
diff changeset
   246
  example in the file \texttt{sqr.ll}.
820
9d5e4fa0c64d updated
Christian Urban <christian.urban@kcl.ac.uk>
parents: 752
diff changeset
   247
\end{itemize}  
205
0b59588d28d2 updated
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents: 204
diff changeset
   248
200
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   249
\end{document}
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   250
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   251
%%% Local Variables: 
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   252
%%% mode: latex
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   253
%%% TeX-master: t
Christian Urban <christian dot urban at kcl dot ac dot uk>
parents:
diff changeset
   254
%%% End: