afl-material: cws/cw05.tex@568671822d52 (annotated)

630 9b1c15c3eb6f updated Christian Urban <urbanc@in.tum.de> parents: 567 diff changeset	1	% !TEX program = xelatex
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	2	\documentclass{article}
299 6322922aa990 update Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 298 diff changeset	3	\usepackage{../style}
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	4	\usepackage{../graphics}
216 f5ec7c597c5b updated Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 214 diff changeset	5	\usepackage{../langs}
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	6
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	7	\begin{document}
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	8
836 a3418ee8c404 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 821 diff changeset	9	\section*{Coursework 5}
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	10
722 14914b57e207 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 719 diff changeset	11
14914b57e207 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 719 diff changeset	12
836 a3418ee8c404 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 821 diff changeset	13	\noindent This coursework is worth 25\% and is due on \cwFIVE{} at
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	14	18:00. You are asked to implement a compiler targeting the LLVM-IR.
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	15	Be careful that this CW needs some material about the LLVM-IR
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	16	that has not been shown in the lectures and your own experiments
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	17	might be required. You can find information about the LLVM-IR at
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	18
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	19	\begin{itemize}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	20	\item \url{https://bit.ly/3rheZYr}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	21	\item \url{https://llvm.org/docs/LangRef.html}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	22	\end{itemize}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	23
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	24	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	25	You can do the implementation of your compiler in any programming
748 383f2a5952ce updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	26	language you like, but you need to submit the source code with which
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	27	you generated the LLVM-IR files, otherwise a mark of 0\% will be
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	28	awarded. You are asked to submit the code of your compiler, but also
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	29	the generated \texttt{.ll} files. You should use the lexer and parser
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	30	from the previous courseworks, but you need to make some modifications
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	31	to them for the `typed' fun-language. I will award up to 5\% if a
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	32	lexer and a parser are correctly implemented. At the end, please
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	33	package everything(!) in a zip-file that creates a directory with the
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	34	name
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	35
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	36	\begin{center}
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	37	\texttt{YournameYourFamilyname}
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	38	\end{center}
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	39
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	40	\noindent
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	41	on my end.
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	42
750 e93a9e74ca8e updated Christian Urban <christian.urban@kcl.ac.uk> parents: 748 diff changeset	43	\subsection*{Disclaimer\alert}
358 b3129cff41e9 updated Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 333 diff changeset	44
750 e93a9e74ca8e updated Christian Urban <christian.urban@kcl.ac.uk> parents: 748 diff changeset	45	It should be understood that the work you submit represents your own
e93a9e74ca8e updated Christian Urban <christian.urban@kcl.ac.uk> parents: 748 diff changeset	46	effort. You have not copied from anyone else. An exception is the
e93a9e74ca8e updated Christian Urban <christian.urban@kcl.ac.uk> parents: 748 diff changeset	47	Scala code I showed during the lectures or uploaded to KEATS, which
751 4b208d81e002 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 750 diff changeset	48	you can both use. You can also use your own code from the CW~1 --
4b208d81e002 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 750 diff changeset	49	CW~4.
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	50
299 6322922aa990 update Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 298 diff changeset	51
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	52	\subsection*{Task}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	53
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	54	The main goal is to lex and parse 4 Fun-programs, including the
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	55	Mandelbrot program shown in Figure~\ref{mand}, and generate
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	56	corresponding code for the LLVM-IR. Unfortunately the calculations for
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	57	the Mandelbrot Set require floating point arithmetic and therefore we
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	58	cannot be as simple-minded about types as we have been so far
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	59	(remember the LLVM-IR is a fully-typed language and needs to know the
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	60	exact types of each expression). The idea is to deal appropriately
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	61	with three types, namely \texttt{Int}, \texttt{Double} and
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	62	\texttt{Void} (they are represented in the LLVM-IR as \texttt{i32},
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	63	\texttt{double} and \texttt{void}). You need to extend the lexer and
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	64	parser accordingly in order to deal with type annotations. The
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	65	Fun-language includes global constants, such as
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	66
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	67	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	68	val Ymin: Double = -1.3;
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	69	val Maxiters: Int = 1000;
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	70	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	71
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	72	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	73	where you want to assume that they are `normal' identifiers, just
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	74	starting with a capital letter---all other identifiers should have
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	75	lower-case letters. Function definitions can take arguments of
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	76	type \texttt{Int} or \texttt{Double}, and need to specify a return
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	77	type, which can be \texttt{Void}, for example
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	78
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	79	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	80	def foo(n: Int, x: Double) : Double = ...
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	81	def id(n: Int) : Int = ...
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	82	def bar() : Void = ...
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	83	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	84
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	85	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	86	The idea is to record all typing information that is given
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	87	in the Fun-program, but then delay any further typing inference to
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	88	after the CPS-translation. That means the parser should
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	89	generate ASTs given by the Scala dataypes:
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	90
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	91	\begin{lstlisting}[numbers=none,language=Scala]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	92	abstract class Exp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	93	abstract class BExp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	94	abstract class Decl
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	95
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	96	case class Def(name: String, args: List[(String, String)],
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	97	ty: String, body: Exp) extends Decl
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	98	case class Main(e: Exp) extends Decl
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	99	case class Const(name: String, v: Int) extends Decl
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	100	case class FConst(name: String, x: Float) extends Decl
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	101
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	102	case class Call(name: String, args: List[Exp]) extends Exp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	103	case class If(a: BExp, e1: Exp, e2: Exp) extends Exp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	104	case class Var(s: String) extends Exp
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	105	case class Num(i: Int) extends Exp // integer numbers
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	106	case class FNum(i: Float) extends Exp // floating numbers
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	107	case class ChConst(c: Int) extends Exp // char constant
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	108	case class Aop(o: String, a1: Exp, a2: Exp) extends Exp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	109	case class Sequence(e1: Exp, e2: Exp) extends Exp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	110	case class Bop(o: String, a1: Exp, a2: Exp) extends BExp
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	111	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	112
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	113	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	114	This datatype distinguishes whether the global constant is an integer
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	115	constant or floating constant. Also a function definition needs to
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	116	record the return type of the function, namely the argument
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	117	\texttt{ty} in \texttt{Def}, and the arguments consist of an pairs of
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	118	identifier names and types (\texttt{Int} or \texttt{Double}). The hard
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	119	part of the CW is to design the K-intermediate language and infer all
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	120	necessary types in order to generate LLVM-IR code. You can check
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	121	your LLVM-IR code by running it with the interpreter \texttt{lli}.
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	122
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	123	\begin{figure}[t]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	124	\lstinputlisting[language=Scala]{../progs/fun2/mand.fun}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	125	\caption{The Mandelbrot program in the `typed' Fun-language.\label{mand}}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	126	\end{figure}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	127
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	128	\begin{figure}[t]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	129	\includegraphics[scale=0.35]{../progs/fun2/out.png}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	130	\caption{Ascii output of the Mandelbrot program.\label{mand}}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	131	\end{figure}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	132
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	133	Also note that the second version of the Mandelbrot program and also
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	134	the Tower of Hanoi program uses character constants, like \texttt{'a'},
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	135	\texttt{'1'}, \texttt{'$\backslash$n'} and so on. When they are tokenised,
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	136	such characters should be interpreted as the corresponding ASCII code (an
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	137	integer), such that we can use them in calculations like \texttt{'a' + 10}
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	138	where the result should be 107. As usual, the character \texttt{'$\backslash$n'} is the
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	139	ASCII code 10.
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	140
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	141
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	142	\subsection*{LLVM-IR}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	143
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	144	There are some subtleties in the LLVM-IR you need to be aware of:
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	145
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	146	\begin{itemize}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	147	\item \textbf{Global constants}: While global constants such as
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	148
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	149	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	150	val Max : Int = 10;
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	151	\end{lstlisting}
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	152
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	153	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	154	can be easily defined in the LLVM-IR as follows
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	155
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	156	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	157	@Max = global i32 10
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	158	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	159
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	160	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	161	they cannot easily be referenced. If you want to use
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	162	this constant then you need to generate code such as
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	163
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	164	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	165	%tmp_22 = load i32, i32* @Max
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	166	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	167
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	168	\noindent
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	169	first, which treats \texttt{@Max} as an Integer-pointer (type
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	170	\texttt{i32*}) that needs to be loaded into a local variable,
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	171	here \texttt{\%tmp\_22}.
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	172
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	173	\item \textbf{Void-Functions}: While integer and double functions
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	174	can easily be called and their results can be allocated to a
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	175	temporary variable:
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	176
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	177	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	178	%tmp_23 = call i32 @sqr (i32 %n)
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	179	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	180
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	181	void-functions cannot be allocated to a variable. They need to be
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	182	called just as
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	183
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	184	\begin{lstlisting}[numbers=none]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	185	call void @print_int (i32 %tmp_23)
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	186	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	187
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	188	\item \textbf{Floating-Point Operations}: While integer operations
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	189	are specified in the LLVM-IR as
201 c813506e0ee8 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 200 diff changeset	190
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	191	\begin{lstlisting}[numbers=none,language=Scala]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	192	def compile_op(op: String) = op match {
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	193	case "+" => "add i32 "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	194	case "*" => "mul i32 "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	195	case "-" => "sub i32 "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	196	case "==" => "icmp eq i32 "
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	197	case "!=" => "icmp ne i32 "
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	198	case "<=" => "icmp sle i32 " // signed less or equal
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	199	case "<" => "icmp slt i32 " // signed less than
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	200	}\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	201
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	202	the corresponding operations on doubles are
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	203
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	204	\begin{lstlisting}[numbers=none,language=Scala]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	205	def compile_dop(op: String) = op match {
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	206	case "+" => "fadd double "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	207	case "*" => "fmul double "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	208	case "-" => "fsub double "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	209	case "==" => "fcmp oeq double "
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	210	case "!=" => "fcmp one double "
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	211	case "<=" => "fcmp ole double "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	212	case "<" => "fcmp olt double "
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	213	}\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	214
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	215	\item \textbf{Typing}: In order to leave the CPS-translations
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	216	as is, it makes sense to defer the full type-inference to the
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	217	K-intermediate-language. For this it is good to define
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	218	the \texttt{KVar} constructor as
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	219
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	220	\begin{lstlisting}[numbers=none,language=Scala]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	221	case class KVar(s: String, ty: Ty = "UNDEF") extends KVal\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	222
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	223	where first a default type, for example \texttt{UNDEF}, is
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	224	given. Then you need to define two typing functions
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	225
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	226	\begin{lstlisting}[numbers=none,language=Scala]
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	227	def typ_val(v: KVal, ts: TyEnv) = ???
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	228	def typ_exp(a: KExp, ts: TyEnv) = ???
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	229	\end{lstlisting}
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	230
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	231	Both functions require a typing-environment that updates
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	232	the information about what type each variable, operation
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	233	and so on receives. Once the types are inferred, the
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	234	LLVM-IR code can be generated. Since we are dealing only
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	235	with simple first-order functions, nothing on the scale
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	236	as the `Hindley-Milner' typing-algorithm is needed. I suggest
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	237	to just look at what data is avaliable and generate all
836 a3418ee8c404 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 821 diff changeset	238	missing information by ``simple means''\ldots rather than
a3418ee8c404 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 821 diff changeset	239	looking at the literature which solves the problem
a3418ee8c404 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 821 diff changeset	240	with much heavier machinery.
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	241
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	242	\item \textbf{Build-In Functions}: The `prelude' comes
7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	243	with several build-in functions: \texttt{new\_line()},
853 568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	244	\texttt{skip}, \texttt{print\_int(n)}, \texttt{print\_space()},
568671822d52 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 836 diff changeset	245	\texttt{print\_star()} and \texttt{print\_char(n)}. You can find the `prelude' for
821 f914b9476dc7 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 820 diff changeset	246	example in the file \texttt{sqr.ll}.
820 7fd1f611c21d updated Christian Urban <christian.urban@kcl.ac.uk> parents: 752 diff changeset	247	\end{itemize}
205 0b59588d28d2 updated Christian Urban <christian dot urban at kcl dot ac dot uk> parents: 204 diff changeset	248
200 7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	249	\end{document}
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	250
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	251	%%% Local Variables:
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	252	%%% mode: latex
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	253	%%% TeX-master: t
7415871b1ef5 added Christian Urban <christian dot urban at kcl dot ac dot uk> parents: diff changeset	254	%%% End:

author	Christian Urban <christian.urban@kcl.ac.uk>
	Fri, 03 Dec 2021 17:45:11 +0000
changeset 853	568671822d52
parent 836	a3418ee8c404
child 855	1c0a684567d7
permissions	-rw-r--r--