afl-material: handouts/ho09.tex@e7e7fe274f5c (annotated)

677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	1	% !TEX program = xelatex
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	2	\documentclass{article}
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	3	\usepackage{../style}
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	4	\usepackage{../langs}
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	5	\usepackage{../graphics}
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	6	\usepackage{../grammar}
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	7	%%\usepackage{multicol}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	8
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	9	%%\newcommand{\dn}{\stackrel{\mbox{\scriptsize def}}{=}}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	10
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	11	\begin{document}
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	12	\fnote{\copyright{} Christian Urban, King's College London, 2019}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	13
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	14
722 7c09b7eadc6b updated Christian Urban <christian.urban@kcl.ac.uk> parents: 704 diff changeset	15	% CPS translations
7c09b7eadc6b updated Christian Urban <christian.urban@kcl.ac.uk> parents: 704 diff changeset	16	% https://felleisen.org/matthias/4400-s20/lecture17.html
7c09b7eadc6b updated Christian Urban <christian.urban@kcl.ac.uk> parents: 704 diff changeset	17
7c09b7eadc6b updated Christian Urban <christian.urban@kcl.ac.uk> parents: 704 diff changeset	18	%% pattern matching resources
7c09b7eadc6b updated Christian Urban <christian.urban@kcl.ac.uk> parents: 704 diff changeset	19	%% https://www.reddit.com/r/ProgrammingLanguages/comments/g1vno3/beginner_resources_for_compiling_pattern_matching/
7c09b7eadc6b updated Christian Urban <christian.urban@kcl.ac.uk> parents: 704 diff changeset	20
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	21	\section*{Handout 9 (LLVM, SSA and CPS)}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	22
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	23	Reflecting on our two tiny compilers targetting the JVM, the code
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	24	generation part was actually not so hard, no? Pretty much just some
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	25	post-traversal of the abstract syntax tree, yes? One of the reasons for
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	26	this ease is that the JVM is a stack-based virtual machine and it is
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	27	therefore not hard to translate deeply-nested arithmetic expressions
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	28	into a sequence of instructions manipulating the stack. The problem is
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	29	that ``real'' CPUs, although supporting stack operations, are not really
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	30	designed to be \emph{stack machines}. The design of CPUs is more like,
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	31	here is a chunk of memory---compiler, or better compiler writers, do
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	32	something with it. Consequently, modern compilers need to go the extra
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	33	mile in order to generate code that is much easier and faster to process
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	34	by CPUs. To make this all tractable for this module, we target the LLVM
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	35	Intermediate Language. In this way we can take advantage of the tools
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	36	coming with LLVM. For example we do not have to worry about things like
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	37	register allocations.\bigskip
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	38
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	39	\noindent LLVM\footnote{\url{http://llvm.org}} is a beautiful example
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	40	that projects from Academia can make a difference in the World. LLVM
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	41	started in 2000 as a project by two researchers at the University of
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	42	Illinois at Urbana-Champaign. At the time the behemoth of compilers was
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	43	gcc with its myriad of front-ends for other languages (C++, Fortran,
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	44	Ada, Go, Objective-C, Pascal etc). The problem was that gcc morphed over
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	45	time into a monolithic gigantic piece of m\ldots ehm software, which you
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	46	could not mess about in an afternoon. In contrast, LLVM is designed to
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	47	be a modular suite of tools with which you can play around easily and
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	48	try out something new. LLVM became a big player once Apple hired one of
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	49	the original developers (I cannot remember the reason why Apple did not
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	50	want to use gcc, but maybe they were also just disgusted by gcc's big
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	51	monolithic codebase). Anyway, LLVM is now the big player and gcc is more
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	52	or less legacy. This does not mean that programming languages like C and
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	53	C++ are dying out any time soon---they are nicely supported by LLVM.
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	54
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	55	We will target the LLVM Intermediate Language, or LLVM Intermediate
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	56	Representation (short LLVM-IR). The LLVM-IR looks very similar to the
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	57	assembly language of Jasmin and Krakatau. It will also allow us to
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	58	benefit from the modular structure of the LLVM compiler and let for
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	59	example the compiler generate code for different CPUs, like X86 or ARM.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	60	That means we can be agnostic about where our code actually runs. We can
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	61	also be ignorant about optimising code and allocating memory
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	62	efficiently.
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	63
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	64	However, what we have to do for LLVM is to generate code in \emph{Static
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	65	Single-Assignment} format (short SSA), because that is what the LLVM-IR
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	66	expects from us. A reason why LLVM uses the SSA format, rather than
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	67	JVM-like stack instructions, is that stack instructions are difficult to
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	68	optimise---you cannot just re-arrange instructions without messing about
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	69	with what is calculated on the stack. Also it is hard to find out if all
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	70	the calculations on the stack are actually necessary and not by chance
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	71	dead code. The JVM has for all these obstacles sophisticated machinery
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	72	to make such ``high-level'' code still run fast, but let's say that for
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	73	the sake of argument we do not want to rely on it. We want to generate
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	74	fast code ourselves. This means we have to work around the intricacies
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	75	of what instructions CPUs can actually process fast. This is what the
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	76	SSA format is designed for.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	77
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	78
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	79	The main idea behind the SSA format is to use very simple variable
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	80	assignments where every tmp-variable is assigned only once. The assignments
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	81	also need to be primitive in the sense that they can be just simple
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	82	operations like addition, multiplication, jumps, comparisons and so on.
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	83	Say, we have an expression $((1 + a) + (3 + (b * 5)))$, then the
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	84	corresponding SSA format is
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	85
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	86	\begin{lstlisting}[language=LLVMIR,numbers=left]
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	87	let tmp0 = add 1 a in
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	88	let tmp1 = mul b 5 in
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	89	let tmp2 = add 3 tmp1 in
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	90	let tmp3 = add tmp0 tmp2 in tmp3
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	91	\end{lstlisting}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	92
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	93	\noindent where every variable is used only once (we could not write
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	94	\texttt{tmp1 = add 3 tmp1} in Line 3 for example).
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	95
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	96	There are sophisticated algorithms for imperative languages, like C,
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	97	that efficiently transform a high-level program into SSA format. But
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	98	we can ignore them here. We want to compile a functional language and
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	99	there things get much more interesting than just sophisticated. We
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	100	will need to have a look at CPS translations, where the CPS stands for
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	101	Continuation-Passing-Style---basically black programming art or
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	102	abracadabra programming. So sit tight.
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	103
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	104	\subsection*{LLVM-IR}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	105
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	106	Before we start, let's first have a look at the \emph{LLVM Intermediate
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	107	Representation} in more detail. The LLVM-IR is in between the frontends
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	108	and backends of the LLVM framework. It allows compilation of multiple
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	109	source languages to multiple targets. It is also the place where most of
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	110	the target independent optimisations are performed.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	111
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	112	What is good about our toy Fun language is that it basically only
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	113	contains expressions (be they arithmetic expressions, boolean
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	114	expressions or if-expressions). The exception are function definitions.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	115	Luckily, for them we can use the mechanism of defining functions in the
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	116	LLVM-IR (this is similar to using JVM methods for functions in our
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	117	earlier compiler). For example the simple Fun program
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	118
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	119
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	120	\begin{lstlisting}[language=Scala,numbers=none]
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	121	def sqr(x) = x * x
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	122	\end{lstlisting}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	123
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	124	\noindent
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	125	can be compiled to the following LLVM-IR function:
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	126
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	127	\begin{lstlisting}[language=LLVM]
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	128	define i32 @sqr(i32 %x) {
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	129	%tmp = mul i32 %x, %x
677 3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	130	ret i32 %tmp
3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	131	}
3787d4fae375 updated Christian Urban <urbanc@in.tum.de> parents: 539 diff changeset	132	\end{lstlisting}
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	133
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	134	\noindent First notice that all variable names, in this case \texttt{x}
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	135	and \texttt{tmp}, are prefixed with \texttt{\%} in the LLVM-IR.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	136	Temporary variables can be named with an identifier, such as
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	137	\texttt{tmp}, or numbers. In contrast, function names, since they are ``global'',
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	138	need to be prefixed with an @-symbol. Also, the LLVM-IR is a fully typed
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	139	language. The \texttt{i32} type stands for 32-bit integers. There are
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	140	also types for 64-bit integers (\texttt{i64}), chars (\texttt{i8}),
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	141	floats, arrays and even pointer types. In the code above, \texttt{sqr}
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	142	takes an argument of type \texttt{i32} and produces a result of type
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	143	\texttt{i32} (the result type is in front of the function name, like in
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	144	C). Each arithmetic operation, for example addition and multiplication,
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	145	are also prefixed with the type they operate on. Obviously these types
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	146	need to match up\ldots{} but since we have in our programs only
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	147	integers, for the moment \texttt{i32} everywhere will do. We do not have to generate
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	148	any other types, but obviously this is a limitation in our Fun language.
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	149
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	150	There are a few interesting instructions in the LLVM-IR which are quite
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	151	different than what we have seen in the JVM. Can you remember the
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	152	kerfuffle we had to go through with boolean expressions and negating the
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	153	condition? In the LLVM-IR, branching if-conditions is implemented
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	154	differently: there is a separate \texttt{br}-instruction as follows:
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	155
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	156	\begin{lstlisting}[language=LLVM]
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	157	br i1 %var, label %if_br, label %else_br
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	158	\end{lstlisting}
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	159
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	160	\noindent
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	161	The type \texttt{i1} stands for booleans. If the variable is true, then
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	162	this instruction jumps to the if-branch, which needs an explicit label;
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	163	otherwise it jumps to the else-branch, again with its own label. This allows us
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	164	to keep the meaning of the boolean expression ``as is'' when compiling if's---thanks god no more negating the boolean.
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	165	A value of type boolean is generated in the LLVM-IR by the
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	166	\texttt{icmp}-instruction. This instruction is for integers (hence the
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	167	\texttt{i}) and takes the comparison operation as argument. For example
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	168
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	169	\begin{lstlisting}[language=LLVM]
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	170	icmp eq i32 %x, %y ; for equal
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	171	icmp sle i32 %x, %y ; signed less or equal
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	172	icmp slt i32 %x, %y ; signed less than
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	173	icmp ult i32 %x, %y ; unsigned less than
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	174	\end{lstlisting}
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	175
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	176	\noindent
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	177	Note that in some operations the LLVM-IR distinguishes between signed and
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	178	unsigned representations of integers.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	179
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	180	It is also easy to call another function in LLVM-IR: as can be
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	181	seen from Figure~\ref{lli} we can just call a function with the
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	182	instruction \texttt{call} and can also assign the result to
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	183	a variable. The syntax is as follows
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	184
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	185	\begin{lstlisting}[language=LLVM]
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	186	%var = call i32 @foo(...args...)
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	187	\end{lstlisting}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	188
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	189	\noindent
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	190	where the arguments can only be simple variables, not compound
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	191	expressions.
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	192
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	193	Conveniently, you can use the program \texttt{lli}, which comes with
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	194	LLVM, to interpret programs written in the LLVM-IR. So you can easily
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	195	check whether the code you produced actually works. To get a running
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	196	program that does something interesting you need to add some boilerplate
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	197	about printing out numbers and a main-function that is the entry point
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	198	for the program (see Figure~\ref{lli} for a complete listing). Again
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	199	this is very similar to the boilerplate we needed to add in our JVM
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	200	compiler.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	201
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	202	You can generate a binary for the program in Figure~\ref{lli} by using
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	203	the \texttt{llc}-compiler and then \texttt{gcc}, whereby \texttt{llc} generates
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	204	an object file and \texttt{gcc} (that is clang) generates the
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	205	executable binary:
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	206
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	207	\begin{lstlisting}[language=bash,numbers=none]
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	208	llc -filetype=obj sqr.ll
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	209	gcc sqr.o -o a.out
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	210	./a.out
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	211	> 25
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	212	\end{lstlisting}
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	213
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	214	\begin{figure}[t]\small
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	215	\lstinputlisting[language=LLVM,numbers=left]{../progs/sqr.ll}
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	216	\caption{An LLVM-IR program for calculating the square function. It
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	217	calls this function in \texttt{@main} with the argument \texttt{5}. The
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	218	code for the \texttt{sqr} function is in Lines 13 -- 16. The main
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	219	function calls \texttt{sqr} and then prints out the result. The other
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	220	code is boilerplate for printing out integers.\label{lli}}
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	221	\end{figure}
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	222
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	223
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	224
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	225	\subsection*{Our Own Intermediate Language}
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	226
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	227	Remember compilers have to solve the problem of bridging the gap between
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	228	``high-level'' programs and ``low-level'' hardware. If the gap is too
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	229	wide for one step, then a good strategy is to lay a stepping stone
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	230	somewhere in between. The LLVM-IR itself is such a stepping stone to
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	231	make the task of generating and optimising code easier. Like a real
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	232	compiler we will use our own stepping stone which I call the
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	233	\emph{K-language}. For what follows recall the various kinds of
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	234	expressions in the Fun language. For convenience the Scala code of the
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	235	corresponding abstract syntax trees is shown on top of
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	236	Figure~\ref{absfun}. Below is the code for the abstract syntax trees in
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	237	the K-language. In K, here are two kinds of syntactic entities, namely
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	238	\emph{K-values} and \emph{K-expressions}. The central constructor of the
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	239	K-language is \texttt{KLet}. For this recall in SSA that arithmetic
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	240	expressions such as $((1 + a) + (3 + (b * 5)))$ need to be broken up
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	241	into smaller ``atomic'' steps, like so
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	242
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	243	\begin{lstlisting}[language=LLVMIR,numbers=none]
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	244	let tmp0 = add 1 a in
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	245	let tmp1 = mul b 5 in
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	246	let tmp2 = add 3 tmp1 in
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	247	let tmp3 = add tmp0 tmp2 in
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	248	tmp3
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	249	\end{lstlisting}
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	250
242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	251	\noindent
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	252	Here \texttt{tmp3} will contain the result of what the whole expression
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	253	stands for. In each individual step we can only perform an ``atomic''
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	254	operation, like addition or multiplication of a number and a variable.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	255	We are not allowed to have for example an if-condition on the right-hand
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	256	side of an equals. Such constraints are enforced upon us because of how
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	257	the SSA format works in the LLVM-IR. By having in \texttt{KLet} taking
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	258	first a string (standing for an intermediate result) and second a value,
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	259	we can fulfil this constraint ``by construction''---there is no way we
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	260	could write anything else than a value.
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	261
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	262	To sum up, K-values are the atomic operations that can be on the
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	263	right-hand side of equal-signs. The K-language is restricted such that
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	264	it is easy to generate the SSA format for the LLVM-IR.
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	265
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	266
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	267
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	268	\begin{figure}[p]\small
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	269	\begin{lstlisting}[language=Scala,numbers=none]
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	270	// Fun language (expressions)
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	271	abstract class Exp
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	272	abstract class BExp
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	273
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	274	case class Call(name: String, args: List[Exp]) extends Exp
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	275	case class If(a: BExp, e1: Exp, e2: Exp) extends Exp
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	276	case class Write(e: Exp) extends Exp
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	277	case class Var(s: String) extends Exp
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	278	case class Num(i: Int) extends Exp
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	279	case class Aop(o: String, a1: Exp, a2: Exp) extends Exp
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	280	case class Sequence(e1: Exp, e2: Exp) extends Exp
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	281	case class Bop(o: String, a1: Exp, a2: Exp) extends BExp
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	282
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	283
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	284
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	285	// K-language (K-expressions, K-values)
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	286	abstract class KExp
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	287	abstract class KVal
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	288
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	289	case class KVar(s: String) extends KVal
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	290	case class KNum(i: Int) extends KVal
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	291	case class Kop(o: String, v1: KVal, v2: KVal) extends KVal
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	292	case class KCall(o: String, vrs: List[KVal]) extends KVal
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	293	case class KWrite(v: KVal) extends KVal
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	294
9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	295	case class KIf(x1: String, e1: KExp, e2: KExp) extends KExp
680 242c1f0e60df updated Christian Urban <urbanc@in.tum.de> parents: 679 diff changeset	296	case class KLet(x: String, v: KVal, e: KExp) extends KExp
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	297	case class KReturn(v: KVal) extends KExp
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	298	\end{lstlisting}
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	299	\caption{Abstract syntax trees for the Fun language.\label{absfun}}
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	300	\end{figure}
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	301
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	302
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	303
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	304	\subsection*{CPS-Translations}
6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	305
704 27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	306	CPS stands for Continuation-Passing-Style. It is a kind of programming
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	307	technique often used in advanced functional programming. Before we delve
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	308	into the CPS-translation for our Fun language, let us look at
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	309	CPS-versions of some well-known functions. Consider
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	310
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	311	\begin{lstlisting}[language=Scala, numbers=none]
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	312	def fact(n: Int) : Int =
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	313	if (n == 0) 1 else n * fact(n - 1)
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	314	\end{lstlisting}
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	315
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	316	\noindent
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	317	This is clearly the usual factorial function. But now consider the
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	318	following version of the factorial function:
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	319
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	320	\begin{lstlisting}[language=Scala, numbers=none]
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	321	def factC(n: Int, ret: Int => Int) : Int =
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	322	if (n == 0) ret(1)
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	323	else factC(n - 1, x => ret(n * x))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	324
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	325	factC(3, identity)
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	326	\end{lstlisting}
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	327
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	328	\noindent
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	329	This function is called with the number, in this case 3, and the
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	330	identity-function (which returns just its input). The recursive
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	331	calls are:
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	332
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	333	\begin{lstlisting}[language=Scala, numbers=none]
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	334	factC(2, x => identity(3 * x))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	335	factC(1, x => identity(3 * (2 * x)))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	336	factC(0, x => identity(3 * (2 * (1 * x))))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	337	\end{lstlisting}
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	338
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	339	\noindent
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	340	Having reached 0, we get out of the recursion and apply 1 to the
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	341	continuation (see if-branch above). This gives
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	342
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	343	\begin{lstlisting}[language=Scala, numbers=none]
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	344	identity(3 * (2 * (1 * 1)))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	345	= 3 * (2 * (1 * 1))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	346	= 6
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	347	\end{lstlisting}
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	348
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	349	\noindent
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	350	which is the expected result. If this looks somewhat familiar to you,
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	351	than this is because functions with continuations can be
704 27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	352	seen as a kind of generalisation of tail-recursive functions. Anyway
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	353	notice how the continuations is ``stacked up'' during the recursion and
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	354	then ``unrolled'' when we apply 1 to the continuation. Interestingly, we
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	355	can do something similar to the Fibonacci function where in the traditional
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	356	version we have two recursive calls. Consider the following function
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	357
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	358	\begin{lstlisting}[language=Scala, numbers=none]
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	359	def fibC(n: Int, ret: Int => Int) : Int =
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	360	if (n == 0 \|\| n == 1) ret(1)
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	361	else fibC(n - 1,
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	362	r1 => fibC(n - 2,
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	363	r2 => ret(r1 + r2)))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	364	\end{lstlisting}
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	365
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	366	\noindent
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	367	Here the continuation is a nested function essentially wrapping up
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	368	the second recursive call. Let us check how the recursion unfolds
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	369	when called with 3 and the identity function:
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	370
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	371	\begin{lstlisting}[language=Scala, numbers=none]
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	372	fibC(3, id)
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	373	fibC(2, r1 => fibC(1, r2 => id(r1 + r2)))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	374	fibC(1, r1 =>
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	375	fibC(0, r2 => fibC(1, r2a => id((r1 + r2) + r2a))))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	376	fibC(0, r2 => fibC(1, r2a => id((1 + r2) + r2a)))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	377	fibC(1, r2a => id((1 + 1) + r2a))
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	378	id((1 + 1) + 1)
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	379	(1 + 1) + 1
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	380	3
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	381	\end{lstlisting}
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	382
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	383	Let us now come back to the CPS-translations for the Fun language. The
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	384	main difficulty of generating instructions in SSA format is that large
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	385	compound expressions need to be broken up into smaller pieces and
700 f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	386	intermediate results need to be chained into later instructions. To do
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	387	this conveniently, CPS-translations have been developed. They use
f1d4d582ac29 updated Christian Urban <urbanc@in.tum.de> parents: 680 diff changeset	388	functions (``continuations'') to represent what is coming next in a
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	389	sequence of instructions. In our case, continuations are functions of type
704 27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	390	\code{KVal} to \code{KExp}. They can be seen as a sequence of
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	391	\code{KLet}s where there is a ``hole'' that needs to be filled. Consider
27959a711959 updated Christian Urban <urbanc@in.tum.de> parents: 701 diff changeset	392	for example
678 6601ff1d9e0a updated Christian Urban <urbanc@in.tum.de> parents: 677 diff changeset	393
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	394	\begin{lstlisting}[language=LLVMIR,numbers=left,escapeinside={(@}{@)}]
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	395	let tmp0 = add 1 a in
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	396	let tmp1 = mul (@$\Box$@) 5 in
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	397	let tmp2 = add 3 tmp1 in
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	398	let tmp3 = add tmp0 tmp2 in
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	399	tmp3
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	400	\end{lstlisting}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	401
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	402	\noindent
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	403	where in the second line is a $\Box$ which still expects a \code{KVal}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	404	to be filled in before it becomes a ``proper'' \code{KExp}. When we
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	405	apply an argument to the continuation (remember they are functions)
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	406	we essentially fill something into the corresponding hole. The code
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	407	of the CPS-translation is
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	408
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	409	\begin{lstlisting}[language=Scala,numbers=none]
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	410	def CPS(e: Exp)(k: KVal => KExp) : KExp =
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	411	e match { ... }
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	412	\end{lstlisting}
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	413
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	414	\noindent
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	415	where \code{k} is the continuation and \code{e} is the expression
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	416	to be compiled. In case we have numbers or variables, we can just
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	417	apply the continuation like
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	418
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	419	\begin{center}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	420	\code{k(KNum(n))} \qquad \code{k(KVar(x))}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	421	\end{center}
679 9a4404f65b63 updated Christian Urban <urbanc@in.tum.de> parents: 678 diff changeset	422
701 81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	423	\noindent This would just fill in the $\Box$ in a \code{KLet}-expression.
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	424	More interesting is the case for an arithmetic expression.
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	425
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	426	\begin{lstlisting}[language=Scala,numbers=none]
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	427	case Aop(o, e1, e2) => {
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	428	val z = Fresh("tmp")
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	429	CPS(e1)(y1 =>
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	430	CPS(e2)(y2 => KLet(z, Kop(o, y1, y2), k(KVar(z)))))
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	431	}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	432	\end{lstlisting}
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	433
81377a3eb717 updated Christian Urban <urbanc@in.tum.de> parents: 700 diff changeset	434	\noindent
898 5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	435	For more such rules, have a look to the code of the fun-llvm
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	436	compiler.
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	437
5075b69142d9 updated Christian Urban <christian.urban@kcl.ac.uk> parents: 722 diff changeset	438	\noindent
539 8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	439	\end{document}
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	440
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	441
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	442	%%% Local Variables:
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	443	%%% mode: latex
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	444	%%% TeX-master: t
8a12889f8c8a updated Christian Urban <urbanc@in.tum.de> parents: diff changeset	445	%%% End:

author	Christian Urban <christian.urban@kcl.ac.uk>
	Tue, 04 Apr 2023 22:31:09 +0100
changeset 906	e7e7fe274f5c
parent 898	5075b69142d9
child 908	68df565a6134
permissions	-rw-r--r--