1 |
|
2 (*<*) |
|
3 theory Paper |
|
4 imports "../Nominal/Nominal2" |
|
5 "~~/src/HOL/Library/LaTeXsugar" |
|
6 begin |
|
7 |
|
8 consts |
|
9 fv :: "'a \<Rightarrow> 'b" |
|
10 abs_set :: "'a \<Rightarrow> 'b \<Rightarrow> 'c" |
|
11 alpha_bn :: "'a \<Rightarrow> 'a \<Rightarrow> bool" |
|
12 abs_set2 :: "'a \<Rightarrow> perm \<Rightarrow> 'b \<Rightarrow> 'c" |
|
13 Abs_dist :: "'a \<Rightarrow> 'b \<Rightarrow> 'c" |
|
14 Abs_print :: "'a \<Rightarrow> 'b \<Rightarrow> 'c" |
|
15 |
|
16 definition |
|
17 "equal \<equiv> (op =)" |
|
18 |
|
19 notation (latex output) |
|
20 swap ("'(_ _')" [1000, 1000] 1000) and |
|
21 fresh ("_ # _" [51, 51] 50) and |
|
22 fresh_star ("_ #\<^sup>* _" [51, 51] 50) and |
|
23 supp ("supp _" [78] 73) and |
|
24 uminus ("-_" [78] 73) and |
|
25 If ("if _ then _ else _" 10) and |
|
26 alpha_set ("_ \<approx>\<^raw:\,\raisebox{-1pt}{\makebox[0mm][l]{$_{\textit{set}}$}}>\<^bsup>_, _, _\<^esup> _") and |
|
27 alpha_lst ("_ \<approx>\<^raw:\,\raisebox{-1pt}{\makebox[0mm][l]{$_{\textit{list}}$}}>\<^bsup>_, _, _\<^esup> _") and |
|
28 alpha_res ("_ \<approx>\<^raw:\,\raisebox{-1pt}{\makebox[0mm][l]{$_{\textit{set+}}$}}>\<^bsup>_, _, _\<^esup> _") and |
|
29 abs_set ("_ \<approx>\<^raw:{$\,_{\textit{abs\_set}}$}> _") and |
|
30 abs_set2 ("_ \<approx>\<^raw:\raisebox{-1pt}{\makebox[0mm][l]{$\,_{\textit{list}}$}}>\<^bsup>_\<^esup> _") and |
|
31 fv ("fa'(_')" [100] 100) and |
|
32 equal ("=") and |
|
33 alpha_abs_set ("_ \<approx>\<^raw:{$\,_{\textit{abs\_set}}$}> _") and |
|
34 Abs_set ("[_]\<^bsub>set\<^esub>._" [20, 101] 999) and |
|
35 Abs_lst ("[_]\<^bsub>list\<^esub>._") and |
|
36 Abs_dist ("[_]\<^bsub>#list\<^esub>._") and |
|
37 Abs_res ("[_]\<^bsub>set+\<^esub>._") and |
|
38 Abs_print ("_\<^bsub>set\<^esub>._") and |
|
39 Cons ("_::_" [78,77] 73) and |
|
40 supp_set ("aux _" [1000] 10) and |
|
41 alpha_bn ("_ \<approx>bn _") |
|
42 |
|
43 consts alpha_trm ::'a |
|
44 consts fa_trm :: 'a |
|
45 consts alpha_trm2 ::'a |
|
46 consts fa_trm2 :: 'a |
|
47 consts ast :: 'a |
|
48 consts ast' :: 'a |
|
49 notation (latex output) |
|
50 alpha_trm ("\<approx>\<^bsub>trm\<^esub>") and |
|
51 fa_trm ("fa\<^bsub>trm\<^esub>") and |
|
52 alpha_trm2 ("'(\<approx>\<^bsub>assn\<^esub>, \<approx>\<^bsub>trm\<^esub>')") and |
|
53 fa_trm2 ("'(fa\<^bsub>assn\<^esub>, fa\<^bsub>trm\<^esub>')") and |
|
54 ast ("'(as, t')") and |
|
55 ast' ("'(as', t\<PRIME> ')") |
|
56 |
|
57 (*>*) |
|
58 |
|
59 |
|
60 section {* Introduction *} |
|
61 |
|
62 text {* |
|
63 |
|
64 So far, Nominal Isabelle provided a mechanism for constructing |
|
65 $\alpha$-equated terms, for example lambda-terms, |
|
66 @{text "t ::= x | t t | \<lambda>x. t"}, |
|
67 where free and bound variables have names. For such $\alpha$-equated terms, |
|
68 Nominal Isabelle derives automatically a reasoning infrastructure that has |
|
69 been used successfully in formalisations of an equivalence checking |
|
70 algorithm for LF \cite{UrbanCheneyBerghofer08}, Typed |
|
71 Scheme~\cite{TobinHochstadtFelleisen08}, several calculi for concurrency |
|
72 \cite{BengtsonParow09} and a strong normalisation result for cut-elimination |
|
73 in classical logic \cite{UrbanZhu08}. It has also been used by Pollack for |
|
74 formalisations in the locally-nameless approach to binding |
|
75 \cite{SatoPollack10}. |
|
76 |
|
77 However, Nominal Isabelle has fared less well in a formalisation of |
|
78 the algorithm W \cite{UrbanNipkow09}, where types and type-schemes are, |
|
79 respectively, of the form |
|
80 % |
|
81 \begin{equation}\label{tysch} |
|
82 \begin{array}{l} |
|
83 @{text "T ::= x | T \<rightarrow> T"}\hspace{9mm} |
|
84 @{text "S ::= \<forall>{x\<^isub>1,\<dots>, x\<^isub>n}. T"} |
|
85 \end{array} |
|
86 \end{equation} |
|
87 % |
|
88 \noindent |
|
89 and the @{text "\<forall>"}-quantification binds a finite (possibly empty) set of |
|
90 type-variables. While it is possible to implement this kind of more general |
|
91 binders by iterating single binders, this leads to a rather clumsy |
|
92 formalisation of W. |
|
93 %The need of iterating single binders is also one reason |
|
94 %why Nominal Isabelle |
|
95 % and similar theorem provers that only provide |
|
96 %mechanisms for binding single variables |
|
97 %has not fared extremely well with the |
|
98 %more advanced tasks in the POPLmark challenge \cite{challenge05}, because |
|
99 %also there one would like to bind multiple variables at once. |
|
100 |
|
101 Binding multiple variables has interesting properties that cannot be captured |
|
102 easily by iterating single binders. For example in the case of type-schemes we do not |
|
103 want to make a distinction about the order of the bound variables. Therefore |
|
104 we would like to regard the first pair of type-schemes as $\alpha$-equivalent, |
|
105 but assuming that @{text x}, @{text y} and @{text z} are distinct variables, |
|
106 the second pair should \emph{not} be $\alpha$-equivalent: |
|
107 % |
|
108 \begin{equation}\label{ex1} |
|
109 @{text "\<forall>{x, y}. x \<rightarrow> y \<approx>\<^isub>\<alpha> \<forall>{y, x}. y \<rightarrow> x"}\hspace{10mm} |
|
110 @{text "\<forall>{x, y}. x \<rightarrow> y \<notapprox>\<^isub>\<alpha> \<forall>{z}. z \<rightarrow> z"} |
|
111 \end{equation} |
|
112 % |
|
113 \noindent |
|
114 Moreover, we like to regard type-schemes as $\alpha$-equivalent, if they differ |
|
115 only on \emph{vacuous} binders, such as |
|
116 % |
|
117 \begin{equation}\label{ex3} |
|
118 @{text "\<forall>{x}. x \<rightarrow> y \<approx>\<^isub>\<alpha> \<forall>{x, z}. x \<rightarrow> y"} |
|
119 \end{equation} |
|
120 % |
|
121 \noindent |
|
122 where @{text z} does not occur freely in the type. In this paper we will |
|
123 give a general binding mechanism and associated notion of $\alpha$-equivalence |
|
124 that can be used to faithfully represent this kind of binding in Nominal |
|
125 Isabelle. |
|
126 %The difficulty of finding the right notion for $\alpha$-equivalence |
|
127 %can be appreciated in this case by considering that the definition given by |
|
128 %Leroy in \cite{Leroy92} is incorrect (it omits a side-condition). |
|
129 |
|
130 However, the notion of $\alpha$-equivalence that is preserved by vacuous |
|
131 binders is not always wanted. For example in terms like |
|
132 % |
|
133 \begin{equation}\label{one} |
|
134 @{text "\<LET> x = 3 \<AND> y = 2 \<IN> x - y \<END>"} |
|
135 \end{equation} |
|
136 |
|
137 \noindent |
|
138 we might not care in which order the assignments @{text "x = 3"} and |
|
139 \mbox{@{text "y = 2"}} are given, but it would be often unusual to regard |
|
140 \eqref{one} as $\alpha$-equivalent with |
|
141 % |
|
142 \begin{center} |
|
143 @{text "\<LET> x = 3 \<AND> y = 2 \<AND> z = foo \<IN> x - y \<END>"} |
|
144 \end{center} |
|
145 % |
|
146 \noindent |
|
147 Therefore we will also provide a separate binding mechanism for cases in |
|
148 which the order of binders does not matter, but the ``cardinality'' of the |
|
149 binders has to agree. |
|
150 |
|
151 However, we found that this is still not sufficient for dealing with |
|
152 language constructs frequently occurring in programming language |
|
153 research. For example in @{text "\<LET>"}s containing patterns like |
|
154 % |
|
155 \begin{equation}\label{two} |
|
156 @{text "\<LET> (x, y) = (3, 2) \<IN> x - y \<END>"} |
|
157 \end{equation} |
|
158 % |
|
159 \noindent |
|
160 we want to bind all variables from the pattern inside the body of the |
|
161 $\mathtt{let}$, but we also care about the order of these variables, since |
|
162 we do not want to regard \eqref{two} as $\alpha$-equivalent with |
|
163 % |
|
164 \begin{center} |
|
165 @{text "\<LET> (y, x) = (3, 2) \<IN> x - y \<END>"} |
|
166 \end{center} |
|
167 % |
|
168 \noindent |
|
169 As a result, we provide three general binding mechanisms each of which binds |
|
170 multiple variables at once, and let the user chose which one is intended |
|
171 in a formalisation. |
|
172 %%when formalising a term-calculus. |
|
173 |
|
174 By providing these general binding mechanisms, however, we have to work |
|
175 around a problem that has been pointed out by Pottier \cite{Pottier06} and |
|
176 Cheney \cite{Cheney05}: in @{text "\<LET>"}-constructs of the form |
|
177 % |
|
178 \begin{center} |
|
179 @{text "\<LET> x\<^isub>1 = t\<^isub>1 \<AND> \<dots> \<AND> x\<^isub>n = t\<^isub>n \<IN> s \<END>"} |
|
180 \end{center} |
|
181 % |
|
182 \noindent |
|
183 we care about the |
|
184 information that there are as many bound variables @{text |
|
185 "x\<^isub>i"} as there are @{text "t\<^isub>i"}. We lose this information if |
|
186 we represent the @{text "\<LET>"}-constructor by something like |
|
187 % |
|
188 \begin{center} |
|
189 @{text "\<LET> (\<lambda>x\<^isub>1\<dots>x\<^isub>n . s) [t\<^isub>1,\<dots>,t\<^isub>n]"} |
|
190 \end{center} |
|
191 % |
|
192 \noindent |
|
193 where the notation @{text "\<lambda>_ . _"} indicates that the list of @{text |
|
194 "x\<^isub>i"} becomes bound in @{text s}. In this representation the term |
|
195 \mbox{@{text "\<LET> (\<lambda>x . s) [t\<^isub>1, t\<^isub>2]"}} is a perfectly legal |
|
196 instance, but the lengths of the two lists do not agree. To exclude such |
|
197 terms, additional predicates about well-formed terms are needed in order to |
|
198 ensure that the two lists are of equal length. This can result in very messy |
|
199 reasoning (see for example~\cite{BengtsonParow09}). To avoid this, we will |
|
200 allow type specifications for @{text "\<LET>"}s as follows |
|
201 % |
|
202 \begin{center} |
|
203 \begin{tabular}{r@ {\hspace{2mm}}r@ {\hspace{2mm}}cl} |
|
204 @{text trm} & @{text "::="} & @{text "\<dots>"} |
|
205 & @{text "|"} @{text "\<LET> as::assn s::trm"}\hspace{2mm} |
|
206 \isacommand{bind} @{text "bn(as)"} \isacommand{in} @{text "s"}\\%%%[1mm] |
|
207 @{text assn} & @{text "::="} & @{text "\<ANIL>"} |
|
208 & @{text "|"} @{text "\<ACONS> name trm assn"} |
|
209 \end{tabular} |
|
210 \end{center} |
|
211 % |
|
212 \noindent |
|
213 where @{text assn} is an auxiliary type representing a list of assignments |
|
214 and @{text bn} an auxiliary function identifying the variables to be bound |
|
215 by the @{text "\<LET>"}. This function can be defined by recursion over @{text |
|
216 assn} as follows |
|
217 % |
|
218 \begin{center} |
|
219 @{text "bn(\<ANIL>) ="} @{term "{}"} \hspace{5mm} |
|
220 @{text "bn(\<ACONS> x t as) = {x} \<union> bn(as)"} |
|
221 \end{center} |
|
222 % |
|
223 \noindent |
|
224 The scope of the binding is indicated by labels given to the types, for |
|
225 example @{text "s::trm"}, and a binding clause, in this case |
|
226 \isacommand{bind} @{text "bn(as)"} \isacommand{in} @{text "s"}. This binding |
|
227 clause states that all the names the function @{text |
|
228 "bn(as)"} returns should be bound in @{text s}. This style of specifying terms and bindings is heavily |
|
229 inspired by the syntax of the Ott-tool \cite{ott-jfp}. |
|
230 |
|
231 %Though, Ott |
|
232 %has only one binding mode, namely the one where the order of |
|
233 %binders matters. Consequently, type-schemes with binding sets |
|
234 %of names cannot be modelled in Ott. |
|
235 |
|
236 However, we will not be able to cope with all specifications that are |
|
237 allowed by Ott. One reason is that Ott lets the user specify ``empty'' |
|
238 types like @{text "t ::= t t | \<lambda>x. t"} |
|
239 where no clause for variables is given. Arguably, such specifications make |
|
240 some sense in the context of Coq's type theory (which Ott supports), but not |
|
241 at all in a HOL-based environment where every datatype must have a non-empty |
|
242 set-theoretic model. % \cite{Berghofer99}. |
|
243 |
|
244 Another reason is that we establish the reasoning infrastructure |
|
245 for $\alpha$-\emph{equated} terms. In contrast, Ott produces a reasoning |
|
246 infrastructure in Isabelle/HOL for |
|
247 \emph{non}-$\alpha$-equated, or ``raw'', terms. While our $\alpha$-equated terms |
|
248 and the raw terms produced by Ott use names for bound variables, |
|
249 there is a key difference: working with $\alpha$-equated terms means, for example, |
|
250 that the two type-schemes |
|
251 |
|
252 \begin{center} |
|
253 @{text "\<forall>{x}. x \<rightarrow> y = \<forall>{x, z}. x \<rightarrow> y"} |
|
254 \end{center} |
|
255 |
|
256 \noindent |
|
257 are not just $\alpha$-equal, but actually \emph{equal}! As a result, we can |
|
258 only support specifications that make sense on the level of $\alpha$-equated |
|
259 terms (offending specifications, which for example bind a variable according |
|
260 to a variable bound somewhere else, are not excluded by Ott, but we have |
|
261 to). |
|
262 |
|
263 %Our insistence on reasoning with $\alpha$-equated terms comes from the |
|
264 %wealth of experience we gained with the older version of Nominal Isabelle: |
|
265 %for non-trivial properties, reasoning with $\alpha$-equated terms is much |
|
266 %easier than reasoning with raw terms. The fundamental reason for this is |
|
267 %that the HOL-logic underlying Nominal Isabelle allows us to replace |
|
268 %``equals-by-equals''. In contrast, replacing |
|
269 %``$\alpha$-equals-by-$\alpha$-equals'' in a representation based on raw terms |
|
270 %requires a lot of extra reasoning work. |
|
271 |
|
272 Although in informal settings a reasoning infrastructure for $\alpha$-equated |
|
273 terms is nearly always taken for granted, establishing it automatically in |
|
274 Isabelle/HOL is a rather non-trivial task. For every |
|
275 specification we will need to construct type(s) containing as elements the |
|
276 $\alpha$-equated terms. To do so, we use the standard HOL-technique of defining |
|
277 a new type by identifying a non-empty subset of an existing type. The |
|
278 construction we perform in Isabelle/HOL can be illustrated by the following picture: |
|
279 % |
|
280 \begin{center} |
|
281 \begin{tikzpicture}[scale=0.89] |
|
282 %\draw[step=2mm] (-4,-1) grid (4,1); |
|
283 |
|
284 \draw[very thick] (0.7,0.4) circle (4.25mm); |
|
285 \draw[rounded corners=1mm, very thick] ( 0.0,-0.8) rectangle ( 1.8, 0.9); |
|
286 \draw[rounded corners=1mm, very thick] (-1.95,0.85) rectangle (-2.85,-0.05); |
|
287 |
|
288 \draw (-2.0, 0.845) -- (0.7,0.845); |
|
289 \draw (-2.0,-0.045) -- (0.7,-0.045); |
|
290 |
|
291 \draw ( 0.7, 0.4) node {\footnotesize\begin{tabular}{@ {}c@ {}}$\alpha$-\\[-1mm]clas.\end{tabular}}; |
|
292 \draw (-2.4, 0.4) node {\footnotesize\begin{tabular}{@ {}c@ {}}$\alpha$-eq.\\[-1mm]terms\end{tabular}}; |
|
293 \draw (1.8, 0.48) node[right=-0.1mm] |
|
294 {\footnotesize\begin{tabular}{@ {}l@ {}}existing\\[-1mm] type\\ (sets of raw terms)\end{tabular}}; |
|
295 \draw (0.9, -0.35) node {\footnotesize\begin{tabular}{@ {}l@ {}}non-empty\\[-1mm]subset\end{tabular}}; |
|
296 \draw (-3.25, 0.55) node {\footnotesize\begin{tabular}{@ {}l@ {}}new\\[-1mm]type\end{tabular}}; |
|
297 |
|
298 \draw[<->, very thick] (-1.8, 0.3) -- (-0.1,0.3); |
|
299 \draw (-0.95, 0.3) node[above=0mm] {\footnotesize{}isomorphism}; |
|
300 |
|
301 \end{tikzpicture} |
|
302 \end{center} |
|
303 % |
|
304 \noindent |
|
305 We take as the starting point a definition of raw terms (defined as a |
|
306 datatype in Isabelle/HOL); then identify the $\alpha$-equivalence classes in |
|
307 the type of sets of raw terms according to our $\alpha$-equivalence relation, |
|
308 and finally define the new type as these $\alpha$-equivalence classes |
|
309 (non-emptiness is satisfied whenever the raw terms are definable as datatype |
|
310 in Isabelle/HOL and our relation for $\alpha$-equivalence is |
|
311 an equivalence relation). |
|
312 |
|
313 %The fact that we obtain an isomorphism between the new type and the |
|
314 %non-empty subset shows that the new type is a faithful representation of |
|
315 %$\alpha$-equated terms. That is not the case for example for terms using the |
|
316 %locally nameless representation of binders \cite{McKinnaPollack99}: in this |
|
317 %representation there are ``junk'' terms that need to be excluded by |
|
318 %reasoning about a well-formedness predicate. |
|
319 |
|
320 The problem with introducing a new type in Isabelle/HOL is that in order to |
|
321 be useful, a reasoning infrastructure needs to be ``lifted'' from the |
|
322 underlying subset to the new type. This is usually a tricky and arduous |
|
323 task. To ease it, we re-implemented in Isabelle/HOL \cite{KaliszykUrban11} the quotient package |
|
324 described by Homeier \cite{Homeier05} for the HOL4 system. This package |
|
325 allows us to lift definitions and theorems involving raw terms to |
|
326 definitions and theorems involving $\alpha$-equated terms. For example if we |
|
327 define the free-variable function over raw lambda-terms |
|
328 |
|
329 \begin{center} |
|
330 @{text "fv(x) = {x}"}\hspace{8mm} |
|
331 @{text "fv(t\<^isub>1 t\<^isub>2) = fv(t\<^isub>1) \<union> fv(t\<^isub>2)"}\hspace{8mm} |
|
332 @{text "fv(\<lambda>x.t) = fv(t) - {x}"} |
|
333 \end{center} |
|
334 |
|
335 \noindent |
|
336 then with the help of the quotient package we can obtain a function @{text "fv\<^sup>\<alpha>"} |
|
337 operating on quotients, or $\alpha$-equivalence classes of lambda-terms. This |
|
338 lifted function is characterised by the equations |
|
339 |
|
340 \begin{center} |
|
341 @{text "fv\<^sup>\<alpha>(x) = {x}"}\hspace{8mm} |
|
342 @{text "fv\<^sup>\<alpha>(t\<^isub>1 t\<^isub>2) = fv\<^sup>\<alpha>(t\<^isub>1) \<union> fv\<^sup>\<alpha>(t\<^isub>2)"}\hspace{8mm} |
|
343 @{text "fv\<^sup>\<alpha>(\<lambda>x.t) = fv\<^sup>\<alpha>(t) - {x}"} |
|
344 \end{center} |
|
345 |
|
346 \noindent |
|
347 (Note that this means also the term-constructors for variables, applications |
|
348 and lambda are lifted to the quotient level.) This construction, of course, |
|
349 only works if $\alpha$-equivalence is indeed an equivalence relation, and the |
|
350 ``raw'' definitions and theorems are respectful w.r.t.~$\alpha$-equivalence. |
|
351 %For example, we will not be able to lift a bound-variable function. Although |
|
352 %this function can be defined for raw terms, it does not respect |
|
353 %$\alpha$-equivalence and therefore cannot be lifted. |
|
354 To sum up, every lifting |
|
355 of theorems to the quotient level needs proofs of some respectfulness |
|
356 properties (see \cite{Homeier05}). In the paper we show that we are able to |
|
357 automate these proofs and as a result can automatically establish a reasoning |
|
358 infrastructure for $\alpha$-equated terms.\smallskip |
|
359 |
|
360 %The examples we have in mind where our reasoning infrastructure will be |
|
361 %helpful includes the term language of Core-Haskell. This term language |
|
362 %involves patterns that have lists of type-, coercion- and term-variables, |
|
363 %all of which are bound in @{text "\<CASE>"}-expressions. In these |
|
364 %patterns we do not know in advance how many variables need to |
|
365 %be bound. Another example is the specification of SML, which includes |
|
366 %includes bindings as in type-schemes.\medskip |
|
367 |
|
368 \noindent |
|
369 {\bf Contributions:} We provide three new definitions for when terms |
|
370 involving general binders are $\alpha$-equivalent. These definitions are |
|
371 inspired by earlier work of Pitts \cite{Pitts04}. By means of automatic |
|
372 proofs, we establish a reasoning infrastructure for $\alpha$-equated |
|
373 terms, including properties about support, freshness and equality |
|
374 conditions for $\alpha$-equated terms. We are also able to derive strong |
|
375 induction principles that have the variable convention already built in. |
|
376 The method behind our specification of general binders is taken |
|
377 from the Ott-tool, but we introduce crucial restrictions, and also extensions, so |
|
378 that our specifications make sense for reasoning about $\alpha$-equated terms. |
|
379 The main improvement over Ott is that we introduce three binding modes |
|
380 (only one is present in Ott), provide formalised definitions for $\alpha$-equivalence and |
|
381 for free variables of our terms, and also derive a reasoning infrastructure |
|
382 for our specifications from ``first principles''. |
|
383 |
|
384 |
|
385 %\begin{figure} |
|
386 %\begin{boxedminipage}{\linewidth} |
|
387 %%\begin{center} |
|
388 %\begin{tabular}{r@ {\hspace{1mm}}r@ {\hspace{2mm}}l} |
|
389 %\multicolumn{3}{@ {}l}{Type Kinds}\\ |
|
390 %@{text "\<kappa>"} & @{text "::="} & @{text "\<star> | \<kappa>\<^isub>1 \<rightarrow> \<kappa>\<^isub>2"}\smallskip\\ |
|
391 %\multicolumn{3}{@ {}l}{Coercion Kinds}\\ |
|
392 %@{text "\<iota>"} & @{text "::="} & @{text "\<sigma>\<^isub>1 \<sim> \<sigma>\<^isub>2"}\smallskip\\ |
|
393 %\multicolumn{3}{@ {}l}{Types}\\ |
|
394 %@{text "\<sigma>"} & @{text "::="} & @{text "a | T | \<sigma>\<^isub>1 \<sigma>\<^isub>2 | S\<^isub>n"}$\;\overline{@{text "\<sigma>"}}$@{text "\<^sup>n"} |
|
395 %@{text "| \<forall>a:\<kappa>. \<sigma> | \<iota> \<Rightarrow> \<sigma>"}\smallskip\\ |
|
396 %\multicolumn{3}{@ {}l}{Coercion Types}\\ |
|
397 %@{text "\<gamma>"} & @{text "::="} & @{text "c | C | \<gamma>\<^isub>1 \<gamma>\<^isub>2 | S\<^isub>n"}$\;\overline{@{text "\<gamma>"}}$@{text "\<^sup>n"} |
|
398 %@{text "| \<forall>c:\<iota>. \<gamma> | \<iota> \<Rightarrow> \<gamma> "}\\ |
|
399 %& @{text "|"} & @{text "refl \<sigma> | sym \<gamma> | \<gamma>\<^isub>1 \<circ> \<gamma>\<^isub>2 | \<gamma> @ \<sigma> | left \<gamma> | right \<gamma>"}\\ |
|
400 %& @{text "|"} & @{text "\<gamma>\<^isub>1 \<sim> \<gamma>\<^isub>2 | rightc \<gamma> | leftc \<gamma> | \<gamma>\<^isub>1 \<triangleright> \<gamma>\<^isub>2"}\smallskip\\ |
|
401 %\multicolumn{3}{@ {}l}{Terms}\\ |
|
402 %@{text "e"} & @{text "::="} & @{text "x | K | \<Lambda>a:\<kappa>. e | \<Lambda>c:\<iota>. e | e \<sigma> | e \<gamma>"}\\ |
|
403 %& @{text "|"} & @{text "\<lambda>x:\<sigma>. e | e\<^isub>1 e\<^isub>2 | \<LET> x:\<sigma> = e\<^isub>1 \<IN> e\<^isub>2"}\\ |
|
404 %& @{text "|"} & @{text "\<CASE> e\<^isub>1 \<OF>"}$\;\overline{@{text "p \<rightarrow> e\<^isub>2"}}$ @{text "| e \<triangleright> \<gamma>"}\smallskip\\ |
|
405 %\multicolumn{3}{@ {}l}{Patterns}\\ |
|
406 %@{text "p"} & @{text "::="} & @{text "K"}$\;\overline{@{text "a:\<kappa>"}}\;\overline{@{text "c:\<iota>"}}\;\overline{@{text "x:\<sigma>"}}$\smallskip\\ |
|
407 %\multicolumn{3}{@ {}l}{Constants}\\ |
|
408 %& @{text C} & coercion constants\\ |
|
409 %& @{text T} & value type constructors\\ |
|
410 %& @{text "S\<^isub>n"} & n-ary type functions (which need to be fully applied)\\ |
|
411 %& @{text K} & data constructors\smallskip\\ |
|
412 %\multicolumn{3}{@ {}l}{Variables}\\ |
|
413 %& @{text a} & type variables\\ |
|
414 %& @{text c} & coercion variables\\ |
|
415 %& @{text x} & term variables\\ |
|
416 %\end{tabular} |
|
417 %\end{center} |
|
418 %\end{boxedminipage} |
|
419 %\caption{The System @{text "F\<^isub>C"} |
|
420 %\cite{CoreHaskell}, also often referred to as \emph{Core-Haskell}. In this |
|
421 %version of @{text "F\<^isub>C"} we made a modification by separating the |
|
422 %grammars for type kinds and coercion kinds, as well as for types and coercion |
|
423 %types. For this paper the interesting term-constructor is @{text "\<CASE>"}, |
|
424 %which binds multiple type-, coercion- and term-variables.\label{corehas}} |
|
425 %\end{figure} |
|
426 *} |
|
427 |
|
428 section {* A Short Review of the Nominal Logic Work *} |
|
429 |
|
430 text {* |
|
431 At its core, Nominal Isabelle is an adaption of the nominal logic work by |
|
432 Pitts \cite{Pitts03}. This adaptation for Isabelle/HOL is described in |
|
433 \cite{HuffmanUrban10} (including proofs). We shall briefly review this work |
|
434 to aid the description of what follows. |
|
435 |
|
436 Two central notions in the nominal logic work are sorted atoms and |
|
437 sort-respecting permutations of atoms. We will use the letters @{text "a, |
|
438 b, c, \<dots>"} to stand for atoms and @{text "p, q, \<dots>"} to stand for |
|
439 permutations. The purpose of atoms is to represent variables, be they bound or free. |
|
440 %The sorts of atoms can be used to represent different kinds of |
|
441 %variables, such as the term-, coercion- and type-variables in Core-Haskell. |
|
442 It is assumed that there is an infinite supply of atoms for each |
|
443 sort. In the interest of brevity, we shall restrict ourselves |
|
444 in what follows to only one sort of atoms. |
|
445 |
|
446 Permutations are bijective functions from atoms to atoms that are |
|
447 the identity everywhere except on a finite number of atoms. There is a |
|
448 two-place permutation operation written |
|
449 @{text "_ \<bullet> _ :: perm \<Rightarrow> \<beta> \<Rightarrow> \<beta>"} |
|
450 where the generic type @{text "\<beta>"} is the type of the object |
|
451 over which the permutation |
|
452 acts. In Nominal Isabelle, the identity permutation is written as @{term "0::perm"}, |
|
453 the composition of two permutations @{term p} and @{term q} as \mbox{@{term "p + q"}}, |
|
454 and the inverse permutation of @{term p} as @{text "- p"}. The permutation |
|
455 operation is defined over the type-hierarchy \cite{HuffmanUrban10}; |
|
456 for example permutations acting on products, lists, sets, functions and booleans are |
|
457 given by: |
|
458 % |
|
459 %\begin{equation}\label{permute} |
|
460 %\mbox{\begin{tabular}{@ {}c@ {\hspace{10mm}}c@ {}} |
|
461 %\begin{tabular}{@ {}l@ {}} |
|
462 %@{thm permute_prod.simps[no_vars, THEN eq_reflection]}\\[2mm] |
|
463 %@{thm permute_list.simps(1)[no_vars, THEN eq_reflection]}\\ |
|
464 %@{thm permute_list.simps(2)[no_vars, THEN eq_reflection]}\\ |
|
465 %\end{tabular} & |
|
466 %\begin{tabular}{@ {}l@ {}} |
|
467 %@{thm permute_set_eq[no_vars, THEN eq_reflection]}\\ |
|
468 %@{text "p \<bullet> f \<equiv> \<lambda>x. p \<bullet> (f (- p \<bullet> x))"}\\ |
|
469 %@{thm permute_bool_def[no_vars, THEN eq_reflection]} |
|
470 %\end{tabular} |
|
471 %\end{tabular}} |
|
472 %\end{equation} |
|
473 % |
|
474 \begin{center} |
|
475 \mbox{\begin{tabular}{@ {}c@ {\hspace{4mm}}c@ {\hspace{4mm}}c@ {}} |
|
476 \begin{tabular}{@ {}l@ {}} |
|
477 @{thm permute_prod.simps[no_vars, THEN eq_reflection]}\\ |
|
478 @{thm permute_bool_def[no_vars, THEN eq_reflection]} |
|
479 \end{tabular} & |
|
480 \begin{tabular}{@ {}l@ {}} |
|
481 @{thm permute_list.simps(1)[no_vars, THEN eq_reflection]}\\ |
|
482 @{thm permute_list.simps(2)[no_vars, THEN eq_reflection]}\\ |
|
483 \end{tabular} & |
|
484 \begin{tabular}{@ {}l@ {}} |
|
485 @{thm permute_set_eq[no_vars, THEN eq_reflection]}\\ |
|
486 @{text "p \<bullet> f \<equiv> \<lambda>x. p \<bullet> (f (- p \<bullet> x))"}\\ |
|
487 \end{tabular} |
|
488 \end{tabular}} |
|
489 \end{center} |
|
490 |
|
491 \noindent |
|
492 Concrete permutations in Nominal Isabelle are built up from swappings, |
|
493 written as \mbox{@{text "(a b)"}}, which are permutations that behave |
|
494 as follows: |
|
495 % |
|
496 \begin{center} |
|
497 @{text "(a b) = \<lambda>c. if a = c then b else if b = c then a else c"} |
|
498 \end{center} |
|
499 |
|
500 The most original aspect of the nominal logic work of Pitts is a general |
|
501 definition for the notion of the ``set of free variables of an object @{text |
|
502 "x"}''. This notion, written @{term "supp x"}, is general in the sense that |
|
503 it applies not only to lambda-terms ($\alpha$-equated or not), but also to lists, |
|
504 products, sets and even functions. The definition depends only on the |
|
505 permutation operation and on the notion of equality defined for the type of |
|
506 @{text x}, namely: |
|
507 % |
|
508 \begin{equation}\label{suppdef} |
|
509 @{thm supp_def[no_vars, THEN eq_reflection]} |
|
510 \end{equation} |
|
511 |
|
512 \noindent |
|
513 There is also the derived notion for when an atom @{text a} is \emph{fresh} |
|
514 for an @{text x}, defined as @{thm fresh_def[no_vars]}. |
|
515 We use for sets of atoms the abbreviation |
|
516 @{thm (lhs) fresh_star_def[no_vars]}, defined as |
|
517 @{thm (rhs) fresh_star_def[no_vars]}. |
|
518 A striking consequence of these definitions is that we can prove |
|
519 without knowing anything about the structure of @{term x} that |
|
520 swapping two fresh atoms, say @{text a} and @{text b}, leaves |
|
521 @{text x} unchanged, namely if @{text "a \<FRESH> x"} and @{text "b \<FRESH> x"} |
|
522 then @{term "(a \<rightleftharpoons> b) \<bullet> x = x"}. |
|
523 % |
|
524 %\begin{myproperty}\label{swapfreshfresh} |
|
525 %@{thm[mode=IfThen] swap_fresh_fresh[no_vars]} |
|
526 %\end{myproperty} |
|
527 % |
|
528 %While often the support of an object can be relatively easily |
|
529 %described, for example for atoms, products, lists, function applications, |
|
530 %booleans and permutations as follows |
|
531 %% |
|
532 %\begin{center} |
|
533 %\begin{tabular}{c@ {\hspace{10mm}}c} |
|
534 %\begin{tabular}{rcl} |
|
535 %@{term "supp a"} & $=$ & @{term "{a}"}\\ |
|
536 %@{term "supp (x, y)"} & $=$ & @{term "supp x \<union> supp y"}\\ |
|
537 %@{term "supp []"} & $=$ & @{term "{}"}\\ |
|
538 %@{term "supp (x#xs)"} & $=$ & @{term "supp x \<union> supp xs"}\\ |
|
539 %\end{tabular} |
|
540 %& |
|
541 %\begin{tabular}{rcl} |
|
542 %@{text "supp (f x)"} & @{text "\<subseteq>"} & @{term "supp f \<union> supp x"}\\ |
|
543 %@{term "supp b"} & $=$ & @{term "{}"}\\ |
|
544 %@{term "supp p"} & $=$ & @{term "{a. p \<bullet> a \<noteq> a}"} |
|
545 %\end{tabular} |
|
546 %\end{tabular} |
|
547 %\end{center} |
|
548 % |
|
549 %\noindent |
|
550 %in some cases it can be difficult to characterise the support precisely, and |
|
551 %only an approximation can be established (as for functions above). |
|
552 % |
|
553 %Reasoning about |
|
554 %such approximations can be simplified with the notion \emph{supports}, defined |
|
555 %as follows: |
|
556 % |
|
557 %\begin{definition} |
|
558 %A set @{text S} \emph{supports} @{text x} if for all atoms @{text a} and @{text b} |
|
559 %not in @{text S} we have @{term "(a \<rightleftharpoons> b) \<bullet> x = x"}. |
|
560 %\end{definition} |
|
561 % |
|
562 %\noindent |
|
563 %The main point of @{text supports} is that we can establish the following |
|
564 %two properties. |
|
565 % |
|
566 %\begin{myproperty}\label{supportsprop} |
|
567 %Given a set @{text "as"} of atoms. |
|
568 %{\it (i)} @{thm[mode=IfThen] supp_is_subset[where S="as", no_vars]} |
|
569 %{\it (ii)} @{thm supp_supports[no_vars]}. |
|
570 %\end{myproperty} |
|
571 % |
|
572 %Another important notion in the nominal logic work is \emph{equivariance}. |
|
573 %For a function @{text f}, say of type @{text "\<alpha> \<Rightarrow> \<beta>"}, to be equivariant |
|
574 %it is required that every permutation leaves @{text f} unchanged, that is |
|
575 %% |
|
576 %\begin{equation}\label{equivariancedef} |
|
577 %@{term "\<forall>p. p \<bullet> f = f"} |
|
578 %\end{equation} |
|
579 % |
|
580 %\noindent or equivalently that a permutation applied to the application |
|
581 %@{text "f x"} can be moved to the argument @{text x}. That means for equivariant |
|
582 %functions @{text f}, we have for all permutations @{text p}: |
|
583 %% |
|
584 %\begin{equation}\label{equivariance} |
|
585 %@{text "p \<bullet> f = f"} \;\;\;\textit{if and only if}\;\;\; |
|
586 %@{text "p \<bullet> (f x) = f (p \<bullet> x)"} |
|
587 %\end{equation} |
|
588 % |
|
589 %\noindent |
|
590 %From property \eqref{equivariancedef} and the definition of @{text supp}, we |
|
591 %can easily deduce that equivariant functions have empty support. There is |
|
592 %also a similar notion for equivariant relations, say @{text R}, namely the property |
|
593 %that |
|
594 %% |
|
595 %\begin{center} |
|
596 %@{text "x R y"} \;\;\textit{implies}\;\; @{text "(p \<bullet> x) R (p \<bullet> y)"} |
|
597 %\end{center} |
|
598 % |
|
599 %Using freshness, the nominal logic work provides us with general means for renaming |
|
600 %binders. |
|
601 % |
|
602 %\noindent |
|
603 While in the older version of Nominal Isabelle, we used extensively |
|
604 %Property~\ref{swapfreshfresh} |
|
605 this property to rename single binders, it %%this property |
|
606 proved too unwieldy for dealing with multiple binders. For such binders the |
|
607 following generalisations turned out to be easier to use. |
|
608 |
|
609 \begin{myproperty}\label{supppermeq} |
|
610 @{thm[mode=IfThen] supp_perm_eq[no_vars]} |
|
611 \end{myproperty} |
|
612 |
|
613 \begin{myproperty}\label{avoiding} |
|
614 For a finite set @{text as} and a finitely supported @{text x} with |
|
615 @{term "as \<sharp>* x"} and also a finitely supported @{text c}, there |
|
616 exists a permutation @{text p} such that @{term "(p \<bullet> as) \<sharp>* c"} and |
|
617 @{term "supp x \<sharp>* p"}. |
|
618 \end{myproperty} |
|
619 |
|
620 \noindent |
|
621 The idea behind the second property is that given a finite set @{text as} |
|
622 of binders (being bound, or fresh, in @{text x} is ensured by the |
|
623 assumption @{term "as \<sharp>* x"}), then there exists a permutation @{text p} such that |
|
624 the renamed binders @{term "p \<bullet> as"} avoid @{text c} (which can be arbitrarily chosen |
|
625 as long as it is finitely supported) and also @{text "p"} does not affect anything |
|
626 in the support of @{text x} (that is @{term "supp x \<sharp>* p"}). The last |
|
627 fact and Property~\ref{supppermeq} allow us to ``rename'' just the binders |
|
628 @{text as} in @{text x}, because @{term "p \<bullet> x = x"}. |
|
629 |
|
630 Most properties given in this section are described in detail in \cite{HuffmanUrban10} |
|
631 and all are formalised in Isabelle/HOL. In the next sections we will make |
|
632 extensive use of these properties in order to define $\alpha$-equivalence in |
|
633 the presence of multiple binders. |
|
634 *} |
|
635 |
|
636 |
|
637 section {* General Bindings\label{sec:binders} *} |
|
638 |
|
639 text {* |
|
640 In Nominal Isabelle, the user is expected to write down a specification of a |
|
641 term-calculus and then a reasoning infrastructure is automatically derived |
|
642 from this specification (remember that Nominal Isabelle is a definitional |
|
643 extension of Isabelle/HOL, which does not introduce any new axioms). |
|
644 |
|
645 In order to keep our work with deriving the reasoning infrastructure |
|
646 manageable, we will wherever possible state definitions and perform proofs |
|
647 on the ``user-level'' of Isabelle/HOL, as opposed to write custom ML-code. % that |
|
648 %generates them anew for each specification. |
|
649 To that end, we will consider |
|
650 first pairs @{text "(as, x)"} of type @{text "(atom set) \<times> \<beta>"}. These pairs |
|
651 are intended to represent the abstraction, or binding, of the set of atoms @{text |
|
652 "as"} in the body @{text "x"}. |
|
653 |
|
654 The first question we have to answer is when two pairs @{text "(as, x)"} and |
|
655 @{text "(bs, y)"} are $\alpha$-equivalent? (For the moment we are interested in |
|
656 the notion of $\alpha$-equivalence that is \emph{not} preserved by adding |
|
657 vacuous binders.) To answer this question, we identify four conditions: {\it (i)} |
|
658 given a free-atom function @{text "fa"} of type \mbox{@{text "\<beta> \<Rightarrow> atom |
|
659 set"}}, then @{text x} and @{text y} need to have the same set of free |
|
660 atoms; moreover there must be a permutation @{text p} such that {\it |
|
661 (ii)} @{text p} leaves the free atoms of @{text x} and @{text y} unchanged, but |
|
662 {\it (iii)} ``moves'' their bound names so that we obtain modulo a relation, |
|
663 say \mbox{@{text "_ R _"}}, two equivalent terms. We also require that {\it (iv)} |
|
664 @{text p} makes the sets of abstracted atoms @{text as} and @{text bs} equal. The |
|
665 requirements {\it (i)} to {\it (iv)} can be stated formally as the conjunction of: |
|
666 % |
|
667 \begin{equation}\label{alphaset} |
|
668 \begin{array}{@ {\hspace{10mm}}l@ {\hspace{5mm}}l@ {\hspace{10mm}}l@ {\hspace{5mm}}l} |
|
669 \multicolumn{4}{l}{@{term "(as, x) \<approx>set R fa p (bs, y)"}\hspace{2mm}@{text "\<equiv>"}}\\[1mm] |
|
670 \mbox{\it (i)} & @{term "fa(x) - as = fa(y) - bs"} & |
|
671 \mbox{\it (iii)} & @{text "(p \<bullet> x) R y"} \\ |
|
672 \mbox{\it (ii)} & @{term "(fa(x) - as) \<sharp>* p"} & |
|
673 \mbox{\it (iv)} & @{term "(p \<bullet> as) = bs"} \\ |
|
674 \end{array} |
|
675 \end{equation} |
|
676 % |
|
677 \noindent |
|
678 Note that this relation depends on the permutation @{text |
|
679 "p"}; $\alpha$-equivalence between two pairs is then the relation where we |
|
680 existentially quantify over this @{text "p"}. Also note that the relation is |
|
681 dependent on a free-atom function @{text "fa"} and a relation @{text |
|
682 "R"}. The reason for this extra generality is that we will use |
|
683 $\approx_{\,\textit{set}}$ for both ``raw'' terms and $\alpha$-equated terms. In |
|
684 the latter case, @{text R} will be replaced by equality @{text "="} and we |
|
685 will prove that @{text "fa"} is equal to @{text "supp"}. |
|
686 |
|
687 The definition in \eqref{alphaset} does not make any distinction between the |
|
688 order of abstracted atoms. If we want this, then we can define $\alpha$-equivalence |
|
689 for pairs of the form \mbox{@{text "(as, x)"}} with type @{text "(atom list) \<times> \<beta>"} |
|
690 as follows |
|
691 % |
|
692 \begin{equation}\label{alphalist} |
|
693 \begin{array}{@ {\hspace{10mm}}l@ {\hspace{5mm}}l@ {\hspace{10mm}}l@ {\hspace{5mm}}l} |
|
694 \multicolumn{4}{l}{@{term "(as, x) \<approx>lst R fa p (bs, y)"}\hspace{2mm}@{text "\<equiv>"}}\\[1mm] |
|
695 \mbox{\it (i)} & @{term "fa(x) - (set as) = fa(y) - (set bs)"} & |
|
696 \mbox{\it (iii)} & @{text "(p \<bullet> x) R y"}\\ |
|
697 \mbox{\it (ii)} & @{term "(fa(x) - set as) \<sharp>* p"} & |
|
698 \mbox{\it (iv)} & @{term "(p \<bullet> as) = bs"}\\ |
|
699 \end{array} |
|
700 \end{equation} |
|
701 % |
|
702 \noindent |
|
703 where @{term set} is the function that coerces a list of atoms into a set of atoms. |
|
704 Now the last clause ensures that the order of the binders matters (since @{text as} |
|
705 and @{text bs} are lists of atoms). |
|
706 |
|
707 If we do not want to make any difference between the order of binders \emph{and} |
|
708 also allow vacuous binders, that means \emph{restrict} names, then we keep sets of binders, but drop |
|
709 condition {\it (iv)} in \eqref{alphaset}: |
|
710 % |
|
711 \begin{equation}\label{alphares} |
|
712 \begin{array}{@ {\hspace{10mm}}l@ {\hspace{5mm}}l@ {\hspace{10mm}}l@ {\hspace{5mm}}l} |
|
713 \multicolumn{2}{l}{@{term "(as, x) \<approx>res R fa p (bs, y)"}\hspace{2mm}@{text "\<equiv>"}}\\[1mm] |
|
714 \mbox{\it (i)} & @{term "fa(x) - as = fa(y) - bs"} & |
|
715 \mbox{\it (iii)} & @{text "(p \<bullet> x) R y"}\\ |
|
716 \mbox{\it (ii)} & @{term "(fa(x) - as) \<sharp>* p"}\\ |
|
717 \end{array} |
|
718 \end{equation} |
|
719 |
|
720 It might be useful to consider first some examples how these definitions |
|
721 of $\alpha$-equivalence pan out in practice. For this consider the case of |
|
722 abstracting a set of atoms over types (as in type-schemes). We set |
|
723 @{text R} to be the usual equality @{text "="} and for @{text "fa(T)"} we |
|
724 define |
|
725 % |
|
726 \begin{center} |
|
727 @{text "fa(x) = {x}"} \hspace{5mm} @{text "fa(T\<^isub>1 \<rightarrow> T\<^isub>2) = fa(T\<^isub>1) \<union> fa(T\<^isub>2)"} |
|
728 \end{center} |
|
729 |
|
730 \noindent |
|
731 Now recall the examples shown in \eqref{ex1} and |
|
732 \eqref{ex3}. It can be easily checked that @{text "({x, y}, x \<rightarrow> y)"} and |
|
733 @{text "({y, x}, y \<rightarrow> x)"} are $\alpha$-equivalent according to |
|
734 $\approx_{\,\textit{set}}$ and $\approx_{\,\textit{set+}}$ by taking @{text p} to |
|
735 be the swapping @{term "(x \<rightleftharpoons> y)"}. In case of @{text "x \<noteq> y"}, then @{text |
|
736 "([x, y], x \<rightarrow> y)"} $\not\approx_{\,\textit{list}}$ @{text "([y, x], x \<rightarrow> y)"} |
|
737 since there is no permutation that makes the lists @{text "[x, y]"} and |
|
738 @{text "[y, x]"} equal, and also leaves the type \mbox{@{text "x \<rightarrow> y"}} |
|
739 unchanged. Another example is @{text "({x}, x)"} $\approx_{\,\textit{set+}}$ |
|
740 @{text "({x, y}, x)"} which holds by taking @{text p} to be the identity |
|
741 permutation. However, if @{text "x \<noteq> y"}, then @{text "({x}, x)"} |
|
742 $\not\approx_{\,\textit{set}}$ @{text "({x, y}, x)"} since there is no |
|
743 permutation that makes the sets @{text "{x}"} and @{text "{x, y}"} equal |
|
744 (similarly for $\approx_{\,\textit{list}}$). It can also relatively easily be |
|
745 shown that all three notions of $\alpha$-equivalence coincide, if we only |
|
746 abstract a single atom. |
|
747 |
|
748 In the rest of this section we are going to introduce three abstraction |
|
749 types. For this we define |
|
750 % |
|
751 \begin{equation} |
|
752 @{term "abs_set (as, x) (bs, x) \<equiv> \<exists>p. alpha_set (as, x) equal supp p (bs, x)"} |
|
753 \end{equation} |
|
754 |
|
755 \noindent |
|
756 (similarly for $\approx_{\,\textit{abs\_set+}}$ |
|
757 and $\approx_{\,\textit{abs\_list}}$). We can show that these relations are equivalence |
|
758 relations. %% and equivariant. |
|
759 |
|
760 \begin{lemma}\label{alphaeq} |
|
761 The relations $\approx_{\,\textit{abs\_set}}$, $\approx_{\,\textit{abs\_list}}$ |
|
762 and $\approx_{\,\textit{abs\_set+}}$ are equivalence relations. %, and if |
|
763 %@{term "abs_set (as, x) (bs, y)"} then also |
|
764 %@{term "abs_set (p \<bullet> as, p \<bullet> x) (p \<bullet> bs, p \<bullet> y)"} (similarly for the other two relations). |
|
765 \end{lemma} |
|
766 |
|
767 \begin{proof} |
|
768 Reflexivity is by taking @{text "p"} to be @{text "0"}. For symmetry we have |
|
769 a permutation @{text p} and for the proof obligation take @{term "-p"}. In case |
|
770 of transitivity, we have two permutations @{text p} and @{text q}, and for the |
|
771 proof obligation use @{text "q + p"}. All conditions are then by simple |
|
772 calculations. |
|
773 \end{proof} |
|
774 |
|
775 \noindent |
|
776 This lemma allows us to use our quotient package for introducing |
|
777 new types @{text "\<beta> abs_set"}, @{text "\<beta> abs_set+"} and @{text "\<beta> abs_list"} |
|
778 representing $\alpha$-equivalence classes of pairs of type |
|
779 @{text "(atom set) \<times> \<beta>"} (in the first two cases) and of type @{text "(atom list) \<times> \<beta>"} |
|
780 (in the third case). |
|
781 The elements in these types will be, respectively, written as |
|
782 % |
|
783 %\begin{center} |
|
784 @{term "Abs_set as x"}, %\hspace{5mm} |
|
785 @{term "Abs_res as x"} and %\hspace{5mm} |
|
786 @{term "Abs_lst as x"}, |
|
787 %\end{center} |
|
788 % |
|
789 %\noindent |
|
790 indicating that a set (or list) of atoms @{text as} is abstracted in @{text x}. We will |
|
791 call the types \emph{abstraction types} and their elements |
|
792 \emph{abstractions}. The important property we need to derive is the support of |
|
793 abstractions, namely: |
|
794 |
|
795 \begin{theorem}[Support of Abstractions]\label{suppabs} |
|
796 Assuming @{text x} has finite support, then |
|
797 |
|
798 \begin{center} |
|
799 \begin{tabular}{l} |
|
800 @{thm (lhs) supp_Abs(1)[no_vars]} $\;\;=\;\;$ |
|
801 @{thm (lhs) supp_Abs(2)[no_vars]} $\;\;=\;\;$ @{thm (rhs) supp_Abs(2)[no_vars]}, and\\ |
|
802 @{thm (lhs) supp_Abs(3)[where bs="bs", no_vars]} $\;\;=\;\;$ |
|
803 @{thm (rhs) supp_Abs(3)[where bs="bs", no_vars]} |
|
804 \end{tabular} |
|
805 \end{center} |
|
806 \end{theorem} |
|
807 |
|
808 \noindent |
|
809 This theorem states that the bound names do not appear in the support. |
|
810 For brevity we omit the proof and again refer the reader to |
|
811 our formalisation in Isabelle/HOL. |
|
812 |
|
813 %\noindent |
|
814 %Below we will show the first equation. The others |
|
815 %follow by similar arguments. By definition of the abstraction type @{text "abs_set"} |
|
816 %we have |
|
817 %% |
|
818 %\begin{equation}\label{abseqiff} |
|
819 %@{thm (lhs) Abs_eq_iff(1)[where bs="as" and bs'="bs", no_vars]} \;\;\text{if and only if}\;\; |
|
820 %@{thm (rhs) Abs_eq_iff(1)[where bs="as" and bs'="bs", no_vars]} |
|
821 %\end{equation} |
|
822 % |
|
823 %\noindent |
|
824 %and also |
|
825 % |
|
826 %\begin{equation}\label{absperm} |
|
827 %%@%{%thm %permute_Abs[no_vars]}% |
|
828 %\end{equation} |
|
829 |
|
830 %\noindent |
|
831 %The second fact derives from the definition of permutations acting on pairs |
|
832 %\eqref{permute} and $\alpha$-equivalence being equivariant |
|
833 %(see Lemma~\ref{alphaeq}). With these two facts at our disposal, we can show |
|
834 %the following lemma about swapping two atoms in an abstraction. |
|
835 % |
|
836 %\begin{lemma} |
|
837 %@{thm[mode=IfThen] Abs_swap1(1)[where bs="as", no_vars]} |
|
838 %\end{lemma} |
|
839 % |
|
840 %\begin{proof} |
|
841 %This lemma is straightforward using \eqref{abseqiff} and observing that |
|
842 %the assumptions give us @{term "(a \<rightleftharpoons> b) \<bullet> (supp x - as) = (supp x - as)"}. |
|
843 %Moreover @{text supp} and set difference are equivariant (see \cite{HuffmanUrban10}). |
|
844 %\end{proof} |
|
845 % |
|
846 %\noindent |
|
847 %Assuming that @{text "x"} has finite support, this lemma together |
|
848 %with \eqref{absperm} allows us to show |
|
849 % |
|
850 %\begin{equation}\label{halfone} |
|
851 %@{thm Abs_supports(1)[no_vars]} |
|
852 %\end{equation} |
|
853 % |
|
854 %\noindent |
|
855 %which by Property~\ref{supportsprop} gives us ``one half'' of |
|
856 %Theorem~\ref{suppabs}. The ``other half'' is a bit more involved. To establish |
|
857 %it, we use a trick from \cite{Pitts04} and first define an auxiliary |
|
858 %function @{text aux}, taking an abstraction as argument: |
|
859 %@{thm supp_set.simps[THEN eq_reflection, no_vars]}. |
|
860 % |
|
861 %Using the second equation in \eqref{equivariance}, we can show that |
|
862 %@{text "aux"} is equivariant (since @{term "p \<bullet> (supp x - as) = (supp (p \<bullet> x)) - (p \<bullet> as)"}) |
|
863 %and therefore has empty support. |
|
864 %This in turn means |
|
865 % |
|
866 %\begin{center} |
|
867 %@{term "supp (supp_gen (Abs_set as x)) \<subseteq> supp (Abs_set as x)"} |
|
868 %\end{center} |
|
869 % |
|
870 %\noindent |
|
871 %using \eqref{suppfun}. Assuming @{term "supp x - as"} is a finite set, |
|
872 %we further obtain |
|
873 % |
|
874 %\begin{equation}\label{halftwo} |
|
875 %@{thm (concl) Abs_supp_subset1(1)[no_vars]} |
|
876 %\end{equation} |
|
877 % |
|
878 %\noindent |
|
879 %since for finite sets of atoms, @{text "bs"}, we have |
|
880 %@{thm (concl) supp_finite_atom_set[where S="bs", no_vars]}. |
|
881 %Finally, taking \eqref{halfone} and \eqref{halftwo} together establishes |
|
882 %Theorem~\ref{suppabs}. |
|
883 |
|
884 The method of first considering abstractions of the |
|
885 form @{term "Abs_set as x"} etc is motivated by the fact that |
|
886 we can conveniently establish at the Isabelle/HOL level |
|
887 properties about them. It would be |
|
888 laborious to write custom ML-code that derives automatically such properties |
|
889 for every term-constructor that binds some atoms. Also the generality of |
|
890 the definitions for $\alpha$-equivalence will help us in the next sections. |
|
891 *} |
|
892 |
|
893 section {* Specifying General Bindings\label{sec:spec} *} |
|
894 |
|
895 text {* |
|
896 Our choice of syntax for specifications is influenced by the existing |
|
897 datatype package of Isabelle/HOL %\cite{Berghofer99} |
|
898 and by the syntax of the |
|
899 Ott-tool \cite{ott-jfp}. For us a specification of a term-calculus is a |
|
900 collection of (possibly mutual recursive) type declarations, say @{text |
|
901 "ty\<AL>\<^isub>1, \<dots>, ty\<AL>\<^isub>n"}, and an associated collection of |
|
902 binding functions, say @{text "bn\<AL>\<^isub>1, \<dots>, bn\<AL>\<^isub>m"}. The |
|
903 syntax in Nominal Isabelle for such specifications is roughly as follows: |
|
904 % |
|
905 \begin{equation}\label{scheme} |
|
906 \mbox{\begin{tabular}{@ {}p{2.5cm}l} |
|
907 type \mbox{declaration part} & |
|
908 $\begin{cases} |
|
909 \mbox{\small\begin{tabular}{l} |
|
910 \isacommand{nominal\_datatype} @{text "ty\<AL>\<^isub>1 = \<dots>"}\\ |
|
911 \isacommand{and} @{text "ty\<AL>\<^isub>2 = \<dots>"}\\ |
|
912 \raisebox{2mm}{$\ldots$}\\[-2mm] |
|
913 \isacommand{and} @{text "ty\<AL>\<^isub>n = \<dots>"}\\ |
|
914 \end{tabular}} |
|
915 \end{cases}$\\ |
|
916 binding \mbox{function part} & |
|
917 $\begin{cases} |
|
918 \mbox{\small\begin{tabular}{l} |
|
919 \isacommand{binder} @{text "bn\<AL>\<^isub>1"} \isacommand{and} \ldots \isacommand{and} @{text "bn\<AL>\<^isub>m"}\\ |
|
920 \isacommand{where}\\ |
|
921 \raisebox{2mm}{$\ldots$}\\[-2mm] |
|
922 \end{tabular}} |
|
923 \end{cases}$\\ |
|
924 \end{tabular}} |
|
925 \end{equation} |
|
926 |
|
927 \noindent |
|
928 Every type declaration @{text ty}$^\alpha_{1..n}$ consists of a collection of |
|
929 term-constructors, each of which comes with a list of labelled |
|
930 types that stand for the types of the arguments of the term-constructor. |
|
931 For example a term-constructor @{text "C\<^sup>\<alpha>"} might be specified with |
|
932 |
|
933 \begin{center} |
|
934 @{text "C\<^sup>\<alpha> label\<^isub>1::ty"}$'_1$ @{text "\<dots> label\<^isub>l::ty"}$'_l\;\;$ @{text "binding_clauses"} |
|
935 \end{center} |
|
936 |
|
937 \noindent |
|
938 whereby some of the @{text ty}$'_{1..l}$ %%(or their components) |
|
939 can be contained |
|
940 in the collection of @{text ty}$^\alpha_{1..n}$ declared in |
|
941 \eqref{scheme}. |
|
942 In this case we will call the corresponding argument a |
|
943 \emph{recursive argument} of @{text "C\<^sup>\<alpha>"}. |
|
944 %The types of such recursive |
|
945 %arguments need to satisfy a ``positivity'' |
|
946 %restriction, which ensures that the type has a set-theoretic semantics |
|
947 %\cite{Berghofer99}. |
|
948 The labels |
|
949 annotated on the types are optional. Their purpose is to be used in the |
|
950 (possibly empty) list of \emph{binding clauses}, which indicate the binders |
|
951 and their scope in a term-constructor. They come in three \emph{modes}: |
|
952 % |
|
953 \begin{center} |
|
954 \begin{tabular}{@ {}l@ {}} |
|
955 \isacommand{bind} {\it binders} \isacommand{in} {\it bodies}\;\;\;\, |
|
956 \isacommand{bind (set)} {\it binders} \isacommand{in} {\it bodies}\;\;\;\, |
|
957 \isacommand{bind (set+)} {\it binders} \isacommand{in} {\it bodies} |
|
958 \end{tabular} |
|
959 \end{center} |
|
960 % |
|
961 \noindent |
|
962 The first mode is for binding lists of atoms (the order of binders matters); |
|
963 the second is for sets of binders (the order does not matter, but the |
|
964 cardinality does) and the last is for sets of binders (with vacuous binders |
|
965 preserving $\alpha$-equivalence). As indicated, the labels in the ``\isacommand{in}-part'' of a binding |
|
966 clause will be called \emph{bodies}; the |
|
967 ``\isacommand{bind}-part'' will be called \emph{binders}. In contrast to |
|
968 Ott, we allow multiple labels in binders and bodies. |
|
969 |
|
970 %For example we allow |
|
971 %binding clauses of the form: |
|
972 % |
|
973 %\begin{center} |
|
974 %\begin{tabular}{@ {}ll@ {}} |
|
975 %@{text "Foo\<^isub>1 x::name y::name t::trm s::trm"} & |
|
976 % \isacommand{bind} @{text "x y"} \isacommand{in} @{text "t s"}\\ |
|
977 %@{text "Foo\<^isub>2 x::name y::name t::trm s::trm"} & |
|
978 % \isacommand{bind} @{text "x y"} \isacommand{in} @{text "t"}, |
|
979 % \isacommand{bind} @{text "x y"} \isacommand{in} @{text "s"}\\ |
|
980 %\end{tabular} |
|
981 %\end{center} |
|
982 |
|
983 \noindent |
|
984 %Similarly for the other binding modes. |
|
985 %Interestingly, in case of \isacommand{bind (set)} |
|
986 %and \isacommand{bind (set+)} the binding clauses above will make a difference to the semantics |
|
987 %of the specifications (the corresponding $\alpha$-equivalence will differ). We will |
|
988 %show this later with an example. |
|
989 |
|
990 There are also some restrictions we need to impose on our binding clauses in comparison to |
|
991 the ones of Ott. The |
|
992 main idea behind these restrictions is that we obtain a sensible notion of |
|
993 $\alpha$-equivalence where it is ensured that within a given scope an |
|
994 atom occurrence cannot be both bound and free at the same time. The first |
|
995 restriction is that a body can only occur in |
|
996 \emph{one} binding clause of a term constructor (this ensures that the bound |
|
997 atoms of a body cannot be free at the same time by specifying an |
|
998 alternative binder for the same body). |
|
999 |
|
1000 For binders we distinguish between |
|
1001 \emph{shallow} and \emph{deep} binders. Shallow binders are just |
|
1002 labels. The restriction we need to impose on them is that in case of |
|
1003 \isacommand{bind (set)} and \isacommand{bind (set+)} the labels must either |
|
1004 refer to atom types or to sets of atom types; in case of \isacommand{bind} |
|
1005 the labels must refer to atom types or lists of atom types. Two examples for |
|
1006 the use of shallow binders are the specification of lambda-terms, where a |
|
1007 single name is bound, and type-schemes, where a finite set of names is |
|
1008 bound: |
|
1009 |
|
1010 \begin{center}\small |
|
1011 \begin{tabular}{@ {}c@ {\hspace{7mm}}c@ {}} |
|
1012 \begin{tabular}{@ {}l} |
|
1013 \isacommand{nominal\_datatype} @{text lam} $=$\\ |
|
1014 \hspace{2mm}\phantom{$\mid$}~@{text "Var name"}\\ |
|
1015 \hspace{2mm}$\mid$~@{text "App lam lam"}\\ |
|
1016 \hspace{2mm}$\mid$~@{text "Lam x::name t::lam"}~~\isacommand{bind} @{text x} \isacommand{in} @{text t}\\ |
|
1017 \end{tabular} & |
|
1018 \begin{tabular}{@ {}l@ {}} |
|
1019 \isacommand{nominal\_datatype}~@{text ty} $=$\\ |
|
1020 \hspace{5mm}\phantom{$\mid$}~@{text "TVar name"}\\ |
|
1021 \hspace{5mm}$\mid$~@{text "TFun ty ty"}\\ |
|
1022 \isacommand{and}~@{text "tsc = All xs::(name fset) T::ty"}~~% |
|
1023 \isacommand{bind (set+)} @{text xs} \isacommand{in} @{text T}\\ |
|
1024 \end{tabular} |
|
1025 \end{tabular} |
|
1026 \end{center} |
|
1027 |
|
1028 \noindent |
|
1029 In these specifications @{text "name"} refers to an atom type, and @{text |
|
1030 "fset"} to the type of finite sets. |
|
1031 Note that for @{text lam} it does not matter which binding mode we use. The |
|
1032 reason is that we bind only a single @{text name}. However, having |
|
1033 \isacommand{bind (set)} or \isacommand{bind} in the second case makes a |
|
1034 difference to the semantics of the specification (which we will define in the next section). |
|
1035 |
|
1036 |
|
1037 A \emph{deep} binder uses an auxiliary binding function that ``picks'' out |
|
1038 the atoms in one argument of the term-constructor, which can be bound in |
|
1039 other arguments and also in the same argument (we will call such binders |
|
1040 \emph{recursive}, see below). The binding functions are |
|
1041 expected to return either a set of atoms (for \isacommand{bind (set)} and |
|
1042 \isacommand{bind (set+)}) or a list of atoms (for \isacommand{bind}). They can |
|
1043 be defined by recursion over the corresponding type; the equations |
|
1044 must be given in the binding function part of the scheme shown in |
|
1045 \eqref{scheme}. For example a term-calculus containing @{text "Let"}s with |
|
1046 tuple patterns might be specified as: |
|
1047 % |
|
1048 \begin{equation}\label{letpat} |
|
1049 \mbox{\small% |
|
1050 \begin{tabular}{l} |
|
1051 \isacommand{nominal\_datatype} @{text trm} $=$\\ |
|
1052 \hspace{5mm}\phantom{$\mid$}~@{term "Var name"}\\ |
|
1053 \hspace{5mm}$\mid$~@{term "App trm trm"}\\ |
|
1054 \hspace{5mm}$\mid$~@{text "Lam x::name t::trm"} |
|
1055 \;\;\isacommand{bind} @{text x} \isacommand{in} @{text t}\\ |
|
1056 \hspace{5mm}$\mid$~@{text "Let p::pat trm t::trm"} |
|
1057 \;\;\isacommand{bind} @{text "bn(p)"} \isacommand{in} @{text t}\\ |
|
1058 \isacommand{and} @{text pat} $=$ |
|
1059 @{text PNil} |
|
1060 $\mid$~@{text "PVar name"} |
|
1061 $\mid$~@{text "PTup pat pat"}\\ |
|
1062 \isacommand{binder}~@{text "bn::pat \<Rightarrow> atom list"}\\ |
|
1063 \isacommand{where}~@{text "bn(PNil) = []"}\\ |
|
1064 \hspace{5mm}$\mid$~@{text "bn(PVar x) = [atom x]"}\\ |
|
1065 \hspace{5mm}$\mid$~@{text "bn(PTup p\<^isub>1 p\<^isub>2) = bn(p\<^isub>1) @ bn(p\<^isub>2)"}\smallskip\\ |
|
1066 \end{tabular}} |
|
1067 \end{equation} |
|
1068 % |
|
1069 \noindent |
|
1070 In this specification the function @{text "bn"} determines which atoms of |
|
1071 the pattern @{text p} are bound in the argument @{text "t"}. Note that in the |
|
1072 second-last @{text bn}-clause the function @{text "atom"} coerces a name |
|
1073 into the generic atom type of Nominal Isabelle \cite{HuffmanUrban10}. This |
|
1074 allows us to treat binders of different atom type uniformly. |
|
1075 |
|
1076 As said above, for deep binders we allow binding clauses such as |
|
1077 % |
|
1078 %\begin{center} |
|
1079 %\begin{tabular}{ll} |
|
1080 @{text "Bar p::pat t::trm"} %%%& |
|
1081 \isacommand{bind} @{text "bn(p)"} \isacommand{in} @{text "p t"} %%\\ |
|
1082 %\end{tabular} |
|
1083 %\end{center} |
|
1084 % |
|
1085 %\noindent |
|
1086 where the argument of the deep binder also occurs in the body. We call such |
|
1087 binders \emph{recursive}. To see the purpose of such recursive binders, |
|
1088 compare ``plain'' @{text "Let"}s and @{text "Let_rec"}s in the following |
|
1089 specification: |
|
1090 % |
|
1091 \begin{equation}\label{letrecs} |
|
1092 \mbox{\small% |
|
1093 \begin{tabular}{@ {}l@ {}} |
|
1094 \isacommand{nominal\_datatype}~@{text "trm ="}~\ldots\\ |
|
1095 \hspace{5mm}$\mid$~@{text "Let as::assn t::trm"} |
|
1096 \;\;\isacommand{bind} @{text "bn(as)"} \isacommand{in} @{text t}\\ |
|
1097 \hspace{5mm}$\mid$~@{text "Let_rec as::assn t::trm"} |
|
1098 \;\;\isacommand{bind} @{text "bn(as)"} \isacommand{in} @{text "as t"}\\ |
|
1099 \isacommand{and} @{text "assn"} $=$ |
|
1100 @{text "ANil"} |
|
1101 $\mid$~@{text "ACons name trm assn"}\\ |
|
1102 \isacommand{binder} @{text "bn::assn \<Rightarrow> atom list"}\\ |
|
1103 \isacommand{where}~@{text "bn(ANil) = []"}\\ |
|
1104 \hspace{5mm}$\mid$~@{text "bn(ACons a t as) = [atom a] @ bn(as)"}\\ |
|
1105 \end{tabular}} |
|
1106 \end{equation} |
|
1107 % |
|
1108 \noindent |
|
1109 The difference is that with @{text Let} we only want to bind the atoms @{text |
|
1110 "bn(as)"} in the term @{text t}, but with @{text "Let_rec"} we also want to bind the atoms |
|
1111 inside the assignment. This difference has consequences for the associated |
|
1112 notions of free-atoms and $\alpha$-equivalence. |
|
1113 |
|
1114 To make sure that atoms bound by deep binders cannot be free at the |
|
1115 same time, we cannot have more than one binding function for a deep binder. |
|
1116 Consequently we exclude specifications such as |
|
1117 % |
|
1118 \begin{center}\small |
|
1119 \begin{tabular}{@ {}l@ {\hspace{2mm}}l@ {}} |
|
1120 @{text "Baz\<^isub>1 p::pat t::trm"} & |
|
1121 \isacommand{bind} @{text "bn\<^isub>1(p) bn\<^isub>2(p)"} \isacommand{in} @{text t}\\ |
|
1122 @{text "Baz\<^isub>2 p::pat t\<^isub>1::trm t\<^isub>2::trm"} & |
|
1123 \isacommand{bind} @{text "bn\<^isub>1(p)"} \isacommand{in} @{text "t\<^isub>1"}, |
|
1124 \isacommand{bind} @{text "bn\<^isub>2(p)"} \isacommand{in} @{text "t\<^isub>2"}\\ |
|
1125 \end{tabular} |
|
1126 \end{center} |
|
1127 |
|
1128 \noindent |
|
1129 Otherwise it is possible that @{text "bn\<^isub>1"} and @{text "bn\<^isub>2"} pick |
|
1130 out different atoms to become bound, respectively be free, in @{text "p"}. |
|
1131 (Since the Ott-tool does not derive a reasoning infrastructure for |
|
1132 $\alpha$-equated terms with deep binders, it can permit such specifications.) |
|
1133 |
|
1134 We also need to restrict the form of the binding functions in order |
|
1135 to ensure the @{text "bn"}-functions can be defined for $\alpha$-equated |
|
1136 terms. The main restriction is that we cannot return an atom in a binding function that is also |
|
1137 bound in the corresponding term-constructor. That means in \eqref{letpat} |
|
1138 that the term-constructors @{text PVar} and @{text PTup} may |
|
1139 not have a binding clause (all arguments are used to define @{text "bn"}). |
|
1140 In contrast, in case of \eqref{letrecs} the term-constructor @{text ACons} |
|
1141 may have a binding clause involving the argument @{text trm} (the only one that |
|
1142 is \emph{not} used in the definition of the binding function). This restriction |
|
1143 is sufficient for lifting the binding function to $\alpha$-equated terms. |
|
1144 |
|
1145 In the version of |
|
1146 Nominal Isabelle described here, we also adopted the restriction from the |
|
1147 Ott-tool that binding functions can only return: the empty set or empty list |
|
1148 (as in case @{text PNil}), a singleton set or singleton list containing an |
|
1149 atom (case @{text PVar}), or unions of atom sets or appended atom lists |
|
1150 (case @{text PTup}). This restriction will simplify some automatic definitions and proofs |
|
1151 later on. |
|
1152 |
|
1153 In order to simplify our definitions of free atoms and $\alpha$-equivalence, |
|
1154 we shall assume specifications |
|
1155 of term-calculi are implicitly \emph{completed}. By this we mean that |
|
1156 for every argument of a term-constructor that is \emph{not} |
|
1157 already part of a binding clause given by the user, we add implicitly a special \emph{empty} binding |
|
1158 clause, written \isacommand{bind}~@{term "{}"}~\isacommand{in}~@{text "labels"}. In case |
|
1159 of the lambda-terms, the completion produces |
|
1160 |
|
1161 \begin{center}\small |
|
1162 \begin{tabular}{@ {}l@ {\hspace{-1mm}}} |
|
1163 \isacommand{nominal\_datatype} @{text lam} =\\ |
|
1164 \hspace{5mm}\phantom{$\mid$}~@{text "Var x::name"} |
|
1165 \;\;\isacommand{bind}~@{term "{}"}~\isacommand{in}~@{text "x"}\\ |
|
1166 \hspace{5mm}$\mid$~@{text "App t\<^isub>1::lam t\<^isub>2::lam"} |
|
1167 \;\;\isacommand{bind}~@{term "{}"}~\isacommand{in}~@{text "t\<^isub>1 t\<^isub>2"}\\ |
|
1168 \hspace{5mm}$\mid$~@{text "Lam x::name t::lam"} |
|
1169 \;\;\isacommand{bind}~@{text x} \isacommand{in} @{text t}\\ |
|
1170 \end{tabular} |
|
1171 \end{center} |
|
1172 |
|
1173 \noindent |
|
1174 The point of completion is that we can make definitions over the binding |
|
1175 clauses and be sure to have captured all arguments of a term constructor. |
|
1176 *} |
|
1177 |
|
1178 section {* Alpha-Equivalence and Free Atoms\label{sec:alpha} *} |
|
1179 |
|
1180 text {* |
|
1181 Having dealt with all syntax matters, the problem now is how we can turn |
|
1182 specifications into actual type definitions in Isabelle/HOL and then |
|
1183 establish a reasoning infrastructure for them. As |
|
1184 Pottier and Cheney pointed out \cite{Pottier06,Cheney05}, just |
|
1185 re-arranging the arguments of |
|
1186 term-constructors so that binders and their bodies are next to each other will |
|
1187 result in inadequate representations in cases like @{text "Let x\<^isub>1 = t\<^isub>1\<dots>x\<^isub>n = t\<^isub>n in s"}. |
|
1188 Therefore we will first |
|
1189 extract ``raw'' datatype definitions from the specification and then define |
|
1190 explicitly an $\alpha$-equivalence relation over them. We subsequently |
|
1191 construct the quotient of the datatypes according to our $\alpha$-equivalence. |
|
1192 |
|
1193 The ``raw'' datatype definition can be obtained by stripping off the |
|
1194 binding clauses and the labels from the types. We also have to invent |
|
1195 new names for the types @{text "ty\<^sup>\<alpha>"} and term-constructors @{text "C\<^sup>\<alpha>"} |
|
1196 given by the user. In our implementation we just use the affix ``@{text "_raw"}''. |
|
1197 But for the purpose of this paper, we use the superscript @{text "_\<^sup>\<alpha>"} to indicate |
|
1198 that a notion is given for $\alpha$-equivalence classes and leave it out |
|
1199 for the corresponding notion given on the ``raw'' level. So for example |
|
1200 we have @{text "ty\<^sup>\<alpha> \<mapsto> ty"} and @{text "C\<^sup>\<alpha> \<mapsto> C"} |
|
1201 where @{term ty} is the type used in the quotient construction for |
|
1202 @{text "ty\<^sup>\<alpha>"} and @{text "C"} is the term-constructor on the ``raw'' type @{text "ty"}. |
|
1203 |
|
1204 %The resulting datatype definition is legal in Isabelle/HOL provided the datatypes are |
|
1205 %non-empty and the types in the constructors only occur in positive |
|
1206 %position (see \cite{Berghofer99} for an in-depth description of the datatype package |
|
1207 %in Isabelle/HOL). |
|
1208 We subsequently define each of the user-specified binding |
|
1209 functions @{term "bn"}$_{1..m}$ by recursion over the corresponding |
|
1210 raw datatype. We can also easily define permutation operations by |
|
1211 recursion so that for each term constructor @{text "C"} we have that |
|
1212 % |
|
1213 \begin{equation}\label{ceqvt} |
|
1214 @{text "p \<bullet> (C z\<^isub>1 \<dots> z\<^isub>n) = C (p \<bullet> z\<^isub>1) \<dots> (p \<bullet> z\<^isub>n)"} |
|
1215 \end{equation} |
|
1216 |
|
1217 The first non-trivial step we have to perform is the generation of |
|
1218 free-atom functions from the specification. For the |
|
1219 \emph{raw} types @{text "ty"}$_{1..n}$ we define the free-atom functions |
|
1220 % |
|
1221 %\begin{equation}\label{fvars} |
|
1222 @{text "fa_ty\<^isub>"}$_{1..n}$ |
|
1223 %\end{equation} |
|
1224 % |
|
1225 %\noindent |
|
1226 by recursion. |
|
1227 We define these functions together with auxiliary free-atom functions for |
|
1228 the binding functions. Given raw binding functions @{text "bn"}$_{1..m}$ |
|
1229 we define |
|
1230 % |
|
1231 %\begin{center} |
|
1232 @{text "fa_bn\<^isub>"}$_{1..m}$. |
|
1233 %\end{center} |
|
1234 % |
|
1235 %\noindent |
|
1236 The reason for this setup is that in a deep binder not all atoms have to be |
|
1237 bound, as we saw in the example with ``plain'' @{text Let}s. We need therefore a function |
|
1238 that calculates those free atoms in a deep binder. |
|
1239 |
|
1240 While the idea behind these free-atom functions is clear (they just |
|
1241 collect all atoms that are not bound), because of our rather complicated |
|
1242 binding mechanisms their definitions are somewhat involved. Given |
|
1243 a term-constructor @{text "C"} of type @{text ty} and some associated |
|
1244 binding clauses @{text "bc\<^isub>1\<dots>bc\<^isub>k"}, the result of @{text |
|
1245 "fa_ty (C z\<^isub>1 \<dots> z\<^isub>n)"} will be the union @{text |
|
1246 "fa(bc\<^isub>1) \<union> \<dots> \<union> fa(bc\<^isub>k)"} where we will define below what @{text "fa"} for a binding |
|
1247 clause means. We only show the details for the mode \isacommand{bind (set)} (the other modes are similar). |
|
1248 Suppose the binding clause @{text bc\<^isub>i} is of the form |
|
1249 % |
|
1250 %\begin{center} |
|
1251 \mbox{\isacommand{bind (set)} @{text "b\<^isub>1\<dots>b\<^isub>p"} \isacommand{in} @{text "d\<^isub>1\<dots>d\<^isub>q"}} |
|
1252 %\end{center} |
|
1253 % |
|
1254 %\noindent |
|
1255 in which the body-labels @{text "d"}$_{1..q}$ refer to types @{text ty}$_{1..q}$, |
|
1256 and the binders @{text b}$_{1..p}$ |
|
1257 either refer to labels of atom types (in case of shallow binders) or to binding |
|
1258 functions taking a single label as argument (in case of deep binders). Assuming |
|
1259 @{text "D"} stands for the set of free atoms of the bodies, @{text B} for the |
|
1260 set of binding atoms in the binders and @{text "B'"} for the set of free atoms in |
|
1261 non-recursive deep binders, |
|
1262 then the free atoms of the binding clause @{text bc\<^isub>i} are\\[-2mm] |
|
1263 % |
|
1264 \begin{equation}\label{fadef} |
|
1265 \mbox{@{text "fa(bc\<^isub>i) \<equiv> (D - B) \<union> B'"}}. |
|
1266 \end{equation} |
|
1267 % |
|
1268 \noindent |
|
1269 The set @{text D} is formally defined as |
|
1270 % |
|
1271 %\begin{center} |
|
1272 @{text "D \<equiv> fa_ty\<^isub>1 d\<^isub>1 \<union> ... \<union> fa_ty\<^isub>q d\<^isub>q"} |
|
1273 %\end{center} |
|
1274 % |
|
1275 %\noindent |
|
1276 where in case @{text "d\<^isub>i"} refers to one of the raw types @{text "ty"}$_{1..n}$ from the |
|
1277 specification, the function @{text "fa_ty\<^isub>i"} is the corresponding free-atom function |
|
1278 we are defining by recursion; |
|
1279 %(see \eqref{fvars}); |
|
1280 otherwise we set @{text "fa_ty\<^isub>i d\<^isub>i = supp d\<^isub>i"}. |
|
1281 |
|
1282 In order to formally define the set @{text B} we use the following auxiliary @{text "bn"}-functions |
|
1283 for atom types to which shallow binders may refer\\[-4mm] |
|
1284 % |
|
1285 %\begin{center} |
|
1286 %\begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l} |
|
1287 %@{text "bn_atom a"} & @{text "\<equiv>"} & @{text "{atom a}"}\\ |
|
1288 %@{text "bn_atom_set as"} & @{text "\<equiv>"} & @{text "atoms as"}\\ |
|
1289 %@{text "bn_atom_list as"} & @{text "\<equiv>"} & @{text "atoms (set as)"} |
|
1290 %\end{tabular} |
|
1291 %\end{center} |
|
1292 % |
|
1293 \begin{center} |
|
1294 @{text "bn\<^bsub>atom\<^esub> a \<equiv> {atom a}"}\hfill |
|
1295 @{text "bn\<^bsub>atom_set\<^esub> as \<equiv> atoms as"}\hfill |
|
1296 @{text "bn\<^bsub>atom_list\<^esub> as \<equiv> atoms (set as)"} |
|
1297 \end{center} |
|
1298 % |
|
1299 \noindent |
|
1300 Like the function @{text atom}, the function @{text "atoms"} coerces |
|
1301 a set of atoms to a set of the generic atom type. |
|
1302 %It is defined as @{text "atoms as \<equiv> {atom a | a \<in> as}"}. |
|
1303 The set @{text B} is then formally defined as\\[-4mm] |
|
1304 % |
|
1305 \begin{center} |
|
1306 @{text "B \<equiv> bn_ty\<^isub>1 b\<^isub>1 \<union> ... \<union> bn_ty\<^isub>p b\<^isub>p"} |
|
1307 \end{center} |
|
1308 % |
|
1309 \noindent |
|
1310 where we use the auxiliary binding functions for shallow binders. |
|
1311 The set @{text "B'"} collects all free atoms in non-recursive deep |
|
1312 binders. Let us assume these binders in @{text "bc\<^isub>i"} are |
|
1313 % |
|
1314 %\begin{center} |
|
1315 \mbox{@{text "bn\<^isub>1 l\<^isub>1, \<dots>, bn\<^isub>r l\<^isub>r"}} |
|
1316 %\end{center} |
|
1317 % |
|
1318 %\noindent |
|
1319 with @{text "l"}$_{1..r}$ $\subseteq$ @{text "b"}$_{1..p}$ and none of the |
|
1320 @{text "l"}$_{1..r}$ being among the bodies @{text |
|
1321 "d"}$_{1..q}$. The set @{text "B'"} is defined as\\[-5mm] |
|
1322 % |
|
1323 \begin{center} |
|
1324 @{text "B' \<equiv> fa_bn\<^isub>1 l\<^isub>1 \<union> ... \<union> fa_bn\<^isub>r l\<^isub>r"}\\[-9mm] |
|
1325 \end{center} |
|
1326 % |
|
1327 \noindent |
|
1328 This completes the definition of the free-atom functions @{text "fa_ty"}$_{1..n}$. |
|
1329 |
|
1330 Note that for non-recursive deep binders, we have to add in \eqref{fadef} |
|
1331 the set of atoms that are left unbound by the binding functions @{text |
|
1332 "bn"}$_{1..m}$. We used for the definition of |
|
1333 this set the functions @{text "fa_bn"}$_{1..m}$, which are also defined by mutual |
|
1334 recursion. Assume the user specified a @{text bn}-clause of the form |
|
1335 % |
|
1336 %\begin{center} |
|
1337 @{text "bn (C z\<^isub>1 \<dots> z\<^isub>s) = rhs"} |
|
1338 %\end{center} |
|
1339 % |
|
1340 %\noindent |
|
1341 where the @{text "z"}$_{1..s}$ are of types @{text "ty"}$_{1..s}$. For each of |
|
1342 the arguments we calculate the free atoms as follows: |
|
1343 % |
|
1344 \begin{center} |
|
1345 \begin{tabular}{c@ {\hspace{2mm}}p{0.9\textwidth}} |
|
1346 $\bullet$ & @{term "fa_ty\<^isub>i z\<^isub>i"} provided @{text "z\<^isub>i"} does not occur in @{text "rhs"} |
|
1347 (that means nothing is bound in @{text "z\<^isub>i"} by the binding function),\\ |
|
1348 $\bullet$ & @{term "fa_bn\<^isub>i z\<^isub>i"} provided @{text "z\<^isub>i"} occurs in @{text "rhs"} |
|
1349 with the recursive call @{text "bn\<^isub>i z\<^isub>i"}, and\\ |
|
1350 $\bullet$ & @{term "{}"} provided @{text "z\<^isub>i"} occurs in @{text "rhs"}, |
|
1351 but without a recursive call. |
|
1352 \end{tabular} |
|
1353 \end{center} |
|
1354 % |
|
1355 \noindent |
|
1356 For defining @{text "fa_bn (C z\<^isub>1 \<dots> z\<^isub>n)"} we just union up all these sets. |
|
1357 |
|
1358 To see how these definitions work in practice, let us reconsider the |
|
1359 term-constructors @{text "Let"} and @{text "Let_rec"} shown in |
|
1360 \eqref{letrecs} together with the term-constructors for assignments @{text |
|
1361 "ANil"} and @{text "ACons"}. Since there is a binding function defined for |
|
1362 assignments, we have three free-atom functions, namely @{text |
|
1363 "fa\<^bsub>trm\<^esub>"}, @{text "fa\<^bsub>assn\<^esub>"} and @{text |
|
1364 "fa\<^bsub>bn\<^esub>"} as follows: |
|
1365 % |
|
1366 \begin{center}\small |
|
1367 \begin{tabular}{@ {}l@ {\hspace{1mm}}c@ {\hspace{1mm}}l@ {}} |
|
1368 @{text "fa\<^bsub>trm\<^esub> (Let as t)"} & @{text "="} & @{text "(fa\<^bsub>trm\<^esub> t - set (bn as)) \<union> fa\<^bsub>bn\<^esub> as"}\\ |
|
1369 @{text "fa\<^bsub>trm\<^esub> (Let_rec as t)"} & @{text "="} & @{text "(fa\<^bsub>assn\<^esub> as \<union> fa\<^bsub>trm\<^esub> t) - set (bn as)"}\\[1mm] |
|
1370 |
|
1371 @{text "fa\<^bsub>assn\<^esub> (ANil)"} & @{text "="} & @{term "{}"}\\ |
|
1372 @{text "fa\<^bsub>assn\<^esub> (ACons a t as)"} & @{text "="} & @{text "(supp a) \<union> (fa\<^bsub>trm\<^esub> t) \<union> (fa\<^bsub>assn\<^esub> as)"}\\[1mm] |
|
1373 |
|
1374 @{text "fa\<^bsub>bn\<^esub> (ANil)"} & @{text "="} & @{term "{}"}\\ |
|
1375 @{text "fa\<^bsub>bn\<^esub> (ACons a t as)"} & @{text "="} & @{text "(fa\<^bsub>trm\<^esub> t) \<union> (fa\<^bsub>bn\<^esub> as)"} |
|
1376 \end{tabular} |
|
1377 \end{center} |
|
1378 |
|
1379 \noindent |
|
1380 Recall that @{text ANil} and @{text "ACons"} have no |
|
1381 binding clause in the specification. The corresponding free-atom |
|
1382 function @{text "fa\<^bsub>assn\<^esub>"} therefore returns all free atoms |
|
1383 of an assignment (in case of @{text "ACons"}, they are given in |
|
1384 terms of @{text supp}, @{text "fa\<^bsub>trm\<^esub>"} and @{text "fa\<^bsub>assn\<^esub>"}). |
|
1385 The binding only takes place in @{text Let} and |
|
1386 @{text "Let_rec"}. In case of @{text "Let"}, the binding clause specifies |
|
1387 that all atoms given by @{text "set (bn as)"} have to be bound in @{text |
|
1388 t}. Therefore we have to subtract @{text "set (bn as)"} from @{text |
|
1389 "fa\<^bsub>trm\<^esub> t"}. However, we also need to add all atoms that are |
|
1390 free in @{text "as"}. This is |
|
1391 in contrast with @{text "Let_rec"} where we have a recursive |
|
1392 binder to bind all occurrences of the atoms in @{text |
|
1393 "set (bn as)"} also inside @{text "as"}. Therefore we have to subtract |
|
1394 @{text "set (bn as)"} from both @{text "fa\<^bsub>trm\<^esub> t"} and @{text "fa\<^bsub>assn\<^esub> as"}. |
|
1395 %Like the function @{text "bn"}, the function @{text "fa\<^bsub>bn\<^esub>"} traverses the |
|
1396 %list of assignments, but instead returns the free atoms, which means in this |
|
1397 %example the free atoms in the argument @{text "t"}. |
|
1398 |
|
1399 An interesting point in this |
|
1400 example is that a ``naked'' assignment (@{text "ANil"} or @{text "ACons"}) does not bind any |
|
1401 atoms, even if the binding function is specified over assignments. |
|
1402 Only in the context of a @{text Let} or @{text "Let_rec"}, where the binding clauses are given, will |
|
1403 some atoms actually become bound. This is a phenomenon that has also been pointed |
|
1404 out in \cite{ott-jfp}. For us this observation is crucial, because we would |
|
1405 not be able to lift the @{text "bn"}-functions to $\alpha$-equated terms if they act on |
|
1406 atoms that are bound. In that case, these functions would \emph{not} respect |
|
1407 $\alpha$-equivalence. |
|
1408 |
|
1409 Next we define the $\alpha$-equivalence relations for the raw types @{text |
|
1410 "ty"}$_{1..n}$ from the specification. We write them as |
|
1411 % |
|
1412 %\begin{center} |
|
1413 @{text "\<approx>ty"}$_{1..n}$. |
|
1414 %\end{center} |
|
1415 % |
|
1416 %\noindent |
|
1417 Like with the free-atom functions, we also need to |
|
1418 define auxiliary $\alpha$-equivalence relations |
|
1419 % |
|
1420 %\begin{center} |
|
1421 @{text "\<approx>bn\<^isub>"}$_{1..m}$ |
|
1422 %\end{center} |
|
1423 % |
|
1424 %\noindent |
|
1425 for the binding functions @{text "bn"}$_{1..m}$, |
|
1426 To simplify our definitions we will use the following abbreviations for |
|
1427 \emph{compound equivalence relations} and \emph{compound free-atom functions} acting on tuples. |
|
1428 % |
|
1429 \begin{center} |
|
1430 \begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l} |
|
1431 @{text "(x\<^isub>1,\<dots>, x\<^isub>n) (R\<^isub>1,\<dots>, R\<^isub>n) (x\<PRIME>\<^isub>1,\<dots>, x\<PRIME>\<^isub>n)"} & @{text "\<equiv>"} & |
|
1432 @{text "x\<^isub>1 R\<^isub>1 x\<PRIME>\<^isub>1 \<and> \<dots> \<and> x\<^isub>n R\<^isub>n x\<PRIME>\<^isub>n"}\\ |
|
1433 @{text "(fa\<^isub>1,\<dots>, fa\<^isub>n) (x\<^isub>1,\<dots>, x\<^isub>n)"} & @{text "\<equiv>"} & @{text "fa\<^isub>1 x\<^isub>1 \<union> \<dots> \<union> fa\<^isub>n x\<^isub>n"}\\ |
|
1434 \end{tabular} |
|
1435 \end{center} |
|
1436 |
|
1437 |
|
1438 The $\alpha$-equivalence relations are defined as inductive predicates |
|
1439 having a single clause for each term-constructor. Assuming a |
|
1440 term-constructor @{text C} is of type @{text ty} and has the binding clauses |
|
1441 @{term "bc"}$_{1..k}$, then the $\alpha$-equivalence clause has the form |
|
1442 % |
|
1443 \begin{center} |
|
1444 \mbox{\infer{@{text "C z\<^isub>1 \<dots> z\<^isub>n \<approx>ty C z\<PRIME>\<^isub>1 \<dots> z\<PRIME>\<^isub>n"}} |
|
1445 {@{text "prems(bc\<^isub>1) \<dots> prems(bc\<^isub>k)"}}} |
|
1446 \end{center} |
|
1447 |
|
1448 \noindent |
|
1449 The task below is to specify what the premises of a binding clause are. As a |
|
1450 special instance, we first treat the case where @{text "bc\<^isub>i"} is the |
|
1451 empty binding clause of the form |
|
1452 % |
|
1453 \begin{center} |
|
1454 \mbox{\isacommand{bind (set)} @{term "{}"} \isacommand{in} @{text "d\<^isub>1\<dots>d\<^isub>q"}.} |
|
1455 \end{center} |
|
1456 |
|
1457 \noindent |
|
1458 In this binding clause no atom is bound and we only have to $\alpha$-relate the bodies. For this |
|
1459 we build first the tuples @{text "D \<equiv> (d\<^isub>1,\<dots>, d\<^isub>q)"} and @{text "D' \<equiv> (d\<PRIME>\<^isub>1,\<dots>, d\<PRIME>\<^isub>q)"} |
|
1460 whereby the labels @{text "d"}$_{1..q}$ refer to arguments @{text "z"}$_{1..n}$ and |
|
1461 respectively @{text "d\<PRIME>"}$_{1..q}$ to @{text "z\<PRIME>"}$_{1..n}$. In order to relate |
|
1462 two such tuples we define the compound $\alpha$-equivalence relation @{text "R"} as follows |
|
1463 % |
|
1464 \begin{equation}\label{rempty} |
|
1465 \mbox{@{text "R \<equiv> (R\<^isub>1,\<dots>, R\<^isub>q)"}} |
|
1466 \end{equation} |
|
1467 |
|
1468 \noindent |
|
1469 with @{text "R\<^isub>i"} being @{text "\<approx>ty\<^isub>i"} if the corresponding labels @{text "d\<^isub>i"} and |
|
1470 @{text "d\<PRIME>\<^isub>i"} refer |
|
1471 to a recursive argument of @{text C} with type @{text "ty\<^isub>i"}; otherwise |
|
1472 we take @{text "R\<^isub>i"} to be the equality @{text "="}. This lets us define |
|
1473 the premise for an empty binding clause succinctly as @{text "prems(bc\<^isub>i) \<equiv> D R D'"}, |
|
1474 which can be unfolded to the series of premises |
|
1475 % |
|
1476 %\begin{center} |
|
1477 @{text "d\<^isub>1 R\<^isub>1 d\<PRIME>\<^isub>1 \<dots> d\<^isub>q R\<^isub>q d\<PRIME>\<^isub>q"}. |
|
1478 %\end{center} |
|
1479 % |
|
1480 %\noindent |
|
1481 We will use the unfolded version in the examples below. |
|
1482 |
|
1483 Now suppose the binding clause @{text "bc\<^isub>i"} is of the general form |
|
1484 % |
|
1485 \begin{equation}\label{nonempty} |
|
1486 \mbox{\isacommand{bind (set)} @{text "b\<^isub>1\<dots>b\<^isub>p"} \isacommand{in} @{text "d\<^isub>1\<dots>d\<^isub>q"}.} |
|
1487 \end{equation} |
|
1488 |
|
1489 \noindent |
|
1490 In this case we define a premise @{text P} using the relation |
|
1491 $\approx_{\,\textit{set}}$ given in Section~\ref{sec:binders} (similarly |
|
1492 $\approx_{\,\textit{set+}}$ and $\approx_{\,\textit{list}}$ for the other |
|
1493 binding modes). This premise defines $\alpha$-equivalence of two abstractions |
|
1494 involving multiple binders. As above, we first build the tuples @{text "D"} and |
|
1495 @{text "D'"} for the bodies @{text "d"}$_{1..q}$, and the corresponding |
|
1496 compound $\alpha$-relation @{text "R"} (shown in \eqref{rempty}). |
|
1497 For $\approx_{\,\textit{set}}$ we also need |
|
1498 a compound free-atom function for the bodies defined as |
|
1499 % |
|
1500 \begin{center} |
|
1501 \mbox{@{text "fa \<equiv> (fa_ty\<^isub>1,\<dots>, fa_ty\<^isub>q)"}} |
|
1502 \end{center} |
|
1503 |
|
1504 \noindent |
|
1505 with the assumption that the @{text "d"}$_{1..q}$ refer to arguments of types @{text "ty"}$_{1..q}$. |
|
1506 The last ingredient we need are the sets of atoms bound in the bodies. |
|
1507 For this we take |
|
1508 |
|
1509 \begin{center} |
|
1510 @{text "B \<equiv> bn_ty\<^isub>1 b\<^isub>1 \<union> \<dots> \<union> bn_ty\<^isub>p b\<^isub>p"}\;.\\ |
|
1511 \end{center} |
|
1512 |
|
1513 \noindent |
|
1514 Similarly for @{text "B'"} using the labels @{text "b\<PRIME>"}$_{1..p}$. This |
|
1515 lets us formally define the premise @{text P} for a non-empty binding clause as: |
|
1516 % |
|
1517 \begin{center} |
|
1518 \mbox{@{term "P \<equiv> \<exists>p. (B, D) \<approx>set R fa p (B', D')"}}\;. |
|
1519 \end{center} |
|
1520 |
|
1521 \noindent |
|
1522 This premise accounts for $\alpha$-equivalence of the bodies of the binding |
|
1523 clause. |
|
1524 However, in case the binders have non-recursive deep binders, this premise |
|
1525 is not enough: |
|
1526 we also have to ``propagate'' $\alpha$-equivalence inside the structure of |
|
1527 these binders. An example is @{text "Let"} where we have to make sure the |
|
1528 right-hand sides of assignments are $\alpha$-equivalent. For this we use |
|
1529 relations @{text "\<approx>bn"}$_{1..m}$ (which we will formally define shortly). |
|
1530 Let us assume the non-recursive deep binders in @{text "bc\<^isub>i"} are |
|
1531 % |
|
1532 %\begin{center} |
|
1533 @{text "bn\<^isub>1 l\<^isub>1, \<dots>, bn\<^isub>r l\<^isub>r"}. |
|
1534 %\end{center} |
|
1535 % |
|
1536 %\noindent |
|
1537 The tuple @{text L} is then @{text "(l\<^isub>1,\<dots>,l\<^isub>r)"} (similarly @{text "L'"}) |
|
1538 and the compound equivalence relation @{text "R'"} is @{text "(\<approx>bn\<^isub>1,\<dots>,\<approx>bn\<^isub>r)"}. |
|
1539 All premises for @{text "bc\<^isub>i"} are then given by |
|
1540 % |
|
1541 \begin{center} |
|
1542 @{text "prems(bc\<^isub>i) \<equiv> P \<and> L R' L'"} |
|
1543 \end{center} |
|
1544 |
|
1545 \noindent |
|
1546 The auxiliary $\alpha$-equivalence relations @{text "\<approx>bn"}$_{1..m}$ |
|
1547 in @{text "R'"} are defined as follows: assuming a @{text bn}-clause is of the form |
|
1548 % |
|
1549 %\begin{center} |
|
1550 @{text "bn (C z\<^isub>1 \<dots> z\<^isub>s) = rhs"} |
|
1551 %\end{center} |
|
1552 % |
|
1553 %\noindent |
|
1554 where the @{text "z"}$_{1..s}$ are of types @{text "ty"}$_{1..s}$, |
|
1555 then the corresponding $\alpha$-equivalence clause for @{text "\<approx>bn"} has the form |
|
1556 % |
|
1557 \begin{center} |
|
1558 \mbox{\infer{@{text "C z\<^isub>1 \<dots> z\<^isub>s \<approx>bn C z\<PRIME>\<^isub>1 \<dots> z\<PRIME>\<^isub>s"}} |
|
1559 {@{text "z\<^isub>1 R\<^isub>1 z\<PRIME>\<^isub>1 \<dots> z\<^isub>s R\<^isub>s z\<PRIME>\<^isub>s"}}} |
|
1560 \end{center} |
|
1561 |
|
1562 \noindent |
|
1563 In this clause the relations @{text "R"}$_{1..s}$ are given by |
|
1564 |
|
1565 \begin{center} |
|
1566 \begin{tabular}{c@ {\hspace{2mm}}p{0.9\textwidth}} |
|
1567 $\bullet$ & @{text "z\<^isub>i \<approx>ty z\<PRIME>\<^isub>i"} provided @{text "z\<^isub>i"} does not occur in @{text rhs} and |
|
1568 is a recursive argument of @{text C},\\ |
|
1569 $\bullet$ & @{text "z\<^isub>i = z\<PRIME>\<^isub>i"} provided @{text "z\<^isub>i"} does not occur in @{text rhs} |
|
1570 and is a non-recursive argument of @{text C},\\ |
|
1571 $\bullet$ & @{text "z\<^isub>i \<approx>bn\<^isub>i z\<PRIME>\<^isub>i"} provided @{text "z\<^isub>i"} occurs in @{text rhs} |
|
1572 with the recursive call @{text "bn\<^isub>i x\<^isub>i"} and\\ |
|
1573 $\bullet$ & @{text True} provided @{text "z\<^isub>i"} occurs in @{text rhs} but without a |
|
1574 recursive call. |
|
1575 \end{tabular} |
|
1576 \end{center} |
|
1577 |
|
1578 \noindent |
|
1579 This completes the definition of $\alpha$-equivalence. As a sanity check, we can show |
|
1580 that the premises of empty binding clauses are a special case of the clauses for |
|
1581 non-empty ones (we just have to unfold the definition of $\approx_{\,\textit{set}}$ and take @{text "0"} |
|
1582 for the existentially quantified permutation). |
|
1583 |
|
1584 Again let us take a look at a concrete example for these definitions. For \eqref{letrecs} |
|
1585 we have three relations $\approx_{\textit{trm}}$, $\approx_{\textit{assn}}$ and |
|
1586 $\approx_{\textit{bn}}$ with the following clauses: |
|
1587 |
|
1588 \begin{center}\small |
|
1589 \begin{tabular}{@ {}c @ {}} |
|
1590 \infer{@{text "Let as t \<approx>\<^bsub>trm\<^esub> Let as' t'"}} |
|
1591 {@{term "\<exists>p. (bn as, t) \<approx>lst alpha_trm fa_trm p (bn as', t')"} & @{text "as \<approx>\<^bsub>bn\<^esub> as'"}}\smallskip\\ |
|
1592 \makebox[0mm]{\infer{@{text "Let_rec as t \<approx>\<^bsub>trm\<^esub> Let_rec as' t'"}} |
|
1593 {@{term "\<exists>p. (bn as, ast) \<approx>lst alpha_trm2 fa_trm2 p (bn as', ast')"}}} |
|
1594 \end{tabular} |
|
1595 \end{center} |
|
1596 |
|
1597 \begin{center}\small |
|
1598 \begin{tabular}{@ {}c @ {}} |
|
1599 \infer{@{text "ANil \<approx>\<^bsub>assn\<^esub> ANil"}}{}\hspace{9mm} |
|
1600 \infer{@{text "ACons a t as \<approx>\<^bsub>assn\<^esub> ACons a' t' as"}} |
|
1601 {@{text "a = a'"} & @{text "t \<approx>\<^bsub>trm\<^esub> t'"} & @{text "as \<approx>\<^bsub>assn\<^esub> as'"}} |
|
1602 \end{tabular} |
|
1603 \end{center} |
|
1604 |
|
1605 \begin{center}\small |
|
1606 \begin{tabular}{@ {}c @ {}} |
|
1607 \infer{@{text "ANil \<approx>\<^bsub>bn\<^esub> ANil"}}{}\hspace{9mm} |
|
1608 \infer{@{text "ACons a t as \<approx>\<^bsub>bn\<^esub> ACons a' t' as"}} |
|
1609 {@{text "t \<approx>\<^bsub>trm\<^esub> t'"} & @{text "as \<approx>\<^bsub>bn\<^esub> as'"}} |
|
1610 \end{tabular} |
|
1611 \end{center} |
|
1612 |
|
1613 \noindent |
|
1614 Note the difference between $\approx_{\textit{assn}}$ and |
|
1615 $\approx_{\textit{bn}}$: the latter only ``tracks'' $\alpha$-equivalence of |
|
1616 the components in an assignment that are \emph{not} bound. This is needed in the |
|
1617 clause for @{text "Let"} (which has |
|
1618 a non-recursive binder). |
|
1619 %The underlying reason is that the terms inside an assignment are not meant |
|
1620 %to be ``under'' the binder. Such a premise is \emph{not} needed in @{text "Let_rec"}, |
|
1621 %because there all components of an assignment are ``under'' the binder. |
|
1622 *} |
|
1623 |
|
1624 section {* Establishing the Reasoning Infrastructure *} |
|
1625 |
|
1626 text {* |
|
1627 Having made all necessary definitions for raw terms, we can start |
|
1628 with establishing the reasoning infrastructure for the $\alpha$-equated types |
|
1629 @{text "ty\<AL>"}$_{1..n}$, that is the types the user originally specified. We sketch |
|
1630 in this section the proofs we need for establishing this infrastructure. One |
|
1631 main point of our work is that we have completely automated these proofs in Isabelle/HOL. |
|
1632 |
|
1633 First we establish that the |
|
1634 $\alpha$-equivalence relations defined in the previous section are |
|
1635 equivalence relations. |
|
1636 |
|
1637 \begin{lemma}\label{equiv} |
|
1638 Given the raw types @{text "ty"}$_{1..n}$ and binding functions |
|
1639 @{text "bn"}$_{1..m}$, the relations @{text "\<approx>ty"}$_{1..n}$ and |
|
1640 @{text "\<approx>bn"}$_{1..m}$ are equivalence relations.%% and equivariant. |
|
1641 \end{lemma} |
|
1642 |
|
1643 \begin{proof} |
|
1644 The proof is by mutual induction over the definitions. The non-trivial |
|
1645 cases involve premises built up by $\approx_{\textit{set}}$, |
|
1646 $\approx_{\textit{set+}}$ and $\approx_{\textit{list}}$. They |
|
1647 can be dealt with as in Lemma~\ref{alphaeq}. |
|
1648 \end{proof} |
|
1649 |
|
1650 \noindent |
|
1651 We can feed this lemma into our quotient package and obtain new types @{text |
|
1652 "ty"}$^\alpha_{1..n}$ representing $\alpha$-equated terms of types @{text "ty"}$_{1..n}$. |
|
1653 We also obtain definitions for the term-constructors @{text |
|
1654 "C"}$^\alpha_{1..k}$ from the raw term-constructors @{text |
|
1655 "C"}$_{1..k}$, and similar definitions for the free-atom functions @{text |
|
1656 "fa_ty"}$^\alpha_{1..n}$ and @{text "fa_bn"}$^\alpha_{1..m}$ as well as the binding functions @{text |
|
1657 "bn"}$^\alpha_{1..m}$. However, these definitions are not really useful to the |
|
1658 user, since they are given in terms of the isomorphisms we obtained by |
|
1659 creating new types in Isabelle/HOL (recall the picture shown in the |
|
1660 Introduction). |
|
1661 |
|
1662 The first useful property for the user is the fact that distinct |
|
1663 term-constructors are not |
|
1664 equal, that is |
|
1665 % |
|
1666 \begin{equation}\label{distinctalpha} |
|
1667 \mbox{@{text "C"}$^\alpha$~@{text "x\<^isub>1 \<dots> x\<^isub>r"}~@{text "\<noteq>"}~% |
|
1668 @{text "D"}$^\alpha$~@{text "y\<^isub>1 \<dots> y\<^isub>s"}} |
|
1669 \end{equation} |
|
1670 |
|
1671 \noindent |
|
1672 whenever @{text "C"}$^\alpha$~@{text "\<noteq>"}~@{text "D"}$^\alpha$. |
|
1673 In order to derive this fact, we use the definition of $\alpha$-equivalence |
|
1674 and establish that |
|
1675 % |
|
1676 \begin{equation}\label{distinctraw} |
|
1677 \mbox{@{text "C x\<^isub>1 \<dots> x\<^isub>r"}\;$\not\approx$@{text ty}\;@{text "D y\<^isub>1 \<dots> y\<^isub>s"}} |
|
1678 \end{equation} |
|
1679 |
|
1680 \noindent |
|
1681 holds for the corresponding raw term-constructors. |
|
1682 In order to deduce \eqref{distinctalpha} from \eqref{distinctraw}, our quotient |
|
1683 package needs to know that the raw term-constructors @{text "C"} and @{text "D"} |
|
1684 are \emph{respectful} w.r.t.~the $\alpha$-equivalence relations (see \cite{Homeier05}). |
|
1685 Assuming, for example, @{text "C"} is of type @{text "ty"} with argument types |
|
1686 @{text "ty"}$_{1..r}$, respectfulness amounts to showing that |
|
1687 % |
|
1688 \begin{center} |
|
1689 @{text "C x\<^isub>1 \<dots> x\<^isub>r \<approx>ty C x\<PRIME>\<^isub>1 \<dots> x\<PRIME>\<^isub>r"} |
|
1690 \end{center} |
|
1691 |
|
1692 \noindent |
|
1693 holds under the assumptions that we have \mbox{@{text |
|
1694 "x\<^isub>i \<approx>ty\<^isub>i x\<PRIME>\<^isub>i"}} whenever @{text "x\<^isub>i"} |
|
1695 and @{text "x\<PRIME>\<^isub>i"} are recursive arguments of @{text C} and |
|
1696 @{text "x\<^isub>i = x\<PRIME>\<^isub>i"} whenever they are non-recursive arguments. We can prove this |
|
1697 implication by applying the corresponding rule in our $\alpha$-equivalence |
|
1698 definition and by establishing the following auxiliary implications %facts |
|
1699 % |
|
1700 \begin{equation}\label{fnresp} |
|
1701 \mbox{% |
|
1702 \begin{tabular}{ll@ {\hspace{7mm}}ll} |
|
1703 \mbox{\it (i)} & @{text "x \<approx>ty\<^isub>i x\<PRIME>"}~~@{text "\<Rightarrow>"}~~@{text "fa_ty\<^isub>i x = fa_ty\<^isub>i x\<PRIME>"} & |
|
1704 \mbox{\it (iii)} & @{text "x \<approx>ty\<^isub>j x\<PRIME>"}~~@{text "\<Rightarrow>"}~~@{text "bn\<^isub>j x = bn\<^isub>j x\<PRIME>"}\\ |
|
1705 |
|
1706 \mbox{\it (ii)} & @{text "x \<approx>ty\<^isub>j x\<PRIME>"}~~@{text "\<Rightarrow>"}~~@{text "fa_bn\<^isub>j x = fa_bn\<^isub>j x\<PRIME>"} & |
|
1707 \mbox{\it (iv)} & @{text "x \<approx>ty\<^isub>j x\<PRIME>"}~~@{text "\<Rightarrow>"}~~@{text "x \<approx>bn\<^isub>j x\<PRIME>"}\\ |
|
1708 \end{tabular}} |
|
1709 \end{equation} |
|
1710 |
|
1711 \noindent |
|
1712 They can be established by induction on @{text "\<approx>ty"}$_{1..n}$. Whereas the first, |
|
1713 second and last implication are true by how we stated our definitions, the |
|
1714 third \emph{only} holds because of our restriction |
|
1715 imposed on the form of the binding functions---namely \emph{not} returning |
|
1716 any bound atoms. In Ott, in contrast, the user may |
|
1717 define @{text "bn"}$_{1..m}$ so that they return bound |
|
1718 atoms and in this case the third implication is \emph{not} true. A |
|
1719 result is that the lifting of the corresponding binding functions in Ott to $\alpha$-equated |
|
1720 terms is impossible. |
|
1721 |
|
1722 Having established respectfulness for the raw term-constructors, the |
|
1723 quotient package is able to automatically deduce \eqref{distinctalpha} from |
|
1724 \eqref{distinctraw}. Having the facts \eqref{fnresp} at our disposal, we can |
|
1725 also lift properties that characterise when two raw terms of the form |
|
1726 % |
|
1727 \begin{center} |
|
1728 @{text "C x\<^isub>1 \<dots> x\<^isub>r \<approx>ty C x\<PRIME>\<^isub>1 \<dots> x\<PRIME>\<^isub>r"} |
|
1729 \end{center} |
|
1730 |
|
1731 \noindent |
|
1732 are $\alpha$-equivalent. This gives us conditions when the corresponding |
|
1733 $\alpha$-equated terms are \emph{equal}, namely |
|
1734 % |
|
1735 %\begin{center} |
|
1736 @{text "C\<^sup>\<alpha> x\<^isub>1 \<dots> x\<^isub>r = C\<^sup>\<alpha> x\<PRIME>\<^isub>1 \<dots> x\<PRIME>\<^isub>r"}. |
|
1737 %\end{center} |
|
1738 % |
|
1739 %\noindent |
|
1740 We call these conditions as \emph{quasi-injectivity}. They correspond to |
|
1741 the premises in our $\alpha$-equivalence relations. |
|
1742 |
|
1743 Next we can lift the permutation |
|
1744 operations defined in \eqref{ceqvt}. In order to make this |
|
1745 lifting to go through, we have to show that the permutation operations are respectful. |
|
1746 This amounts to showing that the |
|
1747 $\alpha$-equivalence relations are equivariant \cite{HuffmanUrban10}. |
|
1748 %, which we already established |
|
1749 %in Lemma~\ref{equiv}. |
|
1750 As a result we can add the equations |
|
1751 % |
|
1752 \begin{equation}\label{calphaeqvt} |
|
1753 @{text "p \<bullet> (C\<^sup>\<alpha> x\<^isub>1 \<dots> x\<^isub>r) = C\<^sup>\<alpha> (p \<bullet> x\<^isub>1) \<dots> (p \<bullet> x\<^isub>r)"} |
|
1754 \end{equation} |
|
1755 |
|
1756 \noindent |
|
1757 to our infrastructure. In a similar fashion we can lift the defining equations |
|
1758 of the free-atom functions @{text "fn_ty\<AL>"}$_{1..n}$ and |
|
1759 @{text "fa_bn\<AL>"}$_{1..m}$ as well as of the binding functions @{text |
|
1760 "bn\<AL>"}$_{1..m}$ and the size functions @{text "size_ty\<AL>"}$_{1..n}$. |
|
1761 The latter are defined automatically for the raw types @{text "ty"}$_{1..n}$ |
|
1762 by the datatype package of Isabelle/HOL. |
|
1763 |
|
1764 Finally we can add to our infrastructure a cases lemma (explained in the next section) |
|
1765 and a structural induction principle |
|
1766 for the types @{text "ty\<AL>"}$_{1..n}$. The conclusion of the induction principle is |
|
1767 of the form |
|
1768 % |
|
1769 %\begin{equation}\label{weakinduct} |
|
1770 \mbox{@{text "P\<^isub>1 x\<^isub>1 \<and> \<dots> \<and> P\<^isub>n x\<^isub>n "}} |
|
1771 %\end{equation} |
|
1772 % |
|
1773 %\noindent |
|
1774 whereby the @{text P}$_{1..n}$ are predicates and the @{text x}$_{1..n}$ |
|
1775 have types @{text "ty\<AL>"}$_{1..n}$. This induction principle has for each |
|
1776 term constructor @{text "C"}$^\alpha$ a premise of the form |
|
1777 % |
|
1778 \begin{equation}\label{weakprem} |
|
1779 \mbox{@{text "\<forall>x\<^isub>1\<dots>x\<^isub>r. P\<^isub>i x\<^isub>i \<and> \<dots> \<and> P\<^isub>j x\<^isub>j \<Rightarrow> P (C\<^sup>\<alpha> x\<^isub>1 \<dots> x\<^isub>r)"}} |
|
1780 \end{equation} |
|
1781 |
|
1782 \noindent |
|
1783 in which the @{text "x"}$_{i..j}$ @{text "\<subseteq>"} @{text "x"}$_{1..r}$ are |
|
1784 the recursive arguments of @{text "C\<AL>"}. |
|
1785 |
|
1786 By working now completely on the $\alpha$-equated level, we |
|
1787 can first show that the free-atom functions and binding functions are |
|
1788 equivariant, namely |
|
1789 % |
|
1790 \begin{center} |
|
1791 \begin{tabular}{rcl@ {\hspace{10mm}}rcl} |
|
1792 @{text "p \<bullet> (fa_ty\<AL>\<^isub>i x)"} & $=$ & @{text "fa_ty\<AL>\<^isub>i (p \<bullet> x)"} & |
|
1793 @{text "p \<bullet> (bn\<AL>\<^isub>j x)"} & $=$ & @{text "bn\<AL>\<^isub>j (p \<bullet> x)"}\\ |
|
1794 @{text "p \<bullet> (fa_bn\<AL>\<^isub>j x)"} & $=$ & @{text "fa_bn\<AL>\<^isub>j (p \<bullet> x)"}\\ |
|
1795 \end{tabular} |
|
1796 \end{center} |
|
1797 % |
|
1798 \noindent |
|
1799 These properties can be established using the induction principle for the types @{text "ty\<AL>"}$_{1..n}$. |
|
1800 %%in \eqref{weakinduct}. |
|
1801 Having these equivariant properties established, we can |
|
1802 show that the support of term-constructors @{text "C\<^sup>\<alpha>"} is included in |
|
1803 the support of its arguments, that means |
|
1804 |
|
1805 \begin{center} |
|
1806 @{text "supp (C\<^sup>\<alpha> x\<^isub>1 \<dots> x\<^isub>r) \<subseteq> (supp x\<^isub>1 \<union> \<dots> \<union> supp x\<^isub>r)"} |
|
1807 \end{center} |
|
1808 |
|
1809 \noindent |
|
1810 holds. This allows us to prove by induction that |
|
1811 every @{text x} of type @{text "ty\<AL>"}$_{1..n}$ is finitely supported. |
|
1812 %This can be again shown by induction |
|
1813 %over @{text "ty\<AL>"}$_{1..n}$. |
|
1814 Lastly, we can show that the support of |
|
1815 elements in @{text "ty\<AL>"}$_{1..n}$ is the same as @{text "fa_ty\<AL>"}$_{1..n}$. |
|
1816 This fact is important in a nominal setting, but also provides evidence |
|
1817 that our notions of free-atoms and $\alpha$-equivalence are correct. |
|
1818 |
|
1819 \begin{theorem} |
|
1820 For @{text "x"}$_{1..n}$ with type @{text "ty\<AL>"}$_{1..n}$, we have |
|
1821 @{text "supp x\<^isub>i = fa_ty\<AL>\<^isub>i x\<^isub>i"}. |
|
1822 \end{theorem} |
|
1823 |
|
1824 \begin{proof} |
|
1825 The proof is by induction. In each case |
|
1826 we unfold the definition of @{text "supp"}, move the swapping inside the |
|
1827 term-constructors and then use the quasi-injectivity lemmas in order to complete the |
|
1828 proof. For the abstraction cases we use the facts derived in Theorem~\ref{suppabs}. |
|
1829 \end{proof} |
|
1830 |
|
1831 \noindent |
|
1832 To sum up this section, we can establish automatically a reasoning infrastructure |
|
1833 for the types @{text "ty\<AL>"}$_{1..n}$ |
|
1834 by first lifting definitions from the raw level to the quotient level and |
|
1835 then by establishing facts about these lifted definitions. All necessary proofs |
|
1836 are generated automatically by custom ML-code. |
|
1837 |
|
1838 %This code can deal with |
|
1839 %specifications such as the one shown in Figure~\ref{nominalcorehas} for Core-Haskell. |
|
1840 |
|
1841 %\begin{figure}[t!] |
|
1842 %\begin{boxedminipage}{\linewidth} |
|
1843 %\small |
|
1844 %\begin{tabular}{l} |
|
1845 %\isacommand{atom\_decl}~@{text "var cvar tvar"}\\[1mm] |
|
1846 %\isacommand{nominal\_datatype}~@{text "tkind ="}\\ |
|
1847 %\phantom{$|$}~@{text "KStar"}~$|$~@{text "KFun tkind tkind"}\\ |
|
1848 %\isacommand{and}~@{text "ckind ="}\\ |
|
1849 %\phantom{$|$}~@{text "CKSim ty ty"}\\ |
|
1850 %\isacommand{and}~@{text "ty ="}\\ |
|
1851 %\phantom{$|$}~@{text "TVar tvar"}~$|$~@{text "T string"}~$|$~@{text "TApp ty ty"}\\ |
|
1852 %$|$~@{text "TFun string ty_list"}~% |
|
1853 %$|$~@{text "TAll tv::tvar tkind ty::ty"} \isacommand{bind}~@{text "tv"}~\isacommand{in}~@{text ty}\\ |
|
1854 %$|$~@{text "TArr ckind ty"}\\ |
|
1855 %\isacommand{and}~@{text "ty_lst ="}\\ |
|
1856 %\phantom{$|$}~@{text "TNil"}~$|$~@{text "TCons ty ty_lst"}\\ |
|
1857 %\isacommand{and}~@{text "cty ="}\\ |
|
1858 %\phantom{$|$}~@{text "CVar cvar"}~% |
|
1859 %$|$~@{text "C string"}~$|$~@{text "CApp cty cty"}~$|$~@{text "CFun string co_lst"}\\ |
|
1860 %$|$~@{text "CAll cv::cvar ckind cty::cty"} \isacommand{bind}~@{text "cv"}~\isacommand{in}~@{text cty}\\ |
|
1861 %$|$~@{text "CArr ckind cty"}~$|$~@{text "CRefl ty"}~$|$~@{text "CSym cty"}~$|$~@{text "CCirc cty cty"}\\ |
|
1862 %$|$~@{text "CAt cty ty"}~$|$~@{text "CLeft cty"}~$|$~@{text "CRight cty"}~$|$~@{text "CSim cty cty"}\\ |
|
1863 %$|$~@{text "CRightc cty"}~$|$~@{text "CLeftc cty"}~$|$~@{text "Coerce cty cty"}\\ |
|
1864 %\isacommand{and}~@{text "co_lst ="}\\ |
|
1865 %\phantom{$|$}@{text "CNil"}~$|$~@{text "CCons cty co_lst"}\\ |
|
1866 %\isacommand{and}~@{text "trm ="}\\ |
|
1867 %\phantom{$|$}~@{text "Var var"}~$|$~@{text "K string"}\\ |
|
1868 %$|$~@{text "LAM_ty tv::tvar tkind t::trm"} \isacommand{bind}~@{text "tv"}~\isacommand{in}~@{text t}\\ |
|
1869 %$|$~@{text "LAM_cty cv::cvar ckind t::trm"} \isacommand{bind}~@{text "cv"}~\isacommand{in}~@{text t}\\ |
|
1870 %$|$~@{text "App_ty trm ty"}~$|$~@{text "App_cty trm cty"}~$|$~@{text "App trm trm"}\\ |
|
1871 %$|$~@{text "Lam v::var ty t::trm"} \isacommand{bind}~@{text "v"}~\isacommand{in}~@{text t}\\ |
|
1872 %$|$~@{text "Let x::var ty trm t::trm"} \isacommand{bind}~@{text x}~\isacommand{in}~@{text t}\\ |
|
1873 %$|$~@{text "Case trm assoc_lst"}~$|$~@{text "Cast trm co"}\\ |
|
1874 %\isacommand{and}~@{text "assoc_lst ="}\\ |
|
1875 %\phantom{$|$}~@{text ANil}~% |
|
1876 %$|$~@{text "ACons p::pat t::trm assoc_lst"} \isacommand{bind}~@{text "bv p"}~\isacommand{in}~@{text t}\\ |
|
1877 %\isacommand{and}~@{text "pat ="}\\ |
|
1878 %\phantom{$|$}~@{text "Kpat string tvtk_lst tvck_lst vt_lst"}\\ |
|
1879 %\isacommand{and}~@{text "vt_lst ="}\\ |
|
1880 %\phantom{$|$}~@{text VTNil}~$|$~@{text "VTCons var ty vt_lst"}\\ |
|
1881 %\isacommand{and}~@{text "tvtk_lst ="}\\ |
|
1882 %\phantom{$|$}~@{text TVTKNil}~$|$~@{text "TVTKCons tvar tkind tvtk_lst"}\\ |
|
1883 %\isacommand{and}~@{text "tvck_lst ="}\\ |
|
1884 %\phantom{$|$}~@{text TVCKNil}~$|$ @{text "TVCKCons cvar ckind tvck_lst"}\\ |
|
1885 %\isacommand{binder}\\ |
|
1886 %@{text "bv :: pat \<Rightarrow> atom list"}~\isacommand{and}~% |
|
1887 %@{text "bv1 :: vt_lst \<Rightarrow> atom list"}~\isacommand{and}\\ |
|
1888 %@{text "bv2 :: tvtk_lst \<Rightarrow> atom list"}~\isacommand{and}~% |
|
1889 %@{text "bv3 :: tvck_lst \<Rightarrow> atom list"}\\ |
|
1890 %\isacommand{where}\\ |
|
1891 %\phantom{$|$}~@{text "bv (K s tvts tvcs vs) = (bv3 tvts) @ (bv2 tvcs) @ (bv1 vs)"}\\ |
|
1892 %$|$~@{text "bv1 VTNil = []"}\\ |
|
1893 %$|$~@{text "bv1 (VTCons x ty tl) = (atom x)::(bv1 tl)"}\\ |
|
1894 %$|$~@{text "bv2 TVTKNil = []"}\\ |
|
1895 %$|$~@{text "bv2 (TVTKCons a ty tl) = (atom a)::(bv2 tl)"}\\ |
|
1896 %$|$~@{text "bv3 TVCKNil = []"}\\ |
|
1897 %$|$~@{text "bv3 (TVCKCons c cty tl) = (atom c)::(bv3 tl)"}\\ |
|
1898 %\end{tabular} |
|
1899 %\end{boxedminipage} |
|
1900 %\caption{The nominal datatype declaration for Core-Haskell. For the moment we |
|
1901 %do not support nested types; therefore we explicitly have to unfold the |
|
1902 %lists @{text "co_lst"}, @{text "assoc_lst"} and so on. This will be improved |
|
1903 %in a future version of Nominal Isabelle. Apart from that, the |
|
1904 %declaration follows closely the original in Figure~\ref{corehas}. The |
|
1905 %point of our work is that having made such a declaration in Nominal Isabelle, |
|
1906 %one obtains automatically a reasoning infrastructure for Core-Haskell. |
|
1907 %\label{nominalcorehas}} |
|
1908 %\end{figure} |
|
1909 *} |
|
1910 |
|
1911 |
|
1912 section {* Strong Induction Principles *} |
|
1913 |
|
1914 text {* |
|
1915 In the previous section we derived induction principles for $\alpha$-equated terms. |
|
1916 We call such induction principles \emph{weak}, because for a |
|
1917 term-constructor \mbox{@{text "C\<^sup>\<alpha> x\<^isub>1\<dots>x\<^isub>r"}} |
|
1918 the induction hypothesis requires us to establish the implications \eqref{weakprem}. |
|
1919 The problem with these implications is that in general they are difficult to establish. |
|
1920 The reason is that we cannot make any assumption about the bound atoms that might be in @{text "C\<^sup>\<alpha>"}. |
|
1921 %%(for example we cannot assume the variable convention for them). |
|
1922 |
|
1923 In \cite{UrbanTasson05} we introduced a method for automatically |
|
1924 strengthening weak induction principles for terms containing single |
|
1925 binders. These stronger induction principles allow the user to make additional |
|
1926 assumptions about bound atoms. |
|
1927 %These additional assumptions amount to a formal |
|
1928 %version of the informal variable convention for binders. |
|
1929 To sketch how this strengthening extends to the case of multiple binders, we use as |
|
1930 running example the term-constructors @{text "Lam"} and @{text "Let"} |
|
1931 from example \eqref{letpat}. Instead of establishing @{text " P\<^bsub>trm\<^esub> t \<and> P\<^bsub>pat\<^esub> p"}, |
|
1932 the stronger induction principle for \eqref{letpat} establishes properties @{text " P\<^bsub>trm\<^esub> c t \<and> P\<^bsub>pat\<^esub> c p"} |
|
1933 where the additional parameter @{text c} controls |
|
1934 which freshness assumptions the binders should satisfy. For the two term constructors |
|
1935 this means that the user has to establish in inductions the implications |
|
1936 % |
|
1937 \begin{center} |
|
1938 \begin{tabular}{l} |
|
1939 @{text "\<forall>a t c. {atom a} \<FRESH>\<^sup>* c \<and> (\<forall>d. P\<^bsub>trm\<^esub> d t) \<Rightarrow> P\<^bsub>trm\<^esub> c (Lam a t)"}\\ |
|
1940 @{text "\<forall>p t c. (set (bn p)) \<FRESH>\<^sup>* c \<and> (\<forall>d. P\<^bsub>pat\<^esub> d p) \<and> (\<forall>d. P\<^bsub>trm\<^esub> d t) \<and> \<Rightarrow> P\<^bsub>trm\<^esub> c (Let p t)"}\\%[-0mm] |
|
1941 \end{tabular} |
|
1942 \end{center} |
|
1943 |
|
1944 In \cite{UrbanTasson05} we showed how the weaker induction principles imply |
|
1945 the stronger ones. This was done by some quite complicated, nevertheless automated, |
|
1946 induction proof. In this paper we simplify this work by leveraging the automated proof |
|
1947 methods from the function package of Isabelle/HOL. |
|
1948 The reasoning principle these methods employ is well-founded induction. |
|
1949 To use them in our setting, we have to discharge |
|
1950 two proof obligations: one is that we have |
|
1951 well-founded measures (for each type @{text "ty"}$^\alpha_{1..n}$) that decrease in |
|
1952 every induction step and the other is that we have covered all cases. |
|
1953 As measures we use the size functions |
|
1954 @{text "size_ty"}$^\alpha_{1..n}$, which we lifted in the previous section and which are |
|
1955 all well-founded. %It is straightforward to establish that these measures decrease |
|
1956 %in every induction step. |
|
1957 |
|
1958 What is left to show is that we covered all cases. To do so, we use |
|
1959 a \emph{cases lemma} derived for each type. For the terms in \eqref{letpat} |
|
1960 this lemma is of the form |
|
1961 % |
|
1962 \begin{equation}\label{weakcases} |
|
1963 \infer{@{text "P\<^bsub>trm\<^esub>"}} |
|
1964 {\begin{array}{l@ {\hspace{9mm}}l} |
|
1965 @{text "\<forall>x. t = Var x \<Rightarrow> P\<^bsub>trm\<^esub>"} & @{text "\<forall>a t'. t = Lam a t' \<Rightarrow> P\<^bsub>trm\<^esub>"}\\ |
|
1966 @{text "\<forall>t\<^isub>1 t\<^isub>2. t = App t\<^isub>1 t\<^isub>2 \<Rightarrow> P\<^bsub>trm\<^esub>"} & @{text "\<forall>p t'. t = Let p t' \<Rightarrow> P\<^bsub>trm\<^esub>"}\\ |
|
1967 \end{array}}\\[-1mm] |
|
1968 \end{equation} |
|
1969 % |
|
1970 where we have a premise for each term-constructor. |
|
1971 The idea behind such cases lemmas is that we can conclude with a property @{text "P\<^bsub>trm\<^esub>"}, |
|
1972 provided we can show that this property holds if we substitute for @{text "t"} all |
|
1973 possible term-constructors. |
|
1974 |
|
1975 The only remaining difficulty is that in order to derive the stronger induction |
|
1976 principles conveniently, the cases lemma in \eqref{weakcases} is too weak. For this note that |
|
1977 in order to apply this lemma, we have to establish @{text "P\<^bsub>trm\<^esub>"} for \emph{all} @{text Lam}- and |
|
1978 \emph{all} @{text Let}-terms. |
|
1979 What we need instead is a cases lemma where we only have to consider terms that have |
|
1980 binders that are fresh w.r.t.~a context @{text "c"}. This gives the implications |
|
1981 % |
|
1982 \begin{center} |
|
1983 \begin{tabular}{l} |
|
1984 @{text "\<forall>a t'. t = Lam a t' \<and> {atom a} \<FRESH>\<^sup>* c \<Rightarrow> P\<^bsub>trm\<^esub>"}\\ |
|
1985 @{text "\<forall>p t'. t = Let p t' \<and> (set (bn p)) \<FRESH>\<^sup>* c \<Rightarrow> P\<^bsub>trm\<^esub>"}\\%[-2mm] |
|
1986 \end{tabular} |
|
1987 \end{center} |
|
1988 % |
|
1989 \noindent |
|
1990 which however can be relatively easily be derived from the implications in \eqref{weakcases} |
|
1991 by a renaming using Properties \ref{supppermeq} and \ref{avoiding}. In the first case we know |
|
1992 that @{text "{atom a} \<FRESH>\<^sup>* Lam a t"}. Property \eqref{avoiding} provides us therefore with |
|
1993 a permutation @{text q}, such that @{text "{atom (q \<bullet> a)} \<FRESH>\<^sup>* c"} and |
|
1994 @{text "supp (Lam a t) \<FRESH>\<^sup>* q"} hold. |
|
1995 By using Property \ref{supppermeq}, we can infer from the latter |
|
1996 that @{text "Lam (q \<bullet> a) (q \<bullet> t) = Lam a t"} |
|
1997 and we are done with this case. |
|
1998 |
|
1999 The @{text Let}-case involving a (non-recursive) deep binder is a bit more complicated. |
|
2000 The reason is that the we cannot apply Property \ref{avoiding} to the whole term @{text "Let p t"}, |
|
2001 because @{text p} might contain names bound by @{text bn}, but also some that are |
|
2002 free. To solve this problem we have to introduce a permutation function that only |
|
2003 permutes names bound by @{text bn} and leaves the other names unchanged. We do this again |
|
2004 by lifting. For a |
|
2005 clause @{text "bn (C x\<^isub>1 \<dots> x\<^isub>r) = rhs"}, we define |
|
2006 % |
|
2007 \begin{center} |
|
2008 @{text "p \<bullet>\<^bsub>bn\<^esub> (C x\<^isub>1 \<dots> x\<^isub>r) \<equiv> C y\<^isub>1 \<dots> y\<^isub>r"} with |
|
2009 $\begin{cases} |
|
2010 \text{@{text "y\<^isub>i \<equiv> x\<^isub>i"} provided @{text "x\<^isub>i"} does not occur in @{text "rhs"}}\\ |
|
2011 \text{@{text "y\<^isub>i \<equiv> p \<bullet>\<^bsub>bn'\<^esub> x\<^isub>i"} provided @{text "bn' x\<^isub>i"} is in @{text "rhs"}}\\ |
|
2012 \text{@{text "y\<^isub>i \<equiv> p \<bullet> x\<^isub>i"} otherwise} |
|
2013 \end{cases}$ |
|
2014 \end{center} |
|
2015 % |
|
2016 %\noindent |
|
2017 %with @{text "y\<^isub>i"} determined as follows: |
|
2018 % |
|
2019 %\begin{center} |
|
2020 %\begin{tabular}{c@ {\hspace{2mm}}p{0.9\textwidth}} |
|
2021 %$\bullet$ & @{text "y\<^isub>i \<equiv> x\<^isub>i"} provided @{text "x\<^isub>i"} does not occur in @{text "rhs"}\\ |
|
2022 %$\bullet$ & @{text "y\<^isub>i \<equiv> p \<bullet>\<^bsub>bn'\<^esub> x\<^isub>i"} provided @{text "bn' x\<^isub>i"} is in @{text "rhs"}\\ |
|
2023 %$\bullet$ & @{text "y\<^isub>i \<equiv> p \<bullet> x\<^isub>i"} otherwise |
|
2024 %\end{tabular} |
|
2025 %\end{center} |
|
2026 % |
|
2027 \noindent |
|
2028 Now Properties \ref{supppermeq} and \ref{avoiding} give us a permutation @{text q} such that |
|
2029 @{text "(set (bn (q \<bullet>\<^bsub>bn\<^esub> p)) \<FRESH>\<^sup>* c"} holds and such that @{text "[q \<bullet>\<^bsub>bn\<^esub> p]\<^bsub>list\<^esub>.(q \<bullet> t)"} |
|
2030 is equal to @{text "[p]\<^bsub>list\<^esub>. t"}. We can also show that @{text "(q \<bullet>\<^bsub>bn\<^esub> p) \<approx>\<^bsub>bn\<^esub> p"}. |
|
2031 These facts establish that @{text "Let (q \<bullet>\<^bsub>bn\<^esub> p) (p \<bullet> t) = Let p t"}, as we need. This |
|
2032 completes the non-trivial cases in \eqref{letpat} for strengthening the corresponding induction |
|
2033 principle. |
|
2034 |
|
2035 |
|
2036 |
|
2037 %A natural question is |
|
2038 %whether we can also strengthen the weak induction principles involving |
|
2039 %the general binders presented here. We will indeed be able to so, but for this we need an |
|
2040 %additional notion for permuting deep binders. |
|
2041 |
|
2042 %Given a binding function @{text "bn"} we define an auxiliary permutation |
|
2043 %operation @{text "_ \<bullet>\<^bsub>bn\<^esub> _"} which permutes only bound arguments in a deep binder. |
|
2044 %Assuming a clause of @{text bn} is given as |
|
2045 % |
|
2046 %\begin{center} |
|
2047 %@{text "bn (C x\<^isub>1 \<dots> x\<^isub>r) = rhs"}, |
|
2048 %\end{center} |
|
2049 |
|
2050 %\noindent |
|
2051 %then we define |
|
2052 % |
|
2053 %\begin{center} |
|
2054 %@{text "p \<bullet>\<^bsub>bn\<^esub> (C x\<^isub>1 \<dots> x\<^isub>r) \<equiv> C y\<^isub>1 \<dots> y\<^isub>r"} |
|
2055 %\end{center} |
|
2056 |
|
2057 %\noindent |
|
2058 %with @{text "y\<^isub>i"} determined as follows: |
|
2059 % |
|
2060 %\begin{center} |
|
2061 %\begin{tabular}{c@ {\hspace{2mm}}p{7cm}} |
|
2062 %$\bullet$ & @{text "y\<^isub>i \<equiv> x\<^isub>i"} provided @{text "x\<^isub>i"} does not occur in @{text "rhs"}\\ |
|
2063 %$\bullet$ & @{text "y\<^isub>i \<equiv> p \<bullet>\<^bsub>bn'\<^esub> x\<^isub>i"} provided @{text "bn' x\<^isub>i"} is in @{text "rhs"}\\ |
|
2064 %$\bullet$ & @{text "y\<^isub>i \<equiv> p \<bullet> x\<^isub>i"} otherwise |
|
2065 %\end{tabular} |
|
2066 %\end{center} |
|
2067 |
|
2068 %\noindent |
|
2069 %Using again the quotient package we can lift the @{text "_ \<bullet>\<^bsub>bn\<^esub> _"} function to |
|
2070 %$\alpha$-equated terms. We can then prove the following two facts |
|
2071 |
|
2072 %\begin{lemma}\label{permutebn} |
|
2073 %Given a binding function @{text "bn\<^sup>\<alpha>"} then for all @{text p} |
|
2074 %{\it (i)} @{text "p \<bullet> (bn\<^sup>\<alpha> x) = bn\<^sup>\<alpha> (p \<bullet>\<AL>\<^bsub>bn\<^esub> x)"} and {\it (ii)} |
|
2075 % @{text "fa_bn\<^isup>\<alpha> x = fa_bn\<^isup>\<alpha> (p \<bullet>\<AL>\<^bsub>bn\<^esub> x)"}. |
|
2076 %\end{lemma} |
|
2077 |
|
2078 %\begin{proof} |
|
2079 %By induction on @{text x}. The equations follow by simple unfolding |
|
2080 %of the definitions. |
|
2081 %\end{proof} |
|
2082 |
|
2083 %\noindent |
|
2084 %The first property states that a permutation applied to a binding function is |
|
2085 %equivalent to first permuting the binders and then calculating the bound |
|
2086 %atoms. The second amounts to the fact that permuting the binders has no |
|
2087 %effect on the free-atom function. The main point of this permutation |
|
2088 %function, however, is that if we have a permutation that is fresh |
|
2089 %for the support of an object @{text x}, then we can use this permutation |
|
2090 %to rename the binders in @{text x}, without ``changing'' @{text x}. In case of the |
|
2091 %@{text "Let"} term-constructor from the example shown |
|
2092 %in \eqref{letpat} this means for a permutation @{text "r"} |
|
2093 %% |
|
2094 %\begin{equation}\label{renaming} |
|
2095 %\begin{array}{l} |
|
2096 %\mbox{if @{term "supp (Abs_lst (bn p) t\<^isub>2) \<sharp>* r"}}\\ |
|
2097 %\qquad\mbox{then @{text "Let p t\<^isub>1 t\<^isub>2 = Let (r \<bullet>\<AL>\<^bsub>bn_pat\<^esub> p) t\<^isub>1 (r \<bullet> t\<^isub>2)"}} |
|
2098 %\end{array} |
|
2099 %\end{equation} |
|
2100 |
|
2101 %\noindent |
|
2102 %This fact will be crucial when establishing the strong induction principles below. |
|
2103 |
|
2104 |
|
2105 %In our running example about @{text "Let"}, the strong induction |
|
2106 %principle means that instead |
|
2107 %of establishing the implication |
|
2108 % |
|
2109 %\begin{center} |
|
2110 %@{text "\<forall>p t\<^isub>1 t\<^isub>2. P\<^bsub>pat\<^esub> p \<and> P\<^bsub>trm\<^esub> t\<^isub>1 \<and> P\<^bsub>trm\<^esub> t\<^isub>2 \<Rightarrow> P\<^bsub>trm\<^esub> (Let p t\<^isub>1 t\<^isub>2)"} |
|
2111 %\end{center} |
|
2112 % |
|
2113 %\noindent |
|
2114 %it is sufficient to establish the following implication |
|
2115 % |
|
2116 %\begin{equation}\label{strong} |
|
2117 %\mbox{\begin{tabular}{l} |
|
2118 %@{text "\<forall>p t\<^isub>1 t\<^isub>2 c."}\\ |
|
2119 %\hspace{5mm}@{text "set (bn p) #\<^sup>* c \<and>"}\\ |
|
2120 %\hspace{5mm}@{text "(\<forall>d. P\<^bsub>pat\<^esub> d p) \<and> (\<forall>d. P\<^bsub>trm\<^esub> d t\<^isub>1) \<and> (\<forall>d. P\<^bsub>trm\<^esub> d t\<^isub>2)"}\\ |
|
2121 %\hspace{15mm}@{text "\<Rightarrow> P\<^bsub>trm\<^esub> c (Let p t\<^isub>1 t\<^isub>2)"} |
|
2122 %\end{tabular}} |
|
2123 %\end{equation} |
|
2124 % |
|
2125 %\noindent |
|
2126 %While this implication contains an additional argument, namely @{text c}, and |
|
2127 %also additional universal quantifications, it is usually easier to establish. |
|
2128 %The reason is that we have the freshness |
|
2129 %assumption @{text "set (bn\<^sup>\<alpha> p) #\<^sup>* c"}, whereby @{text c} can be arbitrarily |
|
2130 %chosen by the user as long as it has finite support. |
|
2131 % |
|
2132 %Let us now show how we derive the strong induction principles from the |
|
2133 %weak ones. In case of the @{text "Let"}-example we derive by the weak |
|
2134 %induction the following two properties |
|
2135 % |
|
2136 %\begin{equation}\label{hyps} |
|
2137 %@{text "\<forall>q c. P\<^bsub>trm\<^esub> c (q \<bullet> t)"} \hspace{4mm} |
|
2138 %@{text "\<forall>q\<^isub>1 q\<^isub>2 c. P\<^bsub>pat\<^esub> (q\<^isub>1 \<bullet>\<AL>\<^bsub>bn\<^esub> (q\<^isub>2 \<bullet> p))"} |
|
2139 %\end{equation} |
|
2140 % |
|
2141 %\noindent |
|
2142 %For the @{text Let} term-constructor we therefore have to establish @{text "P\<^bsub>trm\<^esub> c (q \<bullet> Let p t\<^isub>1 t\<^isub>2)"} |
|
2143 %assuming \eqref{hyps} as induction hypotheses (the first for @{text t\<^isub>1} and @{text t\<^isub>2}). |
|
2144 %By Property~\ref{avoiding} we |
|
2145 %obtain a permutation @{text "r"} such that |
|
2146 % |
|
2147 %\begin{equation}\label{rprops} |
|
2148 %@{term "(r \<bullet> set (bn (q \<bullet> p))) \<sharp>* c "}\hspace{4mm} |
|
2149 %@{term "supp (Abs_lst (bn (q \<bullet> p)) (q \<bullet> t\<^isub>2)) \<sharp>* r"} |
|
2150 %\end{equation} |
|
2151 % |
|
2152 %\noindent |
|
2153 %hold. The latter fact and \eqref{renaming} give us |
|
2154 %% |
|
2155 %\begin{center} |
|
2156 %\begin{tabular}{l} |
|
2157 %@{text "Let (q \<bullet> p) (q \<bullet> t\<^isub>1) (q \<bullet> t\<^isub>2) ="} \\ |
|
2158 %\hspace{15mm}@{text "Let (r \<bullet>\<AL>\<^bsub>bn\<^esub> (q \<bullet> p)) (q \<bullet> t\<^isub>1) (r \<bullet> (q \<bullet> t\<^isub>2))"} |
|
2159 %\end{tabular} |
|
2160 %\end{center} |
|
2161 % |
|
2162 %\noindent |
|
2163 %So instead of proving @{text "P\<^bsub>trm\<^esub> c (q \<bullet> Let p t\<^isub>1 t\<^isub>2)"}, we can equally |
|
2164 %establish @{text "P\<^bsub>trm\<^esub> c (Let (r \<bullet>\<AL>\<^bsub>bn\<^esub> (q \<bullet> p)) (q \<bullet> t\<^isub>1) (r \<bullet> (q \<bullet> t\<^isub>2)))"}. |
|
2165 %To do so, we will use the implication \eqref{strong} of the strong induction |
|
2166 %principle, which requires us to discharge |
|
2167 %the following four proof obligations: |
|
2168 %% |
|
2169 %\begin{center} |
|
2170 %\begin{tabular}{rl} |
|
2171 %{\it (i)} & @{text "set (bn (r \<bullet>\<AL>\<^bsub>bn\<^esub> (q \<bullet> p))) #\<^sup>* c"}\\ |
|
2172 %{\it (ii)} & @{text "\<forall>d. P\<^bsub>pat\<^esub> d (r \<bullet>\<AL>\<^bsub>bn\<^esub> (q \<bullet> p))"}\\ |
|
2173 %{\it (iii)} & @{text "\<forall>d. P\<^bsub>trm\<^esub> d (q \<bullet> t\<^isub>1)"}\\ |
|
2174 %{\it (iv)} & @{text "\<forall>d. P\<^bsub>trm\<^esub> d (r \<bullet> (q \<bullet> t\<^isub>2))"}\\ |
|
2175 %\end{tabular} |
|
2176 %\end{center} |
|
2177 % |
|
2178 %\noindent |
|
2179 %The first follows from \eqref{rprops} and Lemma~\ref{permutebn}.{\it (i)}; the |
|
2180 %others from the induction hypotheses in \eqref{hyps} (in the fourth case |
|
2181 %we have to use the fact that @{term "(r \<bullet> (q \<bullet> t\<^isub>2)) = (r + q) \<bullet> t\<^isub>2"}). |
|
2182 % |
|
2183 %Taking now the identity permutation @{text 0} for the permutations in \eqref{hyps}, |
|
2184 %we can establish our original goals, namely @{text "P\<^bsub>trm\<^esub> c t"} and \mbox{@{text "P\<^bsub>pat\<^esub> c p"}}. |
|
2185 %This completes the proof showing that the weak induction principles imply |
|
2186 %the strong induction principles. |
|
2187 *} |
|
2188 |
|
2189 |
|
2190 section {* Related Work\label{related} *} |
|
2191 |
|
2192 text {* |
|
2193 To our knowledge the earliest usage of general binders in a theorem prover |
|
2194 is described in \cite{NaraschewskiNipkow99} about a formalisation of the |
|
2195 algorithm W. This formalisation implements binding in type-schemes using a |
|
2196 de-Bruijn indices representation. Since type-schemes in W contain only a single |
|
2197 place where variables are bound, different indices do not refer to different binders (as in the usual |
|
2198 de-Bruijn representation), but to different bound variables. A similar idea |
|
2199 has been recently explored for general binders in the locally nameless |
|
2200 approach to binding \cite{chargueraud09}. There, de-Bruijn indices consist |
|
2201 of two numbers, one referring to the place where a variable is bound, and the |
|
2202 other to which variable is bound. The reasoning infrastructure for both |
|
2203 representations of bindings comes for free in theorem provers like Isabelle/HOL or |
|
2204 Coq, since the corresponding term-calculi can be implemented as ``normal'' |
|
2205 datatypes. However, in both approaches it seems difficult to achieve our |
|
2206 fine-grained control over the ``semantics'' of bindings (i.e.~whether the |
|
2207 order of binders should matter, or vacuous binders should be taken into |
|
2208 account). %To do so, one would require additional predicates that filter out |
|
2209 %unwanted terms. Our guess is that such predicates result in rather |
|
2210 %intricate formal reasoning. |
|
2211 |
|
2212 Another technique for representing binding is higher-order abstract syntax |
|
2213 (HOAS). %, which for example is implemented in the Twelf system. |
|
2214 This %%representation |
|
2215 technique supports very elegantly many aspects of \emph{single} binding, and |
|
2216 impressive work has been done that uses HOAS for mechanising the metatheory |
|
2217 of SML~\cite{LeeCraryHarper07}. We are, however, not aware how multiple |
|
2218 binders of SML are represented in this work. Judging from the submitted |
|
2219 Twelf-solution for the POPLmark challenge, HOAS cannot easily deal with |
|
2220 binding constructs where the number of bound variables is not fixed. %For example |
|
2221 In the second part of this challenge, @{text "Let"}s involve |
|
2222 patterns that bind multiple variables at once. In such situations, HOAS |
|
2223 seems to have to resort to the iterated-single-binders-approach with |
|
2224 all the unwanted consequences when reasoning about the resulting terms. |
|
2225 |
|
2226 %Two formalisations involving general binders have been |
|
2227 %performed in older |
|
2228 %versions of Nominal Isabelle (one about Psi-calculi and one about algorithm W |
|
2229 %\cite{BengtsonParow09,UrbanNipkow09}). Both |
|
2230 %use the approach based on iterated single binders. Our experience with |
|
2231 %the latter formalisation has been disappointing. The major pain arose from |
|
2232 %the need to ``unbind'' variables. This can be done in one step with our |
|
2233 %general binders described in this paper, but needs a cumbersome |
|
2234 %iteration with single binders. The resulting formal reasoning turned out to |
|
2235 %be rather unpleasant. The hope is that the extension presented in this paper |
|
2236 %is a substantial improvement. |
|
2237 |
|
2238 The most closely related work to the one presented here is the Ott-tool |
|
2239 \cite{ott-jfp} and the C$\alpha$ml language \cite{Pottier06}. Ott is a nifty |
|
2240 front-end for creating \LaTeX{} documents from specifications of |
|
2241 term-calculi involving general binders. For a subset of the specifications |
|
2242 Ott can also generate theorem prover code using a raw representation of |
|
2243 terms, and in Coq also a locally nameless representation. The developers of |
|
2244 this tool have also put forward (on paper) a definition for |
|
2245 $\alpha$-equivalence of terms that can be specified in Ott. This definition is |
|
2246 rather different from ours, not using any nominal techniques. To our |
|
2247 knowledge there is no concrete mathematical result concerning this |
|
2248 notion of $\alpha$-equivalence. Also the definition for the |
|
2249 notion of free variables |
|
2250 is work in progress. |
|
2251 |
|
2252 Although we were heavily inspired by the syntax of Ott, |
|
2253 its definition of $\alpha$-equi\-valence is unsuitable for our extension of |
|
2254 Nominal Isabelle. First, it is far too complicated to be a basis for |
|
2255 automated proofs implemented on the ML-level of Isabelle/HOL. Second, it |
|
2256 covers cases of binders depending on other binders, which just do not make |
|
2257 sense for our $\alpha$-equated terms. Third, it allows empty types that have no |
|
2258 meaning in a HOL-based theorem prover. We also had to generalise slightly Ott's |
|
2259 binding clauses. In Ott you specify binding clauses with a single body; we |
|
2260 allow more than one. We have to do this, because this makes a difference |
|
2261 for our notion of $\alpha$-equivalence in case of \isacommand{bind (set)} and |
|
2262 \isacommand{bind (set+)}. |
|
2263 % |
|
2264 %Consider the examples |
|
2265 % |
|
2266 %\begin{center} |
|
2267 %\begin{tabular}{@ {}l@ {\hspace{2mm}}l@ {}} |
|
2268 %@{text "Foo\<^isub>1 xs::name fset t::trm s::trm"} & |
|
2269 % \isacommand{bind (set)} @{text "xs"} \isacommand{in} @{text "t s"}\\ |
|
2270 %@{text "Foo\<^isub>2 xs::name fset t::trm s::trm"} & |
|
2271 % \isacommand{bind (set)} @{text "xs"} \isacommand{in} @{text "t"}, |
|
2272 % \isacommand{bind (set)} @{text "xs"} \isacommand{in} @{text "s"}\\ |
|
2273 %\end{tabular} |
|
2274 %\end{center} |
|
2275 % |
|
2276 %\noindent |
|
2277 %In the first term-constructor we have a single |
|
2278 %body that happens to be ``spread'' over two arguments; in the second term-constructor we have |
|
2279 %two independent bodies in which the same variables are bound. As a result we |
|
2280 %have |
|
2281 % |
|
2282 %\begin{center} |
|
2283 %\begin{tabular}{r@ {\hspace{1.5mm}}c@ {\hspace{1.5mm}}l} |
|
2284 %@{text "Foo\<^isub>1 {a, b} (a, b) (a, b)"} & $\not=$ & |
|
2285 %@{text "Foo\<^isub>1 {a, b} (a, b) (b, a)"}\\ |
|
2286 %@{text "Foo\<^isub>2 {a, b} (a, b) (a, b)"} & $=$ & |
|
2287 %@{text "Foo\<^isub>2 {a, b} (a, b) (b, a)"}\\ |
|
2288 %\end{tabular} |
|
2289 %\end{center} |
|
2290 % |
|
2291 %\noindent |
|
2292 %and therefore need the extra generality to be able to distinguish between |
|
2293 %both specifications. |
|
2294 Because of how we set up our definitions, we also had to impose some restrictions |
|
2295 (like a single binding function for a deep binder) that are not present in Ott. |
|
2296 %Our |
|
2297 %expectation is that we can still cover many interesting term-calculi from |
|
2298 %programming language research, for example Core-Haskell. |
|
2299 |
|
2300 Pottier presents in \cite{Pottier06} a language, called C$\alpha$ml, for |
|
2301 representing terms with general binders inside OCaml. This language is |
|
2302 implemented as a front-end that can be translated to OCaml with the help of |
|
2303 a library. He presents a type-system in which the scope of general binders |
|
2304 can be specified using special markers, written @{text "inner"} and |
|
2305 @{text "outer"}. It seems our and his specifications can be |
|
2306 inter-translated as long as ours use the binding mode |
|
2307 \isacommand{bind} only. |
|
2308 However, we have not proved this. Pottier gives a definition for |
|
2309 $\alpha$-equivalence, which also uses a permutation operation (like ours). |
|
2310 Still, this definition is rather different from ours and he only proves that |
|
2311 it defines an equivalence relation. A complete |
|
2312 reasoning infrastructure is well beyond the purposes of his language. |
|
2313 Similar work for Haskell with similar results was reported by Cheney \cite{Cheney05a}. |
|
2314 |
|
2315 In a slightly different domain (programming with dependent types), the |
|
2316 paper \cite{Altenkirch10} presents a calculus with a notion of |
|
2317 $\alpha$-equivalence related to our binding mode \isacommand{bind (set+)}. |
|
2318 The definition in \cite{Altenkirch10} is similar to the one by Pottier, except that it |
|
2319 has a more operational flavour and calculates a partial (renaming) map. |
|
2320 In this way, the definition can deal with vacuous binders. However, to our |
|
2321 best knowledge, no concrete mathematical result concerning this |
|
2322 definition of $\alpha$-equivalence has been proved.\\[-7mm] |
|
2323 *} |
|
2324 |
|
2325 section {* Conclusion *} |
|
2326 |
|
2327 text {* |
|
2328 We have presented an extension of Nominal Isabelle for dealing with |
|
2329 general binders, that is term-constructors having multiple bound |
|
2330 variables. For this extension we introduced new definitions of |
|
2331 $\alpha$-equivalence and automated all necessary proofs in Isabelle/HOL. |
|
2332 To specify general binders we used the specifications from Ott, but extended them |
|
2333 in some places and restricted |
|
2334 them in others so that they make sense in the context of $\alpha$-equated terms. |
|
2335 We also introduced two binding modes (set and set+) that do not |
|
2336 exist in Ott. |
|
2337 We have tried out the extension with calculi such as Core-Haskell, type-schemes |
|
2338 and approximately a dozen of other typical examples from programming |
|
2339 language research~\cite{SewellBestiary}. |
|
2340 %The code |
|
2341 %will eventually become part of the next Isabelle distribution.\footnote{For the moment |
|
2342 %it can be downloaded from the Mercurial repository linked at |
|
2343 %\href{http://isabelle.in.tum.de/nominal/download} |
|
2344 %{http://isabelle.in.tum.de/nominal/download}.} |
|
2345 |
|
2346 We have left out a discussion about how functions can be defined over |
|
2347 $\alpha$-equated terms involving general binders. In earlier versions of Nominal |
|
2348 Isabelle this turned out to be a thorny issue. We |
|
2349 hope to do better this time by using the function package that has recently |
|
2350 been implemented in Isabelle/HOL and also by restricting function |
|
2351 definitions to equivariant functions (for them we can |
|
2352 provide more automation). |
|
2353 |
|
2354 %There are some restrictions we imposed in this paper that we would like to lift in |
|
2355 %future work. One is the exclusion of nested datatype definitions. Nested |
|
2356 %datatype definitions allow one to specify, for instance, the function kinds |
|
2357 %in Core-Haskell as @{text "TFun string (ty list)"} instead of the unfolded |
|
2358 %version @{text "TFun string ty_list"} (see Figure~\ref{nominalcorehas}). To |
|
2359 %achieve this, we need a slightly more clever implementation than we have at the moment. |
|
2360 |
|
2361 %A more interesting line of investigation is whether we can go beyond the |
|
2362 %simple-minded form of binding functions that we adopted from Ott. At the moment, binding |
|
2363 %functions can only return the empty set, a singleton atom set or unions |
|
2364 %of atom sets (similarly for lists). It remains to be seen whether |
|
2365 %properties like |
|
2366 %% |
|
2367 %\begin{center} |
|
2368 %@{text "fa_ty x = bn x \<union> fa_bn x"}. |
|
2369 %\end{center} |
|
2370 % |
|
2371 %\noindent |
|
2372 %allow us to support more interesting binding functions. |
|
2373 % |
|
2374 %We have also not yet played with other binding modes. For example we can |
|
2375 %imagine that there is need for a binding mode |
|
2376 %where instead of lists, we abstract lists of distinct elements. |
|
2377 %Once we feel confident about such binding modes, our implementation |
|
2378 %can be easily extended to accommodate them. |
|
2379 % |
|
2380 \smallskip |
|
2381 \noindent |
|
2382 {\bf Acknowledgements:} %We are very grateful to Andrew Pitts for |
|
2383 %many discussions about Nominal Isabelle. |
|
2384 We thank Peter Sewell for |
|
2385 making the informal notes \cite{SewellBestiary} available to us and |
|
2386 also for patiently explaining some of the finer points of the Ott-tool.\\[-7mm] |
|
2387 %Stephanie Weirich suggested to separate the subgrammars |
|
2388 %of kinds and types in our Core-Haskell example. \\[-6mm] |
|
2389 *} |
|
2390 |
|
2391 |
|
2392 (*<*) |
|
2393 end |
|
2394 (*>*) |
|