4
|
1 |
theory Parsing
|
38
|
2 |
imports Base
|
4
|
3 |
|
|
4 |
begin
|
|
5 |
|
42
|
6 |
|
4
|
7 |
chapter {* Parsing *}
|
|
8 |
|
|
9 |
text {*
|
|
10 |
|
38
|
11 |
Isabelle distinguishes between \emph{outer} and \emph{inner} syntax.
|
|
12 |
Theory commands, such as \isacommand{definition}, \isacommand{inductive} and so
|
|
13 |
on, belong to the outer syntax, whereas items inside double quotation marks, such
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
14 |
as terms, types and so on, belong to the inner syntax. For parsing inner syntax,
|
38
|
15 |
Isabelle uses a rather general and sophisticated algorithm due to Earley, which
|
|
16 |
is driven by priority grammars. Parsers for outer syntax are built up by functional
|
|
17 |
parsing combinators. These combinators are a well-established technique for parsing,
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
18 |
which has, for example, been described in Paulson's classic ML-book \cite{paulson-ml2}.
|
42
|
19 |
Isabelle developers are usually concerned with writing these outer syntax parsers,
|
|
20 |
either for new definitional packages or for calling tactics with specific arguments.
|
|
21 |
|
|
22 |
\begin{readmore}
|
|
23 |
The library
|
50
|
24 |
for writing parser combinators is split up, roughly, into two parts.
|
38
|
25 |
The first part consists of a collection of generic parser combinators defined
|
|
26 |
in the structure @{ML_struct Scan} in the file
|
|
27 |
@{ML_file "Pure/General/scan.ML"}. The second part of the library consists of
|
|
28 |
combinators for dealing with specific token types, which are defined in the
|
|
29 |
structure @{ML_struct OuterParse} in the file
|
|
30 |
@{ML_file "Pure/Isar/outer_parse.ML"}.
|
42
|
31 |
\end{readmore}
|
38
|
32 |
|
|
33 |
*}
|
|
34 |
|
49
|
35 |
section {* Building Generic Parsers *}
|
38
|
36 |
|
|
37 |
text {*
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
38 |
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
39 |
Let us first have a look at parsing strings using generic parsing combinators.
|
50
|
40 |
The function @{ML "(op $$)"} takes a string as argument and will ``consume'' this string from
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
41 |
a given input list of strings. ``Consume'' in this context means that it will
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
42 |
return a pair consisting of this string and the rest of the input list.
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
43 |
For example:
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
44 |
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
45 |
@{ML_response [display] "($$ \"h\") (explode \"hello\")" "(\"h\", [\"e\", \"l\", \"l\", \"o\"])"}
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
46 |
@{ML_response [display] "($$ \"w\") (explode \"world\")" "(\"w\", [\"o\", \"r\", \"l\", \"d\"])"}
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
47 |
|
52
|
48 |
This function will either succeed (as in the two examples above) or raise the exception
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
49 |
@{ML_text "FAIL"} if no string can be consumed. For example trying to parse
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
50 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
51 |
@{ML_response_fake [display] "($$ \"x\") (explode \"world\")"
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
52 |
"Exception FAIL raised"}
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
53 |
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
54 |
will raise the exception @{ML_text "FAIL"}.
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
55 |
There are three exceptions used in the parsing combinators:
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
56 |
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
57 |
\begin{itemize}
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
58 |
\item @{ML_text "FAIL"} is used to indicate that alternative routes of parsing
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
59 |
might be explored.
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
60 |
\item @{ML_text "MORE"} indicates that there is not enough input for the parser. For example
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
61 |
in @{ML_text "($$ \"h\") []"}.
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
62 |
\item @{ML_text "ABORT"} is the exception which is raised when a dead end is reached.
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
63 |
It is used for example in the function @{ML "(op !!)"} (see below).
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
64 |
\end{itemize}
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
65 |
|
50
|
66 |
However, note that these exceptions are private to the parser and cannot be accessed
|
49
|
67 |
by the programmer (for example to handle them).
|
|
68 |
|
|
69 |
Slightly more general than the parser @{ML "(op $$)"} is the function @{ML
|
|
70 |
Scan.one}, in that it takes a predicate as argument and then parses exactly
|
52
|
71 |
one item from the input list satisfying this predicate. For example the
|
49
|
72 |
following parser either consumes an @{ML_text [quotes] "h"} or a @{ML_text
|
|
73 |
[quotes] "w"}:
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
74 |
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
75 |
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
76 |
@{ML_response [display]
|
40
|
77 |
"let
|
|
78 |
val hw = Scan.one (fn x => x = \"h\" orelse x = \"w\")
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
79 |
val input1 = (explode \"hello\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
80 |
val input2 = (explode \"world\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
81 |
in
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
82 |
(hw input1, hw input2)
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
83 |
end"
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
84 |
"((\"h\", [\"e\", \"l\", \"l\", \"o\"]),(\"w\", [\"o\", \"r\", \"l\", \"d\"]))"}
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
85 |
|
52
|
86 |
Two parser can be connected in sequence by using the function @{ML "(op --)"}.
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
87 |
For example parsing @{ML_text "h"}, @{ML_text "e"} and @{ML_text "l"} in this
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
88 |
sequence can be achieved by
|
38
|
89 |
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
90 |
@{ML_response [display] "(($$ \"h\") -- ($$ \"e\") -- ($$ \"l\")) (explode \"hello\")"
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
91 |
"(((\"h\", \"e\"), \"l\"), [\"l\", \"o\"])"}
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
92 |
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
93 |
Note how the result of consumed strings builds up on the left as nested pairs.
|
38
|
94 |
|
|
95 |
Parsers that explore
|
|
96 |
alternatives can be constructed using the function @{ML "(op ||)"}. For example, the
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
97 |
parser @{ML "(p || q)" for p q} returns the result of @{ML_text "p"}, in case it succeeds,
|
38
|
98 |
otherwise it returns the result of @{ML_text "q"}. For example
|
|
99 |
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
100 |
@{ML_response [display]
|
40
|
101 |
"let
|
|
102 |
val hw = ($$ \"h\") || ($$ \"w\")
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
103 |
val input1 = (explode \"hello\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
104 |
val input2 = (explode \"world\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
105 |
in
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
106 |
(hw input1, hw input2)
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
107 |
end"
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
108 |
"((\"h\", [\"e\", \"l\", \"l\", \"o\"]), (\"w\", [\"o\", \"r\", \"l\", \"d\"]))"}
|
38
|
109 |
|
52
|
110 |
The functions @{ML "(op |--)"} and @{ML "(op --|)"} work like the sequencing function
|
50
|
111 |
for parsers, except that they discard the item being parsed by the first (respectively second)
|
38
|
112 |
parser. For example
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
113 |
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
114 |
@{ML_response [display]
|
40
|
115 |
"let
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
116 |
val just_e = ($$ \"h\") |-- ($$ \"e\")
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
117 |
val just_h = ($$ \"h\") --| ($$ \"e\")
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
118 |
val input = (explode \"hello\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
119 |
in
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
120 |
(just_e input, just_h input)
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
121 |
end"
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
122 |
"((\"e\", [\"l\", \"l\", \"o\"]),(\"h\", [\"l\", \"l\", \"o\"]))"}
|
38
|
123 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
124 |
The parser @{ML "Scan.optional p x" for p x} returns the result of the parser
|
38
|
125 |
@{ML_text "p"}, if it succeeds; otherwise it returns
|
|
126 |
the default value @{ML_text "x"}. For example
|
|
127 |
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
128 |
@{ML_response [display]
|
40
|
129 |
"let
|
|
130 |
val p = Scan.optional ($$ \"h\") \"x\"
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
131 |
val input1 = (explode \"hello\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
132 |
val input2 = (explode \"world\")
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
133 |
in
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
134 |
(p input1, p input2)
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
135 |
end"
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
136 |
"((\"h\", [\"e\", \"l\", \"l\", \"o\"]), (\"x\", [\"w\", \"o\", \"r\", \"l\", \"d\"]))"}
|
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
137 |
|
49
|
138 |
The function @{ML Scan.option} works similarly, except no default value can
|
50
|
139 |
be given. Instead, the result is wrapped as an @{text "option"}-type. For example:
|
|
140 |
|
|
141 |
@{ML_response [display]
|
|
142 |
"let
|
|
143 |
val p = Scan.option ($$ \"h\")
|
|
144 |
val input1 = (explode \"hello\")
|
|
145 |
val input2 = (explode \"world\")
|
|
146 |
in
|
|
147 |
(p input1, p input2)
|
|
148 |
end" "((SOME \"h\", [\"e\", \"l\", \"l\", \"o\"]), (NONE, [\"w\", \"o\", \"r\", \"l\", \"d\"]))"}
|
49
|
149 |
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
150 |
The function @{ML "(op !!)"} helps to produce appropriate error messages
|
43
02f76f1b6e7b
added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
151 |
during parsing. For example if one wants to parse that @{ML_text p} is immediately
|
40
|
152 |
followed by @{ML_text q}, or start a completely different parser @{ML_text r},
|
|
153 |
one might write
|
|
154 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
155 |
@{ML [display] "(p -- q) || r" for p q r}
|
40
|
156 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
157 |
However, this parser is problematic for producing an appropriate error message, in case
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
158 |
the parsing of @{ML "(p -- q)" for p q} fails. Because in that case one loses with the parser
|
42
|
159 |
above the information
|
|
160 |
that @{ML_text p} should be followed by @{ML_text q}. To see this consider the case in
|
|
161 |
which @{ML_text p}
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
162 |
is present in the input, but not @{ML_text q}. That means @{ML "(p -- q)" for p q} will fail
|
42
|
163 |
and the
|
52
|
164 |
alternative parser @{ML_text r} will be tried. However in many circumstance this will be the wrong
|
43
02f76f1b6e7b
added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
165 |
parser for the input ``p-followed-by-q'' and therefore will also fail. The error message is then
|
02f76f1b6e7b
added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
166 |
caused by the
|
52
|
167 |
failure of @{ML_text r}, not by the absence of @{ML_text q} in the input. This kind of situation
|
|
168 |
can be avoided by using the function @{ML "(op !!)"}. This function aborts the whole process of
|
50
|
169 |
parsing in case of a failure and invokes an error message. For example if we invoke the parser
|
40
|
170 |
|
|
171 |
@{ML [display] "(!! (fn _ => \"foo\") ($$ \"h\"))"}
|
|
172 |
|
42
|
173 |
on @{ML_text [quotes] "hello"}, the parsing succeeds
|
39
631d12c25bde
substantial changes to the antiquotations (preliminary version)
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
174 |
|
40
|
175 |
@{ML_response [display]
|
|
176 |
"(!! (fn _ => \"foo\") ($$ \"h\")) (explode \"hello\")"
|
|
177 |
"(\"h\", [\"e\", \"l\", \"l\", \"o\"])"}
|
|
178 |
|
42
|
179 |
but if we invoke it on @{ML_text [quotes] "world"}
|
40
|
180 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
181 |
@{ML_response_fake [display] "(!! (fn _ => \"foo\") ($$ \"h\")) (explode \"world\")"
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
182 |
"Exception ABORT raised"}
|
40
|
183 |
|
|
184 |
the parsing aborts and the error message @{ML_text "foo"} is printed out. In order to
|
|
185 |
see the error message properly, we need to prefix the parser with the function
|
|
186 |
@{ML "Scan.error"}. For example
|
|
187 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
188 |
@{ML_response_fake [display] "Scan.error ((!! (fn _ => \"foo\") ($$ \"h\")))"
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
189 |
"Exception Error \"foo\" raised"}
|
40
|
190 |
|
|
191 |
This ``prefixing'' is usually done by wrappers such as @{ML "OuterSyntax.command"}
|
42
|
192 |
(FIXME: give reference to later place).
|
40
|
193 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
194 |
Returning to our example of parsing @{ML "(p -- q) || r" for p q r}. If we want
|
42
|
195 |
to generate the correct error message for p-followed-by-q, then
|
50
|
196 |
we have to write:
|
38
|
197 |
*}
|
|
198 |
|
42
|
199 |
ML {*
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
200 |
fun p_followed_by_q p q r =
|
40
|
201 |
let
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
202 |
val err = (fn _ => p ^ " is not followed by " ^ q)
|
40
|
203 |
in
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
204 |
(($$ p) -- (!! err ($$ q))) || (($$ r) -- ($$ r))
|
40
|
205 |
end
|
38
|
206 |
*}
|
|
207 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
208 |
|
40
|
209 |
text {*
|
|
210 |
Running this parser with
|
|
211 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
212 |
@{ML_response_fake [display] "Scan.error (p_followed_by_q \"h\" \"e\" \"w\") (explode \"holle\")"
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
213 |
"Exception ERROR \"h is not followed by e\" raised"}
|
40
|
214 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
215 |
gives the correct error message. Running it with
|
40
|
216 |
|
|
217 |
@{ML_response [display] "Scan.error (p_followed_by_q \"h\" \"e\" \"w\") (explode \"wworld\")"
|
|
218 |
"((\"w\", \"w\"), [\"o\", \"r\", \"l\", \"d\"])"}
|
|
219 |
|
|
220 |
yields the expected parsing.
|
38
|
221 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
222 |
The function @{ML "Scan.repeat p" for p} will apply a parser @{ML_text p} as
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
223 |
often as it succeeds. For example
|
40
|
224 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
225 |
@{ML_response [display] "Scan.repeat ($$ \"h\") (explode \"hhhhello\")"
|
40
|
226 |
"([\"h\", \"h\", \"h\", \"h\"], [\"e\", \"l\", \"l\", \"o\"])"}
|
|
227 |
|
|
228 |
Note that @{ML "Scan.repeat"} stores the parsed items in a list. The function
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
229 |
@{ML "Scan.repeat1"} is similar, but requires that the parser @{ML_text "p"}
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
230 |
succeeds at least once.
|
48
|
231 |
|
49
|
232 |
Also note that the parser would have aborted with the exception @{ML_text MORE}, if
|
50
|
233 |
we had run it only on just @{ML_text [quotes] "hhhh"}. This can be avoided using
|
49
|
234 |
the wrapper @{ML Scan.finite} and the ``stopper-token'' @{ML Symbol.stopper}. With
|
|
235 |
them we can write
|
|
236 |
|
|
237 |
@{ML_response [display] "Scan.finite Symbol.stopper (Scan.repeat ($$ \"h\")) (explode \"hhhh\")"
|
|
238 |
"([\"h\", \"h\", \"h\", \"h\"], [])"}
|
|
239 |
|
50
|
240 |
However, this kind of manually wrapping needs to be done only very rarely
|
|
241 |
in practise, because it is already done by the infrastructure for you.
|
49
|
242 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
243 |
After parsing succeeded, one nearly always wants to apply a function on the parsed
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
244 |
items. This is done using the function @{ML "(p >> f)" for p f} which runs
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
245 |
first the parser @{ML_text p} and upon successful completion applies the
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
246 |
function @{ML_text f} to the result. For example
|
38
|
247 |
|
40
|
248 |
@{ML_response [display]
|
|
249 |
"let
|
|
250 |
fun double (x,y) = (x^x,y^y)
|
|
251 |
in
|
|
252 |
(($$ \"h\") -- ($$ \"e\") >> double) (explode \"hello\")
|
|
253 |
end"
|
|
254 |
"((\"hh\", \"ee\"), [\"l\", \"l\", \"o\"])"}
|
|
255 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
256 |
doubles the two parsed input strings.
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
257 |
|
40
|
258 |
The function @{ML Scan.lift} takes a parser and a pair as arguments. This function applies
|
|
259 |
the given parser to the second component of the pair and leaves the first component
|
|
260 |
untouched. For example
|
38
|
261 |
|
40
|
262 |
@{ML_response [display]
|
|
263 |
"Scan.lift (($$ \"h\") -- ($$ \"e\")) (1,(explode \"hello\"))"
|
|
264 |
"((\"h\", \"e\"), (1, [\"l\", \"l\", \"o\"]))"}
|
|
265 |
|
43
02f76f1b6e7b
added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
266 |
(FIXME: In which situations is this useful? Give examples.)
|
40
|
267 |
*}
|
|
268 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
269 |
section {* Parsing Theory Syntax *}
|
38
|
270 |
|
40
|
271 |
text {*
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
272 |
Most of the time, however, Isabelle developers have to deal with parsing tokens, not strings.
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
273 |
This is because the parsers for the theory syntax, as well as the parsers for the
|
43
02f76f1b6e7b
added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
274 |
argument syntax of proof methods and attributes use the type
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
275 |
@{ML_type OuterLex.token} (which is identical to the type @{ML_type OuterParse.token}).
|
42
|
276 |
|
|
277 |
\begin{readmore}
|
40
|
278 |
The parser functions for the theory syntax are contained in the structure
|
42
|
279 |
@{ML_struct OuterParse} defined in the file @{ML_file "Pure/Isar/outer_parse.ML"}.
|
|
280 |
The definition for tokens is in the file @{ML_file "Pure/Isar/outer_lex.ML"}.
|
|
281 |
\end{readmore}
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
282 |
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
283 |
The structure @{ML_struct OuterLex} defines several kinds of token (for example
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
284 |
@{ML "Ident" in OuterLex} for identifiers, @{ML "Keyword" in OuterLex} for keywords and
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
285 |
@{ML "Command" in OuterLex} for commands). Some token parsers take into account the
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
286 |
kind of token.
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
287 |
*}
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
288 |
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
289 |
text {*
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
290 |
For the examples below, we can generate a token list out of a string using
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
291 |
the function @{ML "OuterSyntax.scan"}, which we give below @{ML
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
292 |
"Position.none"} as argument since, at the moment, we are not interested in
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
293 |
generating precise error messages. The following
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
294 |
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
295 |
|
49
|
296 |
@{ML_response_fake [display] "OuterSyntax.scan Position.none \"hello world\""
|
50
|
297 |
"[Token (\<dots>,(Ident, \"hello\"),\<dots>),
|
|
298 |
Token (\<dots>,(Space, \" \"),\<dots>),
|
|
299 |
Token (\<dots>,(Ident, \"world\"),\<dots>)]"}
|
|
300 |
|
|
301 |
produces three tokens where the first and the last are identifiers, since
|
|
302 |
@{ML_text [quotes] "hello"} and @{ML_text [quotes] "world"} do not match any
|
|
303 |
other syntactic category.\footnote{Note that because of a possible a bug in
|
|
304 |
the PolyML runtime system the result is printed as @{text "?"}, instead of
|
|
305 |
the token.} The second indicates a space.
|
|
306 |
|
|
307 |
Many parsing functions later on will require spaces, comments and the like
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
308 |
to have already been filtered out. So from now on we are going to use the
|
50
|
309 |
functions @{ML filter} and @{ML OuterLex.is_proper} do this. For example
|
|
310 |
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
311 |
|
50
|
312 |
@{ML_response_fake [display]
|
|
313 |
"let
|
|
314 |
val input = OuterSyntax.scan Position.none \"hello world\"
|
|
315 |
in
|
|
316 |
filter OuterLex.is_proper input
|
|
317 |
end"
|
|
318 |
"[Token (\<dots>,(Ident, \"hello\"), \<dots>), Token (\<dots>,(Ident, \"world\"), \<dots>)]"}
|
|
319 |
|
|
320 |
For convenience we are going to use the function
|
|
321 |
|
|
322 |
*}
|
|
323 |
|
|
324 |
ML {*
|
|
325 |
fun filtered_input str =
|
|
326 |
filter OuterLex.is_proper (OuterSyntax.scan Position.none str)
|
|
327 |
*}
|
|
328 |
|
|
329 |
text {*
|
|
330 |
|
48
|
331 |
If we parse
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
332 |
|
50
|
333 |
@{ML_response_fake [display]
|
|
334 |
"filtered_input \"inductive | for\""
|
|
335 |
"[Token (\<dots>,(Command, \"inductive\"),\<dots>),
|
|
336 |
Token (\<dots>,(Keyword, \"|\"),\<dots>),
|
|
337 |
Token (\<dots>,(Keyword, \"for\"),\<dots>)]"}
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
338 |
|
52
|
339 |
we obtain a list consisting of only a command and two keyword tokens.
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
340 |
If you want to see which keywords and commands are currently known, use
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
341 |
the following (you might have to adjust the @{ML print_depth} in order to
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
342 |
see the complete list):
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
343 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
344 |
@{ML_response_fake [display]
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
345 |
"let
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
346 |
val (keywords, commands) = OuterKeyword.get_lexicons ()
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
347 |
in
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
348 |
(Scan.dest_lexicon commands, Scan.dest_lexicon keywords)
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
349 |
end"
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
350 |
"([\"}\",\"{\",\<dots>],[\"\<rightleftharpoons>\",\"\<leftharpoondown>\",\<dots>])"}
|
42
|
351 |
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
352 |
Now the parser @{ML "OuterParse.$$$"} parses a single keyword. For example
|
50
|
353 |
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
354 |
@{ML_response [display]
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
355 |
"let
|
50
|
356 |
val input1 = filtered_input \"where for\"
|
|
357 |
val input2 = filtered_input \"| in\"
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
358 |
in
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
359 |
(OuterParse.$$$ \"where\" input1, OuterParse.$$$ \"|\" input2)
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
360 |
end"
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
361 |
"((\"where\",\<dots>),(\"|\",\<dots>))"}
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
362 |
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
363 |
Like before, we can sequentially connect parsers with @{ML "(op --)"}. For example
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
364 |
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
365 |
@{ML_response [display]
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
366 |
"let
|
50
|
367 |
val input = filtered_input \"| in\"
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
368 |
in
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
369 |
(OuterParse.$$$ \"|\" -- OuterParse.$$$ \"in\") input
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
370 |
end"
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
371 |
"((\"|\",\"in\"),[])"}
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
372 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
373 |
The parser @{ML "OuterParse.enum s p" for s p} parses a possibly empty
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
374 |
list of items recognised by the parser @{ML_text p}, where the items being parsed
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
375 |
are separated by the string @{ML_text s}. For example
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
376 |
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
377 |
@{ML_response [display]
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
378 |
"let
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
379 |
val input = filtered_input \"in | in | in foo\"
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
380 |
in
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
381 |
(OuterParse.enum \"|\" (OuterParse.$$$ \"in\")) input
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
382 |
end"
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
383 |
"([\"in\",\"in\",\"in\"],[\<dots>])"}
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
384 |
|
50
|
385 |
@{ML "OuterParse.enum1"} works similarly, except that the parsed list must
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
386 |
be non-empty. Note that we had to add an @{ML_text [quotes] "foo"} at the end
|
50
|
387 |
of the parsed string, otherwise the parser would have consumed all tokens
|
|
388 |
and then failed with the exception @{ML_text "MORE"}. Like in the previous
|
|
389 |
section, we can avoid this exception using the wrapper @{ML
|
|
390 |
Scan.finite}. This time, however, we have to use the ``stopper-token'' @{ML
|
|
391 |
OuterLex.stopper}. We can write
|
49
|
392 |
|
|
393 |
@{ML_response [display]
|
|
394 |
"let
|
50
|
395 |
val input = filtered_input \"in | in | in\"
|
49
|
396 |
in
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
397 |
Scan.finite OuterLex.stopper
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
398 |
(OuterParse.enum \"|\" (OuterParse.$$$ \"in\")) input
|
49
|
399 |
end"
|
|
400 |
"([\"in\",\"in\",\"in\"],[])"}
|
|
401 |
|
54
|
402 |
The following function will help us later to run examples
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
403 |
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
404 |
*}
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
405 |
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
406 |
ML {*
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
407 |
fun parse p input = Scan.finite OuterLex.stopper (Scan.error p) input
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
408 |
*}
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
409 |
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
410 |
text {*
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
411 |
|
49
|
412 |
The function @{ML "OuterParse.!!!"} can be used to force termination of the
|
|
413 |
parser in case of a dead end, just like @{ML "Scan.!!"} (see previous section),
|
|
414 |
except that the error message is fixed to be @{text [quotes] "Outer syntax error"}
|
|
415 |
with a relatively precise description of the failure. For example:
|
|
416 |
|
|
417 |
@{ML_response_fake [display]
|
|
418 |
"let
|
50
|
419 |
val input = filtered_input \"in |\"
|
49
|
420 |
val parse_bar_then_in = OuterParse.$$$ \"|\" -- OuterParse.$$$ \"in\"
|
|
421 |
in
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
422 |
parse (OuterParse.!!! parse_bar_then_in) input
|
49
|
423 |
end"
|
|
424 |
"Exception ERROR \"Outer syntax error: keyword \"|\" expected,
|
|
425 |
but keyword in was found\" raised"
|
|
426 |
}
|
42
|
427 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
428 |
\begin{exercise}
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
429 |
A type-identifier, for example @{typ "'a"}, is a token of
|
54
|
430 |
kind @{ML "Keyword" in OuterLex}. It can be parsed using
|
|
431 |
the function @{ML OuterParse.type_ident}.
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
432 |
\end{exercise}
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
433 |
|
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
434 |
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
435 |
*}
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
436 |
|
53
0c3580c831a4
removed the @{ML ...} antiquotation in favour of @{ML_open ...x}
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
437 |
|
49
|
438 |
section {* Positional Information *}
|
|
439 |
|
|
440 |
text {*
|
|
441 |
|
|
442 |
@{ML OuterParse.position}
|
|
443 |
|
|
444 |
*}
|
|
445 |
|
|
446 |
ML {*
|
|
447 |
OuterParse.position
|
|
448 |
*}
|
|
449 |
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
450 |
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
451 |
section {* Parsing Inner Syntax *}
|
42
|
452 |
|
44
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
453 |
ML {*
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
454 |
let
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
455 |
val input = OuterSyntax.scan Position.none "0"
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
456 |
in
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
457 |
OuterParse.prop input
|
dee4b3e66dfe
added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
458 |
end
|
42
|
459 |
|
|
460 |
*}
|
|
461 |
|
50
|
462 |
ML {*
|
|
463 |
OuterParse.opt_target
|
|
464 |
*}
|
|
465 |
|
|
466 |
ML {*
|
|
467 |
OuterParse.opt_target --
|
|
468 |
OuterParse.fixes --
|
|
469 |
OuterParse.for_fixes --
|
|
470 |
Scan.optional (OuterParse.$$$ "where" |--
|
|
471 |
OuterParse.!!! (OuterParse.enum1 "|" (SpecParse.opt_thm_name ":" -- OuterParse.prop))) []
|
|
472 |
|
|
473 |
*}
|
|
474 |
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
475 |
text {* (FIXME funny output for a proposition) *}
|
41
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
476 |
|
b11653b11bd3
further progress on the parsing section and tuning on the antiqu's
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
477 |
|
38
|
478 |
|
|
479 |
chapter {* Parsing *}
|
|
480 |
|
|
481 |
text {*
|
|
482 |
|
4
|
483 |
Lots of Standard ML code is given in this document, for various reasons,
|
|
484 |
including:
|
|
485 |
\begin{itemize}
|
|
486 |
\item direct quotation of code found in the Isabelle source files,
|
|
487 |
or simplified versions of such code
|
|
488 |
\item identifiers found in the Isabelle source code, with their types
|
|
489 |
(or specialisations of their types)
|
|
490 |
\item code examples, which can be run by the reader, to help illustrate the
|
|
491 |
behaviour of functions found in the Isabelle source code
|
|
492 |
\item ancillary functions, not from the Isabelle source code,
|
|
493 |
which enable the reader to run relevant code examples
|
|
494 |
\item type abbreviations, which help explain the uses of certain functions
|
|
495 |
\end{itemize}
|
|
496 |
|
|
497 |
*}
|
|
498 |
|
|
499 |
section {* Parsing Isar input *}
|
|
500 |
|
|
501 |
text {*
|
|
502 |
|
|
503 |
The typical parsing function has the type
|
|
504 |
\texttt{'src -> 'res * 'src}, with input
|
|
505 |
of type \texttt{'src}, returning a result
|
|
506 |
of type \texttt{'res}, which is (or is derived from) the first part of the
|
|
507 |
input, and also returning the remainder of the input.
|
|
508 |
(In the common case, when it is clear what the ``remainder of the input''
|
|
509 |
means, we will just say that the functions ``returns'' the
|
|
510 |
value of type \texttt{'res}).
|
|
511 |
An exception is raised if an appropriate value
|
|
512 |
cannot be produced from the input.
|
|
513 |
A range of exceptions can be used to identify different reasons
|
|
514 |
for the failure of a parse.
|
|
515 |
|
|
516 |
This contrasts the standard parsing function in Standard ML,
|
|
517 |
which is of type
|
|
518 |
\texttt{type ('res, 'src) reader = 'src -> ('res * 'src) option};
|
|
519 |
(for example, \texttt{List.getItem} and \texttt{Substring.getc}).
|
|
520 |
However, much of the discussion at
|
|
521 |
FIX file:/home/jeremy/html/ml/SMLBasis/string-cvt.html
|
|
522 |
is relevant.
|
|
523 |
|
|
524 |
Naturally one may convert between the two different sorts of parsing functions
|
|
525 |
as follows:
|
|
526 |
\begin{verbatim}
|
|
527 |
open StringCvt ;
|
|
528 |
type ('res, 'src) ex_reader = 'src -> 'res * 'src
|
|
529 |
(* ex_reader : ('res, 'src) reader -> ('res, 'src) ex_reader *)
|
|
530 |
fun ex_reader rdr src = Option.valOf (rdr src) ;
|
|
531 |
(* reader : ('res, 'src) ex_reader -> ('res, 'src) reader *)
|
|
532 |
fun reader exrdr src = SOME (exrdr src) handle _ => NONE ;
|
|
533 |
\end{verbatim}
|
|
534 |
|
|
535 |
*}
|
|
536 |
|
|
537 |
section{* The \texttt{Scan} structure *}
|
|
538 |
|
|
539 |
text {*
|
|
540 |
The source file is \texttt{src/General/scan.ML}.
|
|
541 |
This structure provides functions for using and combining parsing functions
|
|
542 |
of the type \texttt{'src -> 'res * 'src}.
|
|
543 |
Three exceptions are used:
|
|
544 |
\begin{verbatim}
|
|
545 |
exception MORE of string option; (*need more input (prompt)*)
|
|
546 |
exception FAIL of string option; (*try alternatives (reason of failure)*)
|
|
547 |
exception ABORT of string; (*dead end*)
|
|
548 |
\end{verbatim}
|
|
549 |
Many functions in this structure (generally those with names composed of
|
|
550 |
symbols) are declared as infix.
|
|
551 |
|
|
552 |
Some functions from that structure are
|
|
553 |
\begin{verbatim}
|
|
554 |
|-- : ('src -> 'res1 * 'src') * ('src' -> 'res2 * 'src'') ->
|
|
555 |
'src -> 'res2 * 'src''
|
|
556 |
--| : ('src -> 'res1 * 'src') * ('src' -> 'res2 * 'src'') ->
|
|
557 |
'src -> 'res1 * 'src''
|
|
558 |
-- : ('src -> 'res1 * 'src') * ('src' -> 'res2 * 'src'') ->
|
|
559 |
'src -> ('res1 * 'res2) * 'src''
|
|
560 |
^^ : ('src -> string * 'src') * ('src' -> string * 'src'') ->
|
|
561 |
'src -> string * 'src''
|
|
562 |
\end{verbatim}
|
|
563 |
These functions parse a result off the input source twice.
|
|
564 |
|
|
565 |
\texttt{|--} and \texttt{--|}
|
|
566 |
return the first result and the second result, respectively.
|
|
567 |
|
|
568 |
\texttt{--} returns both.
|
|
569 |
|
|
570 |
\verb|^^| returns the result of concatenating the two results
|
|
571 |
(which must be strings).
|
|
572 |
|
|
573 |
Note how, although the types
|
|
574 |
\texttt{'src}, \texttt{'src'} and \texttt{'src''} will normally be the same,
|
|
575 |
the types as shown help suggest the behaviour of the functions.
|
|
576 |
\begin{verbatim}
|
|
577 |
:-- : ('src -> 'res1 * 'src') * ('res1 -> 'src' -> 'res2 * 'src'') ->
|
|
578 |
'src -> ('res1 * 'res2) * 'src''
|
|
579 |
:|-- : ('src -> 'res1 * 'src') * ('res1 -> 'src' -> 'res2 * 'src'') ->
|
|
580 |
'src -> 'res2 * 'src''
|
|
581 |
\end{verbatim}
|
|
582 |
These are similar to \texttt{|--} and \texttt{--|},
|
|
583 |
except that the second parsing function can depend on the result of the first.
|
|
584 |
\begin{verbatim}
|
|
585 |
>> : ('src -> 'res1 * 'src') * ('res1 -> 'res2) -> 'src -> 'res2 * 'src'
|
|
586 |
|| : ('src -> 'res_src) * ('src -> 'res_src) -> 'src -> 'res_src
|
|
587 |
\end{verbatim}
|
|
588 |
\texttt{p >> f} applies a function \texttt{f} to the result of a parse.
|
|
589 |
|
|
590 |
\texttt{||} tries a second parsing function if the first one
|
|
591 |
fails by raising an exception of the form \texttt{FAIL \_}.
|
|
592 |
|
|
593 |
\begin{verbatim}
|
|
594 |
succeed : 'res -> ('src -> 'res * 'src) ;
|
|
595 |
fail : ('src -> 'res_src) ;
|
|
596 |
!! : ('src * string option -> string) ->
|
|
597 |
('src -> 'res_src) -> ('src -> 'res_src) ;
|
|
598 |
\end{verbatim}
|
|
599 |
\texttt{succeed r} returns \texttt{r}, with the input unchanged.
|
|
600 |
\texttt{fail} always fails, raising exception \texttt{FAIL NONE}.
|
|
601 |
\texttt{!! f} only affects the failure mode, turning a failure that
|
|
602 |
raises \texttt{FAIL \_} into a failure that raises \texttt{ABORT ...}.
|
|
603 |
This is used to prevent recovery from the failure ---
|
|
604 |
thus, in \texttt{!! parse1 || parse2}, if \texttt{parse1} fails,
|
|
605 |
it won't recover by trying \texttt{parse2}.
|
|
606 |
|
|
607 |
\begin{verbatim}
|
|
608 |
one : ('si -> bool) -> ('si list -> 'si * 'si list) ;
|
|
609 |
some : ('si -> 'res option) -> ('si list -> 'res * 'si list) ;
|
|
610 |
\end{verbatim}
|
|
611 |
These require the input to be a list of items:
|
|
612 |
they fail, raising \texttt{MORE NONE} if the list is empty.
|
|
613 |
On other failures they raise \texttt{FAIL NONE}
|
|
614 |
|
|
615 |
\texttt{one p} takes the first
|
|
616 |
item from the list if it satisfies \texttt{p}, otherwise fails.
|
|
617 |
|
|
618 |
\texttt{some f} takes the first
|
|
619 |
item from the list and applies \texttt{f} to it, failing if this returns
|
|
620 |
\texttt{NONE}.
|
|
621 |
|
|
622 |
\begin{verbatim}
|
|
623 |
many : ('si -> bool) -> 'si list -> 'si list * 'si list ;
|
|
624 |
\end{verbatim}
|
|
625 |
\texttt{many p} takes items from the input until it encounters one
|
|
626 |
which does not satisfy \texttt{p}. If it reaches the end of the input
|
|
627 |
it fails, raising \texttt{MORE NONE}.
|
|
628 |
|
|
629 |
\texttt{many1} (with the same type) fails if the first item
|
|
630 |
does not satisfy \texttt{p}.
|
|
631 |
|
|
632 |
\begin{verbatim}
|
|
633 |
option : ('src -> 'res * 'src) -> ('src -> 'res option * 'src)
|
|
634 |
optional : ('src -> 'res * 'src) -> 'res -> ('src -> 'res * 'src)
|
|
635 |
\end{verbatim}
|
|
636 |
\texttt{option}:
|
|
637 |
where the parser \texttt{f} succeeds with result \texttt{r}
|
|
638 |
or raises \texttt{FAIL \_},
|
|
639 |
\texttt{option f} gives the result \texttt{SOME r} or \texttt{NONE}.
|
|
640 |
|
|
641 |
\texttt{optional}: if parser \texttt{f} fails by raising \texttt{FAIL \_},
|
|
642 |
\texttt{optional f default} provides the result \texttt{default}.
|
|
643 |
|
|
644 |
\begin{verbatim}
|
|
645 |
repeat : ('src -> 'res * 'src) -> 'src -> 'res list * 'src
|
|
646 |
repeat1 : ('src -> 'res * 'src) -> 'src -> 'res list * 'src
|
|
647 |
bulk : ('src -> 'res * 'src) -> 'src -> 'res list * 'src
|
|
648 |
\end{verbatim}
|
|
649 |
\texttt{repeat f} repeatedly parses an item off the remaining input until
|
|
650 |
\texttt{f} fails with \texttt{FAIL \_}
|
|
651 |
|
|
652 |
\texttt{repeat1} is as for \texttt{repeat}, but requires at least one
|
|
653 |
successful parse.
|
|
654 |
|
|
655 |
\begin{verbatim}
|
|
656 |
lift : ('src -> 'res * 'src) -> ('ex * 'src -> 'res * ('ex * 'src))
|
|
657 |
\end{verbatim}
|
|
658 |
\texttt{lift} changes the source type of a parser by putting in an extra
|
|
659 |
component \texttt{'ex}, which is ignored in the parsing.
|
|
660 |
|
|
661 |
The \texttt{Scan} structure also provides the type \texttt{lexicon},
|
|
662 |
HOW DO THEY WORK ?? TO BE COMPLETED
|
|
663 |
\begin{verbatim}
|
|
664 |
dest_lexicon: lexicon -> string list ;
|
|
665 |
make_lexicon: string list list -> lexicon ;
|
|
666 |
empty_lexicon: lexicon ;
|
|
667 |
extend_lexicon: string list list -> lexicon -> lexicon ;
|
|
668 |
merge_lexicons: lexicon -> lexicon -> lexicon ;
|
|
669 |
is_literal: lexicon -> string list -> bool ;
|
|
670 |
literal: lexicon -> string list -> string list * string list ;
|
|
671 |
\end{verbatim}
|
|
672 |
Two lexicons, for the commands and keywords, are stored and can be retrieved
|
|
673 |
by:
|
|
674 |
\begin{verbatim}
|
|
675 |
val (command_lexicon, keyword_lexicon) = OuterSyntax.get_lexicons () ;
|
|
676 |
val commands = Scan.dest_lexicon command_lexicon ;
|
|
677 |
val keywords = Scan.dest_lexicon keyword_lexicon ;
|
|
678 |
\end{verbatim}
|
|
679 |
*}
|
|
680 |
|
|
681 |
section{* The \texttt{OuterLex} structure *}
|
|
682 |
|
|
683 |
text {*
|
|
684 |
The source file is @{text "src/Pure/Isar/outer_lex.ML"}.
|
|
685 |
In some other source files its name is abbreviated:
|
|
686 |
\begin{verbatim}
|
|
687 |
structure T = OuterLex;
|
|
688 |
\end{verbatim}
|
|
689 |
This structure defines the type \texttt{token}.
|
|
690 |
(The types
|
|
691 |
\texttt{OuterLex.token},
|
|
692 |
\texttt{OuterParse.token} and
|
|
693 |
\texttt{SpecParse.token} are all the same).
|
|
694 |
|
|
695 |
Input text is split up into tokens, and the input source type for many parsing
|
|
696 |
functions is \texttt{token list}.
|
|
697 |
|
|
698 |
The datatype definition (which is not published in the signature) is
|
|
699 |
\begin{verbatim}
|
|
700 |
datatype token = Token of Position.T * (token_kind * string);
|
|
701 |
\end{verbatim}
|
|
702 |
but here are some runnable examples for viewing tokens:
|
|
703 |
|
|
704 |
*}
|
|
705 |
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
706 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
707 |
|
4
|
708 |
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
709 |
ML {*
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
710 |
val toks = OuterSyntax.scan Position.none
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
711 |
"theory,imports;begin x.y.z apply ?v1 ?'a 'a -- || 44 simp (* xx *) { * fff * }" ;
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
712 |
*}
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
713 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
714 |
ML {*
|
4
|
715 |
print_depth 20 ;
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
716 |
*}
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
717 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
718 |
ML {*
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
719 |
map OuterLex.text_of toks ;
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
720 |
*}
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
721 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
722 |
ML {*
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
723 |
val proper_toks = filter OuterLex.is_proper toks ;
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
724 |
*}
|
4
|
725 |
|
47
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
726 |
ML {*
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
727 |
map OuterLex.kind_of proper_toks
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
728 |
*}
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
729 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
730 |
ML {*
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
731 |
map OuterLex.unparse proper_toks ;
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
732 |
*}
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
733 |
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
734 |
ML {*
|
4daf913fdbe1
hakked latex so that it does not display ML {* *}; general tuning
Christian Urban <urbanc@in.tum.de>
diff
changeset
|
735 |
OuterLex.stopper
|
4
|
736 |
*}
|
|
737 |
|
|
738 |
text {*
|
|
739 |
|
|
740 |
The function \texttt{is\_proper : token -> bool} identifies tokens which are
|
|
741 |
not white space or comments: many parsing functions assume require spaces or
|
|
742 |
comments to have been filtered out.
|
|
743 |
|
|
744 |
There is a special end-of-file token:
|
|
745 |
\begin{verbatim}
|
|
746 |
val (tok_eof : token, is_eof : token -> bool) = T.stopper ;
|
|
747 |
(* end of file token *)
|
|
748 |
\end{verbatim}
|
|
749 |
|
|
750 |
*}
|
|
751 |
|
|
752 |
section {* The \texttt{OuterParse} structure *}
|
|
753 |
|
|
754 |
text {*
|
|
755 |
The source file is \texttt{src/Pure/Isar/outer\_parse.ML}.
|
|
756 |
In some other source files its name is abbreviated:
|
|
757 |
\begin{verbatim}
|
|
758 |
structure P = OuterParse;
|
|
759 |
\end{verbatim}
|
|
760 |
Here the parsers use \texttt{token list} as the input source type.
|
|
761 |
|
|
762 |
Some of the parsers simply select the first token, provided that it is of the
|
|
763 |
right kind (as returned by \texttt{T.kind\_of}): these are
|
|
764 |
\texttt{ command, keyword, short\_ident, long\_ident, sym\_ident, term\_var,
|
|
765 |
type\_ident, type\_var, number, string, alt\_string, verbatim, sync, eof}
|
|
766 |
Others select the first token, provided that it is one of several kinds,
|
|
767 |
(eg, \texttt{name, xname, text, typ}).
|
|
768 |
|
|
769 |
\begin{verbatim}
|
|
770 |
type 'a tlp = token list -> 'a * token list ; (* token list parser *)
|
|
771 |
$$$ : string -> string tlp
|
|
772 |
nat : int tlp ;
|
|
773 |
maybe : 'a tlp -> 'a option tlp ;
|
|
774 |
\end{verbatim}
|
|
775 |
|
|
776 |
\texttt{\$\$\$ s} returns the first token,
|
|
777 |
if it equals \texttt{s} \emph{and} \texttt{s} is a keyword.
|
|
778 |
|
|
779 |
\texttt{nat} returns the first token, if it is a number, and evaluates it.
|
|
780 |
|
|
781 |
\texttt{maybe}: if \texttt{p} returns \texttt{r},
|
|
782 |
then \texttt{maybe p} returns \texttt{SOME r} ;
|
|
783 |
if the first token is an underscore, it returns \texttt{NONE}.
|
|
784 |
|
|
785 |
A few examples:
|
|
786 |
\begin{verbatim}
|
|
787 |
P.list : 'a tlp -> 'a list tlp ; (* likewise P.list1 *)
|
|
788 |
P.and_list : 'a tlp -> 'a list tlp ; (* likewise P.and_list1 *)
|
|
789 |
val toks : token list = OuterSyntax.scan "44 ,_, 66,77" ;
|
|
790 |
val proper_toks = List.filter T.is_proper toks ;
|
|
791 |
P.list P.nat toks ; (* OK, doesn't recognize white space *)
|
|
792 |
P.list P.nat proper_toks ; (* fails, doesn't recognize what follows ',' *)
|
|
793 |
P.list (P.maybe P.nat) proper_toks ; (* fails, end of input *)
|
|
794 |
P.list (P.maybe P.nat) (proper_toks @ [tok_eof]) ; (* OK *)
|
|
795 |
val toks : token list = OuterSyntax.scan "44 and 55 and 66 and 77" ;
|
|
796 |
P.and_list P.nat (List.filter T.is_proper toks @ [tok_eof]) ; (* ??? *)
|
|
797 |
\end{verbatim}
|
|
798 |
|
|
799 |
The following code helps run examples:
|
|
800 |
\begin{verbatim}
|
|
801 |
fun parse_str tlp str =
|
|
802 |
let val toks : token list = OuterSyntax.scan str ;
|
|
803 |
val proper_toks = List.filter T.is_proper toks @ [tok_eof] ;
|
|
804 |
val (res, rem_toks) = tlp proper_toks ;
|
|
805 |
val rem_str = String.concat
|
|
806 |
(Library.separate " " (List.map T.unparse rem_toks)) ;
|
|
807 |
in (res, rem_str) end ;
|
|
808 |
\end{verbatim}
|
|
809 |
|
|
810 |
Some examples from \texttt{src/Pure/Isar/outer\_parse.ML}
|
|
811 |
\begin{verbatim}
|
|
812 |
val type_args =
|
|
813 |
type_ident >> Library.single ||
|
|
814 |
$$$ "(" |-- !!! (list1 type_ident --| $$$ ")") ||
|
|
815 |
Scan.succeed [];
|
|
816 |
\end{verbatim}
|
|
817 |
There are three ways parsing a list of type arguments can succeed.
|
|
818 |
The first line reads a single type argument, and turns it into a singleton
|
|
819 |
list.
|
|
820 |
The second line reads "(", and then the remainder, ignoring the "(" ;
|
|
821 |
the remainder consists of a list of type identifiers (at least one),
|
|
822 |
and then a ")" which is also ignored.
|
|
823 |
The \texttt{!!!} ensures that if the parsing proceeds this far and then fails,
|
|
824 |
it won't try the third line (see the description of \texttt{Scan.!!}).
|
|
825 |
The third line consumes no input and returns the empty list.
|
|
826 |
|
|
827 |
\begin{verbatim}
|
|
828 |
fun triple2 (x, (y, z)) = (x, y, z);
|
|
829 |
val arity = xname -- ($$$ "::" |-- !!! (
|
|
830 |
Scan.optional ($$$ "(" |-- !!! (list1 sort --| $$$ ")")) []
|
|
831 |
-- sort)) >> triple2;
|
|
832 |
\end{verbatim}
|
|
833 |
The parser \texttt{arity} reads a typename $t$, then ``\texttt{::}'' (which is
|
|
834 |
ignored), then optionally a list $ss$ of sorts and then another sort $s$.
|
|
835 |
The result $(t, (ss, s))$ is transformed by \texttt{triple2} to $(t, ss, s)$.
|
|
836 |
The second line reads the optional list of sorts:
|
|
837 |
it reads first ``\texttt{(}'' and last ``\texttt{)}'', which are both ignored,
|
|
838 |
and between them a comma-separated list of sorts.
|
|
839 |
If this list is absent, the default \texttt{[]} provides the list of sorts.
|
|
840 |
|
|
841 |
\begin{verbatim}
|
|
842 |
parse_str P.type_args "('a, 'b) ntyp" ;
|
|
843 |
parse_str P.type_args "'a ntyp" ;
|
|
844 |
parse_str P.type_args "ntyp" ;
|
|
845 |
parse_str P.arity "ty :: tycl" ;
|
|
846 |
parse_str P.arity "ty :: (tycl1, tycl2) tycl" ;
|
|
847 |
\end{verbatim}
|
|
848 |
|
|
849 |
*}
|
|
850 |
|
|
851 |
section {* The \texttt{SpecParse} structure *}
|
|
852 |
|
|
853 |
text {*
|
|
854 |
The source file is \texttt{src/Pure/Isar/spec\_parse.ML}.
|
|
855 |
This structure contains token list parsers for more complicated values.
|
|
856 |
For example,
|
|
857 |
\begin{verbatim}
|
|
858 |
open SpecParse ;
|
|
859 |
attrib : Attrib.src tok_rdr ;
|
|
860 |
attribs : Attrib.src list tok_rdr ;
|
|
861 |
opt_attribs : Attrib.src list tok_rdr ;
|
|
862 |
xthm : (thmref * Attrib.src list) tok_rdr ;
|
|
863 |
xthms1 : (thmref * Attrib.src list) list tok_rdr ;
|
|
864 |
|
|
865 |
parse_str attrib "simp" ;
|
|
866 |
parse_str opt_attribs "hello" ;
|
|
867 |
val (ass, "") = parse_str attribs "[standard, xxxx, simp, intro, OF sym]" ;
|
|
868 |
map Args.dest_src ass ;
|
|
869 |
val (asrc, "") = parse_str attrib "THEN trans [THEN sym]" ;
|
|
870 |
|
|
871 |
parse_str xthm "mythm [attr]" ;
|
|
872 |
parse_str xthms1 "thm1 [attr] thms2" ;
|
|
873 |
\end{verbatim}
|
|
874 |
|
|
875 |
As you can see, attributes are described using types of the \texttt{Args}
|
|
876 |
structure, described below.
|
|
877 |
*}
|
|
878 |
|
|
879 |
section{* The \texttt{Args} structure *}
|
|
880 |
|
|
881 |
text {*
|
|
882 |
The source file is \texttt{src/Pure/Isar/args.ML}.
|
|
883 |
The primary type of this structure is the \texttt{src} datatype;
|
|
884 |
the single constructors not published in the signature, but
|
|
885 |
\texttt{Args.src} and \texttt{Args.dest\_src}
|
|
886 |
are in fact the constructor and destructor functions.
|
|
887 |
Note that the types \texttt{Attrib.src} and \texttt{Method.src}
|
|
888 |
are in fact \texttt{Args.src}.
|
|
889 |
|
|
890 |
\begin{verbatim}
|
|
891 |
src : (string * Args.T list) * Position.T -> Args.src ;
|
|
892 |
dest_src : Args.src -> (string * Args.T list) * Position.T ;
|
|
893 |
Args.pretty_src : Proof.context -> Args.src -> Pretty.T ;
|
|
894 |
fun pr_src ctxt src = Pretty.string_of (Args.pretty_src ctxt src) ;
|
|
895 |
|
|
896 |
val thy = ML_Context.the_context () ;
|
|
897 |
val ctxt = ProofContext.init thy ;
|
|
898 |
map (pr_src ctxt) ass ;
|
|
899 |
\end{verbatim}
|
|
900 |
|
|
901 |
So an \texttt{Args.src} consists of the first word, then a list of further
|
|
902 |
``arguments'', of type \texttt{Args.T}, with information about position in the
|
|
903 |
input.
|
|
904 |
\begin{verbatim}
|
|
905 |
(* how an Args.src is parsed *)
|
|
906 |
P.position : 'a tlp -> ('a * Position.T) tlp ;
|
|
907 |
P.arguments : Args.T list tlp ;
|
|
908 |
|
|
909 |
val parse_src : Args.src tlp =
|
|
910 |
P.position (P.xname -- P.arguments) >> Args.src ;
|
|
911 |
\end{verbatim}
|
|
912 |
|
|
913 |
\begin{verbatim}
|
|
914 |
val ((first_word, args), pos) = Args.dest_src asrc ;
|
|
915 |
map Args.string_of args ;
|
|
916 |
\end{verbatim}
|
|
917 |
|
|
918 |
The \texttt{Args} structure contains more parsers and parser transformers
|
|
919 |
for which the input source type is \texttt{Args.T list}. For example,
|
|
920 |
\begin{verbatim}
|
|
921 |
type 'a atlp = Args.T list -> 'a * Args.T list ;
|
|
922 |
open Args ;
|
|
923 |
nat : int atlp ; (* also Args.int *)
|
|
924 |
thm_sel : PureThy.interval list atlp ;
|
|
925 |
list : 'a atlp -> 'a list atlp ;
|
|
926 |
attribs : (string -> string) -> Args.src list atlp ;
|
|
927 |
opt_attribs : (string -> string) -> Args.src list atlp ;
|
|
928 |
|
|
929 |
(* parse_atl_str : 'a atlp -> (string -> 'a * string) ;
|
|
930 |
given an Args.T list parser, to get a string parser *)
|
|
931 |
fun parse_atl_str atlp str =
|
|
932 |
let val (ats, rem_str) = parse_str P.arguments str ;
|
|
933 |
val (res, rem_ats) = atlp ats ;
|
|
934 |
in (res, String.concat (Library.separate " "
|
|
935 |
(List.map Args.string_of rem_ats @ [rem_str]))) end ;
|
|
936 |
|
|
937 |
parse_atl_str Args.int "-1-," ;
|
|
938 |
parse_atl_str (Scan.option Args.int) "x1-," ;
|
|
939 |
parse_atl_str Args.thm_sel "(1-,4,13-22)" ;
|
|
940 |
|
|
941 |
val (ats as atsrc :: _, "") = parse_atl_str (Args.attribs I)
|
|
942 |
"[THEN trans [THEN sym], simp, OF sym]" ;
|
|
943 |
\end{verbatim}
|
|
944 |
|
|
945 |
From here, an attribute is interpreted using \texttt{Attrib.attribute}.
|
|
946 |
|
|
947 |
\texttt{Args} has a large number of functions which parse an \texttt{Args.src}
|
|
948 |
and also refer to a generic context.
|
|
949 |
Note the use of \texttt{Scan.lift} for this.
|
|
950 |
(as does \texttt{Attrib} - RETHINK THIS)
|
|
951 |
|
|
952 |
(\texttt{Args.syntax} shown below has type specialised)
|
|
953 |
|
|
954 |
\begin{verbatim}
|
|
955 |
type ('res, 'src) parse_fn = 'src -> 'res * 'src ;
|
|
956 |
type 'a cgatlp = ('a, Context.generic * Args.T list) parse_fn ;
|
|
957 |
Scan.lift : 'a atlp -> 'a cgatlp ;
|
|
958 |
term : term cgatlp ;
|
|
959 |
typ : typ cgatlp ;
|
|
960 |
|
|
961 |
Args.syntax : string -> 'res cgatlp -> src -> ('res, Context.generic) parse_fn ;
|
|
962 |
Attrib.thm : thm cgatlp ;
|
|
963 |
Attrib.thms : thm list cgatlp ;
|
|
964 |
Attrib.multi_thm : thm list cgatlp ;
|
|
965 |
|
|
966 |
(* parse_cgatl_str : 'a cgatlp -> (string -> 'a * string) ;
|
|
967 |
given a (Context.generic * Args.T list) parser, to get a string parser *)
|
|
968 |
fun parse_cgatl_str cgatlp str =
|
|
969 |
let
|
|
970 |
(* use the current generic context *)
|
|
971 |
val generic = Context.Theory thy ;
|
|
972 |
val (ats, rem_str) = parse_str P.arguments str ;
|
|
973 |
(* ignore any change to the generic context *)
|
|
974 |
val (res, (_, rem_ats)) = cgatlp (generic, ats) ;
|
|
975 |
in (res, String.concat (Library.separate " "
|
|
976 |
(List.map Args.string_of rem_ats @ [rem_str]))) end ;
|
|
977 |
\end{verbatim}
|
|
978 |
*}
|
|
979 |
|
|
980 |
section{* Attributes, and the \texttt{Attrib} structure *}
|
|
981 |
|
|
982 |
text {*
|
|
983 |
The type \texttt{attribute} is declared in \texttt{src/Pure/thm.ML}.
|
|
984 |
The source file for the \texttt{Attrib} structure is
|
|
985 |
\texttt{src/Pure/Isar/attrib.ML}.
|
|
986 |
Most attributes use a theorem to change a generic context (for example,
|
|
987 |
by declaring that the theorem should be used, by default, in simplification),
|
|
988 |
or change a theorem (which most often involves referring to the current
|
|
989 |
theory).
|
|
990 |
The functions \texttt{Thm.rule\_attribute} and
|
|
991 |
\texttt{Thm.declaration\_attribute} create attributes of these kinds.
|
|
992 |
|
|
993 |
\begin{verbatim}
|
|
994 |
type attribute = Context.generic * thm -> Context.generic * thm;
|
|
995 |
type 'a trf = 'a -> 'a ; (* transformer of a given type *)
|
|
996 |
Thm.rule_attribute : (Context.generic -> thm -> thm) -> attribute ;
|
|
997 |
Thm.declaration_attribute : (thm -> Context.generic trf) -> attribute ;
|
|
998 |
|
|
999 |
Attrib.print_attributes : theory -> unit ;
|
|
1000 |
Attrib.pretty_attribs : Proof.context -> src list -> Pretty.T list ;
|
|
1001 |
|
|
1002 |
List.app Pretty.writeln (Attrib.pretty_attribs ctxt ass) ;
|
|
1003 |
\end{verbatim}
|
|
1004 |
|
|
1005 |
An attribute is stored in a theory as indicated by:
|
|
1006 |
\begin{verbatim}
|
|
1007 |
Attrib.add_attributes :
|
|
1008 |
(bstring * (src -> attribute) * string) list -> theory trf ;
|
|
1009 |
(*
|
|
1010 |
Attrib.add_attributes [("THEN", THEN_att, "resolution with rule")] ;
|
|
1011 |
*)
|
|
1012 |
\end{verbatim}
|
|
1013 |
where the first and third arguments are name and description of the attribute,
|
|
1014 |
and the second is a function which parses the attribute input text
|
|
1015 |
(including the attribute name, which has necessarily already been parsed).
|
|
1016 |
Here, \texttt{THEN\_att} is a function declared in the code for the
|
|
1017 |
structure \texttt{Attrib}, but not published in its signature.
|
|
1018 |
The source file \texttt{src/Pure/Isar/attrib.ML} shows the use of
|
|
1019 |
\texttt{Attrib.add\_attributes} to add a number of attributes.
|
|
1020 |
|
|
1021 |
\begin{verbatim}
|
|
1022 |
FullAttrib.THEN_att : src -> attribute ;
|
|
1023 |
FullAttrib.THEN_att atsrc (generic, ML_Context.thm "sym") ;
|
|
1024 |
FullAttrib.THEN_att atsrc (generic, ML_Context.thm "all_comm") ;
|
|
1025 |
\end{verbatim}
|
|
1026 |
|
|
1027 |
\begin{verbatim}
|
|
1028 |
Attrib.syntax : attribute cgatlp -> src -> attribute ;
|
|
1029 |
Attrib.no_args : attribute -> src -> attribute ;
|
|
1030 |
\end{verbatim}
|
|
1031 |
When this is called as \texttt{syntax scan src (gc, th)}
|
|
1032 |
the generic context \texttt{gc} is used
|
|
1033 |
(and potentially changed to \texttt{gc'})
|
|
1034 |
by \texttt{scan} in parsing to obtain an attribute \texttt{attr} which would
|
|
1035 |
then be applied to \texttt{(gc', th)}.
|
|
1036 |
The source for parsing the attribute is the arguments part of \texttt{src},
|
|
1037 |
which must all be consumed by the parse.
|
|
1038 |
|
|
1039 |
For example, for \texttt{Attrib.no\_args attr src}, the attribute parser
|
|
1040 |
simply returns \texttt{attr}, requiring that the arguments part of
|
|
1041 |
\texttt{src} must be empty.
|
|
1042 |
|
|
1043 |
Some examples from \texttt{src/Pure/Isar/attrib.ML}, modified:
|
|
1044 |
\begin{verbatim}
|
|
1045 |
fun rot_att_n n (gc, th) = (gc, rotate_prems n th) ;
|
|
1046 |
rot_att_n : int -> attribute ;
|
|
1047 |
val rot_arg = Scan.lift (Scan.optional Args.int 1 : int atlp) : int cgatlp ;
|
|
1048 |
val rotated_att : src -> attribute =
|
|
1049 |
Attrib.syntax (rot_arg >> rot_att_n : attribute cgatlp) ;
|
|
1050 |
|
|
1051 |
val THEN_arg : int cgatlp = Scan.lift
|
|
1052 |
(Scan.optional (Args.bracks Args.nat : int atlp) 1 : int atlp) ;
|
|
1053 |
|
|
1054 |
Attrib.thm : thm cgatlp ;
|
|
1055 |
|
|
1056 |
THEN_arg -- Attrib.thm : (int * thm) cgatlp ;
|
|
1057 |
|
|
1058 |
fun THEN_att_n (n, tht) (gc, th) = (gc, th RSN (n, tht)) ;
|
|
1059 |
THEN_att_n : int * thm -> attribute ;
|
|
1060 |
|
|
1061 |
val THEN_att : src -> attribute = Attrib.syntax
|
|
1062 |
(THEN_arg -- Attrib.thm >> THEN_att_n : attribute cgatlp);
|
|
1063 |
\end{verbatim}
|
|
1064 |
The functions I've called \texttt{rot\_arg} and \texttt{THEN\_arg}
|
|
1065 |
read an optional argument, which for \texttt{rotated} is an integer,
|
|
1066 |
and for \texttt{THEN} is a natural enclosed in square brackets;
|
|
1067 |
the default, if the argument is absent, is 1 in each case.
|
|
1068 |
Functions \texttt{rot\_att\_n} and \texttt{THEN\_att\_n} turn these into
|
|
1069 |
attributes, where \texttt{THEN\_att\_n} also requires a theorem, which is
|
|
1070 |
parsed by \texttt{Attrib.thm}.
|
|
1071 |
Infix operators \texttt{--} and \texttt{>>} are in the structure \texttt{Scan}.
|
|
1072 |
|
|
1073 |
*}
|
|
1074 |
|
|
1075 |
section{* Methods, and the \texttt{Method} structure *}
|
|
1076 |
|
|
1077 |
text {*
|
|
1078 |
The source file is \texttt{src/Pure/Isar/method.ML}.
|
|
1079 |
The type \texttt{method} is defined by the datatype declaration
|
|
1080 |
\begin{verbatim}
|
|
1081 |
(* datatype method = Meth of thm list -> cases_tactic; *)
|
|
1082 |
RuleCases.NO_CASES : tactic -> cases_tactic ;
|
|
1083 |
\end{verbatim}
|
|
1084 |
In fact \texttt{RAW\_METHOD\_CASES} (below) is exactly the constructor
|
|
1085 |
\texttt{Meth}.
|
|
1086 |
A \texttt{cases\_tactic} is an elaborated version of a tactic.
|
|
1087 |
\texttt{NO\_CASES tac} is a \texttt{cases\_tactic} which consists of a
|
|
1088 |
\texttt{cases\_tactic} without any further case information.
|
|
1089 |
For further details see the description of structure \texttt{RuleCases} below.
|
|
1090 |
The list of theorems to be passed to a method consists of the current
|
|
1091 |
\emph{facts} in the proof.
|
|
1092 |
|
|
1093 |
\begin{verbatim}
|
|
1094 |
RAW_METHOD : (thm list -> tactic) -> method ;
|
|
1095 |
METHOD : (thm list -> tactic) -> method ;
|
|
1096 |
|
|
1097 |
SIMPLE_METHOD : tactic -> method ;
|
|
1098 |
SIMPLE_METHOD' : (int -> tactic) -> method ;
|
|
1099 |
SIMPLE_METHOD'' : ((int -> tactic) -> tactic) -> (int -> tactic) -> method ;
|
|
1100 |
|
|
1101 |
RAW_METHOD_CASES : (thm list -> cases_tactic) -> method ;
|
|
1102 |
METHOD_CASES : (thm list -> cases_tactic) -> method ;
|
|
1103 |
\end{verbatim}
|
|
1104 |
A method is, in its simplest form, a tactic; applying the method is to apply
|
|
1105 |
the tactic to the current goal state.
|
|
1106 |
|
|
1107 |
Applying \texttt{RAW\_METHOD tacf} creates a tactic by applying
|
|
1108 |
\texttt{tacf} to the current {facts}, and applying that tactic to the
|
|
1109 |
goal state.
|
|
1110 |
|
|
1111 |
\texttt{METHOD} is similar but also first applies
|
|
1112 |
\texttt{Goal.conjunction\_tac} to all subgoals.
|
|
1113 |
|
|
1114 |
\texttt{SIMPLE\_METHOD tac} inserts the facts into all subgoals and then
|
|
1115 |
applies \texttt{tacf}.
|
|
1116 |
|
|
1117 |
\texttt{SIMPLE\_METHOD' tacf} inserts the facts and then
|
|
1118 |
applies \texttt{tacf} to subgoal 1.
|
|
1119 |
|
|
1120 |
\texttt{SIMPLE\_METHOD'' quant tacf} does this for subgoal(s) selected by
|
|
1121 |
\texttt{quant}, which may be, for example,
|
|
1122 |
\texttt{ALLGOALS} (all subgoals),
|
|
1123 |
\texttt{TRYALL} (try all subgoals, failure is OK),
|
|
1124 |
\texttt{FIRSTGOAL} (try subgoals until it succeeds once),
|
|
1125 |
\texttt{(fn tacf => tacf 4)} (subgoal 4), etc
|
16
|
1126 |
(see the \texttt{Tactical} structure, FIXME) %%\cite[Chapter 4]{ref}).
|
4
|
1127 |
|
|
1128 |
A method is stored in a theory as indicated by:
|
|
1129 |
\begin{verbatim}
|
|
1130 |
Method.add_method :
|
|
1131 |
(bstring * (src -> Proof.context -> method) * string) -> theory trf ;
|
|
1132 |
( *
|
|
1133 |
* )
|
|
1134 |
\end{verbatim}
|
|
1135 |
where the first and third arguments are name and description of the method,
|
|
1136 |
and the second is a function which parses the method input text
|
|
1137 |
(including the method name, which has necessarily already been parsed).
|
|
1138 |
|
|
1139 |
Here, \texttt{xxx} is a function declared in the code for the
|
|
1140 |
structure \texttt{Method}, but not published in its signature.
|
|
1141 |
The source file \texttt{src/Pure/Isar/method.ML} shows the use of
|
|
1142 |
\texttt{Method.add\_method} to add a number of methods.
|
|
1143 |
|
|
1144 |
|
|
1145 |
*}
|
|
1146 |
|
|
1147 |
|
|
1148 |
end |