isabelle-cookbook: CookBook/Parsing.thy@1783211b3494 (annotated)

4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1	theory Parsing
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	2	imports Base
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	3
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	4	begin
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	5
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	6
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	7	chapter {* Parsing *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	8
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	9	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	10
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	11	Isabelle distinguishes between \emph{outer} and \emph{inner} syntax.
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	12	Theory commands, such as \isacommand{definition}, \isacommand{inductive} and so
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	13	on, belong to the outer syntax, whereas items inside double quotation marks, such
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	14	as terms, types and so on, belong to the inner syntax. For parsing inner syntax,
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	15	Isabelle uses a rather general and sophisticated algorithm due to Earley, which
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	16	is driven by priority grammars. Parsers for outer syntax are built up by functional
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	17	parsing combinators. These combinators are a well-established technique for parsing,
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	18	which has, for example, been described in Paulson's classic ML-book \cite{paulson-ml2}.
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	19	Isabelle developers are usually concerned with writing these outer syntax parsers,
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	20	either for new definitional packages or for calling tactics with specific arguments.
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	21
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	22	\begin{readmore}
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	23	The library
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	24	for writing parser combinators is split up, roughly, into two parts.
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	25	The first part consists of a collection of generic parser combinators defined
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	26	in the structure @{ML_struct Scan} in the file
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	27	@{ML_file "Pure/General/scan.ML"}. The second part of the library consists of
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	28	combinators for dealing with specific token types, which are defined in the
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	29	structure @{ML_struct OuterParse} in the file
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	30	@{ML_file "Pure/Isar/outer_parse.ML"}.
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	31	\end{readmore}
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	32
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	33	*}
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	34
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	35	section {* Building Generic Parsers *}
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	36
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	37	text {*
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	38
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	39	Let us first have a look at parsing strings using generic parsing combinators.
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	40	The function @{ML "(op $$)"} takes a string as argument and will ``consume'' this string from
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	41	a given input list of strings. ``Consume'' in this context means that it will
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	42	return a pair consisting of this string and the rest of the input list.
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	43	For example:
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	44
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	45	@{ML_response [display] "($$ \"h\") (explode \"hello\")" "(\"h\", [\"e\", \"l\", \"l\", \"o\"])"}
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	46	@{ML_response [display] "($$ \"w\") (explode \"world\")" "(\"w\", [\"o\", \"r\", \"l\", \"d\"])"}
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	47
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	48	This function will either succeed (as in the two examples above) or raise the exception
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	49	@{ML_text "FAIL"} if no string can be consumed. For example trying to parse
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	50
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	51	@{ML_response_fake [display] "($$ \"x\") (explode \"world\")"
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	52	"Exception FAIL raised"}
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	53
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	54	will raise the exception @{ML_text "FAIL"}.
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	55	There are three exceptions used in the parsing combinators:
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	56
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	57	\begin{itemize}
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	58	\item @{ML_text "FAIL"} is used to indicate that alternative routes of parsing
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	59	might be explored.
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	60	\item @{ML_text "MORE"} indicates that there is not enough input for the parser. For example
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	61	in @{ML_text "($$ \"h\") []"}.
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	62	\item @{ML_text "ABORT"} is the exception which is raised when a dead end is reached.
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	63	It is used for example in the function @{ML "(op !!)"} (see below).
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	64	\end{itemize}
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	65
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	66	However, note that these exceptions are private to the parser and cannot be accessed
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	67	by the programmer (for example to handle them).
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	68
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	69	Slightly more general than the parser @{ML "(op $$)"} is the function @{ML
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	70	Scan.one}, in that it takes a predicate as argument and then parses exactly
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	71	one item from the input list satisfying this predicate. For example the
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	72	following parser either consumes an @{ML_text [quotes] "h"} or a @{ML_text
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	73	[quotes] "w"}:
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	74
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	75
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	76	@{ML_response [display]
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	77	"let
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	78	val hw = Scan.one (fn x => x = \"h\" orelse x = \"w\")
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	79	val input1 = (explode \"hello\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	80	val input2 = (explode \"world\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	81	in
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	82	(hw input1, hw input2)
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	83	end"
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	84	"((\"h\", [\"e\", \"l\", \"l\", \"o\"]),(\"w\", [\"o\", \"r\", \"l\", \"d\"]))"}
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	85
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	86	Two parser can be connected in sequence by using the function @{ML "(op --)"}.
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	87	For example parsing @{ML_text "h"}, @{ML_text "e"} and @{ML_text "l"} in this
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	88	sequence can be achieved by
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	89
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	90	@{ML_response [display] "(($$ \"h\") -- ($$ \"e\") -- ($$ \"l\")) (explode \"hello\")"
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	91	"(((\"h\", \"e\"), \"l\"), [\"l\", \"o\"])"}
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	92
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	93	Note how the result of consumed strings builds up on the left as nested pairs.
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	94
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	95	Parsers that explore
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	96	alternatives can be constructed using the function @{ML "(op \|\|)"}. For example, the
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	97	parser @{ML "(p \|\| q)" for p q} returns the result of @{ML_text "p"}, in case it succeeds,
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	98	otherwise it returns the result of @{ML_text "q"}. For example
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	99
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	100	@{ML_response [display]
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	101	"let
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	102	val hw = ($$ \"h\") \|\| ($$ \"w\")
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	103	val input1 = (explode \"hello\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	104	val input2 = (explode \"world\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	105	in
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	106	(hw input1, hw input2)
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	107	end"
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	108	"((\"h\", [\"e\", \"l\", \"l\", \"o\"]), (\"w\", [\"o\", \"r\", \"l\", \"d\"]))"}
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	109
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	110	The functions @{ML "(op \|--)"} and @{ML "(op --\|)"} work like the sequencing function
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	111	for parsers, except that they discard the item being parsed by the first (respectively second)
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	112	parser. For example
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	113
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	114	@{ML_response [display]
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	115	"let
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	116	val just_e = ($$ \"h\") \|-- ($$ \"e\")
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	117	val just_h = ($$ \"h\") --\| ($$ \"e\")
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	118	val input = (explode \"hello\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	119	in
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	120	(just_e input, just_h input)
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	121	end"
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	122	"((\"e\", [\"l\", \"l\", \"o\"]),(\"h\", [\"l\", \"l\", \"o\"]))"}
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	123
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	124	The parser @{ML "Scan.optional p x" for p x} returns the result of the parser
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	125	@{ML_text "p"}, if it succeeds; otherwise it returns
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	126	the default value @{ML_text "x"}. For example
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	127
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	128	@{ML_response [display]
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	129	"let
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	130	val p = Scan.optional ($$ \"h\") \"x\"
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	131	val input1 = (explode \"hello\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	132	val input2 = (explode \"world\")
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	133	in
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	134	(p input1, p input2)
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	135	end"
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	136	"((\"h\", [\"e\", \"l\", \"l\", \"o\"]), (\"x\", [\"w\", \"o\", \"r\", \"l\", \"d\"]))"}
631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	137
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	138	The function @{ML Scan.option} works similarly, except no default value can
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	139	be given. Instead, the result is wrapped as an @{text "option"}-type. For example:
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	140
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	141	@{ML_response [display]
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	142	"let
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	143	val p = Scan.option ($$ \"h\")
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	144	val input1 = (explode \"hello\")
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	145	val input2 = (explode \"world\")
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	146	in
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	147	(p input1, p input2)
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	148	end" "((SOME \"h\", [\"e\", \"l\", \"l\", \"o\"]), (NONE, [\"w\", \"o\", \"r\", \"l\", \"d\"]))"}
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	149
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	150	The function @{ML "(op !!)"} helps to produce appropriate error messages
43 02f76f1b6e7b added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit Christian Urban <urbanc@in.tum.de> parents: 42 diff changeset	151	during parsing. For example if one wants to parse that @{ML_text p} is immediately
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	152	followed by @{ML_text q}, or start a completely different parser @{ML_text r},
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	153	one might write
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	154
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	155	@{ML [display] "(p -- q) \|\| r" for p q r}
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	156
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	157	However, this parser is problematic for producing an appropriate error message, in case
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	158	the parsing of @{ML "(p -- q)" for p q} fails. Because in that case one loses with the parser
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	159	above the information
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	160	that @{ML_text p} should be followed by @{ML_text q}. To see this consider the case in
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	161	which @{ML_text p}
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	162	is present in the input, but not @{ML_text q}. That means @{ML "(p -- q)" for p q} will fail
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	163	and the
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	164	alternative parser @{ML_text r} will be tried. However in many circumstance this will be the wrong
43 02f76f1b6e7b added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit Christian Urban <urbanc@in.tum.de> parents: 42 diff changeset	165	parser for the input ``p-followed-by-q'' and therefore will also fail. The error message is then
02f76f1b6e7b added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit Christian Urban <urbanc@in.tum.de> parents: 42 diff changeset	166	caused by the
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	167	failure of @{ML_text r}, not by the absence of @{ML_text q} in the input. This kind of situation
a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	168	can be avoided by using the function @{ML "(op !!)"}. This function aborts the whole process of
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	169	parsing in case of a failure and invokes an error message. For example if we invoke the parser
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	170
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	171	@{ML [display] "(!! (fn _ => \"foo\") ($$ \"h\"))"}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	172
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	173	on @{ML_text [quotes] "hello"}, the parsing succeeds
39 631d12c25bde substantial changes to the antiquotations (preliminary version) Christian Urban <urbanc@in.tum.de> parents: 38 diff changeset	174
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	175	@{ML_response [display]
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	176	"(!! (fn _ => \"foo\") ($$ \"h\")) (explode \"hello\")"
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	177	"(\"h\", [\"e\", \"l\", \"l\", \"o\"])"}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	178
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	179	but if we invoke it on @{ML_text [quotes] "world"}
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	180
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	181	@{ML_response_fake [display] "(!! (fn _ => \"foo\") ($$ \"h\")) (explode \"world\")"
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	182	"Exception ABORT raised"}
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	183
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	184	the parsing aborts and the error message @{ML_text "foo"} is printed out. In order to
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	185	see the error message properly, we need to prefix the parser with the function
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	186	@{ML "Scan.error"}. For example
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	187
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	188	@{ML_response_fake [display] "Scan.error ((!! (fn _ => \"foo\") ($$ \"h\")))"
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	189	"Exception Error \"foo\" raised"}
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	190
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	191	This ``prefixing'' is usually done by wrappers such as @{ML "OuterSyntax.command"}
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	192	(FIXME: give reference to later place).
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	193
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	194	Returning to our example of parsing @{ML "(p -- q) \|\| r" for p q r}. If we want
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	195	to generate the correct error message for p-followed-by-q, then
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	196	we have to write:
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	197	*}
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	198
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	199	ML {*
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	200	fun p_followed_by_q p q r =
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	201	let
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	202	val err = (fn _ => p ^ " is not followed by " ^ q)
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	203	in
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	204	(($$ p) -- (!! err ($$ q))) \|\| (($$ r) -- ($$ r))
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	205	end
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	206	*}
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	207
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	208
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	209	text {*
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	210	Running this parser with
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	211
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	212	@{ML_response_fake [display] "Scan.error (p_followed_by_q \"h\" \"e\" \"w\") (explode \"holle\")"
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	213	"Exception ERROR \"h is not followed by e\" raised"}
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	214
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	215	gives the correct error message. Running it with
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	216
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	217	@{ML_response [display] "Scan.error (p_followed_by_q \"h\" \"e\" \"w\") (explode \"wworld\")"
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	218	"((\"w\", \"w\"), [\"o\", \"r\", \"l\", \"d\"])"}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	219
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	220	yields the expected parsing.
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	221
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	222	The function @{ML "Scan.repeat p" for p} will apply a parser @{ML_text p} as
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	223	often as it succeeds. For example
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	224
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	225	@{ML_response [display] "Scan.repeat ($$ \"h\") (explode \"hhhhello\")"
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	226	"([\"h\", \"h\", \"h\", \"h\"], [\"e\", \"l\", \"l\", \"o\"])"}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	227
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	228	Note that @{ML "Scan.repeat"} stores the parsed items in a list. The function
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	229	@{ML "Scan.repeat1"} is similar, but requires that the parser @{ML_text "p"}
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	230	succeeds at least once.
48 609f9ef73494 fixed FIXME's in fake responses Christian Urban <urbanc@in.tum.de> parents: 47 diff changeset	231
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	232	Also note that the parser would have aborted with the exception @{ML_text MORE}, if
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	233	we had run it only on just @{ML_text [quotes] "hhhh"}. This can be avoided using
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	234	the wrapper @{ML Scan.finite} and the ``stopper-token'' @{ML Symbol.stopper}. With
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	235	them we can write
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	236
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	237	@{ML_response [display] "Scan.finite Symbol.stopper (Scan.repeat ($$ \"h\")) (explode \"hhhh\")"
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	238	"([\"h\", \"h\", \"h\", \"h\"], [])"}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	239
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	240	However, this kind of manually wrapping needs to be done only very rarely
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	241	in practise, because it is already done by the infrastructure for you.
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	242
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	243	After parsing succeeded, one nearly always wants to apply a function on the parsed
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	244	items. This is done using the function @{ML "(p >> f)" for p f} which runs
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	245	first the parser @{ML_text p} and upon successful completion applies the
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	246	function @{ML_text f} to the result. For example
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	247
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	248	@{ML_response [display]
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	249	"let
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	250	fun double (x,y) = (x^x,y^y)
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	251	in
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	252	(($$ \"h\") -- ($$ \"e\") >> double) (explode \"hello\")
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	253	end"
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	254	"((\"hh\", \"ee\"), [\"l\", \"l\", \"o\"])"}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	255
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	256	doubles the two parsed input strings.
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	257
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	258	The function @{ML Scan.lift} takes a parser and a pair as arguments. This function applies
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	259	the given parser to the second component of the pair and leaves the first component
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	260	untouched. For example
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	261
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	262	@{ML_response [display]
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	263	"Scan.lift (($$ \"h\") -- ($$ \"e\")) (1,(explode \"hello\"))"
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	264	"((\"h\", \"e\"), (1, [\"l\", \"l\", \"o\"]))"}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	265
43 02f76f1b6e7b added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit Christian Urban <urbanc@in.tum.de> parents: 42 diff changeset	266	(FIXME: In which situations is this useful? Give examples.)
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	267	*}
35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	268
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	269	section {* Parsing Theory Syntax *}
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	270
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	271	text {*
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	272	Most of the time, however, Isabelle developers have to deal with parsing tokens, not strings.
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	273	This is because the parsers for the theory syntax, as well as the parsers for the
43 02f76f1b6e7b added positions to anti-quotations; removed old antiquotation_setup; tuned the text a bit Christian Urban <urbanc@in.tum.de> parents: 42 diff changeset	274	argument syntax of proof methods and attributes use the type
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	275	@{ML_type OuterLex.token} (which is identical to the type @{ML_type OuterParse.token}).
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	276
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	277	\begin{readmore}
40 35e1dff0d9bb more on the parsing section Christian Urban <urbanc@in.tum.de> parents: 39 diff changeset	278	The parser functions for the theory syntax are contained in the structure
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	279	@{ML_struct OuterParse} defined in the file @{ML_file "Pure/Isar/outer_parse.ML"}.
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	280	The definition for tokens is in the file @{ML_file "Pure/Isar/outer_lex.ML"}.
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	281	\end{readmore}
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	282
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	283	The structure @{ML_struct OuterLex} defines several kinds of token (for example
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	284	@{ML "Ident" in OuterLex} for identifiers, @{ML "Keyword" in OuterLex} for keywords and
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	285	@{ML "Command" in OuterLex} for commands). Some token parsers take into account the
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	286	kind of token.
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	287	*}
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	288
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	289	text {*
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	290	For the examples below, we can generate a token list out of a string using
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	291	the function @{ML "OuterSyntax.scan"}, which we give below @{ML
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	292	"Position.none"} as argument since, at the moment, we are not interested in
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	293	generating precise error messages. The following
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	294
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	295
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	296	@{ML_response_fake [display] "OuterSyntax.scan Position.none \"hello world\""
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	297	"[Token (\<dots>,(Ident, \"hello\"),\<dots>),
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	298	Token (\<dots>,(Space, \" \"),\<dots>),
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	299	Token (\<dots>,(Ident, \"world\"),\<dots>)]"}
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	300
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	301	produces three tokens where the first and the last are identifiers, since
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	302	@{ML_text [quotes] "hello"} and @{ML_text [quotes] "world"} do not match any
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	303	other syntactic category.\footnote{Note that because of a possible a bug in
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	304	the PolyML runtime system the result is printed as @{text "?"}, instead of
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	305	the token.} The second indicates a space.
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	306
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	307	Many parsing functions later on will require spaces, comments and the like
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	308	to have already been filtered out. So from now on we are going to use the
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	309	functions @{ML filter} and @{ML OuterLex.is_proper} do this. For example
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	310
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	311
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	312	@{ML_response_fake [display]
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	313	"let
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	314	val input = OuterSyntax.scan Position.none \"hello world\"
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	315	in
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	316	filter OuterLex.is_proper input
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	317	end"
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	318	"[Token (\<dots>,(Ident, \"hello\"), \<dots>), Token (\<dots>,(Ident, \"world\"), \<dots>)]"}
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	319
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	320	For convenience we are going to use the function
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	321
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	322	*}
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	323
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	324	ML {*
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	325	fun filtered_input str =
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	326	filter OuterLex.is_proper (OuterSyntax.scan Position.none str)
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	327	*}
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	328
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	329	text {*
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	330
48 609f9ef73494 fixed FIXME's in fake responses Christian Urban <urbanc@in.tum.de> parents: 47 diff changeset	331	If we parse
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	332
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	333	@{ML_response_fake [display]
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	334	"filtered_input \"inductive \| for\""
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	335	"[Token (\<dots>,(Command, \"inductive\"),\<dots>),
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	336	Token (\<dots>,(Keyword, \"\|\"),\<dots>),
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	337	Token (\<dots>,(Keyword, \"for\"),\<dots>)]"}
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	338
52 a04bdee4fb1e tuned Christian Urban <urbanc@in.tum.de> parents: 50 diff changeset	339	we obtain a list consisting of only a command and two keyword tokens.
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	340	If you want to see which keywords and commands are currently known, use
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	341	the following (you might have to adjust the @{ML print_depth} in order to
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	342	see the complete list):
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	343
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	344	@{ML_response_fake [display]
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	345	"let
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	346	val (keywords, commands) = OuterKeyword.get_lexicons ()
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	347	in
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	348	(Scan.dest_lexicon commands, Scan.dest_lexicon keywords)
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	349	end"
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	350	"([\"}\",\"{\",\<dots>],[\"\<rightleftharpoons>\",\"\<leftharpoondown>\",\<dots>])"}
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	351
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	352	Now the parser @{ML "OuterParse.$$$"} parses a single keyword. For example
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	353
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	354	@{ML_response [display]
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	355	"let
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	356	val input1 = filtered_input \"where for\"
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	357	val input2 = filtered_input \"\| in\"
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	358	in
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	359	(OuterParse.$$$ \"where\" input1, OuterParse.$$$ \"\|\" input2)
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	360	end"
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	361	"((\"where\",\<dots>),(\"\|\",\<dots>))"}
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	362
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	363	Like before, we can sequentially connect parsers with @{ML "(op --)"}. For example
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	364
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	365	@{ML_response [display]
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	366	"let
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	367	val input = filtered_input \"\| in\"
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	368	in
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	369	(OuterParse.$$$ \"\|\" -- OuterParse.$$$ \"in\") input
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	370	end"
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	371	"((\"\|\",\"in\"),[])"}
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	372
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	373	The parser @{ML "OuterParse.enum s p" for s p} parses a possibly empty
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	374	list of items recognised by the parser @{ML_text p}, where the items being parsed
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	375	are separated by the string @{ML_text s}. For example
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	376
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	377	@{ML_response [display]
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	378	"let
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	379	val input = filtered_input \"in \| in \| in foo\"
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	380	in
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	381	(OuterParse.enum \"\|\" (OuterParse.$$$ \"in\")) input
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	382	end"
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	383	"([\"in\",\"in\",\"in\"],[\<dots>])"}
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	384
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	385	@{ML "OuterParse.enum1"} works similarly, except that the parsed list must
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	386	be non-empty. Note that we had to add an @{ML_text [quotes] "foo"} at the end
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	387	of the parsed string, otherwise the parser would have consumed all tokens
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	388	and then failed with the exception @{ML_text "MORE"}. Like in the previous
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	389	section, we can avoid this exception using the wrapper @{ML
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	390	Scan.finite}. This time, however, we have to use the ``stopper-token'' @{ML
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	391	OuterLex.stopper}. We can write
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	392
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	393	@{ML_response [display]
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	394	"let
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	395	val input = filtered_input \"in \| in \| in\"
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	396	in
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	397	Scan.finite OuterLex.stopper
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	398	(OuterParse.enum \"\|\" (OuterParse.$$$ \"in\")) input
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	399	end"
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	400	"([\"in\",\"in\",\"in\"],[])"}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	401
54 1783211b3494 tuned; added document antiquotation ML_response_fake_both Christian Urban <urbanc@in.tum.de> parents: 53 diff changeset	402	The following function will help us later to run examples
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	403
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	404	*}
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	405
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	406	ML {*
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	407	fun parse p input = Scan.finite OuterLex.stopper (Scan.error p) input
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	408	*}
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	409
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	410	text {*
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	411
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	412	The function @{ML "OuterParse.!!!"} can be used to force termination of the
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	413	parser in case of a dead end, just like @{ML "Scan.!!"} (see previous section),
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	414	except that the error message is fixed to be @{text [quotes] "Outer syntax error"}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	415	with a relatively precise description of the failure. For example:
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	416
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	417	@{ML_response_fake [display]
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	418	"let
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	419	val input = filtered_input \"in \|\"
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	420	val parse_bar_then_in = OuterParse.$$$ \"\|\" -- OuterParse.$$$ \"in\"
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	421	in
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	422	parse (OuterParse.!!! parse_bar_then_in) input
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	423	end"
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	424	"Exception ERROR \"Outer syntax error: keyword \"\|\" expected,
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	425	but keyword in was found\" raised"
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	426	}
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	427
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	428	\begin{exercise}
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	429	A type-identifier, for example @{typ "'a"}, is a token of
54 1783211b3494 tuned; added document antiquotation ML_response_fake_both Christian Urban <urbanc@in.tum.de> parents: 53 diff changeset	430	kind @{ML "Keyword" in OuterLex}. It can be parsed using
1783211b3494 tuned; added document antiquotation ML_response_fake_both Christian Urban <urbanc@in.tum.de> parents: 53 diff changeset	431	the function @{ML OuterParse.type_ident}.
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	432	\end{exercise}
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	433
0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	434
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	435	*}
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	436
53 0c3580c831a4 removed the @{ML ...} antiquotation in favour of @{ML_open ...x} Christian Urban <urbanc@in.tum.de> parents: 52 diff changeset	437
49 a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	438	section {* Positional Information *}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	439
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	440	text {*
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	441
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	442	@{ML OuterParse.position}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	443
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	444	*}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	445
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	446	ML {*
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	447	OuterParse.position
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	448	*}
a0edabf14457 added more material Christian Urban <urbanc@in.tum.de> parents: 48 diff changeset	449
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	450
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	451	section {* Parsing Inner Syntax *}
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	452
44 dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	453	ML {*
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	454	let
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	455	val input = OuterSyntax.scan Position.none "0"
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	456	in
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	457	OuterParse.prop input
dee4b3e66dfe added a readme chapter for prospective authors; added commands for referring to the Isar Reference Manual Christian Urban <urbanc@in.tum.de> parents: 43 diff changeset	458	end
42 cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	459
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	460	*}
cd612b489504 tuned mostly antiquotation and text Christian Urban <urbanc@in.tum.de> parents: 41 diff changeset	461
50 3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	462	ML {*
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	463	OuterParse.opt_target
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	464	*}
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	465
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	466	ML {*
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	467	OuterParse.opt_target --
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	468	OuterParse.fixes --
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	469	OuterParse.for_fixes --
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	470	Scan.optional (OuterParse.$$$ "where" \|--
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	471	OuterParse.!!! (OuterParse.enum1 "\|" (SpecParse.opt_thm_name ":" -- OuterParse.prop))) []
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	472
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	473	*}
3d4b49921cdb tuned Christian Urban <urbanc@in.tum.de> parents: 49 diff changeset	474
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	475	text {* (FIXME funny output for a proposition) *}
41 b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	476
b11653b11bd3 further progress on the parsing section and tuning on the antiqu's Christian Urban <urbanc@in.tum.de> parents: 40 diff changeset	477
38 e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	478
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	479	chapter {* Parsing *}
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	480
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	481	text {*
e21b2f888fa2 added a preliminary section about parsing Christian Urban <urbanc@in.tum.de> parents: 16 diff changeset	482
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	483	Lots of Standard ML code is given in this document, for various reasons,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	484	including:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	485	\begin{itemize}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	486	\item direct quotation of code found in the Isabelle source files,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	487	or simplified versions of such code
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	488	\item identifiers found in the Isabelle source code, with their types
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	489	(or specialisations of their types)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	490	\item code examples, which can be run by the reader, to help illustrate the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	491	behaviour of functions found in the Isabelle source code
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	492	\item ancillary functions, not from the Isabelle source code,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	493	which enable the reader to run relevant code examples
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	494	\item type abbreviations, which help explain the uses of certain functions
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	495	\end{itemize}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	496
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	497	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	498
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	499	section {* Parsing Isar input *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	500
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	501	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	502
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	503	The typical parsing function has the type
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	504	\texttt{'src -> 'res * 'src}, with input
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	505	of type \texttt{'src}, returning a result
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	506	of type \texttt{'res}, which is (or is derived from) the first part of the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	507	input, and also returning the remainder of the input.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	508	(In the common case, when it is clear what the ``remainder of the input''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	509	means, we will just say that the functions ``returns'' the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	510	value of type \texttt{'res}).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	511	An exception is raised if an appropriate value
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	512	cannot be produced from the input.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	513	A range of exceptions can be used to identify different reasons
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	514	for the failure of a parse.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	515
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	516	This contrasts the standard parsing function in Standard ML,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	517	which is of type
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	518	\texttt{type ('res, 'src) reader = 'src -> ('res * 'src) option};
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	519	(for example, \texttt{List.getItem} and \texttt{Substring.getc}).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	520	However, much of the discussion at
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	521	FIX file:/home/jeremy/html/ml/SMLBasis/string-cvt.html
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	522	is relevant.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	523
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	524	Naturally one may convert between the two different sorts of parsing functions
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	525	as follows:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	526	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	527	open StringCvt ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	528	type ('res, 'src) ex_reader = 'src -> 'res * 'src
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	529	(* ex_reader : ('res, 'src) reader -> ('res, 'src) ex_reader *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	530	fun ex_reader rdr src = Option.valOf (rdr src) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	531	(* reader : ('res, 'src) ex_reader -> ('res, 'src) reader *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	532	fun reader exrdr src = SOME (exrdr src) handle _ => NONE ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	533	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	534
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	535	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	536
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	537	section{* The \texttt{Scan} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	538
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	539	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	540	The source file is \texttt{src/General/scan.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	541	This structure provides functions for using and combining parsing functions
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	542	of the type \texttt{'src -> 'res * 'src}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	543	Three exceptions are used:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	544	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	545	exception MORE of string option; (need more input (prompt))
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	546	exception FAIL of string option; (try alternatives (reason of failure))
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	547	exception ABORT of string; (dead end)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	548	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	549	Many functions in this structure (generally those with names composed of
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	550	symbols) are declared as infix.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	551
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	552	Some functions from that structure are
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	553	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	554	\|-- : ('src -> 'res1 * 'src') * ('src' -> 'res2 * 'src'') ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	555	'src -> 'res2 * 'src''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	556	--\| : ('src -> 'res1 * 'src') * ('src' -> 'res2 * 'src'') ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	557	'src -> 'res1 * 'src''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	558	-- : ('src -> 'res1 * 'src') * ('src' -> 'res2 * 'src'') ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	559	'src -> ('res1 * 'res2) * 'src''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	560	^^ : ('src -> string * 'src') * ('src' -> string * 'src'') ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	561	'src -> string * 'src''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	562	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	563	These functions parse a result off the input source twice.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	564
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	565	\texttt{\|--} and \texttt{--\|}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	566	return the first result and the second result, respectively.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	567
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	568	\texttt{--} returns both.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	569
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	570	\verb\|^^\| returns the result of concatenating the two results
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	571	(which must be strings).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	572
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	573	Note how, although the types
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	574	\texttt{'src}, \texttt{'src'} and \texttt{'src''} will normally be the same,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	575	the types as shown help suggest the behaviour of the functions.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	576	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	577	:-- : ('src -> 'res1 * 'src') * ('res1 -> 'src' -> 'res2 * 'src'') ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	578	'src -> ('res1 * 'res2) * 'src''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	579	:\|-- : ('src -> 'res1 * 'src') * ('res1 -> 'src' -> 'res2 * 'src'') ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	580	'src -> 'res2 * 'src''
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	581	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	582	These are similar to \texttt{\|--} and \texttt{--\|},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	583	except that the second parsing function can depend on the result of the first.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	584	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	585	>> : ('src -> 'res1 * 'src') * ('res1 -> 'res2) -> 'src -> 'res2 * 'src'
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	586	\|\| : ('src -> 'res_src) * ('src -> 'res_src) -> 'src -> 'res_src
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	587	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	588	\texttt{p >> f} applies a function \texttt{f} to the result of a parse.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	589
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	590	\texttt{\|\|} tries a second parsing function if the first one
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	591	fails by raising an exception of the form \texttt{FAIL \_}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	592
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	593	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	594	succeed : 'res -> ('src -> 'res * 'src) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	595	fail : ('src -> 'res_src) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	596	!! : ('src * string option -> string) ->
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	597	('src -> 'res_src) -> ('src -> 'res_src) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	598	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	599	\texttt{succeed r} returns \texttt{r}, with the input unchanged.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	600	\texttt{fail} always fails, raising exception \texttt{FAIL NONE}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	601	\texttt{!! f} only affects the failure mode, turning a failure that
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	602	raises \texttt{FAIL \_} into a failure that raises \texttt{ABORT ...}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	603	This is used to prevent recovery from the failure ---
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	604	thus, in \texttt{!! parse1 \|\| parse2}, if \texttt{parse1} fails,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	605	it won't recover by trying \texttt{parse2}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	606
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	607	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	608	one : ('si -> bool) -> ('si list -> 'si * 'si list) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	609	some : ('si -> 'res option) -> ('si list -> 'res * 'si list) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	610	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	611	These require the input to be a list of items:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	612	they fail, raising \texttt{MORE NONE} if the list is empty.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	613	On other failures they raise \texttt{FAIL NONE}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	614
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	615	\texttt{one p} takes the first
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	616	item from the list if it satisfies \texttt{p}, otherwise fails.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	617
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	618	\texttt{some f} takes the first
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	619	item from the list and applies \texttt{f} to it, failing if this returns
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	620	\texttt{NONE}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	621
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	622	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	623	many : ('si -> bool) -> 'si list -> 'si list * 'si list ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	624	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	625	\texttt{many p} takes items from the input until it encounters one
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	626	which does not satisfy \texttt{p}. If it reaches the end of the input
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	627	it fails, raising \texttt{MORE NONE}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	628
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	629	\texttt{many1} (with the same type) fails if the first item
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	630	does not satisfy \texttt{p}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	631
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	632	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	633	option : ('src -> 'res * 'src) -> ('src -> 'res option * 'src)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	634	optional : ('src -> 'res * 'src) -> 'res -> ('src -> 'res * 'src)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	635	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	636	\texttt{option}:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	637	where the parser \texttt{f} succeeds with result \texttt{r}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	638	or raises \texttt{FAIL \_},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	639	\texttt{option f} gives the result \texttt{SOME r} or \texttt{NONE}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	640
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	641	\texttt{optional}: if parser \texttt{f} fails by raising \texttt{FAIL \_},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	642	\texttt{optional f default} provides the result \texttt{default}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	643
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	644	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	645	repeat : ('src -> 'res * 'src) -> 'src -> 'res list * 'src
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	646	repeat1 : ('src -> 'res * 'src) -> 'src -> 'res list * 'src
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	647	bulk : ('src -> 'res * 'src) -> 'src -> 'res list * 'src
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	648	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	649	\texttt{repeat f} repeatedly parses an item off the remaining input until
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	650	\texttt{f} fails with \texttt{FAIL \_}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	651
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	652	\texttt{repeat1} is as for \texttt{repeat}, but requires at least one
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	653	successful parse.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	654
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	655	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	656	lift : ('src -> 'res * 'src) -> ('ex * 'src -> 'res * ('ex * 'src))
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	657	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	658	\texttt{lift} changes the source type of a parser by putting in an extra
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	659	component \texttt{'ex}, which is ignored in the parsing.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	660
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	661	The \texttt{Scan} structure also provides the type \texttt{lexicon},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	662	HOW DO THEY WORK ?? TO BE COMPLETED
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	663	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	664	dest_lexicon: lexicon -> string list ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	665	make_lexicon: string list list -> lexicon ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	666	empty_lexicon: lexicon ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	667	extend_lexicon: string list list -> lexicon -> lexicon ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	668	merge_lexicons: lexicon -> lexicon -> lexicon ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	669	is_literal: lexicon -> string list -> bool ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	670	literal: lexicon -> string list -> string list * string list ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	671	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	672	Two lexicons, for the commands and keywords, are stored and can be retrieved
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	673	by:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	674	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	675	val (command_lexicon, keyword_lexicon) = OuterSyntax.get_lexicons () ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	676	val commands = Scan.dest_lexicon command_lexicon ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	677	val keywords = Scan.dest_lexicon keyword_lexicon ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	678	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	679	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	680
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	681	section{* The \texttt{OuterLex} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	682
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	683	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	684	The source file is @{text "src/Pure/Isar/outer_lex.ML"}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	685	In some other source files its name is abbreviated:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	686	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	687	structure T = OuterLex;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	688	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	689	This structure defines the type \texttt{token}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	690	(The types
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	691	\texttt{OuterLex.token},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	692	\texttt{OuterParse.token} and
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	693	\texttt{SpecParse.token} are all the same).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	694
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	695	Input text is split up into tokens, and the input source type for many parsing
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	696	functions is \texttt{token list}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	697
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	698	The datatype definition (which is not published in the signature) is
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	699	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	700	datatype token = Token of Position.T * (token_kind * string);
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	701	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	702	but here are some runnable examples for viewing tokens:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	703
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	704	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	705
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	706
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	707
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	708
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	709	ML {*
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	710	val toks = OuterSyntax.scan Position.none
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	711	"theory,imports;begin x.y.z apply ?v1 ?'a 'a -- \|\| 44 simp (* xx ) { fff * }" ;
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	712	*}
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	713
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	714	ML {*
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	715	print_depth 20 ;
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	716	*}
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	717
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	718	ML {*
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	719	map OuterLex.text_of toks ;
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	720	*}
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	721
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	722	ML {*
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	723	val proper_toks = filter OuterLex.is_proper toks ;
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	724	*}
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	725
47 4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	726	ML {*
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	727	map OuterLex.kind_of proper_toks
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	728	*}
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	729
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	730	ML {*
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	731	map OuterLex.unparse proper_toks ;
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	732	*}
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	733
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	734	ML {*
4daf913fdbe1 hakked latex so that it does not display ML {* }; general tuning Christian Urban <urbanc@in.tum.de>* parents: 44 diff changeset	735	OuterLex.stopper
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	736	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	737
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	738	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	739
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	740	The function \texttt{is\_proper : token -> bool} identifies tokens which are
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	741	not white space or comments: many parsing functions assume require spaces or
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	742	comments to have been filtered out.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	743
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	744	There is a special end-of-file token:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	745	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	746	val (tok_eof : token, is_eof : token -> bool) = T.stopper ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	747	(* end of file token *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	748	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	749
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	750	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	751
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	752	section {* The \texttt{OuterParse} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	753
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	754	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	755	The source file is \texttt{src/Pure/Isar/outer\_parse.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	756	In some other source files its name is abbreviated:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	757	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	758	structure P = OuterParse;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	759	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	760	Here the parsers use \texttt{token list} as the input source type.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	761
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	762	Some of the parsers simply select the first token, provided that it is of the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	763	right kind (as returned by \texttt{T.kind\_of}): these are
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	764	\texttt{ command, keyword, short\_ident, long\_ident, sym\_ident, term\_var,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	765	type\_ident, type\_var, number, string, alt\_string, verbatim, sync, eof}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	766	Others select the first token, provided that it is one of several kinds,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	767	(eg, \texttt{name, xname, text, typ}).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	768
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	769	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	770	type 'a tlp = token list -> 'a * token list ; (* token list parser *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	771	$$$ : string -> string tlp
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	772	nat : int tlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	773	maybe : 'a tlp -> 'a option tlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	774	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	775
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	776	\texttt{\$\$\$ s} returns the first token,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	777	if it equals \texttt{s} \emph{and} \texttt{s} is a keyword.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	778
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	779	\texttt{nat} returns the first token, if it is a number, and evaluates it.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	780
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	781	\texttt{maybe}: if \texttt{p} returns \texttt{r},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	782	then \texttt{maybe p} returns \texttt{SOME r} ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	783	if the first token is an underscore, it returns \texttt{NONE}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	784
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	785	A few examples:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	786	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	787	P.list : 'a tlp -> 'a list tlp ; (* likewise P.list1 *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	788	P.and_list : 'a tlp -> 'a list tlp ; (* likewise P.and_list1 *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	789	val toks : token list = OuterSyntax.scan "44 ,_, 66,77" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	790	val proper_toks = List.filter T.is_proper toks ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	791	P.list P.nat toks ; (* OK, doesn't recognize white space *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	792	P.list P.nat proper_toks ; (* fails, doesn't recognize what follows ',' *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	793	P.list (P.maybe P.nat) proper_toks ; (* fails, end of input *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	794	P.list (P.maybe P.nat) (proper_toks @ [tok_eof]) ; (* OK *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	795	val toks : token list = OuterSyntax.scan "44 and 55 and 66 and 77" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	796	P.and_list P.nat (List.filter T.is_proper toks @ [tok_eof]) ; (* ??? *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	797	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	798
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	799	The following code helps run examples:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	800	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	801	fun parse_str tlp str =
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	802	let val toks : token list = OuterSyntax.scan str ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	803	val proper_toks = List.filter T.is_proper toks @ [tok_eof] ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	804	val (res, rem_toks) = tlp proper_toks ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	805	val rem_str = String.concat
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	806	(Library.separate " " (List.map T.unparse rem_toks)) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	807	in (res, rem_str) end ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	808	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	809
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	810	Some examples from \texttt{src/Pure/Isar/outer\_parse.ML}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	811	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	812	val type_args =
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	813	type_ident >> Library.single \|\|
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	814	$$$ "(" \|-- !!! (list1 type_ident --\| $$$ ")") \|\|
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	815	Scan.succeed [];
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	816	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	817	There are three ways parsing a list of type arguments can succeed.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	818	The first line reads a single type argument, and turns it into a singleton
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	819	list.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	820	The second line reads "(", and then the remainder, ignoring the "(" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	821	the remainder consists of a list of type identifiers (at least one),
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	822	and then a ")" which is also ignored.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	823	The \texttt{!!!} ensures that if the parsing proceeds this far and then fails,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	824	it won't try the third line (see the description of \texttt{Scan.!!}).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	825	The third line consumes no input and returns the empty list.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	826
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	827	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	828	fun triple2 (x, (y, z)) = (x, y, z);
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	829	val arity = xname -- ($$$ "::" \|-- !!! (
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	830	Scan.optional ($$$ "(" \|-- !!! (list1 sort --\| $$$ ")")) []
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	831	-- sort)) >> triple2;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	832	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	833	The parser \texttt{arity} reads a typename $t$, then ``\texttt{::}'' (which is
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	834	ignored), then optionally a list $ss$ of sorts and then another sort $s$.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	835	The result $(t, (ss, s))$ is transformed by \texttt{triple2} to $(t, ss, s)$.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	836	The second line reads the optional list of sorts:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	837	it reads first ``\texttt{(}'' and last ``\texttt{)}'', which are both ignored,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	838	and between them a comma-separated list of sorts.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	839	If this list is absent, the default \texttt{[]} provides the list of sorts.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	840
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	841	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	842	parse_str P.type_args "('a, 'b) ntyp" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	843	parse_str P.type_args "'a ntyp" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	844	parse_str P.type_args "ntyp" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	845	parse_str P.arity "ty :: tycl" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	846	parse_str P.arity "ty :: (tycl1, tycl2) tycl" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	847	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	848
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	849	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	850
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	851	section {* The \texttt{SpecParse} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	852
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	853	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	854	The source file is \texttt{src/Pure/Isar/spec\_parse.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	855	This structure contains token list parsers for more complicated values.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	856	For example,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	857	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	858	open SpecParse ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	859	attrib : Attrib.src tok_rdr ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	860	attribs : Attrib.src list tok_rdr ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	861	opt_attribs : Attrib.src list tok_rdr ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	862	xthm : (thmref * Attrib.src list) tok_rdr ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	863	xthms1 : (thmref * Attrib.src list) list tok_rdr ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	864
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	865	parse_str attrib "simp" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	866	parse_str opt_attribs "hello" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	867	val (ass, "") = parse_str attribs "[standard, xxxx, simp, intro, OF sym]" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	868	map Args.dest_src ass ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	869	val (asrc, "") = parse_str attrib "THEN trans [THEN sym]" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	870
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	871	parse_str xthm "mythm [attr]" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	872	parse_str xthms1 "thm1 [attr] thms2" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	873	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	874
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	875	As you can see, attributes are described using types of the \texttt{Args}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	876	structure, described below.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	877	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	878
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	879	section{* The \texttt{Args} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	880
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	881	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	882	The source file is \texttt{src/Pure/Isar/args.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	883	The primary type of this structure is the \texttt{src} datatype;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	884	the single constructors not published in the signature, but
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	885	\texttt{Args.src} and \texttt{Args.dest\_src}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	886	are in fact the constructor and destructor functions.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	887	Note that the types \texttt{Attrib.src} and \texttt{Method.src}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	888	are in fact \texttt{Args.src}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	889
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	890	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	891	src : (string * Args.T list) * Position.T -> Args.src ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	892	dest_src : Args.src -> (string * Args.T list) * Position.T ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	893	Args.pretty_src : Proof.context -> Args.src -> Pretty.T ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	894	fun pr_src ctxt src = Pretty.string_of (Args.pretty_src ctxt src) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	895
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	896	val thy = ML_Context.the_context () ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	897	val ctxt = ProofContext.init thy ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	898	map (pr_src ctxt) ass ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	899	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	900
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	901	So an \texttt{Args.src} consists of the first word, then a list of further
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	902	``arguments'', of type \texttt{Args.T}, with information about position in the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	903	input.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	904	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	905	(* how an Args.src is parsed *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	906	P.position : 'a tlp -> ('a * Position.T) tlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	907	P.arguments : Args.T list tlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	908
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	909	val parse_src : Args.src tlp =
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	910	P.position (P.xname -- P.arguments) >> Args.src ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	911	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	912
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	913	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	914	val ((first_word, args), pos) = Args.dest_src asrc ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	915	map Args.string_of args ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	916	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	917
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	918	The \texttt{Args} structure contains more parsers and parser transformers
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	919	for which the input source type is \texttt{Args.T list}. For example,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	920	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	921	type 'a atlp = Args.T list -> 'a * Args.T list ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	922	open Args ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	923	nat : int atlp ; (* also Args.int *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	924	thm_sel : PureThy.interval list atlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	925	list : 'a atlp -> 'a list atlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	926	attribs : (string -> string) -> Args.src list atlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	927	opt_attribs : (string -> string) -> Args.src list atlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	928
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	929	(* parse_atl_str : 'a atlp -> (string -> 'a * string) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	930	given an Args.T list parser, to get a string parser *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	931	fun parse_atl_str atlp str =
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	932	let val (ats, rem_str) = parse_str P.arguments str ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	933	val (res, rem_ats) = atlp ats ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	934	in (res, String.concat (Library.separate " "
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	935	(List.map Args.string_of rem_ats @ [rem_str]))) end ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	936
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	937	parse_atl_str Args.int "-1-," ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	938	parse_atl_str (Scan.option Args.int) "x1-," ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	939	parse_atl_str Args.thm_sel "(1-,4,13-22)" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	940
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	941	val (ats as atsrc :: _, "") = parse_atl_str (Args.attribs I)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	942	"[THEN trans [THEN sym], simp, OF sym]" ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	943	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	944
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	945	From here, an attribute is interpreted using \texttt{Attrib.attribute}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	946
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	947	\texttt{Args} has a large number of functions which parse an \texttt{Args.src}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	948	and also refer to a generic context.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	949	Note the use of \texttt{Scan.lift} for this.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	950	(as does \texttt{Attrib} - RETHINK THIS)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	951
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	952	(\texttt{Args.syntax} shown below has type specialised)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	953
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	954	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	955	type ('res, 'src) parse_fn = 'src -> 'res * 'src ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	956	type 'a cgatlp = ('a, Context.generic * Args.T list) parse_fn ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	957	Scan.lift : 'a atlp -> 'a cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	958	term : term cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	959	typ : typ cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	960
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	961	Args.syntax : string -> 'res cgatlp -> src -> ('res, Context.generic) parse_fn ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	962	Attrib.thm : thm cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	963	Attrib.thms : thm list cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	964	Attrib.multi_thm : thm list cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	965
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	966	(* parse_cgatl_str : 'a cgatlp -> (string -> 'a * string) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	967	given a (Context.generic * Args.T list) parser, to get a string parser *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	968	fun parse_cgatl_str cgatlp str =
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	969	let
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	970	(* use the current generic context *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	971	val generic = Context.Theory thy ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	972	val (ats, rem_str) = parse_str P.arguments str ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	973	(* ignore any change to the generic context *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	974	val (res, (_, rem_ats)) = cgatlp (generic, ats) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	975	in (res, String.concat (Library.separate " "
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	976	(List.map Args.string_of rem_ats @ [rem_str]))) end ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	977	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	978	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	979
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	980	section{* Attributes, and the \texttt{Attrib} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	981
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	982	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	983	The type \texttt{attribute} is declared in \texttt{src/Pure/thm.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	984	The source file for the \texttt{Attrib} structure is
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	985	\texttt{src/Pure/Isar/attrib.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	986	Most attributes use a theorem to change a generic context (for example,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	987	by declaring that the theorem should be used, by default, in simplification),
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	988	or change a theorem (which most often involves referring to the current
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	989	theory).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	990	The functions \texttt{Thm.rule\_attribute} and
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	991	\texttt{Thm.declaration\_attribute} create attributes of these kinds.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	992
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	993	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	994	type attribute = Context.generic * thm -> Context.generic * thm;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	995	type 'a trf = 'a -> 'a ; (* transformer of a given type *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	996	Thm.rule_attribute : (Context.generic -> thm -> thm) -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	997	Thm.declaration_attribute : (thm -> Context.generic trf) -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	998
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	999	Attrib.print_attributes : theory -> unit ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1000	Attrib.pretty_attribs : Proof.context -> src list -> Pretty.T list ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1001
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1002	List.app Pretty.writeln (Attrib.pretty_attribs ctxt ass) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1003	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1004
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1005	An attribute is stored in a theory as indicated by:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1006	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1007	Attrib.add_attributes :
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1008	(bstring * (src -> attribute) * string) list -> theory trf ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1009	(*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1010	Attrib.add_attributes [("THEN", THEN_att, "resolution with rule")] ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1011	*)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1012	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1013	where the first and third arguments are name and description of the attribute,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1014	and the second is a function which parses the attribute input text
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1015	(including the attribute name, which has necessarily already been parsed).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1016	Here, \texttt{THEN\_att} is a function declared in the code for the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1017	structure \texttt{Attrib}, but not published in its signature.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1018	The source file \texttt{src/Pure/Isar/attrib.ML} shows the use of
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1019	\texttt{Attrib.add\_attributes} to add a number of attributes.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1020
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1021	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1022	FullAttrib.THEN_att : src -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1023	FullAttrib.THEN_att atsrc (generic, ML_Context.thm "sym") ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1024	FullAttrib.THEN_att atsrc (generic, ML_Context.thm "all_comm") ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1025	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1026
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1027	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1028	Attrib.syntax : attribute cgatlp -> src -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1029	Attrib.no_args : attribute -> src -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1030	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1031	When this is called as \texttt{syntax scan src (gc, th)}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1032	the generic context \texttt{gc} is used
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1033	(and potentially changed to \texttt{gc'})
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1034	by \texttt{scan} in parsing to obtain an attribute \texttt{attr} which would
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1035	then be applied to \texttt{(gc', th)}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1036	The source for parsing the attribute is the arguments part of \texttt{src},
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1037	which must all be consumed by the parse.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1038
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1039	For example, for \texttt{Attrib.no\_args attr src}, the attribute parser
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1040	simply returns \texttt{attr}, requiring that the arguments part of
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1041	\texttt{src} must be empty.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1042
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1043	Some examples from \texttt{src/Pure/Isar/attrib.ML}, modified:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1044	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1045	fun rot_att_n n (gc, th) = (gc, rotate_prems n th) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1046	rot_att_n : int -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1047	val rot_arg = Scan.lift (Scan.optional Args.int 1 : int atlp) : int cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1048	val rotated_att : src -> attribute =
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1049	Attrib.syntax (rot_arg >> rot_att_n : attribute cgatlp) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1050
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1051	val THEN_arg : int cgatlp = Scan.lift
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1052	(Scan.optional (Args.bracks Args.nat : int atlp) 1 : int atlp) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1053
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1054	Attrib.thm : thm cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1055
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1056	THEN_arg -- Attrib.thm : (int * thm) cgatlp ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1057
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1058	fun THEN_att_n (n, tht) (gc, th) = (gc, th RSN (n, tht)) ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1059	THEN_att_n : int * thm -> attribute ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1060
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1061	val THEN_att : src -> attribute = Attrib.syntax
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1062	(THEN_arg -- Attrib.thm >> THEN_att_n : attribute cgatlp);
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1063	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1064	The functions I've called \texttt{rot\_arg} and \texttt{THEN\_arg}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1065	read an optional argument, which for \texttt{rotated} is an integer,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1066	and for \texttt{THEN} is a natural enclosed in square brackets;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1067	the default, if the argument is absent, is 1 in each case.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1068	Functions \texttt{rot\_att\_n} and \texttt{THEN\_att\_n} turn these into
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1069	attributes, where \texttt{THEN\_att\_n} also requires a theorem, which is
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1070	parsed by \texttt{Attrib.thm}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1071	Infix operators \texttt{--} and \texttt{>>} are in the structure \texttt{Scan}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1072
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1073	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1074
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1075	section{* Methods, and the \texttt{Method} structure *}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1076
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1077	text {*
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1078	The source file is \texttt{src/Pure/Isar/method.ML}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1079	The type \texttt{method} is defined by the datatype declaration
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1080	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1081	(* datatype method = Meth of thm list -> cases_tactic; *)
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1082	RuleCases.NO_CASES : tactic -> cases_tactic ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1083	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1084	In fact \texttt{RAW\_METHOD\_CASES} (below) is exactly the constructor
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1085	\texttt{Meth}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1086	A \texttt{cases\_tactic} is an elaborated version of a tactic.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1087	\texttt{NO\_CASES tac} is a \texttt{cases\_tactic} which consists of a
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1088	\texttt{cases\_tactic} without any further case information.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1089	For further details see the description of structure \texttt{RuleCases} below.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1090	The list of theorems to be passed to a method consists of the current
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1091	\emph{facts} in the proof.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1092
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1093	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1094	RAW_METHOD : (thm list -> tactic) -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1095	METHOD : (thm list -> tactic) -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1096
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1097	SIMPLE_METHOD : tactic -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1098	SIMPLE_METHOD' : (int -> tactic) -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1099	SIMPLE_METHOD'' : ((int -> tactic) -> tactic) -> (int -> tactic) -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1100
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1101	RAW_METHOD_CASES : (thm list -> cases_tactic) -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1102	METHOD_CASES : (thm list -> cases_tactic) -> method ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1103	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1104	A method is, in its simplest form, a tactic; applying the method is to apply
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1105	the tactic to the current goal state.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1106
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1107	Applying \texttt{RAW\_METHOD tacf} creates a tactic by applying
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1108	\texttt{tacf} to the current {facts}, and applying that tactic to the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1109	goal state.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1110
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1111	\texttt{METHOD} is similar but also first applies
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1112	\texttt{Goal.conjunction\_tac} to all subgoals.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1113
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1114	\texttt{SIMPLE\_METHOD tac} inserts the facts into all subgoals and then
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1115	applies \texttt{tacf}.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1116
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1117	\texttt{SIMPLE\_METHOD' tacf} inserts the facts and then
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1118	applies \texttt{tacf} to subgoal 1.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1119
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1120	\texttt{SIMPLE\_METHOD'' quant tacf} does this for subgoal(s) selected by
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1121	\texttt{quant}, which may be, for example,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1122	\texttt{ALLGOALS} (all subgoals),
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1123	\texttt{TRYALL} (try all subgoals, failure is OK),
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1124	\texttt{FIRSTGOAL} (try subgoals until it succeeds once),
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1125	\texttt{(fn tacf => tacf 4)} (subgoal 4), etc
16 5045dec52d2b polished Christian Urban <urbanc@in.tum.de> parents: 4 diff changeset	1126	(see the \texttt{Tactical} structure, FIXME) %%\cite[Chapter 4]{ref}).
4 2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1127
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1128	A method is stored in a theory as indicated by:
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1129	\begin{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1130	Method.add_method :
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1131	(bstring * (src -> Proof.context -> method) * string) -> theory trf ;
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1132	( *
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1133	* )
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1134	\end{verbatim}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1135	where the first and third arguments are name and description of the method,
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1136	and the second is a function which parses the method input text
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1137	(including the method name, which has necessarily already been parsed).
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1138
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1139	Here, \texttt{xxx} is a function declared in the code for the
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1140	structure \texttt{Method}, but not published in its signature.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1141	The source file \texttt{src/Pure/Isar/method.ML} shows the use of
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1142	\texttt{Method.add\_method} to add a number of methods.
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1143
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1144
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1145	*}
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1146
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1147
2a69b119cdee added verbatim the notes by Jeremy Christian Urban <urbanc@in.tum.de> parents: diff changeset	1148	end

author	Christian Urban <urbanc@in.tum.de>
	Sat, 13 Dec 2008 01:33:22 +0000 (2008-12-13)
changeset 54	1783211b3494
parent 53	0c3580c831a4
child 56	126646f2aa88
permissions	-rw-r--r--