author | urbanc |
Tue, 14 Feb 2012 00:11:17 +0000 | |
changeset 323 | eee031cc9634 |
parent 322 | c37b387110d0 |
child 324 | 41e4b331ce08 |
permissions | -rwxr-xr-x |
262 | 1 |
(*<*) |
2 |
theory Paper |
|
301 | 3 |
imports "../CpsG" "../ExtGG" "~~/src/HOL/Library/LaTeXsugar" |
262 | 4 |
begin |
266 | 5 |
ML {* |
273 | 6 |
open Printer; |
272 | 7 |
show_question_marks_default := false; |
266 | 8 |
*} |
284 | 9 |
|
10 |
notation (latex output) |
|
11 |
Cons ("_::_" [78,77] 73) and |
|
12 |
vt ("valid'_state") and |
|
13 |
runing ("running") and |
|
286 | 14 |
birthtime ("last'_set") and |
284 | 15 |
If ("(\<^raw:\textrm{>if\<^raw:}> (_)/ \<^raw:\textrm{>then\<^raw:}> (_)/ \<^raw:\textrm{>else\<^raw:}> (_))" 10) and |
286 | 16 |
Prc ("'(_, _')") and |
287 | 17 |
holding ("holds") and |
18 |
waiting ("waits") and |
|
290 | 19 |
Th ("T") and |
20 |
Cs ("C") and |
|
287 | 21 |
readys ("ready") and |
290 | 22 |
depend ("RAG") and |
23 |
preced ("prec") and |
|
24 |
cpreced ("cprec") and |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
25 |
dependents ("dependants") and |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
26 |
cp ("cprec") and |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
27 |
holdents ("resources") and |
299 | 28 |
original_priority ("priority") and |
284 | 29 |
DUMMY ("\<^raw:\mbox{$\_\!\_$}>") |
262 | 30 |
(*>*) |
31 |
||
32 |
section {* Introduction *} |
|
33 |
||
34 |
text {* |
|
284 | 35 |
Many real-time systems need to support threads involving priorities and |
267 | 36 |
locking of resources. Locking of resources ensures mutual exclusion |
275 | 37 |
when accessing shared data or devices that cannot be |
284 | 38 |
preempted. Priorities allow scheduling of threads that need to |
275 | 39 |
finish their work within deadlines. Unfortunately, both features |
40 |
can interact in subtle ways leading to a problem, called |
|
284 | 41 |
\emph{Priority Inversion}. Suppose three threads having priorities |
42 |
$H$(igh), $M$(edium) and $L$(ow). We would expect that the thread |
|
43 |
$H$ blocks any other thread with lower priority and itself cannot |
|
44 |
be blocked by any thread with lower priority. Alas, in a naive |
|
275 | 45 |
implementation of resource looking and priorities this property can |
46 |
be violated. Even worse, $H$ can be delayed indefinitely by |
|
284 | 47 |
threads with lower priorities. For this let $L$ be in the |
275 | 48 |
possession of a lock for a resource that also $H$ needs. $H$ must |
49 |
therefore wait for $L$ to exit the critical section and release this |
|
50 |
lock. The problem is that $L$ might in turn be blocked by any |
|
284 | 51 |
thread with priority $M$, and so $H$ sits there potentially waiting |
52 |
indefinitely. Since $H$ is blocked by threads with lower |
|
275 | 53 |
priorities, the problem is called Priority Inversion. It was first |
277 | 54 |
described in \cite{Lampson80} in the context of the |
275 | 55 |
Mesa programming language designed for concurrent programming. |
265 | 56 |
|
273 | 57 |
If the problem of Priority Inversion is ignored, real-time systems |
267 | 58 |
can become unpredictable and resulting bugs can be hard to diagnose. |
59 |
The classic example where this happened is the software that |
|
284 | 60 |
controlled the Mars Pathfinder mission in 1997 \cite{Reeves98}. |
61 |
Once the spacecraft landed, the software shut down at irregular |
|
62 |
intervals leading to loss of project time as normal operation of the |
|
63 |
craft could only resume the next day (the mission and data already |
|
64 |
collected were fortunately not lost, because of a clever system |
|
65 |
design). The reason for the shutdowns was that the scheduling |
|
66 |
software fell victim of Priority Inversion: a low priority thread |
|
67 |
locking a resource prevented a high priority thread from running in |
|
68 |
time leading to a system reset. Once the problem was found, it was |
|
69 |
rectified by enabling the \emph{Priority Inheritance Protocol} (PIP) |
|
70 |
\cite{Sha90}\footnote{Sha et al.~call it the \emph{Basic Priority |
|
286 | 71 |
Inheritance Protocol} \cite{Sha90} and others sometimes also call it |
72 |
\emph{Priority Boosting}.} in the scheduling software. |
|
262 | 73 |
|
284 | 74 |
The idea behind PIP is to let the thread $L$ temporarily inherit |
286 | 75 |
the high priority from $H$ until $L$ leaves the critical section |
284 | 76 |
unlocking the resource. This solves the problem of $H$ having to |
77 |
wait indefinitely, because $L$ cannot be blocked by threads having |
|
78 |
priority $M$. While a few other solutions exist for the Priority |
|
79 |
Inversion problem, PIP is one that is widely deployed and |
|
80 |
implemented. This includes VxWorks (a proprietary real-time OS used |
|
81 |
in the Mars Pathfinder mission, in Boeing's 787 Dreamliner, Honda's |
|
82 |
ASIMO robot, etc.), but also the POSIX 1003.1c Standard realised for |
|
83 |
example in libraries for FreeBSD, Solaris and Linux. |
|
274 | 84 |
|
284 | 85 |
One advantage of PIP is that increasing the priority of a thread |
275 | 86 |
can be dynamically calculated by the scheduler. This is in contrast |
277 | 87 |
to, for example, \emph{Priority Ceiling} \cite{Sha90}, another |
88 |
solution to the Priority Inversion problem, which requires static |
|
284 | 89 |
analysis of the program in order to prevent Priority |
90 |
Inversion. However, there has also been strong criticism against |
|
91 |
PIP. For instance, PIP cannot prevent deadlocks when lock |
|
92 |
dependencies are circular, and also blocking times can be |
|
93 |
substantial (more than just the duration of a critical section). |
|
94 |
Though, most criticism against PIP centres around unreliable |
|
95 |
implementations and PIP being too complicated and too inefficient. |
|
96 |
For example, Yodaiken writes in \cite{Yodaiken02}: |
|
274 | 97 |
|
98 |
\begin{quote} |
|
99 |
\it{}``Priority inheritance is neither efficient nor reliable. Implementations |
|
100 |
are either incomplete (and unreliable) or surprisingly complex and intrusive.'' |
|
101 |
\end{quote} |
|
273 | 102 |
|
274 | 103 |
\noindent |
275 | 104 |
He suggests to avoid PIP altogether by not allowing critical |
286 | 105 |
sections to be preempted. Unfortunately, this solution does not |
304 | 106 |
help in real-time systems with hard deadlines for high-priority |
107 |
threads. |
|
278 | 108 |
|
286 | 109 |
In our opinion, there is clearly a need for investigating correct |
278 | 110 |
algorithms for PIP. A few specifications for PIP exist (in English) |
111 |
and also a few high-level descriptions of implementations (e.g.~in |
|
112 |
the textbook \cite[Section 5.6.5]{Vahalia96}), but they help little |
|
113 |
with actual implementations. That this is a problem in practise is |
|
283 | 114 |
proved by an email from Baker, who wrote on 13 July 2009 on the Linux |
278 | 115 |
Kernel mailing list: |
274 | 116 |
|
117 |
\begin{quote} |
|
275 | 118 |
\it{}``I observed in the kernel code (to my disgust), the Linux PIP |
119 |
implementation is a nightmare: extremely heavy weight, involving |
|
120 |
maintenance of a full wait-for graph, and requiring updates for a |
|
121 |
range of events, including priority changes and interruptions of |
|
122 |
wait operations.'' |
|
274 | 123 |
\end{quote} |
124 |
||
125 |
\noindent |
|
277 | 126 |
The criticism by Yodaiken, Baker and others suggests to us to look |
127 |
again at PIP from a more abstract level (but still concrete enough |
|
286 | 128 |
to inform an implementation), and makes PIP an ideal candidate for a |
277 | 129 |
formal verification. One reason, of course, is that the original |
284 | 130 |
presentation of PIP~\cite{Sha90}, despite being informally |
283 | 131 |
``proved'' correct, is actually \emph{flawed}. |
132 |
||
133 |
Yodaiken \cite{Yodaiken02} points to a subtlety that had been |
|
134 |
overlooked in the informal proof by Sha et al. They specify in |
|
284 | 135 |
\cite{Sha90} that after the thread (whose priority has been raised) |
283 | 136 |
completes its critical section and releases the lock, it ``returns |
137 |
to its original priority level.'' This leads them to believe that an |
|
284 | 138 |
implementation of PIP is ``rather straightforward''~\cite{Sha90}. |
139 |
Unfortunately, as Yodaiken points out, this behaviour is too |
|
140 |
simplistic. Consider the case where the low priority thread $L$ |
|
141 |
locks \emph{two} resources, and two high-priority threads $H$ and |
|
300 | 142 |
$H'$ each wait for one of them. If $L$ releases one resource |
283 | 143 |
so that $H$, say, can proceed, then we still have Priority Inversion |
144 |
with $H'$ (which waits for the other resource). The correct |
|
145 |
behaviour for $L$ is to revert to the highest remaining priority of |
|
284 | 146 |
the threads that it blocks. The advantage of formalising the |
147 |
correctness of a high-level specification of PIP in a theorem prover |
|
148 |
is that such issues clearly show up and cannot be overlooked as in |
|
149 |
informal reasoning (since we have to analyse all possible behaviours |
|
300 | 150 |
of threads, i.e.~\emph{traces}, that could possibly happen).\medskip |
274 | 151 |
|
300 | 152 |
\noindent |
301 | 153 |
{\bf Contributions:} There have been earlier formal investigations |
304 | 154 |
into PIP \cite{Faria08,Jahier09,Wellings07}, but they employ model |
155 |
checking techniques. This paper presents a formalised and |
|
156 |
mechanically checked proof for the correctness of PIP (to our |
|
305 | 157 |
knowledge the first one; the earlier informal proof by Sha et |
304 | 158 |
al.~\cite{Sha90} is flawed). In contrast to model checking, our |
159 |
formalisation provides insight into why PIP is correct and allows us |
|
310 | 160 |
to prove stronger properties that, as we will show, can inform an |
314 | 161 |
efficient implementation. For example, we found by ``playing'' with the formalisation |
304 | 162 |
that the choice of the next thread to take over a lock when a |
305 | 163 |
resource is released is irrelevant for PIP being correct. Something |
164 |
which has not been mentioned in the relevant literature. |
|
280 | 165 |
*} |
278 | 166 |
|
283 | 167 |
section {* Formal Model of the Priority Inheritance Protocol *} |
267 | 168 |
|
280 | 169 |
text {* |
286 | 170 |
The Priority Inheritance Protocol, short PIP, is a scheduling |
171 |
algorithm for a single-processor system.\footnote{We shall come back |
|
172 |
later to the case of PIP on multi-processor systems.} Our model of |
|
173 |
PIP is based on Paulson's inductive approach to protocol |
|
174 |
verification \cite{Paulson98}, where the \emph{state} of a system is |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
175 |
given by a list of events that happened so far. \emph{Events} of PIP fall |
290 | 176 |
into five categories defined as the datatype: |
283 | 177 |
|
178 |
\begin{isabelle}\ \ \ \ \ %%% |
|
284 | 179 |
\mbox{\begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l@ {\hspace{7mm}}l} |
180 |
\isacommand{datatype} event |
|
181 |
& @{text "="} & @{term "Create thread priority"}\\ |
|
182 |
& @{text "|"} & @{term "Exit thread"} \\ |
|
286 | 183 |
& @{text "|"} & @{term "Set thread priority"} & {\rm reset of the priority for} @{text thread}\\ |
284 | 184 |
& @{text "|"} & @{term "P thread cs"} & {\rm request of resource} @{text "cs"} {\rm by} @{text "thread"}\\ |
185 |
& @{text "|"} & @{term "V thread cs"} & {\rm release of resource} @{text "cs"} {\rm by} @{text "thread"} |
|
186 |
\end{tabular}} |
|
187 |
\end{isabelle} |
|
188 |
||
189 |
\noindent |
|
286 | 190 |
whereby threads, priorities and (critical) resources are represented |
191 |
as natural numbers. The event @{term Set} models the situation that |
|
192 |
a thread obtains a new priority given by the programmer or |
|
193 |
user (for example via the {\tt nice} utility under UNIX). As in Paulson's work, we |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
194 |
need to define functions that allow us to make some observations |
297 | 195 |
about states. One, called @{term threads}, calculates the set of |
293 | 196 |
``live'' threads that we have seen so far: |
284 | 197 |
|
198 |
\begin{isabelle}\ \ \ \ \ %%% |
|
199 |
\mbox{\begin{tabular}{lcl} |
|
200 |
@{thm (lhs) threads.simps(1)} & @{text "\<equiv>"} & |
|
201 |
@{thm (rhs) threads.simps(1)}\\ |
|
202 |
@{thm (lhs) threads.simps(2)[where thread="th"]} & @{text "\<equiv>"} & |
|
203 |
@{thm (rhs) threads.simps(2)[where thread="th"]}\\ |
|
204 |
@{thm (lhs) threads.simps(3)[where thread="th"]} & @{text "\<equiv>"} & |
|
205 |
@{thm (rhs) threads.simps(3)[where thread="th"]}\\ |
|
206 |
@{term "threads (DUMMY#s)"} & @{text "\<equiv>"} & @{term "threads s"}\\ |
|
207 |
\end{tabular}} |
|
283 | 208 |
\end{isabelle} |
209 |
||
210 |
\noindent |
|
299 | 211 |
In this definition @{term "DUMMY # DUMMY"} stands for list-cons. |
290 | 212 |
Another function calculates the priority for a thread @{text "th"}, which is |
213 |
defined as |
|
284 | 214 |
|
215 |
\begin{isabelle}\ \ \ \ \ %%% |
|
216 |
\mbox{\begin{tabular}{lcl} |
|
217 |
@{thm (lhs) original_priority.simps(1)[where thread="th"]} & @{text "\<equiv>"} & |
|
218 |
@{thm (rhs) original_priority.simps(1)[where thread="th"]}\\ |
|
219 |
@{thm (lhs) original_priority.simps(2)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
220 |
@{thm (rhs) original_priority.simps(2)[where thread="th" and thread'="th'"]}\\ |
|
221 |
@{thm (lhs) original_priority.simps(3)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
222 |
@{thm (rhs) original_priority.simps(3)[where thread="th" and thread'="th'"]}\\ |
|
223 |
@{term "original_priority th (DUMMY#s)"} & @{text "\<equiv>"} & @{term "original_priority th s"}\\ |
|
224 |
\end{tabular}} |
|
225 |
\end{isabelle} |
|
226 |
||
227 |
\noindent |
|
228 |
In this definition we set @{text 0} as the default priority for |
|
229 |
threads that have not (yet) been created. The last function we need |
|
285 | 230 |
calculates the ``time'', or index, at which time a process had its |
290 | 231 |
priority last set. |
284 | 232 |
|
233 |
\begin{isabelle}\ \ \ \ \ %%% |
|
234 |
\mbox{\begin{tabular}{lcl} |
|
235 |
@{thm (lhs) birthtime.simps(1)[where thread="th"]} & @{text "\<equiv>"} & |
|
236 |
@{thm (rhs) birthtime.simps(1)[where thread="th"]}\\ |
|
237 |
@{thm (lhs) birthtime.simps(2)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
238 |
@{thm (rhs) birthtime.simps(2)[where thread="th" and thread'="th'"]}\\ |
|
239 |
@{thm (lhs) birthtime.simps(3)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
240 |
@{thm (rhs) birthtime.simps(3)[where thread="th" and thread'="th'"]}\\ |
|
241 |
@{term "birthtime th (DUMMY#s)"} & @{text "\<equiv>"} & @{term "birthtime th s"}\\ |
|
242 |
\end{tabular}} |
|
243 |
\end{isabelle} |
|
286 | 244 |
|
245 |
\noindent |
|
287 | 246 |
In this definition @{term "length s"} stands for the length of the list |
247 |
of events @{text s}. Again the default value in this function is @{text 0} |
|
248 |
for threads that have not been created yet. A \emph{precedence} of a thread @{text th} in a |
|
290 | 249 |
state @{text s} is the pair of natural numbers defined as |
284 | 250 |
|
286 | 251 |
\begin{isabelle}\ \ \ \ \ %%% |
290 | 252 |
@{thm preced_def[where thread="th"]} |
286 | 253 |
\end{isabelle} |
254 |
||
255 |
\noindent |
|
287 | 256 |
The point of precedences is to schedule threads not according to priorities (because what should |
286 | 257 |
we do in case two threads have the same priority), but according to precedences. |
290 | 258 |
Precedences allow us to always discriminate between two threads with equal priority by |
296 | 259 |
taking into account the time when the priority was last set. We order precedences so |
286 | 260 |
that threads with the same priority get a higher precedence if their priority has been |
293 | 261 |
set earlier, since for such threads it is more urgent to finish their work. In an implementation |
262 |
this choice would translate to a quite natural FIFO-scheduling of processes with |
|
286 | 263 |
the same priority. |
264 |
||
265 |
Next, we introduce the concept of \emph{waiting queues}. They are |
|
266 |
lists of threads associated with every resource. The first thread in |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
267 |
this list (i.e.~the head, or short @{term hd}) is chosen to be the one |
290 | 268 |
that is in possession of the |
286 | 269 |
``lock'' of the corresponding resource. We model waiting queues as |
293 | 270 |
functions, below abbreviated as @{text wq}. They take a resource as |
271 |
argument and return a list of threads. This allows us to define |
|
290 | 272 |
when a thread \emph{holds}, respectively \emph{waits} for, a |
293 | 273 |
resource @{text cs} given a waiting queue function @{text wq}. |
287 | 274 |
|
275 |
\begin{isabelle}\ \ \ \ \ %%% |
|
276 |
\begin{tabular}{@ {}l} |
|
290 | 277 |
@{thm cs_holding_def[where thread="th"]}\\ |
278 |
@{thm cs_waiting_def[where thread="th"]} |
|
287 | 279 |
\end{tabular} |
280 |
\end{isabelle} |
|
281 |
||
282 |
\noindent |
|
283 |
In this definition we assume @{text "set"} converts a list into a set. |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
284 |
At the beginning, that is in the state where no thread is created yet, |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
285 |
the waiting queue function will be the function that returns the |
293 | 286 |
empty list for every resource. |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
287 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
288 |
\begin{isabelle}\ \ \ \ \ %%% |
301 | 289 |
@{abbrev all_unlocked}\hfill\numbered{allunlocked} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
290 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
291 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
292 |
\noindent |
290 | 293 |
Using @{term "holding"} and @{term waiting}, we can introduce \emph{Resource Allocation Graphs} |
294 |
(RAG), which represent the dependencies between threads and resources. |
|
295 |
We represent RAGs as relations using pairs of the form |
|
296 |
||
297 |
\begin{isabelle}\ \ \ \ \ %%% |
|
298 |
@{term "(Th th, Cs cs)"} \hspace{5mm}{\rm and}\hspace{5mm} |
|
299 |
@{term "(Cs cs, Th th)"} |
|
300 |
\end{isabelle} |
|
301 |
||
302 |
\noindent |
|
303 |
where the first stands for a \emph{waiting edge} and the second for a |
|
304 |
\emph{holding edge} (@{term Cs} and @{term Th} are constructors of a |
|
305 |
datatype for vertices). Given a waiting queue function, a RAG is defined |
|
306 | 306 |
as the union of the sets of waiting and holding edges, namely |
290 | 307 |
|
308 |
\begin{isabelle}\ \ \ \ \ %%% |
|
309 |
@{thm cs_depend_def} |
|
310 |
\end{isabelle} |
|
311 |
||
312 |
\noindent |
|
306 | 313 |
Given three threads and three resources, an instance of a RAG can be pictured |
314 |
as follows: |
|
290 | 315 |
|
316 |
\begin{center} |
|
297 | 317 |
\newcommand{\fnt}{\fontsize{7}{8}\selectfont} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
318 |
\begin{tikzpicture}[scale=1] |
297 | 319 |
%%\draw[step=2mm] (-3,2) grid (1,-1); |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
320 |
|
297 | 321 |
\node (A) at (0,0) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>0"}}; |
322 |
\node (B) at (2,0) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>1"}}; |
|
323 |
\node (C) at (4,0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>1"}}; |
|
324 |
\node (D) at (4,-0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>2"}}; |
|
325 |
\node (E) at (6,-0.7) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>2"}}; |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
326 |
\node (E1) at (6, 0.2) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>3"}}; |
297 | 327 |
\node (F) at (8,-0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>3"}}; |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
328 |
|
300 | 329 |
\draw [<-,line width=0.6mm] (A) to node [pos=0.54,sloped,above=-0.5mm] {\fnt{}holding} (B); |
297 | 330 |
\draw [->,line width=0.6mm] (C) to node [pos=0.4,sloped,above=-0.5mm] {\fnt{}waiting} (B); |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
331 |
\draw [->,line width=0.6mm] (D) to node [pos=0.4,sloped,below=-0.5mm] {\fnt{}waiting} (B); |
300 | 332 |
\draw [<-,line width=0.6mm] (D) to node [pos=0.54,sloped,below=-0.5mm] {\fnt{}holding} (E); |
333 |
\draw [<-,line width=0.6mm] (D) to node [pos=0.54,sloped,above=-0.5mm] {\fnt{}holding} (E1); |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
334 |
\draw [->,line width=0.6mm] (F) to node [pos=0.45,sloped,below=-0.5mm] {\fnt{}waiting} (E); |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
335 |
\end{tikzpicture} |
290 | 336 |
\end{center} |
337 |
||
338 |
\noindent |
|
296 | 339 |
The use of relations for representing RAGs allows us to conveniently define |
306 | 340 |
the notion of the \emph{dependants} of a thread using the transitive closure |
341 |
operation for relations. This gives |
|
290 | 342 |
|
343 |
\begin{isabelle}\ \ \ \ \ %%% |
|
344 |
@{thm cs_dependents_def} |
|
345 |
\end{isabelle} |
|
346 |
||
347 |
\noindent |
|
296 | 348 |
This definition needs to account for all threads that wait for a thread to |
290 | 349 |
release a resource. This means we need to include threads that transitively |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
350 |
wait for a resource being released (in the picture above this means the dependants |
306 | 351 |
of @{text "th\<^isub>0"} are @{text "th\<^isub>1"} and @{text "th\<^isub>2"}, which wait for resource @{text "cs\<^isub>1"}, |
352 |
but also @{text "th\<^isub>3"}, |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
353 |
which cannot make any progress unless @{text "th\<^isub>2"} makes progress, which |
306 | 354 |
in turn needs to wait for @{text "th\<^isub>0"} to finish). If there is a circle in a RAG, then clearly |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
355 |
we have a deadlock. Therefore when a thread requests a resource, |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
356 |
we must ensure that the resulting RAG is not circular. |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
357 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
358 |
Next we introduce the notion of the \emph{current precedence} of a thread @{text th} in a |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
359 |
state @{text s}. It is defined as |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
360 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
361 |
\begin{isabelle}\ \ \ \ \ %%% |
299 | 362 |
@{thm cpreced_def2}\hfill\numbered{cpreced} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
363 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
364 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
365 |
\noindent |
306 | 366 |
where the dependants of @{text th} are given by the waiting queue function. |
293 | 367 |
While the precedence @{term prec} of a thread is determined by the programmer |
368 |
(for example when the thread is |
|
306 | 369 |
created), the point of the current precedence is to let the scheduler increase this |
370 |
precedence, if needed according to PIP. Therefore the current precedence of @{text th} is |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
371 |
given as the maximum of the precedence @{text th} has in state @{text s} \emph{and} all |
306 | 372 |
threads that are dependants of @{text th}. Since the notion @{term "dependants"} is |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
373 |
defined as the transitive closure of all dependent threads, we deal correctly with the |
306 | 374 |
problem in the informal algorithm by Sha et al.~\cite{Sha90} where a priority of a thread is |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
375 |
lowered prematurely. |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
376 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
377 |
The next function, called @{term schs}, defines the behaviour of the scheduler. It will be defined |
306 | 378 |
by recursion on the state (a list of events); this function returns a \emph{schedule state}, which |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
379 |
we represent as a record consisting of two |
296 | 380 |
functions: |
293 | 381 |
|
382 |
\begin{isabelle}\ \ \ \ \ %%% |
|
383 |
@{text "\<lparr>wq_fun, cprec_fun\<rparr>"} |
|
384 |
\end{isabelle} |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
385 |
|
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
386 |
\noindent |
314 | 387 |
The first function is a waiting queue function (that is, it takes a |
388 |
resource @{text "cs"} and returns the corresponding list of threads |
|
389 |
that lock, respectively wait for, it); the second is a function that |
|
390 |
takes a thread and returns its current precedence (see |
|
391 |
\eqref{cpreced}). We assume the usual getter and setter methods for |
|
392 |
such records. |
|
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
393 |
|
306 | 394 |
In the initial state, the scheduler starts with all resources unlocked (the corresponding |
395 |
function is defined in \eqref{allunlocked}) and the |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
396 |
current precedence of every thread is initialised with @{term "Prc 0 0"}; that means |
299 | 397 |
\mbox{@{abbrev initial_cprec}}. Therefore |
306 | 398 |
we have for the initial state |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
399 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
400 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
401 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
402 |
@{thm (lhs) schs.simps(1)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
403 |
\hspace{5mm}@{term "(|wq_fun = all_unlocked, cprec_fun = (\<lambda>_::thread. Prc 0 0)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
404 |
\end{tabular} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
405 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
406 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
407 |
\noindent |
296 | 408 |
The cases for @{term Create}, @{term Exit} and @{term Set} are also straightforward: |
409 |
we calculate the waiting queue function of the (previous) state @{text s}; |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
410 |
this waiting queue function @{text wq} is unchanged in the next schedule state---because |
306 | 411 |
none of these events lock or release any resource; |
412 |
for calculating the next @{term "cprec_fun"}, we use @{text wq} and |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
413 |
@{term cpreced}. This gives the following three clauses for @{term schs}: |
290 | 414 |
|
415 |
\begin{isabelle}\ \ \ \ \ %%% |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
416 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
417 |
@{thm (lhs) schs.simps(2)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
418 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
419 |
\hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Create th prio # s)|)"}\smallskip\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
420 |
@{thm (lhs) schs.simps(3)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
421 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
422 |
\hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Exit th # s)|)"}\smallskip\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
423 |
@{thm (lhs) schs.simps(4)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
424 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
425 |
\hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Set th prio # s)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
426 |
\end{tabular} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
427 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
428 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
429 |
\noindent |
306 | 430 |
More interesting are the cases where a resource, say @{text cs}, is locked or released. In these cases |
300 | 431 |
we need to calculate a new waiting queue function. For the event @{term "P th cs"}, we have to update |
306 | 432 |
the function so that the new thread list for @{text cs} is the old thread list plus the thread @{text th} |
314 | 433 |
appended to the end of that list (remember the head of this list is assigned to be in the possession of this |
306 | 434 |
resource). This gives the clause |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
435 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
436 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
437 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
438 |
@{thm (lhs) schs.simps(5)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
439 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
440 |
\hspace{5mm}@{text "let"} @{text "new_wq = wq(cs := (wq cs @ [th]))"} @{text "in"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
441 |
\hspace{8mm}@{term "(|wq_fun = new_wq, cprec_fun = cpreced new_wq (P th cs # s)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
442 |
\end{tabular} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
443 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
444 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
445 |
\noindent |
300 | 446 |
The clause for event @{term "V th cs"} is similar, except that we need to update the waiting queue function |
301 | 447 |
so that the thread that possessed the lock is deleted from the corresponding thread list. For this |
448 |
list transformation, we use |
|
296 | 449 |
the auxiliary function @{term release}. A simple version of @{term release} would |
306 | 450 |
just delete this thread and return the remaining threads, namely |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
451 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
452 |
\begin{isabelle}\ \ \ \ \ %%% |
296 | 453 |
\begin{tabular}{@ {}lcl} |
454 |
@{term "release []"} & @{text "\<equiv>"} & @{term "[]"}\\ |
|
455 |
@{term "release (DUMMY # qs)"} & @{text "\<equiv>"} & @{term "qs"}\\ |
|
456 |
\end{tabular} |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
457 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
458 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
459 |
\noindent |
300 | 460 |
In practice, however, often the thread with the highest precedence in the list will get the |
296 | 461 |
lock next. We have implemented this choice, but later found out that the choice |
300 | 462 |
of which thread is chosen next is actually irrelevant for the correctness of PIP. |
296 | 463 |
Therefore we prove the stronger result where @{term release} is defined as |
464 |
||
465 |
\begin{isabelle}\ \ \ \ \ %%% |
|
466 |
\begin{tabular}{@ {}lcl} |
|
467 |
@{term "release []"} & @{text "\<equiv>"} & @{term "[]"}\\ |
|
468 |
@{term "release (DUMMY # qs)"} & @{text "\<equiv>"} & @{term "SOME qs'. distinct qs' \<and> set qs' = set qs"}\\ |
|
469 |
\end{tabular} |
|
470 |
\end{isabelle} |
|
471 |
||
472 |
\noindent |
|
306 | 473 |
where @{text "SOME"} stands for Hilbert's epsilon and implements an arbitrary |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
474 |
choice for the next waiting list. It just has to be a list of distinctive threads and |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
475 |
contain the same elements as @{text "qs"}. This gives for @{term V} the clause: |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
476 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
477 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
478 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
479 |
@{thm (lhs) schs.simps(6)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
480 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
481 |
\hspace{5mm}@{text "let"} @{text "new_wq = release (wq cs)"} @{text "in"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
482 |
\hspace{8mm}@{term "(|wq_fun = new_wq, cprec_fun = cpreced new_wq (V th cs # s)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
483 |
\end{tabular} |
290 | 484 |
\end{isabelle} |
485 |
||
300 | 486 |
Having the scheduler function @{term schs} at our disposal, we can ``lift'', or |
487 |
overload, the notions |
|
488 |
@{term waiting}, @{term holding}, @{term depend} and @{term cp} to operate on states only. |
|
286 | 489 |
|
490 |
\begin{isabelle}\ \ \ \ \ %%% |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
491 |
\begin{tabular}{@ {}rcl} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
492 |
@{thm (lhs) s_holding_abv} & @{text "\<equiv>"} & @{thm (rhs) s_holding_abv}\\ |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
493 |
@{thm (lhs) s_waiting_abv} & @{text "\<equiv>"} & @{thm (rhs) s_waiting_abv}\\ |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
494 |
@{thm (lhs) s_depend_abv} & @{text "\<equiv>"} & @{thm (rhs) s_depend_abv}\\ |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
495 |
@{thm (lhs) cp_def} & @{text "\<equiv>"} & @{thm (rhs) cp_def} |
287 | 496 |
\end{tabular} |
497 |
\end{isabelle} |
|
498 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
499 |
\noindent |
300 | 500 |
With these abbreviations we can introduce |
501 |
the notion of threads being @{term readys} in a state (i.e.~threads |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
502 |
that do not wait for any resource) and the running thread. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
503 |
|
287 | 504 |
\begin{isabelle}\ \ \ \ \ %%% |
505 |
\begin{tabular}{@ {}l} |
|
506 |
@{thm readys_def}\\ |
|
507 |
@{thm runing_def}\\ |
|
286 | 508 |
\end{tabular} |
509 |
\end{isabelle} |
|
284 | 510 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
511 |
\noindent |
306 | 512 |
In this definition @{term "DUMMY ` DUMMY"} stands for the image of a set under a function. |
513 |
Note that in the initial state, that is where the list of events is empty, the set |
|
309 | 514 |
@{term threads} is empty and therefore there is neither a thread ready nor running. |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
515 |
If there is one or more threads ready, then there can only be \emph{one} thread |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
516 |
running, namely the one whose current precedence is equal to the maximum of all ready |
314 | 517 |
threads. We use sets to capture both possibilities. |
306 | 518 |
We can now also conveniently define the set of resources that are locked by a thread in a |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
519 |
given state. |
284 | 520 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
521 |
\begin{isabelle}\ \ \ \ \ %%% |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
522 |
@{thm holdents_def} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
523 |
\end{isabelle} |
284 | 524 |
|
306 | 525 |
Finally we can define what a \emph{valid state} is in our model of PIP. For |
304 | 526 |
example we cannot expect to be able to exit a thread, if it was not |
306 | 527 |
created yet. These validity constraints on states are characterised by the |
528 |
inductive predicate @{term "step"} and @{term vt}. We first give five inference rules |
|
529 |
for @{term step} relating a state and an event that can happen next. |
|
284 | 530 |
|
531 |
\begin{center} |
|
532 |
\begin{tabular}{c} |
|
533 |
@{thm[mode=Rule] thread_create[where thread=th]}\hspace{1cm} |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
534 |
@{thm[mode=Rule] thread_exit[where thread=th]} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
535 |
\end{tabular} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
536 |
\end{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
537 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
538 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
539 |
The first rule states that a thread can only be created, if it does not yet exists. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
540 |
Similarly, the second rule states that a thread can only be terminated if it was |
306 | 541 |
running and does not lock any resources anymore (this simplifies slightly our model; |
314 | 542 |
in practice we would expect the operating system releases all locks held by a |
306 | 543 |
thread that is about to exit). The event @{text Set} can happen |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
544 |
if the corresponding thread is running. |
284 | 545 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
546 |
\begin{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
547 |
@{thm[mode=Rule] thread_set[where thread=th]} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
548 |
\end{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
549 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
550 |
\noindent |
301 | 551 |
If a thread wants to lock a resource, then the thread needs to be |
552 |
running and also we have to make sure that the resource lock does |
|
553 |
not lead to a cycle in the RAG. In practice, ensuring the latter is |
|
314 | 554 |
the responsibility of the programmer. In our formal |
555 |
model we brush aside these problematic cases in order to be able to make |
|
301 | 556 |
some meaningful statements about PIP.\footnote{This situation is |
310 | 557 |
similar to the infamous occurs check in Prolog: In order to say |
306 | 558 |
anything meaningful about unification, one needs to perform an occurs |
310 | 559 |
check. But in practice the occurs check is ommited and the |
306 | 560 |
responsibility for avoiding problems rests with the programmer.} |
561 |
||
562 |
\begin{center} |
|
563 |
@{thm[mode=Rule] thread_P[where thread=th]} |
|
564 |
\end{center} |
|
565 |
||
566 |
\noindent |
|
301 | 567 |
Similarly, if a thread wants to release a lock on a resource, then |
568 |
it must be running and in the possession of that lock. This is |
|
306 | 569 |
formally given by the last inference rule of @{term step}. |
570 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
571 |
\begin{center} |
306 | 572 |
@{thm[mode=Rule] thread_V[where thread=th]} |
284 | 573 |
\end{center} |
306 | 574 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
575 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
576 |
A valid state of PIP can then be conveniently be defined as follows: |
284 | 577 |
|
578 |
\begin{center} |
|
579 |
\begin{tabular}{c} |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
580 |
@{thm[mode=Axiom] vt_nil}\hspace{1cm} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
581 |
@{thm[mode=Rule] vt_cons} |
284 | 582 |
\end{tabular} |
583 |
\end{center} |
|
584 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
585 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
586 |
This completes our formal model of PIP. In the next section we present |
309 | 587 |
properties that show our model of PIP is correct. |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
588 |
*} |
274 | 589 |
|
310 | 590 |
section {* The Correctness Proof *} |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
591 |
|
301 | 592 |
(*<*) |
593 |
context extend_highest_gen |
|
594 |
begin |
|
595 |
print_locale extend_highest_gen |
|
596 |
thm extend_highest_gen_def |
|
597 |
thm extend_highest_gen_axioms_def |
|
598 |
thm highest_gen_def |
|
307 | 599 |
(*>*) |
301 | 600 |
text {* |
322 | 601 |
Sha et al.~\cite[Theorem 6]{Sha90} state their correctness criterion |
602 |
for PIP in terms of the number of critical resources: if there are |
|
603 |
@{text m} critical resources, then a blocked job with high priority |
|
323 | 604 |
can only be blocked @{text m} times---that is a \emph{bounded} number of |
605 |
times. This result on its own, strictly speaking, does \emph{not} prevent Priority |
|
322 | 606 |
Inversion, because if one low-priority thread does not give up its |
323 | 607 |
critical resource (the high-priority thread is waiting for), then the |
322 | 608 |
high-priority thread can never run. The argument of Sha et al.~is |
609 |
that \emph{if} threads release locked resources in a finite amount |
|
610 |
of time, then Priority Inversion cannot occur---the high-priority |
|
611 |
thread is guaranteed to run eventually. The assumption is that |
|
612 |
programmers always ensure that this is the case. However, even |
|
323 | 613 |
taking this assumption into account, ther correctness property is \emph{not} |
614 |
true for their version of PIP. As Yodaiken |
|
615 |
\cite{Yodaiken02} pointed out: If a low-priority thread possesses locks to two |
|
616 |
resources for which two high-priority threads are waiting for, then |
|
617 |
lowering the priority prematurely after giving up only one lock, can |
|
618 |
cause Priority Inversion for one of the high-priority threads, invalidating |
|
619 |
their bound. |
|
307 | 620 |
|
323 | 621 |
Even when fixed, their proof idea does not seem to go through for |
622 |
us, because of the way we have set up our formal model of PIP. The |
|
623 |
reason is that we allow that critical sections can intersect |
|
624 |
(something Sha et al.~explicitly exclude). Therefore we have a |
|
625 |
different correctness criterion for PIP. The idea behind our |
|
626 |
criterion is as follows: for all states @{text |
|
627 |
s}, we know the corresponding thread @{text th} with the highest |
|
628 |
precedence; we show that in every future state (denoted by @{text |
|
629 |
"s' @ s"}) in which @{text th} is still alive, either @{text th} is |
|
630 |
running or it is blocked by a thread that was alive in the state |
|
631 |
@{text s}. Since in @{text s}, as in every state, the set of alive |
|
632 |
threads is finite, @{text th} can only be blocked a finite number of |
|
633 |
times. We will actually prove a stricter bound below. However, this |
|
634 |
correctness criterion hinges upon a number of assumptions about the |
|
635 |
states @{text s} and @{text "s' @ s"}, the thread @{text th} and the |
|
636 |
events happening in @{text s'}. We list them next: |
|
307 | 637 |
|
638 |
\begin{quote} |
|
639 |
{\bf Assumptions on the states @{text s} and @{text "s' @ s"}:} In order to make |
|
640 |
any meaningful statement, we need to require that @{text "s"} and |
|
641 |
@{text "s' @ s"} are valid states, namely |
|
642 |
\begin{isabelle}\ \ \ \ \ %%% |
|
643 |
\begin{tabular}{l} |
|
644 |
@{term "vt s"}\\ |
|
645 |
@{term "vt (s' @ s)"} |
|
646 |
\end{tabular} |
|
647 |
\end{isabelle} |
|
648 |
\end{quote} |
|
301 | 649 |
|
307 | 650 |
\begin{quote} |
310 | 651 |
{\bf Assumptions on the thread @{text "th"}:} The thread @{text th} must be alive in @{text s} and |
652 |
has the highest precedence of all alive threads in @{text s}. Furthermore the |
|
653 |
priority of @{text th} is @{text prio} (we need this in the next assumptions). |
|
307 | 654 |
\begin{isabelle}\ \ \ \ \ %%% |
655 |
\begin{tabular}{l} |
|
656 |
@{term "th \<in> threads s"}\\ |
|
657 |
@{term "prec th s = Max (cprec s ` threads s)"}\\ |
|
658 |
@{term "prec th s = (prio, DUMMY)"} |
|
659 |
\end{tabular} |
|
660 |
\end{isabelle} |
|
661 |
\end{quote} |
|
662 |
||
663 |
\begin{quote} |
|
664 |
{\bf Assumptions on the events in @{text "s'"}:} We want to prove that @{text th} cannot |
|
309 | 665 |
be blocked indefinitely. Of course this can happen if threads with higher priority |
666 |
than @{text th} are continously created in @{text s'}. Therefore we have to assume that |
|
667 |
events in @{text s'} can only create (respectively set) threads with equal or lower |
|
310 | 668 |
priority than @{text prio} of @{text th}. We also need to assume that the |
669 |
priority of @{text "th"} does not get reset and also that @{text th} does |
|
670 |
not get ``exited'' in @{text "s'"}. This can be ensured by assuming the following three implications. |
|
307 | 671 |
\begin{isabelle}\ \ \ \ \ %%% |
672 |
\begin{tabular}{l} |
|
310 | 673 |
{If}~~@{text "Create th' prio' \<in> set s'"}~~{then}~~@{text "prio' \<le> prio"}\\ |
307 | 674 |
{If}~~@{text "Set th' prio' \<in> set s'"}~~{then}~~@{text "th' \<noteq> th"}~~{and}~~@{text "prio' \<le> prio"}\\ |
675 |
{If}~~@{text "Exit th' \<in> set s'"}~~{then}~~@{text "th' \<noteq> th"}\\ |
|
676 |
\end{tabular} |
|
677 |
\end{isabelle} |
|
678 |
\end{quote} |
|
301 | 679 |
|
307 | 680 |
\noindent |
310 | 681 |
Under these assumptions we will prove the following correctness property: |
307 | 682 |
|
308 | 683 |
\begin{theorem}\label{mainthm} |
307 | 684 |
Given the assumptions about states @{text "s"} and @{text "s' @ s"}, |
308 | 685 |
the thread @{text th} and the events in @{text "s'"}, |
686 |
if @{term "th' \<in> running (s' @ s)"} and @{text "th' \<noteq> th"} then |
|
687 |
@{text "th' \<in> threads s"}. |
|
307 | 688 |
\end{theorem} |
301 | 689 |
|
308 | 690 |
\noindent |
691 |
This theorem ensures that the thread @{text th}, which has the highest |
|
692 |
precedence in the state @{text s}, can only be blocked in the state @{text "s' @ s"} |
|
693 |
by a thread @{text th'} that already existed in @{text s}. As we shall see shortly, |
|
694 |
that means by only finitely many threads. Consequently, indefinite wait of |
|
310 | 695 |
@{text th}---which would be Priority Inversion---cannot occur. |
309 | 696 |
|
697 |
In what follows we will describe properties of PIP that allow us to prove |
|
698 |
Theorem~\ref{mainthm}. It is relatively easily to see that |
|
699 |
||
700 |
\begin{isabelle}\ \ \ \ \ %%% |
|
701 |
\begin{tabular}{@ {}l} |
|
702 |
@{text "running s \<subseteq> ready s \<subseteq> threads s"}\\ |
|
703 |
@{thm[mode=IfThen] finite_threads} |
|
704 |
\end{tabular} |
|
705 |
\end{isabelle} |
|
706 |
||
707 |
\noindent |
|
708 |
where the second property is by induction of @{term vt}. The next three |
|
709 |
properties are |
|
308 | 710 |
|
309 | 711 |
\begin{isabelle}\ \ \ \ \ %%% |
712 |
\begin{tabular}{@ {}l} |
|
713 |
@{thm[mode=IfThen] waiting_unique[of _ _ "cs\<^isub>1" "cs\<^isub>2"]}\\ |
|
714 |
@{thm[mode=IfThen] held_unique[of _ "th\<^isub>1" _ "th\<^isub>2"]}\\ |
|
715 |
@{thm[mode=IfThen] runing_unique[of _ "th\<^isub>1" "th\<^isub>2"]} |
|
716 |
\end{tabular} |
|
717 |
\end{isabelle} |
|
308 | 718 |
|
309 | 719 |
\noindent |
720 |
The first one states that every waiting thread can only wait for a single |
|
310 | 721 |
resource (because it gets suspended after requesting that resource and having |
722 |
to wait for it); the second that every resource can only be held by a single thread; |
|
723 |
the third property establishes that in every given valid state, there is |
|
724 |
at most one running thread. We can also show the following properties |
|
725 |
about the RAG in @{text "s"}. |
|
726 |
||
727 |
\begin{isabelle}\ \ \ \ \ %%% |
|
728 |
\begin{tabular}{@ {}l} |
|
312 | 729 |
@{text If}~@{thm (prem 1) acyclic_depend}~@{text "then"}:\\ |
730 |
\hspace{5mm}@{thm (concl) acyclic_depend}, |
|
731 |
@{thm (concl) finite_depend} and |
|
732 |
@{thm (concl) wf_dep_converse},\\ |
|
733 |
\hspace{5mm}@{text "if"}~@{thm (prem 2) dm_depend_threads}~@{text "then"}~@{thm (concl) dm_depend_threads}\\ |
|
734 |
\hspace{5mm}@{text "if"}~@{thm (prem 2) range_in}~@{text "then"}~@{thm (concl) range_in} |
|
310 | 735 |
\end{tabular} |
736 |
\end{isabelle} |
|
309 | 737 |
|
738 |
TODO |
|
739 |
||
740 |
\noindent |
|
308 | 741 |
The following lemmas show how RAG is changed with the execution of events: |
742 |
\begin{enumerate} |
|
743 |
\item Execution of @{term "Set"} does not change RAG (@{text "depend_set_unchanged"}): |
|
744 |
@{thm[display] depend_set_unchanged} |
|
745 |
\item Execution of @{term "Create"} does not change RAG (@{text "depend_create_unchanged"}): |
|
746 |
@{thm[display] depend_create_unchanged} |
|
747 |
\item Execution of @{term "Exit"} does not change RAG (@{text "depend_exit_unchanged"}): |
|
748 |
@{thm[display] depend_exit_unchanged} |
|
749 |
\item Execution of @{term "P"} (@{text "step_depend_p"}): |
|
750 |
@{thm[display] step_depend_p} |
|
751 |
\item Execution of @{term "V"} (@{text "step_depend_v"}): |
|
752 |
@{thm[display] step_depend_v} |
|
753 |
\end{enumerate} |
|
754 |
*} |
|
301 | 755 |
|
308 | 756 |
text {* \noindent |
757 |
These properties are used to derive the following important results about RAG: |
|
758 |
\begin{enumerate} |
|
759 |
\item RAG is loop free (@{text "acyclic_depend"}): |
|
760 |
@{thm [display] acyclic_depend} |
|
761 |
\item RAGs are finite (@{text "finite_depend"}): |
|
762 |
@{thm [display] finite_depend} |
|
763 |
\item Reverse paths in RAG are well founded (@{text "wf_dep_converse"}): |
|
764 |
@{thm [display] wf_dep_converse} |
|
765 |
\item The dependence relation represented by RAG has a tree structure (@{text "unique_depend"}): |
|
766 |
@{thm [display] unique_depend[of _ _ "n\<^isub>1" "n\<^isub>2"]} |
|
767 |
\item All threads in RAG are living threads |
|
768 |
(@{text "dm_depend_threads"} and @{text "range_in"}): |
|
769 |
@{thm [display] dm_depend_threads range_in} |
|
770 |
\end{enumerate} |
|
771 |
*} |
|
772 |
||
773 |
text {* \noindent |
|
774 |
The following lemmas show how every node in RAG can be chased to ready threads: |
|
775 |
\begin{enumerate} |
|
776 |
\item Every node in RAG can be chased to a ready thread (@{text "chain_building"}): |
|
777 |
@{thm [display] chain_building[rule_format]} |
|
778 |
\item The ready thread chased to is unique (@{text "dchain_unique"}): |
|
779 |
@{thm [display] dchain_unique[of _ _ "th\<^isub>1" "th\<^isub>2"]} |
|
780 |
\end{enumerate} |
|
781 |
*} |
|
301 | 782 |
|
308 | 783 |
text {* \noindent |
784 |
Properties about @{term "next_th"}: |
|
785 |
\begin{enumerate} |
|
786 |
\item The thread taking over is different from the thread which is releasing |
|
787 |
(@{text "next_th_neq"}): |
|
788 |
@{thm [display] next_th_neq} |
|
789 |
\item The thread taking over is unique |
|
790 |
(@{text "next_th_unique"}): |
|
791 |
@{thm [display] next_th_unique[of _ _ _ "th\<^isub>1" "th\<^isub>2"]} |
|
792 |
\end{enumerate} |
|
793 |
*} |
|
301 | 794 |
|
308 | 795 |
text {* \noindent |
796 |
Some deeper results about the system: |
|
797 |
\begin{enumerate} |
|
798 |
\item The maximum of @{term "cp"} and @{term "preced"} are equal (@{text "max_cp_eq"}): |
|
799 |
@{thm [display] max_cp_eq} |
|
800 |
\item There must be one ready thread having the max @{term "cp"}-value |
|
801 |
(@{text "max_cp_readys_threads"}): |
|
802 |
@{thm [display] max_cp_readys_threads} |
|
803 |
\end{enumerate} |
|
804 |
*} |
|
301 | 805 |
|
308 | 806 |
text {* \noindent |
807 |
The relationship between the count of @{text "P"} and @{text "V"} and the number of |
|
808 |
critical resources held by a thread is given as follows: |
|
809 |
\begin{enumerate} |
|
810 |
\item The @{term "V"}-operation decreases the number of critical resources |
|
811 |
one thread holds (@{text "cntCS_v_dec"}) |
|
812 |
@{thm [display] cntCS_v_dec} |
|
813 |
\item The number of @{text "V"} never exceeds the number of @{text "P"} |
|
814 |
(@{text "cnp_cnv_cncs"}): |
|
815 |
@{thm [display] cnp_cnv_cncs} |
|
816 |
\item The number of @{text "V"} equals the number of @{text "P"} when |
|
817 |
the relevant thread is not living: |
|
818 |
(@{text "cnp_cnv_eq"}): |
|
819 |
@{thm [display] cnp_cnv_eq} |
|
820 |
\item When a thread is not living, it does not hold any critical resource |
|
821 |
(@{text "not_thread_holdents"}): |
|
822 |
@{thm [display] not_thread_holdents} |
|
823 |
\item When the number of @{text "P"} equals the number of @{text "V"}, the relevant |
|
824 |
thread does not hold any critical resource, therefore no thread can depend on it |
|
825 |
(@{text "count_eq_dependents"}): |
|
826 |
@{thm [display] count_eq_dependents} |
|
827 |
\end{enumerate} |
|
301 | 828 |
*} |
829 |
||
830 |
(*<*) |
|
831 |
end |
|
832 |
(*>*) |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
833 |
|
313 | 834 |
subsection {* Proof idea *} |
835 |
||
836 |
(*<*) |
|
837 |
context extend_highest_gen |
|
838 |
begin |
|
839 |
print_locale extend_highest_gen |
|
840 |
thm extend_highest_gen_def |
|
841 |
thm extend_highest_gen_axioms_def |
|
842 |
thm highest_gen_def |
|
843 |
(*>*) |
|
844 |
||
845 |
text {* |
|
846 |
The reason that only threads which already held some resoures |
|
847 |
can be runing and block @{text "th"} is that if , otherwise, one thread |
|
848 |
does not hold any resource, it may never have its prioirty raised |
|
849 |
and will not get a chance to run. This fact is supported by |
|
850 |
lemma @{text "moment_blocked"}: |
|
851 |
@{thm [display] moment_blocked} |
|
852 |
When instantiating @{text "i"} to @{text "0"}, the lemma means threads which did not hold any |
|
853 |
resource in state @{text "s"} will not have a change to run latter. Rephrased, it means |
|
854 |
any thread which is running after @{text "th"} became the highest must have already held |
|
855 |
some resource at state @{text "s"}. |
|
856 |
||
857 |
||
858 |
When instantiating @{text "i"} to a number larger than @{text "0"}, the lemma means |
|
859 |
if a thread releases all its resources at some moment in @{text "t"}, after that, |
|
860 |
it may never get a change to run. If every thread releases its resource in finite duration, |
|
861 |
then after a while, only thread @{text "th"} is left running. This shows how indefinite |
|
862 |
priority inversion can be avoided. |
|
863 |
||
864 |
So, the key of the proof is to establish the correctness of @{text "moment_blocked"}. |
|
865 |
We are going to show how this lemma is proved. At the heart of this proof, is |
|
866 |
lemma @{text "pv_blocked"}: |
|
867 |
@{thm [display] pv_blocked} |
|
868 |
This lemma says: for any @{text "s"}-extension {text "t"}, if thread @{text "th'"} |
|
869 |
does not hold any resource, it can not be running at @{text "t@s"}. |
|
870 |
||
871 |
||
872 |
\noindent Proof: |
|
873 |
\begin{enumerate} |
|
874 |
\item Since thread @{text "th'"} does not hold any resource, no thread may depend on it, |
|
875 |
so its current precedence @{text "cp (t@s) th'"} equals to its own precedence |
|
876 |
@{text "preced th' (t@s)"}. \label{arg_1} |
|
877 |
\item Since @{text "th"} has the highest precedence in the system and |
|
878 |
precedences are distinct among threads, we have |
|
879 |
@{text "preced th' (t@s) < preced th (t@s)"}. From this and item \ref{arg_1}, |
|
880 |
we have @{text "cp (t@s) th' < preced th (t@s)"}. |
|
881 |
\item Since @{text "preced th (t@s)"} is already the highest in the system, |
|
882 |
@{text "cp (t@s) th"} can not be higher than this and can not be lower neither (by |
|
883 |
the definition of @{text "cp"}), we have @{text "preced th (t@s) = cp (t@s) th"}. |
|
884 |
\item Finally we have @{text "cp (t@s) th' < cp (t@s) th"}. |
|
885 |
\item By defintion of @{text "running"}, @{text "th'"} can not be runing at |
|
886 |
@{text "t@s"}. |
|
887 |
\end{enumerate} |
|
888 |
Since @{text "th'"} is not able to run at state @{text "t@s"}, it is not able to |
|
889 |
make either {text "P"} or @{text "V"} action, so if @{text "t@s"} is extended |
|
890 |
one step further, @{text "th'"} still does not hold any resource. |
|
891 |
The situation will not unchanged in further extensions as long as |
|
892 |
@{text "th"} holds the highest precedence. Since this @{text "t"} is arbitarily chosen |
|
893 |
except being constrained by predicate @{text "extend_highest_gen"} and |
|
894 |
this predicate has the property that if it holds for @{text "t"}, it also holds |
|
895 |
for any moment @{text "i"} inside @{text "t"}, as shown by lemma @{text "red_moment"}: |
|
896 |
@{thm [display] "extend_highest_gen.red_moment"} |
|
897 |
so @{text "pv_blocked"} can be applied to any @{text "moment i t"}. |
|
898 |
From this, lemma @{text "moment_blocked"} follows. |
|
899 |
*} |
|
900 |
||
901 |
(*<*) |
|
902 |
end |
|
903 |
(*>*) |
|
904 |
||
905 |
||
314 | 906 |
section {* Properties for an Implementation\label{implement} *} |
311 | 907 |
|
908 |
text {* |
|
312 | 909 |
While a formal correctness proof for our model of PIP is certainly |
910 |
attractive (especially in light of the flawed proof by Sha et |
|
911 |
al.~\cite{Sha90}), we found that the formalisation can even help us |
|
912 |
with efficiently implementing PIP. |
|
311 | 913 |
|
312 | 914 |
For example Baker complained that calculating the current precedence |
321 | 915 |
in PIP is quite ``heavy weight'' in Linux (see the Introduction). |
312 | 916 |
In our model of PIP the current precedence of a thread in a state s |
917 |
depends on all its dependants---a ``global'' transitive notion, |
|
918 |
which is indeed heavy weight (see Def.~shown in \eqref{cpreced}). |
|
321 | 919 |
We can however improve upon this. For this let us define the notion |
920 |
of @{term children} of a thread @{text th} in a state @{text s} as |
|
312 | 921 |
|
922 |
\begin{isabelle}\ \ \ \ \ %%% |
|
923 |
\begin{tabular}{@ {}l} |
|
924 |
@{thm children_def2} |
|
925 |
\end{tabular} |
|
926 |
\end{isabelle} |
|
927 |
||
928 |
\noindent |
|
321 | 929 |
where a child is a thread that is one ``hop'' away from the tread |
930 |
@{text th} in the @{term RAG} (and waiting for @{text th} to release |
|
931 |
a resource). We can prove that |
|
311 | 932 |
|
312 | 933 |
\begin{lemma}\label{childrenlem} |
934 |
@{text "If"} @{thm (prem 1) cp_rec} @{text "then"} |
|
935 |
\begin{center} |
|
936 |
@{thm (concl) cp_rec}. |
|
937 |
\end{center} |
|
938 |
\end{lemma} |
|
311 | 939 |
|
312 | 940 |
\noindent |
941 |
That means the current precedence of a thread @{text th} can be |
|
942 |
computed locally by considering only the children of @{text th}. In |
|
943 |
effect, it only needs to be recomputed for @{text th} when one of |
|
321 | 944 |
its children changes its current precedence. Once the current |
312 | 945 |
precedence is computed in this more efficient manner, the selection |
946 |
of the thread with highest precedence from a set of ready threads is |
|
947 |
a standard scheduling operation implemented in most operating |
|
948 |
systems. |
|
311 | 949 |
|
321 | 950 |
Of course the main implementation work for PIP involves the |
951 |
scheduler and coding how it should react to events. Below we |
|
952 |
outline how our formalisation guides this implementation for each |
|
953 |
kind of event.\smallskip |
|
312 | 954 |
*} |
311 | 955 |
|
956 |
(*<*) |
|
312 | 957 |
context step_create_cps |
958 |
begin |
|
959 |
(*>*) |
|
960 |
text {* |
|
961 |
\noindent |
|
321 | 962 |
\colorbox{mygrey}{@{term "Create th prio"}:} We assume that the current state @{text s'} and |
312 | 963 |
the next state @{term "s \<equiv> Create th prio#s'"} are both valid (meaning the event |
964 |
is allowed to occur). In this situation we can show that |
|
965 |
||
966 |
\begin{isabelle}\ \ \ \ \ %%% |
|
967 |
\begin{tabular}{@ {}l} |
|
321 | 968 |
@{thm eq_dep},\\ |
969 |
@{thm eq_cp_th}, and\\ |
|
312 | 970 |
@{thm[mode=IfThen] eq_cp} |
971 |
\end{tabular} |
|
972 |
\end{isabelle} |
|
973 |
||
974 |
\noindent |
|
975 |
This means we do not have recalculate the @{text RAG} and also none of the |
|
976 |
current precedences of the other threads. The current precedence of the created |
|
321 | 977 |
thread @{text th} is just its precedence, namely the pair @{term "(prio, length (s::event list))"}. |
312 | 978 |
\smallskip |
979 |
*} |
|
980 |
(*<*) |
|
981 |
end |
|
982 |
context step_exit_cps |
|
983 |
begin |
|
984 |
(*>*) |
|
985 |
text {* |
|
986 |
\noindent |
|
321 | 987 |
\colorbox{mygrey}{@{term "Exit th"}:} We again assume that the current state @{text s'} and |
312 | 988 |
the next state @{term "s \<equiv> Exit th#s'"} are both valid. We can show that |
989 |
||
990 |
\begin{isabelle}\ \ \ \ \ %%% |
|
991 |
\begin{tabular}{@ {}l} |
|
321 | 992 |
@{thm eq_dep}, and\\ |
312 | 993 |
@{thm[mode=IfThen] eq_cp} |
994 |
\end{tabular} |
|
995 |
\end{isabelle} |
|
996 |
||
997 |
\noindent |
|
321 | 998 |
This means again we do not have to recalculate the @{text RAG} and |
999 |
also not the current precedences for the other threads. Since @{term th} is not |
|
312 | 1000 |
alive anymore in state @{term "s"}, there is no need to calculate its |
1001 |
current precedence. |
|
1002 |
\smallskip |
|
1003 |
*} |
|
1004 |
(*<*) |
|
1005 |
end |
|
311 | 1006 |
context step_set_cps |
1007 |
begin |
|
1008 |
(*>*) |
|
312 | 1009 |
text {* |
1010 |
\noindent |
|
321 | 1011 |
\colorbox{mygrey}{@{term "Set th prio"}:} We assume that @{text s'} and |
312 | 1012 |
@{term "s \<equiv> Set th prio#s'"} are both valid. We can show that |
311 | 1013 |
|
312 | 1014 |
\begin{isabelle}\ \ \ \ \ %%% |
1015 |
\begin{tabular}{@ {}l} |
|
321 | 1016 |
@{thm[mode=IfThen] eq_dep}, and\\ |
312 | 1017 |
@{thm[mode=IfThen] eq_cp} |
1018 |
\end{tabular} |
|
1019 |
\end{isabelle} |
|
311 | 1020 |
|
312 | 1021 |
\noindent |
321 | 1022 |
The first property is again telling us we do not need to change the @{text RAG}. The second |
1023 |
however states that only threads that are \emph{not} dependants of @{text th} have their |
|
312 | 1024 |
current precedence unchanged. For the others we have to recalculate the current |
1025 |
precedence. To do this we can start from @{term "th"} |
|
1026 |
and follow the @{term "depend"}-chains to recompute the @{term "cp"} of every |
|
1027 |
thread encountered on the way using Lemma~\ref{childrenlem}. Since the @{term "depend"} |
|
321 | 1028 |
is loop free, this procedure will always stop. The following two lemmas show, however, |
1029 |
that this procedure can actually stop often earlier without having to consider all |
|
1030 |
dependants. |
|
312 | 1031 |
|
1032 |
\begin{isabelle}\ \ \ \ \ %%% |
|
1033 |
\begin{tabular}{@ {}l} |
|
1034 |
@{thm[mode=IfThen] eq_up_self}\\ |
|
1035 |
@{text "If"} @{thm (prem 1) eq_up}, @{thm (prem 2) eq_up} and @{thm (prem 3) eq_up}\\ |
|
1036 |
@{text "then"} @{thm (concl) eq_up}. |
|
1037 |
\end{tabular} |
|
1038 |
\end{isabelle} |
|
1039 |
||
1040 |
\noindent |
|
1041 |
The first states that if the current precedence of @{text th} is unchanged, |
|
1042 |
then the procedure can stop immediately (all dependent threads have their @{term cp}-value unchanged). |
|
1043 |
The second states that if an intermediate @{term cp}-value does not change, then |
|
1044 |
the procedure can also stop, because none of its dependent threads will |
|
1045 |
have their current precedence changed. |
|
1046 |
\smallskip |
|
311 | 1047 |
*} |
1048 |
(*<*) |
|
1049 |
end |
|
1050 |
context step_v_cps_nt |
|
1051 |
begin |
|
1052 |
(*>*) |
|
1053 |
text {* |
|
312 | 1054 |
\noindent |
321 | 1055 |
\colorbox{mygrey}{@{term "V th cs"}:} We assume that @{text s'} and |
312 | 1056 |
@{term "s \<equiv> V th cs#s'"} are both valid. We have to consider two |
1057 |
subcases: one where there is a thread to ``take over'' the released |
|
321 | 1058 |
resource @{text cs}, and one where there is not. Let us consider them |
312 | 1059 |
in turn. Suppose in state @{text s}, the thread @{text th'} takes over |
1060 |
resource @{text cs} from thread @{text th}. We can show |
|
311 | 1061 |
|
1062 |
||
312 | 1063 |
\begin{isabelle}\ \ \ \ \ %%% |
1064 |
@{thm depend_s} |
|
1065 |
\end{isabelle} |
|
1066 |
||
1067 |
\noindent |
|
1068 |
which shows how the @{text RAG} needs to be changed. This also suggests |
|
1069 |
how the current precedences need to be recalculated. For threads that are |
|
1070 |
not @{text "th"} and @{text "th'"} nothing needs to be changed, since we |
|
1071 |
can show |
|
1072 |
||
1073 |
\begin{isabelle}\ \ \ \ \ %%% |
|
1074 |
@{thm[mode=IfThen] cp_kept} |
|
1075 |
\end{isabelle} |
|
1076 |
||
1077 |
\noindent |
|
1078 |
For @{text th} and @{text th'} we need to use Lemma~\ref{childrenlem} to |
|
1079 |
recalculate their current prcedence since their children have changed. *}(*<*)end context step_v_cps_nnt begin (*>*)text {* |
|
1080 |
\noindent |
|
1081 |
In the other case where there is no thread that takes over @{text cs}, we can show how |
|
1082 |
to recalculate the @{text RAG} and also show that no current precedence needs |
|
321 | 1083 |
to be recalculated. |
312 | 1084 |
|
1085 |
\begin{isabelle}\ \ \ \ \ %%% |
|
1086 |
\begin{tabular}{@ {}l} |
|
1087 |
@{thm depend_s}\\ |
|
1088 |
@{thm eq_cp} |
|
1089 |
\end{tabular} |
|
1090 |
\end{isabelle} |
|
311 | 1091 |
*} |
1092 |
(*<*) |
|
1093 |
end |
|
1094 |
context step_P_cps_e |
|
1095 |
begin |
|
1096 |
(*>*) |
|
1097 |
||
1098 |
text {* |
|
312 | 1099 |
\noindent |
321 | 1100 |
\colorbox{mygrey}{@{term "P th cs"}:} We assume that @{text s'} and |
312 | 1101 |
@{term "s \<equiv> P th cs#s'"} are both valid. We again have to analyse two subcases, namely |
1102 |
the one where @{text cs} is locked, and where it is not. We treat the second case |
|
1103 |
first by showing that |
|
1104 |
||
1105 |
\begin{isabelle}\ \ \ \ \ %%% |
|
1106 |
\begin{tabular}{@ {}l} |
|
1107 |
@{thm depend_s}\\ |
|
1108 |
@{thm eq_cp} |
|
1109 |
\end{tabular} |
|
1110 |
\end{isabelle} |
|
311 | 1111 |
|
312 | 1112 |
\noindent |
1113 |
This means we do not need to add a holding edge to the @{text RAG} and no |
|
321 | 1114 |
current precedence needs to be recalculated.*}(*<*)end context step_P_cps_ne begin(*>*) text {* |
312 | 1115 |
\noindent |
1116 |
In the second case we know that resouce @{text cs} is locked. We can show that |
|
1117 |
||
1118 |
\begin{isabelle}\ \ \ \ \ %%% |
|
1119 |
\begin{tabular}{@ {}l} |
|
1120 |
@{thm depend_s}\\ |
|
1121 |
@{thm[mode=IfThen] eq_cp} |
|
1122 |
\end{tabular} |
|
1123 |
\end{isabelle} |
|
311 | 1124 |
|
312 | 1125 |
\noindent |
1126 |
That means we have to add a waiting edge to the @{text RAG}. Furthermore |
|
321 | 1127 |
the current precedence for all threads that are not dependants of @{text th} |
1128 |
are unchanged. For the others we need to follow the edges |
|
312 | 1129 |
in the @{text RAG} and recompute the @{term "cp"}. However, like in the |
321 | 1130 |
@case of {text Set}, this operation can stop often earlier, namely when intermediate |
312 | 1131 |
values do not change. |
311 | 1132 |
*} |
1133 |
(*<*) |
|
1134 |
end |
|
1135 |
(*>*) |
|
1136 |
text {* |
|
312 | 1137 |
\noindent |
321 | 1138 |
A pleasing result of our formalisation is that the properties in |
1139 |
this section closely inform an implementation of PIP: Whether the |
|
1140 |
@{text RAG} needs to be reconfigured or current precedences need to |
|
1141 |
recalculated for an event is given by a lemma we proved. |
|
311 | 1142 |
*} |
1143 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
1144 |
section {* Conclusion *} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
1145 |
|
300 | 1146 |
text {* |
314 | 1147 |
The Priority Inheritance Protocol (PIP) is a classic textbook |
315 | 1148 |
algorithm used in real-time operating systems in order to avoid the problem of |
1149 |
Priority Inversion. Although classic and widely used, PIP does have |
|
317 | 1150 |
its faults: for example it does not prevent deadlocks in cases where threads |
315 | 1151 |
have circular lock dependencies. |
300 | 1152 |
|
317 | 1153 |
We had two goals in mind with our formalisation of PIP: One is to |
315 | 1154 |
make the notions in the correctness proof by Sha et al.~\cite{Sha90} |
317 | 1155 |
precise so that they can be processed by a theorem prover. The reason is |
1156 |
that a mechanically checked proof avoids the flaws that crept into their |
|
1157 |
informal reasoning. We achieved this goal: The correctness of PIP now |
|
315 | 1158 |
only hinges on the assumptions behind our formal model. The reasoning, which is |
314 | 1159 |
sometimes quite intricate and tedious, has been checked beyond any |
315 | 1160 |
reasonable doubt by Isabelle/HOL. We can also confirm that Paulson's |
321 | 1161 |
inductive method for protocol verification~\cite{Paulson98} is quite |
315 | 1162 |
suitable for our formal model and proof. The traditional application |
1163 |
area of this method is security protocols. The only other |
|
1164 |
application of Paulson's method we know of outside this area is |
|
1165 |
\cite{Wang09}. |
|
301 | 1166 |
|
317 | 1167 |
The second goal of our formalisation is to provide a specification for actually |
1168 |
implementing PIP. Textbooks, for example \cite[Section 5.6.5]{Vahalia96}, |
|
315 | 1169 |
explain how to use various implementations of PIP and abstractly |
317 | 1170 |
discuss their properties, but surprisingly lack most details for a |
1171 |
programmer who wants to implement PIP. That this is an issue in practice is illustrated by the |
|
315 | 1172 |
email from Baker we cited in the Introduction. We achieved also this |
317 | 1173 |
goal: The formalisation gives the first author enough data to enable |
1174 |
his undergraduate students to implement PIP (as part of their OS course) |
|
1175 |
on top of PINTOS, a small operating system for teaching |
|
315 | 1176 |
purposes. A byproduct of our formalisation effort is that nearly all |
314 | 1177 |
design choices for the PIP scheduler are backed up with a proved |
317 | 1178 |
lemma. We were also able to establish the property that the choice of |
1179 |
the next thread which takes over a lock is irrelevant for the correctness |
|
1180 |
of PIP. Earlier model checking approaches which verified implementations |
|
1181 |
of PIP \cite{Faria08,Jahier09,Wellings07} cannot |
|
1182 |
provide this kind of ``deep understanding'' about the principles behind |
|
1183 |
PIP and its correctness. |
|
315 | 1184 |
|
1185 |
PIP is a scheduling algorithm for single-processor systems. We are |
|
316 | 1186 |
now living in a multi-processor world. So the question naturally |
318 | 1187 |
arises whether PIP has any relevance in such a world beyond |
1188 |
teaching. Priority Inversion certainly occurs also in |
|
321 | 1189 |
multi-processor systems. However, the surprising answer, according |
1190 |
to \cite{Steinberg10}, is that except for one unsatisfactory |
|
1191 |
proposal nobody has a good idea for how PIP should be modified to |
|
1192 |
work correctly on multi-processor systems. The difficulties become |
|
1193 |
clear when considering that locking and releasing a resource always |
|
1194 |
requires a small amount of time. If processes work independently, |
|
1195 |
then a low priority process can ``steal'' in such an unguarded |
|
1196 |
moment a lock for a resource that was supposed allow a high-priority |
|
1197 |
process to run next. Thus the problem of Priority Inversion is not |
|
1198 |
really prevented. It seems difficult to design a PIP-algorithm with |
|
1199 |
a meaningful correctness property on a multi-processor systems where |
|
1200 |
processes work independently. We can imagine PIP to be of use in |
|
1201 |
situations where processes are \emph{not} independent, but |
|
1202 |
coordinated via a master process that distributes work over some |
|
1203 |
slave processes. However, a formal investigation of this is beyond |
|
1204 |
the scope of this paper. We are not aware of any proofs in this |
|
1205 |
area, not even informal ones. |
|
265 | 1206 |
|
321 | 1207 |
The most closely related work to ours is the formal verification in |
1208 |
PVS for Priority Ceiling done by Dutertre \cite{dutertre99b}. His formalisation |
|
1209 |
consists of 407 lemmas and 2500 lines of ``specification'' (we do not |
|
1210 |
know whether this includes also code for proofs). Our formalisation |
|
1211 |
consists of around 210 lemmas and overall 6950 lines of readable Isabelle/Isar |
|
1212 |
code with a few apply-scripts interspersed. The formal model of PIP |
|
1213 |
is 385 lines long; the formal correctness proof 3800 lines. Some auxiliary |
|
1214 |
definitions and proofs took 770 lines of code. The properties relevant |
|
1215 |
for an implementation took 2000 lines. Our code can be downloaded from |
|
1216 |
... |
|
1217 |
||
1218 |
\bibliographystyle{plain} |
|
1219 |
\bibliography{root} |
|
262 | 1220 |
*} |
1221 |
||
1222 |
section {* Key properties \label{extension} *} |
|
1223 |
||
264 | 1224 |
(*<*) |
1225 |
context extend_highest_gen |
|
1226 |
begin |
|
1227 |
(*>*) |
|
1228 |
||
1229 |
text {* |
|
1230 |
The essential of {\em Priority Inheritance} is to avoid indefinite priority inversion. For this |
|
1231 |
purpose, we need to investigate what happens after one thread takes the highest precedence. |
|
1232 |
A locale is used to describe such a situation, which assumes: |
|
1233 |
\begin{enumerate} |
|
1234 |
\item @{term "s"} is a valid state (@{text "vt_s"}): |
|
1235 |
@{thm vt_s}. |
|
1236 |
\item @{term "th"} is a living thread in @{term "s"} (@{text "threads_s"}): |
|
1237 |
@{thm threads_s}. |
|
1238 |
\item @{term "th"} has the highest precedence in @{term "s"} (@{text "highest"}): |
|
1239 |
@{thm highest}. |
|
1240 |
\item The precedence of @{term "th"} is @{term "Prc prio tm"} (@{text "preced_th"}): |
|
1241 |
@{thm preced_th}. |
|
1242 |
\end{enumerate} |
|
1243 |
*} |
|
1244 |
||
1245 |
text {* \noindent |
|
1246 |
Under these assumptions, some basic priority can be derived for @{term "th"}: |
|
1247 |
\begin{enumerate} |
|
1248 |
\item The current precedence of @{term "th"} equals its own precedence (@{text "eq_cp_s_th"}): |
|
1249 |
@{thm [display] eq_cp_s_th} |
|
1250 |
\item The current precedence of @{term "th"} is the highest precedence in |
|
1251 |
the system (@{text "highest_cp_preced"}): |
|
1252 |
@{thm [display] highest_cp_preced} |
|
1253 |
\item The precedence of @{term "th"} is the highest precedence |
|
1254 |
in the system (@{text "highest_preced_thread"}): |
|
1255 |
@{thm [display] highest_preced_thread} |
|
1256 |
\item The current precedence of @{term "th"} is the highest current precedence |
|
1257 |
in the system (@{text "highest'"}): |
|
1258 |
@{thm [display] highest'} |
|
1259 |
\end{enumerate} |
|
1260 |
*} |
|
1261 |
||
1262 |
text {* \noindent |
|
1263 |
To analysis what happens after state @{term "s"} a sub-locale is defined, which |
|
1264 |
assumes: |
|
1265 |
\begin{enumerate} |
|
1266 |
\item @{term "t"} is a valid extension of @{term "s"} (@{text "vt_t"}): @{thm vt_t}. |
|
1267 |
\item Any thread created in @{term "t"} has priority no higher than @{term "prio"}, therefore |
|
1268 |
its precedence can not be higher than @{term "th"}, therefore |
|
1269 |
@{term "th"} remain to be the one with the highest precedence |
|
1270 |
(@{text "create_low"}): |
|
1271 |
@{thm [display] create_low} |
|
1272 |
\item Any adjustment of priority in |
|
1273 |
@{term "t"} does not happen to @{term "th"} and |
|
1274 |
the priority set is no higher than @{term "prio"}, therefore |
|
1275 |
@{term "th"} remain to be the one with the highest precedence (@{text "set_diff_low"}): |
|
1276 |
@{thm [display] set_diff_low} |
|
1277 |
\item Since we are investigating what happens to @{term "th"}, it is assumed |
|
1278 |
@{term "th"} does not exit during @{term "t"} (@{text "exit_diff"}): |
|
1279 |
@{thm [display] exit_diff} |
|
1280 |
\end{enumerate} |
|
1281 |
*} |
|
1282 |
||
1283 |
text {* \noindent |
|
1284 |
All these assumptions are put into a predicate @{term "extend_highest_gen"}. |
|
1285 |
It can be proved that @{term "extend_highest_gen"} holds |
|
1286 |
for any moment @{text "i"} in it @{term "t"} (@{text "red_moment"}): |
|
1287 |
@{thm [display] red_moment} |
|
1288 |
||
1289 |
From this, an induction principle can be derived for @{text "t"}, so that |
|
1290 |
properties already derived for @{term "t"} can be applied to any prefix |
|
1291 |
of @{text "t"} in the proof of new properties |
|
1292 |
about @{term "t"} (@{text "ind"}): |
|
1293 |
\begin{center} |
|
1294 |
@{thm[display] ind} |
|
1295 |
\end{center} |
|
1296 |
||
1297 |
The following properties can be proved about @{term "th"} in @{term "t"}: |
|
1298 |
\begin{enumerate} |
|
1299 |
\item In @{term "t"}, thread @{term "th"} is kept live and its |
|
1300 |
precedence is preserved as well |
|
1301 |
(@{text "th_kept"}): |
|
1302 |
@{thm [display] th_kept} |
|
1303 |
\item In @{term "t"}, thread @{term "th"}'s precedence is always the maximum among |
|
1304 |
all living threads |
|
1305 |
(@{text "max_preced"}): |
|
1306 |
@{thm [display] max_preced} |
|
1307 |
\item In @{term "t"}, thread @{term "th"}'s current precedence is always the maximum precedence |
|
1308 |
among all living threads |
|
1309 |
(@{text "th_cp_max_preced"}): |
|
1310 |
@{thm [display] th_cp_max_preced} |
|
1311 |
\item In @{term "t"}, thread @{term "th"}'s current precedence is always the maximum current |
|
1312 |
precedence among all living threads |
|
1313 |
(@{text "th_cp_max"}): |
|
1314 |
@{thm [display] th_cp_max} |
|
1315 |
\item In @{term "t"}, thread @{term "th"}'s current precedence equals its precedence at moment |
|
1316 |
@{term "s"} |
|
1317 |
(@{text "th_cp_preced"}): |
|
1318 |
@{thm [display] th_cp_preced} |
|
1319 |
\end{enumerate} |
|
1320 |
*} |
|
1321 |
||
1322 |
text {* \noindent |
|
266 | 1323 |
The main theorem of this part is to characterizing the running thread during @{term "t"} |
264 | 1324 |
(@{text "runing_inversion_2"}): |
1325 |
@{thm [display] runing_inversion_2} |
|
1326 |
According to this, if a thread is running, it is either @{term "th"} or was |
|
1327 |
already live and held some resource |
|
1328 |
at moment @{text "s"} (expressed by: @{text "cntV s th' < cntP s th'"}). |
|
1329 |
||
1330 |
Since there are only finite many threads live and holding some resource at any moment, |
|
1331 |
if every such thread can release all its resources in finite duration, then after finite |
|
1332 |
duration, none of them may block @{term "th"} anymore. So, no priority inversion may happen |
|
1333 |
then. |
|
1334 |
*} |
|
1335 |
||
1336 |
(*<*) |
|
1337 |
end |
|
1338 |
||
262 | 1339 |
end |
1340 |
(*>*) |