author | urbanc |
Sun, 12 Feb 2012 04:45:20 +0000 | |
changeset 298 | f2e0d031a395 |
parent 297 | 0a4be67ea7f8 |
child 299 | 4fcd802eba59 |
permissions | -rwxr-xr-x |
262 | 1 |
(*<*) |
2 |
theory Paper |
|
292 | 3 |
imports CpsG ExtGG "~~/src/HOL/Library/LaTeXsugar" |
262 | 4 |
begin |
266 | 5 |
ML {* |
273 | 6 |
open Printer; |
272 | 7 |
show_question_marks_default := false; |
266 | 8 |
*} |
284 | 9 |
|
10 |
notation (latex output) |
|
11 |
Cons ("_::_" [78,77] 73) and |
|
12 |
vt ("valid'_state") and |
|
13 |
runing ("running") and |
|
286 | 14 |
birthtime ("last'_set") and |
284 | 15 |
If ("(\<^raw:\textrm{>if\<^raw:}> (_)/ \<^raw:\textrm{>then\<^raw:}> (_)/ \<^raw:\textrm{>else\<^raw:}> (_))" 10) and |
286 | 16 |
Prc ("'(_, _')") and |
287 | 17 |
holding ("holds") and |
18 |
waiting ("waits") and |
|
290 | 19 |
Th ("T") and |
20 |
Cs ("C") and |
|
287 | 21 |
readys ("ready") and |
290 | 22 |
depend ("RAG") and |
23 |
preced ("prec") and |
|
24 |
cpreced ("cprec") and |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
25 |
dependents ("dependants") and |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
26 |
cp ("cprec") and |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
27 |
holdents ("resources") and |
284 | 28 |
DUMMY ("\<^raw:\mbox{$\_\!\_$}>") |
262 | 29 |
(*>*) |
30 |
||
31 |
section {* Introduction *} |
|
32 |
||
33 |
text {* |
|
284 | 34 |
Many real-time systems need to support threads involving priorities and |
267 | 35 |
locking of resources. Locking of resources ensures mutual exclusion |
275 | 36 |
when accessing shared data or devices that cannot be |
284 | 37 |
preempted. Priorities allow scheduling of threads that need to |
275 | 38 |
finish their work within deadlines. Unfortunately, both features |
39 |
can interact in subtle ways leading to a problem, called |
|
284 | 40 |
\emph{Priority Inversion}. Suppose three threads having priorities |
41 |
$H$(igh), $M$(edium) and $L$(ow). We would expect that the thread |
|
42 |
$H$ blocks any other thread with lower priority and itself cannot |
|
43 |
be blocked by any thread with lower priority. Alas, in a naive |
|
275 | 44 |
implementation of resource looking and priorities this property can |
45 |
be violated. Even worse, $H$ can be delayed indefinitely by |
|
284 | 46 |
threads with lower priorities. For this let $L$ be in the |
275 | 47 |
possession of a lock for a resource that also $H$ needs. $H$ must |
48 |
therefore wait for $L$ to exit the critical section and release this |
|
49 |
lock. The problem is that $L$ might in turn be blocked by any |
|
284 | 50 |
thread with priority $M$, and so $H$ sits there potentially waiting |
51 |
indefinitely. Since $H$ is blocked by threads with lower |
|
275 | 52 |
priorities, the problem is called Priority Inversion. It was first |
277 | 53 |
described in \cite{Lampson80} in the context of the |
275 | 54 |
Mesa programming language designed for concurrent programming. |
265 | 55 |
|
273 | 56 |
If the problem of Priority Inversion is ignored, real-time systems |
267 | 57 |
can become unpredictable and resulting bugs can be hard to diagnose. |
58 |
The classic example where this happened is the software that |
|
284 | 59 |
controlled the Mars Pathfinder mission in 1997 \cite{Reeves98}. |
60 |
Once the spacecraft landed, the software shut down at irregular |
|
61 |
intervals leading to loss of project time as normal operation of the |
|
62 |
craft could only resume the next day (the mission and data already |
|
63 |
collected were fortunately not lost, because of a clever system |
|
64 |
design). The reason for the shutdowns was that the scheduling |
|
65 |
software fell victim of Priority Inversion: a low priority thread |
|
66 |
locking a resource prevented a high priority thread from running in |
|
67 |
time leading to a system reset. Once the problem was found, it was |
|
68 |
rectified by enabling the \emph{Priority Inheritance Protocol} (PIP) |
|
69 |
\cite{Sha90}\footnote{Sha et al.~call it the \emph{Basic Priority |
|
286 | 70 |
Inheritance Protocol} \cite{Sha90} and others sometimes also call it |
71 |
\emph{Priority Boosting}.} in the scheduling software. |
|
262 | 72 |
|
284 | 73 |
The idea behind PIP is to let the thread $L$ temporarily inherit |
286 | 74 |
the high priority from $H$ until $L$ leaves the critical section |
284 | 75 |
unlocking the resource. This solves the problem of $H$ having to |
76 |
wait indefinitely, because $L$ cannot be blocked by threads having |
|
77 |
priority $M$. While a few other solutions exist for the Priority |
|
78 |
Inversion problem, PIP is one that is widely deployed and |
|
79 |
implemented. This includes VxWorks (a proprietary real-time OS used |
|
80 |
in the Mars Pathfinder mission, in Boeing's 787 Dreamliner, Honda's |
|
81 |
ASIMO robot, etc.), but also the POSIX 1003.1c Standard realised for |
|
82 |
example in libraries for FreeBSD, Solaris and Linux. |
|
274 | 83 |
|
284 | 84 |
One advantage of PIP is that increasing the priority of a thread |
275 | 85 |
can be dynamically calculated by the scheduler. This is in contrast |
277 | 86 |
to, for example, \emph{Priority Ceiling} \cite{Sha90}, another |
87 |
solution to the Priority Inversion problem, which requires static |
|
284 | 88 |
analysis of the program in order to prevent Priority |
89 |
Inversion. However, there has also been strong criticism against |
|
90 |
PIP. For instance, PIP cannot prevent deadlocks when lock |
|
91 |
dependencies are circular, and also blocking times can be |
|
92 |
substantial (more than just the duration of a critical section). |
|
93 |
Though, most criticism against PIP centres around unreliable |
|
94 |
implementations and PIP being too complicated and too inefficient. |
|
95 |
For example, Yodaiken writes in \cite{Yodaiken02}: |
|
274 | 96 |
|
97 |
\begin{quote} |
|
98 |
\it{}``Priority inheritance is neither efficient nor reliable. Implementations |
|
99 |
are either incomplete (and unreliable) or surprisingly complex and intrusive.'' |
|
100 |
\end{quote} |
|
273 | 101 |
|
274 | 102 |
\noindent |
275 | 103 |
He suggests to avoid PIP altogether by not allowing critical |
286 | 104 |
sections to be preempted. Unfortunately, this solution does not |
105 |
help in real-time systems with low latency \emph{requirements}. |
|
278 | 106 |
|
286 | 107 |
In our opinion, there is clearly a need for investigating correct |
278 | 108 |
algorithms for PIP. A few specifications for PIP exist (in English) |
109 |
and also a few high-level descriptions of implementations (e.g.~in |
|
110 |
the textbook \cite[Section 5.6.5]{Vahalia96}), but they help little |
|
111 |
with actual implementations. That this is a problem in practise is |
|
283 | 112 |
proved by an email from Baker, who wrote on 13 July 2009 on the Linux |
278 | 113 |
Kernel mailing list: |
274 | 114 |
|
115 |
\begin{quote} |
|
275 | 116 |
\it{}``I observed in the kernel code (to my disgust), the Linux PIP |
117 |
implementation is a nightmare: extremely heavy weight, involving |
|
118 |
maintenance of a full wait-for graph, and requiring updates for a |
|
119 |
range of events, including priority changes and interruptions of |
|
120 |
wait operations.'' |
|
274 | 121 |
\end{quote} |
122 |
||
123 |
\noindent |
|
277 | 124 |
The criticism by Yodaiken, Baker and others suggests to us to look |
125 |
again at PIP from a more abstract level (but still concrete enough |
|
286 | 126 |
to inform an implementation), and makes PIP an ideal candidate for a |
277 | 127 |
formal verification. One reason, of course, is that the original |
284 | 128 |
presentation of PIP~\cite{Sha90}, despite being informally |
283 | 129 |
``proved'' correct, is actually \emph{flawed}. |
130 |
||
131 |
Yodaiken \cite{Yodaiken02} points to a subtlety that had been |
|
132 |
overlooked in the informal proof by Sha et al. They specify in |
|
284 | 133 |
\cite{Sha90} that after the thread (whose priority has been raised) |
283 | 134 |
completes its critical section and releases the lock, it ``returns |
135 |
to its original priority level.'' This leads them to believe that an |
|
284 | 136 |
implementation of PIP is ``rather straightforward''~\cite{Sha90}. |
137 |
Unfortunately, as Yodaiken points out, this behaviour is too |
|
138 |
simplistic. Consider the case where the low priority thread $L$ |
|
139 |
locks \emph{two} resources, and two high-priority threads $H$ and |
|
283 | 140 |
$H'$ each wait for one of them. If $L$ then releases one resource |
141 |
so that $H$, say, can proceed, then we still have Priority Inversion |
|
142 |
with $H'$ (which waits for the other resource). The correct |
|
143 |
behaviour for $L$ is to revert to the highest remaining priority of |
|
284 | 144 |
the threads that it blocks. The advantage of formalising the |
145 |
correctness of a high-level specification of PIP in a theorem prover |
|
146 |
is that such issues clearly show up and cannot be overlooked as in |
|
147 |
informal reasoning (since we have to analyse all possible behaviours |
|
148 |
of threads, i.e.~\emph{traces}, that could possibly happen). |
|
274 | 149 |
|
279 | 150 |
There have been earlier formal investigations into PIP, but ...\cite{Faria08} |
284 | 151 |
|
152 |
vt (valid trace) was introduced earlier, cite |
|
153 |
||
154 |
distributed PIP |
|
286 | 155 |
|
156 |
Paulson's method has not been used outside security field, except |
|
157 |
work by Zhang et al. |
|
158 |
||
159 |
no clue about multi-processor case according to \cite{Steinberg10} |
|
280 | 160 |
*} |
278 | 161 |
|
283 | 162 |
section {* Formal Model of the Priority Inheritance Protocol *} |
267 | 163 |
|
280 | 164 |
text {* |
286 | 165 |
The Priority Inheritance Protocol, short PIP, is a scheduling |
166 |
algorithm for a single-processor system.\footnote{We shall come back |
|
167 |
later to the case of PIP on multi-processor systems.} Our model of |
|
168 |
PIP is based on Paulson's inductive approach to protocol |
|
169 |
verification \cite{Paulson98}, where the \emph{state} of a system is |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
170 |
given by a list of events that happened so far. \emph{Events} of PIP fall |
290 | 171 |
into five categories defined as the datatype: |
283 | 172 |
|
173 |
\begin{isabelle}\ \ \ \ \ %%% |
|
284 | 174 |
\mbox{\begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l@ {\hspace{7mm}}l} |
175 |
\isacommand{datatype} event |
|
176 |
& @{text "="} & @{term "Create thread priority"}\\ |
|
177 |
& @{text "|"} & @{term "Exit thread"} \\ |
|
286 | 178 |
& @{text "|"} & @{term "Set thread priority"} & {\rm reset of the priority for} @{text thread}\\ |
284 | 179 |
& @{text "|"} & @{term "P thread cs"} & {\rm request of resource} @{text "cs"} {\rm by} @{text "thread"}\\ |
180 |
& @{text "|"} & @{term "V thread cs"} & {\rm release of resource} @{text "cs"} {\rm by} @{text "thread"} |
|
181 |
\end{tabular}} |
|
182 |
\end{isabelle} |
|
183 |
||
184 |
\noindent |
|
286 | 185 |
whereby threads, priorities and (critical) resources are represented |
186 |
as natural numbers. The event @{term Set} models the situation that |
|
187 |
a thread obtains a new priority given by the programmer or |
|
188 |
user (for example via the {\tt nice} utility under UNIX). As in Paulson's work, we |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
189 |
need to define functions that allow us to make some observations |
297 | 190 |
about states. One, called @{term threads}, calculates the set of |
293 | 191 |
``live'' threads that we have seen so far: |
284 | 192 |
|
193 |
\begin{isabelle}\ \ \ \ \ %%% |
|
194 |
\mbox{\begin{tabular}{lcl} |
|
195 |
@{thm (lhs) threads.simps(1)} & @{text "\<equiv>"} & |
|
196 |
@{thm (rhs) threads.simps(1)}\\ |
|
197 |
@{thm (lhs) threads.simps(2)[where thread="th"]} & @{text "\<equiv>"} & |
|
198 |
@{thm (rhs) threads.simps(2)[where thread="th"]}\\ |
|
199 |
@{thm (lhs) threads.simps(3)[where thread="th"]} & @{text "\<equiv>"} & |
|
200 |
@{thm (rhs) threads.simps(3)[where thread="th"]}\\ |
|
201 |
@{term "threads (DUMMY#s)"} & @{text "\<equiv>"} & @{term "threads s"}\\ |
|
202 |
\end{tabular}} |
|
283 | 203 |
\end{isabelle} |
204 |
||
205 |
\noindent |
|
290 | 206 |
Another function calculates the priority for a thread @{text "th"}, which is |
207 |
defined as |
|
284 | 208 |
|
209 |
\begin{isabelle}\ \ \ \ \ %%% |
|
210 |
\mbox{\begin{tabular}{lcl} |
|
211 |
@{thm (lhs) original_priority.simps(1)[where thread="th"]} & @{text "\<equiv>"} & |
|
212 |
@{thm (rhs) original_priority.simps(1)[where thread="th"]}\\ |
|
213 |
@{thm (lhs) original_priority.simps(2)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
214 |
@{thm (rhs) original_priority.simps(2)[where thread="th" and thread'="th'"]}\\ |
|
215 |
@{thm (lhs) original_priority.simps(3)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
216 |
@{thm (rhs) original_priority.simps(3)[where thread="th" and thread'="th'"]}\\ |
|
217 |
@{term "original_priority th (DUMMY#s)"} & @{text "\<equiv>"} & @{term "original_priority th s"}\\ |
|
218 |
\end{tabular}} |
|
219 |
\end{isabelle} |
|
220 |
||
221 |
\noindent |
|
222 |
In this definition we set @{text 0} as the default priority for |
|
223 |
threads that have not (yet) been created. The last function we need |
|
285 | 224 |
calculates the ``time'', or index, at which time a process had its |
290 | 225 |
priority last set. |
284 | 226 |
|
227 |
\begin{isabelle}\ \ \ \ \ %%% |
|
228 |
\mbox{\begin{tabular}{lcl} |
|
229 |
@{thm (lhs) birthtime.simps(1)[where thread="th"]} & @{text "\<equiv>"} & |
|
230 |
@{thm (rhs) birthtime.simps(1)[where thread="th"]}\\ |
|
231 |
@{thm (lhs) birthtime.simps(2)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
232 |
@{thm (rhs) birthtime.simps(2)[where thread="th" and thread'="th'"]}\\ |
|
233 |
@{thm (lhs) birthtime.simps(3)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & |
|
234 |
@{thm (rhs) birthtime.simps(3)[where thread="th" and thread'="th'"]}\\ |
|
235 |
@{term "birthtime th (DUMMY#s)"} & @{text "\<equiv>"} & @{term "birthtime th s"}\\ |
|
236 |
\end{tabular}} |
|
237 |
\end{isabelle} |
|
286 | 238 |
|
239 |
\noindent |
|
287 | 240 |
In this definition @{term "length s"} stands for the length of the list |
241 |
of events @{text s}. Again the default value in this function is @{text 0} |
|
242 |
for threads that have not been created yet. A \emph{precedence} of a thread @{text th} in a |
|
290 | 243 |
state @{text s} is the pair of natural numbers defined as |
284 | 244 |
|
286 | 245 |
\begin{isabelle}\ \ \ \ \ %%% |
290 | 246 |
@{thm preced_def[where thread="th"]} |
286 | 247 |
\end{isabelle} |
248 |
||
249 |
\noindent |
|
287 | 250 |
The point of precedences is to schedule threads not according to priorities (because what should |
286 | 251 |
we do in case two threads have the same priority), but according to precedences. |
290 | 252 |
Precedences allow us to always discriminate between two threads with equal priority by |
296 | 253 |
taking into account the time when the priority was last set. We order precedences so |
286 | 254 |
that threads with the same priority get a higher precedence if their priority has been |
293 | 255 |
set earlier, since for such threads it is more urgent to finish their work. In an implementation |
256 |
this choice would translate to a quite natural FIFO-scheduling of processes with |
|
286 | 257 |
the same priority. |
258 |
||
259 |
Next, we introduce the concept of \emph{waiting queues}. They are |
|
260 |
lists of threads associated with every resource. The first thread in |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
261 |
this list (i.e.~the head, or short @{term hd}) is chosen to be the one |
290 | 262 |
that is in possession of the |
286 | 263 |
``lock'' of the corresponding resource. We model waiting queues as |
293 | 264 |
functions, below abbreviated as @{text wq}. They take a resource as |
265 |
argument and return a list of threads. This allows us to define |
|
290 | 266 |
when a thread \emph{holds}, respectively \emph{waits} for, a |
293 | 267 |
resource @{text cs} given a waiting queue function @{text wq}. |
287 | 268 |
|
269 |
\begin{isabelle}\ \ \ \ \ %%% |
|
270 |
\begin{tabular}{@ {}l} |
|
290 | 271 |
@{thm cs_holding_def[where thread="th"]}\\ |
272 |
@{thm cs_waiting_def[where thread="th"]} |
|
287 | 273 |
\end{tabular} |
274 |
\end{isabelle} |
|
275 |
||
276 |
\noindent |
|
277 |
In this definition we assume @{text "set"} converts a list into a set. |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
278 |
At the beginning, that is in the state where no thread is created yet, |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
279 |
the waiting queue function will be the function that returns the |
293 | 280 |
empty list for every resource. |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
281 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
282 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
283 |
@{abbrev all_unlocked} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
284 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
285 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
286 |
\noindent |
290 | 287 |
Using @{term "holding"} and @{term waiting}, we can introduce \emph{Resource Allocation Graphs} |
288 |
(RAG), which represent the dependencies between threads and resources. |
|
289 |
We represent RAGs as relations using pairs of the form |
|
290 |
||
291 |
\begin{isabelle}\ \ \ \ \ %%% |
|
292 |
@{term "(Th th, Cs cs)"} \hspace{5mm}{\rm and}\hspace{5mm} |
|
293 |
@{term "(Cs cs, Th th)"} |
|
294 |
\end{isabelle} |
|
295 |
||
296 |
\noindent |
|
297 |
where the first stands for a \emph{waiting edge} and the second for a |
|
298 |
\emph{holding edge} (@{term Cs} and @{term Th} are constructors of a |
|
299 |
datatype for vertices). Given a waiting queue function, a RAG is defined |
|
300 |
as |
|
301 |
||
302 |
\begin{isabelle}\ \ \ \ \ %%% |
|
303 |
@{thm cs_depend_def} |
|
304 |
\end{isabelle} |
|
305 |
||
306 |
\noindent |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
307 |
Given three threads and three resources, an instance of a RAG is as follows: |
290 | 308 |
|
309 |
\begin{center} |
|
297 | 310 |
\newcommand{\fnt}{\fontsize{7}{8}\selectfont} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
311 |
\begin{tikzpicture}[scale=1] |
297 | 312 |
%%\draw[step=2mm] (-3,2) grid (1,-1); |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
313 |
|
297 | 314 |
\node (A) at (0,0) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>0"}}; |
315 |
\node (B) at (2,0) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>1"}}; |
|
316 |
\node (C) at (4,0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>1"}}; |
|
317 |
\node (D) at (4,-0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>2"}}; |
|
318 |
\node (E) at (6,-0.7) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>2"}}; |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
319 |
\node (E1) at (6, 0.2) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>3"}}; |
297 | 320 |
\node (F) at (8,-0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>3"}}; |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
321 |
|
297 | 322 |
\draw [->,line width=0.6mm] (A) to node [pos=0.45,sloped,above=-0.5mm] {\fnt{}holding} (B); |
323 |
\draw [->,line width=0.6mm] (C) to node [pos=0.4,sloped,above=-0.5mm] {\fnt{}waiting} (B); |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
324 |
\draw [->,line width=0.6mm] (D) to node [pos=0.4,sloped,below=-0.5mm] {\fnt{}waiting} (B); |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
325 |
\draw [->,line width=0.6mm] (D) to node [pos=0.45,sloped,below=-0.5mm] {\fnt{}holding} (E); |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
326 |
\draw [->,line width=0.6mm] (D) to node [pos=0.45,sloped,above=-0.5mm] {\fnt{}holding} (E1); |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
327 |
\draw [->,line width=0.6mm] (F) to node [pos=0.45,sloped,below=-0.5mm] {\fnt{}waiting} (E); |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
328 |
\end{tikzpicture} |
290 | 329 |
\end{center} |
330 |
||
331 |
\noindent |
|
296 | 332 |
The use of relations for representing RAGs allows us to conveniently define |
290 | 333 |
the notion of the \emph{dependants} of a thread. This is defined as |
334 |
||
335 |
\begin{isabelle}\ \ \ \ \ %%% |
|
336 |
@{thm cs_dependents_def} |
|
337 |
\end{isabelle} |
|
338 |
||
339 |
\noindent |
|
296 | 340 |
This definition needs to account for all threads that wait for a thread to |
290 | 341 |
release a resource. This means we need to include threads that transitively |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
342 |
wait for a resource being released (in the picture above this means the dependants |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
343 |
of @{text "th\<^isub>0"} are @{text "th\<^isub>1"} and @{text "th\<^isub>2"}, but also @{text "th\<^isub>3"}, |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
344 |
which cannot make any progress unless @{text "th\<^isub>2"} makes progress, which |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
345 |
in turn needs to wait for @{text "th\<^isub>1"} to finish). If there is a circle in a RAG, then clearly |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
346 |
we have a deadlock. Therefore when a thread requests a resource, |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
347 |
we must ensure that the resulting RAG is not circular. |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
348 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
349 |
Next we introduce the notion of the \emph{current precedence} of a thread @{text th} in a |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
350 |
state @{text s}. It is defined as |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
351 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
352 |
\begin{isabelle}\ \ \ \ \ %%% |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
353 |
@{thm cpreced_def2}\numbered{permprops} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
354 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
355 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
356 |
\noindent |
293 | 357 |
While the precedence @{term prec} of a thread is determined by the programmer |
358 |
(for example when the thread is |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
359 |
created), the point of the current precedence is to let scheduler increase this |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
360 |
priority, if needed according to PIP. Therefore the current precedence of @{text th} is |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
361 |
given as the maximum of the precedence @{text th} has in state @{text s} \emph{and} all |
296 | 362 |
processes that are dependants of @{text th}. Since the notion @{term "dependants"} is |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
363 |
defined as the transitive closure of all dependent threads, we deal correctly with the |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
364 |
problem in the algorithm by Sha et al.~\cite{Sha90} where a priority of a thread is |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
365 |
lowered prematurely. |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
366 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
367 |
The next function, called @{term schs}, defines the behaviour of the scheduler. It will be defined |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
368 |
by recursion on the state (a list of events); @{term "schs"} returns a \emph{schedule state}, which |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
369 |
we represent as a record consisting of two |
296 | 370 |
functions: |
293 | 371 |
|
372 |
\begin{isabelle}\ \ \ \ \ %%% |
|
373 |
@{text "\<lparr>wq_fun, cprec_fun\<rparr>"} |
|
374 |
\end{isabelle} |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
375 |
|
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
376 |
\noindent |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
377 |
The first function is a waiting queue function (that is takes a @{text "cs"} and returns the |
296 | 378 |
corresponding list of threads that wait for it), the second is a function that takes |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
379 |
a thread and returns its current precedence (see ???). We assume the usual getter and |
296 | 380 |
setter methods for such records. |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
381 |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
382 |
In the initial state, the scheduler starts with all resources unlocked and the |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
383 |
current precedence of every thread is initialised with @{term "Prc 0 0"}; that means |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
384 |
@{abbrev initial_cprec}. Therefore |
296 | 385 |
we have |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
386 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
387 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
388 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
389 |
@{thm (lhs) schs.simps(1)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
390 |
\hspace{5mm}@{term "(|wq_fun = all_unlocked, cprec_fun = (\<lambda>_::thread. Prc 0 0)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
391 |
\end{tabular} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
392 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
393 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
394 |
\noindent |
296 | 395 |
The cases for @{term Create}, @{term Exit} and @{term Set} are also straightforward: |
396 |
we calculate the waiting queue function of the (previous) state @{text s}; |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
397 |
this waiting queue function @{text wq} is unchanged in the next schedule state---because |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
398 |
none of these events lock or release any resources; |
296 | 399 |
for calculating the next @{term "cprec_fun"}, we use @{text wq} and the function |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
400 |
@{term cpreced}. This gives the following three clauses for @{term schs}: |
290 | 401 |
|
402 |
\begin{isabelle}\ \ \ \ \ %%% |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
403 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
404 |
@{thm (lhs) schs.simps(2)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
405 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
406 |
\hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Create th prio # s)|)"}\smallskip\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
407 |
@{thm (lhs) schs.simps(3)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
408 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
409 |
\hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Exit th # s)|)"}\smallskip\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
410 |
@{thm (lhs) schs.simps(4)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
411 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
412 |
\hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Set th prio # s)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
413 |
\end{tabular} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
414 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
415 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
416 |
\noindent |
296 | 417 |
More interesting are the cases when a resource, say @{text cs}, is locked or released. In this case |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
418 |
we need to calculate a new waiting queue function. For the event @{term P}, we have to update |
296 | 419 |
the function so that the new thread list for @{text cs} is old thread list plus the thread @{text th} |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
420 |
appended to the end of that list (remember the head of this list is seen to be in the possession of the |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
421 |
resource). |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
422 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
423 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
424 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
425 |
@{thm (lhs) schs.simps(5)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
426 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
427 |
\hspace{5mm}@{text "let"} @{text "new_wq = wq(cs := (wq cs @ [th]))"} @{text "in"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
428 |
\hspace{8mm}@{term "(|wq_fun = new_wq, cprec_fun = cpreced new_wq (P th cs # s)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
429 |
\end{tabular} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
430 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
431 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
432 |
\noindent |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
433 |
The clause for event @{term V} is similar, except that we need to update the waiting queue function |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
434 |
so that the thread that possessed the lock is deleted from the corresponding thread list. For this we use |
296 | 435 |
the auxiliary function @{term release}. A simple version of @{term release} would |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
436 |
just delete this thread and return the rest, namely |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
437 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
438 |
\begin{isabelle}\ \ \ \ \ %%% |
296 | 439 |
\begin{tabular}{@ {}lcl} |
440 |
@{term "release []"} & @{text "\<equiv>"} & @{term "[]"}\\ |
|
441 |
@{term "release (DUMMY # qs)"} & @{text "\<equiv>"} & @{term "qs"}\\ |
|
442 |
\end{tabular} |
|
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
443 |
\end{isabelle} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
444 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
445 |
\noindent |
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
446 |
In practice, however, often the thread with the highest precedence will get the |
296 | 447 |
lock next. We have implemented this choice, but later found out that the choice |
448 |
about which thread is chosen next is actually irrelevant for the correctness of PIP. |
|
449 |
Therefore we prove the stronger result where @{term release} is defined as |
|
450 |
||
451 |
\begin{isabelle}\ \ \ \ \ %%% |
|
452 |
\begin{tabular}{@ {}lcl} |
|
453 |
@{term "release []"} & @{text "\<equiv>"} & @{term "[]"}\\ |
|
454 |
@{term "release (DUMMY # qs)"} & @{text "\<equiv>"} & @{term "SOME qs'. distinct qs' \<and> set qs' = set qs"}\\ |
|
455 |
\end{tabular} |
|
456 |
\end{isabelle} |
|
457 |
||
458 |
\noindent |
|
459 |
@{text "SOME"} stands for Hilbert's epsilon and implements an arbitrary |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
460 |
choice for the next waiting list. It just has to be a list of distinctive threads and |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
461 |
contain the same elements as @{text "qs"}. This gives for @{term V} the clause: |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
462 |
|
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
463 |
\begin{isabelle}\ \ \ \ \ %%% |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
464 |
\begin{tabular}{@ {}l} |
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
465 |
@{thm (lhs) schs.simps(6)} @{text "\<equiv>"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
466 |
\hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\ |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
467 |
\hspace{5mm}@{text "let"} @{text "new_wq = release (wq cs)"} @{text "in"}\\ |
294
bc5bf9e9ada2
renamed waiting_queue -> wq_fun; cur_preced -> cprec_fun
urbanc
parents:
293
diff
changeset
|
468 |
\hspace{8mm}@{term "(|wq_fun = new_wq, cprec_fun = cpreced new_wq (V th cs # s)|)"} |
291
5ef9f6ebe827
more on paper; modified schs functions; it is still compatible with the old definition
urbanc
parents:
290
diff
changeset
|
469 |
\end{tabular} |
290 | 470 |
\end{isabelle} |
471 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
472 |
Having the scheduler function @{term schs} at our disposal, we can ``lift'' the notions |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
473 |
@{term waiting}, @{term holding} @{term depend} and @{term cp} such that they only depend |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
474 |
on states. |
286 | 475 |
|
476 |
\begin{isabelle}\ \ \ \ \ %%% |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
477 |
\begin{tabular}{@ {}rcl} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
478 |
@{thm (lhs) s_holding_abv} & @{text "\<equiv>"} & @{thm (rhs) s_holding_abv}\\ |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
479 |
@{thm (lhs) s_waiting_abv} & @{text "\<equiv>"} & @{thm (rhs) s_waiting_abv}\\ |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
480 |
@{thm (lhs) s_depend_abv} & @{text "\<equiv>"} & @{thm (rhs) s_depend_abv}\\ |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
481 |
@{thm (lhs) cp_def} & @{text "\<equiv>"} & @{thm (rhs) cp_def} |
287 | 482 |
\end{tabular} |
483 |
\end{isabelle} |
|
484 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
485 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
486 |
With them we can introduce the notion of threads being @{term readys} (i.e.~threads |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
487 |
that do not wait for any resource) and the running thread. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
488 |
|
287 | 489 |
\begin{isabelle}\ \ \ \ \ %%% |
490 |
\begin{tabular}{@ {}l} |
|
491 |
@{thm readys_def}\\ |
|
492 |
@{thm runing_def}\\ |
|
286 | 493 |
\end{tabular} |
494 |
\end{isabelle} |
|
284 | 495 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
496 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
497 |
Note that in the initial case, that is where the list of events is empty, the set |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
498 |
@{term threads} is empty and therefore there is no thread ready nor a running. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
499 |
If there is one or more threads ready, then there can only be \emph{one} thread |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
500 |
running, namely the one whose current precedence is equal to the maximum of all ready |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
501 |
threads. We can also define the set of resources that are locked by a thread in a |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
502 |
given state. |
284 | 503 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
504 |
\begin{isabelle}\ \ \ \ \ %%% |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
505 |
@{thm holdents_def} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
506 |
\end{isabelle} |
284 | 507 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
508 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
509 |
These resources are given by the holding edges in the RAG. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
510 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
511 |
Finally we can define what a \emph{valid state} is. For example we cannot exptect to |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
512 |
be able to exit a thread, if it was not created yet. These validity constraints |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
513 |
are characterised by the inductive predicate @{term "step"} by the following five |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
514 |
inference rules relating a state and an event that can happen next. |
284 | 515 |
|
516 |
\begin{center} |
|
517 |
\begin{tabular}{c} |
|
518 |
@{thm[mode=Rule] thread_create[where thread=th]}\hspace{1cm} |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
519 |
@{thm[mode=Rule] thread_exit[where thread=th]} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
520 |
\end{tabular} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
521 |
\end{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
522 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
523 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
524 |
The first rule states that a thread can only be created, if it does not yet exists. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
525 |
Similarly, the second rule states that a thread can only be terminated if it was |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
526 |
running and does not lock any resources anymore. The event @{text Set} can happen |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
527 |
if the corresponding thread is running. |
284 | 528 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
529 |
\begin{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
530 |
@{thm[mode=Rule] thread_set[where thread=th]} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
531 |
\end{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
532 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
533 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
534 |
If a thread wants to lock a resource, then the thread needs to be running and |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
535 |
also we have to make sure that the resource lock doe not lead to a cycle in the |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
536 |
RAG. Similarly, if a thread wants to release a lock on a resource, then it must |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
537 |
be running and in the possession of that lock. This is formally given by the |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
538 |
last two inference rules of @{term step}. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
539 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
540 |
\begin{center} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
541 |
\begin{tabular}{c} |
284 | 542 |
@{thm[mode=Rule] thread_P[where thread=th]}\medskip\\ |
543 |
@{thm[mode=Rule] thread_V[where thread=th]}\\ |
|
544 |
\end{tabular} |
|
545 |
\end{center} |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
546 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
547 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
548 |
A valid state of PIP can then be conveniently be defined as follows: |
284 | 549 |
|
550 |
\begin{center} |
|
551 |
\begin{tabular}{c} |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
552 |
@{thm[mode=Axiom] vt_nil}\hspace{1cm} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
553 |
@{thm[mode=Rule] vt_cons} |
284 | 554 |
\end{tabular} |
555 |
\end{center} |
|
556 |
||
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
557 |
\noindent |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
558 |
This completes our formal model of PIP. In the next section we present |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
559 |
properties that show our version of PIP is correct. |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
560 |
*} |
274 | 561 |
|
298
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
562 |
section {* Correctness Proof *} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
563 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
564 |
text {* TO DO *} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
565 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
566 |
section {* Properties for an Implementation *} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
567 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
568 |
text {* TO DO *} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
569 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
570 |
section {* Conclusion *} |
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
571 |
|
f2e0d031a395
completed model section; vt has only state as argument
urbanc
parents:
297
diff
changeset
|
572 |
text {* TO DO *} |
273 | 573 |
|
280 | 574 |
text {* |
575 |
\bigskip |
|
284 | 576 |
The priority inversion phenomenon was first published in |
577 |
\cite{Lampson80}. The two protocols widely used to eliminate |
|
578 |
priority inversion, namely PI (Priority Inheritance) and PCE |
|
579 |
(Priority Ceiling Emulation), were proposed in \cite{Sha90}. PCE is |
|
580 |
less convenient to use because it requires static analysis of |
|
581 |
programs. Therefore, PI is more commonly used in |
|
582 |
practice\cite{locke-july02}. However, as pointed out in the |
|
583 |
literature, the analysis of priority inheritance protocol is quite |
|
584 |
subtle\cite{yodaiken-july02}. A formal analysis will certainly be |
|
585 |
helpful for us to understand and correctly implement PI. All |
|
586 |
existing formal analysis of PI |
|
587 |
\cite{conf/fase/JahierHR09,WellingsBSB07,Faria08} are based on the |
|
588 |
model checking technology. Because of the state explosion problem, |
|
589 |
model check is much like an exhaustive testing of finite models with |
|
590 |
limited size. The results obtained can not be safely generalized to |
|
591 |
models with arbitrarily large size. Worse still, since model |
|
592 |
checking is fully automatic, it give little insight on why the |
|
593 |
formal model is correct. It is therefore definitely desirable to |
|
594 |
analyze PI using theorem proving, which gives more general results |
|
595 |
as well as deeper insight. And this is the purpose of this paper |
|
596 |
which gives a formal analysis of PI in the interactive theorem |
|
597 |
prover Isabelle using Higher Order Logic (HOL). The formalization |
|
262 | 598 |
focuses on on two issues: |
599 |
||
600 |
\begin{enumerate} |
|
601 |
\item The correctness of the protocol model itself. A series of desirable properties is |
|
602 |
derived until we are fully convinced that the formal model of PI does |
|
603 |
eliminate priority inversion. And a better understanding of PI is so obtained |
|
604 |
in due course. For example, we find through formalization that the choice of |
|
605 |
next thread to take hold when a |
|
606 |
resource is released is irrelevant for the very basic property of PI to hold. |
|
607 |
A point never mentioned in literature. |
|
608 |
\item The correctness of the implementation. A series of properties is derived the meaning |
|
609 |
of which can be used as guidelines on how PI can be implemented efficiently and correctly. |
|
610 |
\end{enumerate} |
|
611 |
||
612 |
The rest of the paper is organized as follows: Section \ref{overview} gives an overview |
|
613 |
of PI. Section \ref{model} introduces the formal model of PI. Section \ref{general} |
|
614 |
discusses a series of basic properties of PI. Section \ref{extension} shows formally |
|
615 |
how priority inversion is controlled by PI. Section \ref{implement} gives properties |
|
616 |
which can be used for guidelines of implementation. Section \ref{related} discusses |
|
617 |
related works. Section \ref{conclusion} concludes the whole paper. |
|
265 | 618 |
|
273 | 619 |
The basic priority inheritance protocol has two problems: |
620 |
||
621 |
It does not prevent a deadlock from happening in a program with circular lock dependencies. |
|
622 |
||
623 |
A chain of blocking may be formed; blocking duration can be substantial, though bounded. |
|
624 |
||
265 | 625 |
|
626 |
Contributions |
|
627 |
||
628 |
Despite the wide use of Priority Inheritance Protocol in real time operating |
|
629 |
system, it's correctness has never been formally proved and mechanically checked. |
|
630 |
All existing verification are based on model checking technology. Full automatic |
|
631 |
verification gives little help to understand why the protocol is correct. |
|
632 |
And results such obtained only apply to models of limited size. |
|
633 |
This paper presents a formal verification based on theorem proving. |
|
634 |
Machine checked formal proof does help to get deeper understanding. We found |
|
635 |
the fact which is not mentioned in the literature, that the choice of next |
|
636 |
thread to take over when an critical resource is release does not affect the correctness |
|
637 |
of the protocol. The paper also shows how formal proof can help to construct |
|
638 |
correct and efficient implementation.\bigskip |
|
639 |
||
262 | 640 |
*} |
641 |
||
642 |
section {* An overview of priority inversion and priority inheritance \label{overview} *} |
|
643 |
||
644 |
text {* |
|
645 |
||
646 |
Priority inversion refers to the phenomenon when a thread with high priority is blocked |
|
647 |
by a thread with low priority. Priority happens when the high priority thread requests |
|
648 |
for some critical resource already taken by the low priority thread. Since the high |
|
649 |
priority thread has to wait for the low priority thread to complete, it is said to be |
|
650 |
blocked by the low priority thread. Priority inversion might prevent high priority |
|
651 |
thread from fulfill its task in time if the duration of priority inversion is indefinite |
|
652 |
and unpredictable. Indefinite priority inversion happens when indefinite number |
|
653 |
of threads with medium priorities is activated during the period when the high |
|
654 |
priority thread is blocked by the low priority thread. Although these medium |
|
655 |
priority threads can not preempt the high priority thread directly, they are able |
|
656 |
to preempt the low priority threads and cause it to stay in critical section for |
|
657 |
an indefinite long duration. In this way, the high priority thread may be blocked indefinitely. |
|
658 |
||
659 |
Priority inheritance is one protocol proposed to avoid indefinite priority inversion. |
|
660 |
The basic idea is to let the high priority thread donate its priority to the low priority |
|
661 |
thread holding the critical resource, so that it will not be preempted by medium priority |
|
662 |
threads. The thread with highest priority will not be blocked unless it is requesting |
|
663 |
some critical resource already taken by other threads. Viewed from a different angle, |
|
664 |
any thread which is able to block the highest priority threads must already hold some |
|
665 |
critical resource. Further more, it must have hold some critical resource at the |
|
666 |
moment the highest priority is created, otherwise, it may never get change to run and |
|
667 |
get hold. Since the number of such resource holding lower priority threads is finite, |
|
668 |
if every one of them finishes with its own critical section in a definite duration, |
|
669 |
the duration the highest priority thread is blocked is definite as well. The key to |
|
670 |
guarantee lower priority threads to finish in definite is to donate them the highest |
|
671 |
priority. In such cases, the lower priority threads is said to have inherited the |
|
672 |
highest priority. And this explains the name of the protocol: |
|
673 |
{\em Priority Inheritance} and how Priority Inheritance prevents indefinite delay. |
|
674 |
||
675 |
The objectives of this paper are: |
|
676 |
\begin{enumerate} |
|
677 |
\item Build the above mentioned idea into formal model and prove a series of properties |
|
678 |
until we are convinced that the formal model does fulfill the original idea. |
|
679 |
\item Show how formally derived properties can be used as guidelines for correct |
|
680 |
and efficient implementation. |
|
681 |
\end{enumerate} |
|
682 |
The proof is totally formal in the sense that every detail is reduced to the |
|
683 |
very first principles of Higher Order Logic. The nature of interactive theorem |
|
684 |
proving is for the human user to persuade computer program to accept its arguments. |
|
685 |
A clear and simple understanding of the problem at hand is both a prerequisite and a |
|
686 |
byproduct of such an effort, because everything has finally be reduced to the very |
|
687 |
first principle to be checked mechanically. The former intuitive explanation of |
|
688 |
Priority Inheritance is just such a byproduct. |
|
689 |
*} |
|
690 |
||
691 |
section {* Formal model of Priority Inheritance \label{model} *} |
|
692 |
text {* |
|
693 |
\input{../../generated/PrioGDef} |
|
694 |
*} |
|
695 |
||
696 |
section {* General properties of Priority Inheritance \label{general} *} |
|
264 | 697 |
|
698 |
text {* |
|
699 |
The following are several very basic prioprites: |
|
700 |
\begin{enumerate} |
|
701 |
\item All runing threads must be ready (@{text "runing_ready"}): |
|
702 |
@{thm[display] "runing_ready"} |
|
703 |
\item All ready threads must be living (@{text "readys_threads"}): |
|
704 |
@{thm[display] "readys_threads"} |
|
705 |
\item There are finite many living threads at any moment (@{text "finite_threads"}): |
|
706 |
@{thm[display] "finite_threads"} |
|
707 |
\item Every waiting queue does not contain duplcated elements (@{text "wq_distinct"}): |
|
708 |
@{thm[display] "wq_distinct"} |
|
709 |
\item All threads in waiting queues are living threads (@{text "wq_threads"}): |
|
710 |
@{thm[display] "wq_threads"} |
|
711 |
\item The event which can get a thread into waiting queue must be @{term "P"}-events |
|
712 |
(@{text "block_pre"}): |
|
713 |
@{thm[display] "block_pre"} |
|
714 |
\item A thread may never wait for two different critical resources |
|
715 |
(@{text "waiting_unique"}): |
|
716 |
@{thm[display] waiting_unique[of _ _ "cs\<^isub>1" "cs\<^isub>2"]} |
|
717 |
\item Every resource can only be held by one thread |
|
718 |
(@{text "held_unique"}): |
|
719 |
@{thm[display] held_unique[of _ "th\<^isub>1" _ "th\<^isub>2"]} |
|
720 |
\item Every living thread has an unique precedence |
|
721 |
(@{text "preced_unique"}): |
|
722 |
@{thm[display] preced_unique[of "th\<^isub>1" _ "th\<^isub>2"]} |
|
723 |
\end{enumerate} |
|
724 |
*} |
|
725 |
||
726 |
text {* \noindent |
|
727 |
The following lemmas show how RAG is changed with the execution of events: |
|
728 |
\begin{enumerate} |
|
729 |
\item Execution of @{term "Set"} does not change RAG (@{text "depend_set_unchanged"}): |
|
730 |
@{thm[display] depend_set_unchanged} |
|
731 |
\item Execution of @{term "Create"} does not change RAG (@{text "depend_create_unchanged"}): |
|
732 |
@{thm[display] depend_create_unchanged} |
|
733 |
\item Execution of @{term "Exit"} does not change RAG (@{text "depend_exit_unchanged"}): |
|
734 |
@{thm[display] depend_exit_unchanged} |
|
735 |
\item Execution of @{term "P"} (@{text "step_depend_p"}): |
|
736 |
@{thm[display] step_depend_p} |
|
737 |
\item Execution of @{term "V"} (@{text "step_depend_v"}): |
|
738 |
@{thm[display] step_depend_v} |
|
739 |
\end{enumerate} |
|
740 |
*} |
|
741 |
||
742 |
text {* \noindent |
|
743 |
These properties are used to derive the following important results about RAG: |
|
744 |
\begin{enumerate} |
|
745 |
\item RAG is loop free (@{text "acyclic_depend"}): |
|
746 |
@{thm [display] acyclic_depend} |
|
747 |
\item RAGs are finite (@{text "finite_depend"}): |
|
748 |
@{thm [display] finite_depend} |
|
749 |
\item Reverse paths in RAG are well founded (@{text "wf_dep_converse"}): |
|
750 |
@{thm [display] wf_dep_converse} |
|
751 |
\item The dependence relation represented by RAG has a tree structure (@{text "unique_depend"}): |
|
752 |
@{thm [display] unique_depend[of _ _ "n\<^isub>1" "n\<^isub>2"]} |
|
753 |
\item All threads in RAG are living threads |
|
754 |
(@{text "dm_depend_threads"} and @{text "range_in"}): |
|
755 |
@{thm [display] dm_depend_threads range_in} |
|
756 |
\end{enumerate} |
|
757 |
*} |
|
758 |
||
759 |
text {* \noindent |
|
760 |
The following lemmas show how every node in RAG can be chased to ready threads: |
|
761 |
\begin{enumerate} |
|
762 |
\item Every node in RAG can be chased to a ready thread (@{text "chain_building"}): |
|
763 |
@{thm [display] chain_building[rule_format]} |
|
764 |
\item The ready thread chased to is unique (@{text "dchain_unique"}): |
|
765 |
@{thm [display] dchain_unique[of _ _ "th\<^isub>1" "th\<^isub>2"]} |
|
766 |
\end{enumerate} |
|
767 |
*} |
|
768 |
||
769 |
text {* \noindent |
|
770 |
Properties about @{term "next_th"}: |
|
771 |
\begin{enumerate} |
|
772 |
\item The thread taking over is different from the thread which is releasing |
|
773 |
(@{text "next_th_neq"}): |
|
774 |
@{thm [display] next_th_neq} |
|
775 |
\item The thread taking over is unique |
|
776 |
(@{text "next_th_unique"}): |
|
777 |
@{thm [display] next_th_unique[of _ _ _ "th\<^isub>1" "th\<^isub>2"]} |
|
778 |
\end{enumerate} |
|
779 |
*} |
|
780 |
||
781 |
text {* \noindent |
|
782 |
Some deeper results about the system: |
|
783 |
\begin{enumerate} |
|
784 |
\item There can only be one running thread (@{text "runing_unique"}): |
|
785 |
@{thm [display] runing_unique[of _ "th\<^isub>1" "th\<^isub>2"]} |
|
786 |
\item The maximum of @{term "cp"} and @{term "preced"} are equal (@{text "max_cp_eq"}): |
|
787 |
@{thm [display] max_cp_eq} |
|
788 |
\item There must be one ready thread having the max @{term "cp"}-value |
|
789 |
(@{text "max_cp_readys_threads"}): |
|
790 |
@{thm [display] max_cp_readys_threads} |
|
791 |
\end{enumerate} |
|
792 |
*} |
|
793 |
||
794 |
text {* \noindent |
|
795 |
The relationship between the count of @{text "P"} and @{text "V"} and the number of |
|
796 |
critical resources held by a thread is given as follows: |
|
797 |
\begin{enumerate} |
|
798 |
\item The @{term "V"}-operation decreases the number of critical resources |
|
799 |
one thread holds (@{text "cntCS_v_dec"}) |
|
800 |
@{thm [display] cntCS_v_dec} |
|
801 |
\item The number of @{text "V"} never exceeds the number of @{text "P"} |
|
802 |
(@{text "cnp_cnv_cncs"}): |
|
803 |
@{thm [display] cnp_cnv_cncs} |
|
804 |
\item The number of @{text "V"} equals the number of @{text "P"} when |
|
805 |
the relevant thread is not living: |
|
806 |
(@{text "cnp_cnv_eq"}): |
|
807 |
@{thm [display] cnp_cnv_eq} |
|
808 |
\item When a thread is not living, it does not hold any critical resource |
|
809 |
(@{text "not_thread_holdents"}): |
|
810 |
@{thm [display] not_thread_holdents} |
|
811 |
\item When the number of @{text "P"} equals the number of @{text "V"}, the relevant |
|
812 |
thread does not hold any critical resource, therefore no thread can depend on it |
|
813 |
(@{text "count_eq_dependents"}): |
|
814 |
@{thm [display] count_eq_dependents} |
|
815 |
\end{enumerate} |
|
816 |
*} |
|
262 | 817 |
|
818 |
section {* Key properties \label{extension} *} |
|
819 |
||
264 | 820 |
(*<*) |
821 |
context extend_highest_gen |
|
822 |
begin |
|
823 |
(*>*) |
|
824 |
||
825 |
text {* |
|
826 |
The essential of {\em Priority Inheritance} is to avoid indefinite priority inversion. For this |
|
827 |
purpose, we need to investigate what happens after one thread takes the highest precedence. |
|
828 |
A locale is used to describe such a situation, which assumes: |
|
829 |
\begin{enumerate} |
|
830 |
\item @{term "s"} is a valid state (@{text "vt_s"}): |
|
831 |
@{thm vt_s}. |
|
832 |
\item @{term "th"} is a living thread in @{term "s"} (@{text "threads_s"}): |
|
833 |
@{thm threads_s}. |
|
834 |
\item @{term "th"} has the highest precedence in @{term "s"} (@{text "highest"}): |
|
835 |
@{thm highest}. |
|
836 |
\item The precedence of @{term "th"} is @{term "Prc prio tm"} (@{text "preced_th"}): |
|
837 |
@{thm preced_th}. |
|
838 |
\end{enumerate} |
|
839 |
*} |
|
840 |
||
841 |
text {* \noindent |
|
842 |
Under these assumptions, some basic priority can be derived for @{term "th"}: |
|
843 |
\begin{enumerate} |
|
844 |
\item The current precedence of @{term "th"} equals its own precedence (@{text "eq_cp_s_th"}): |
|
845 |
@{thm [display] eq_cp_s_th} |
|
846 |
\item The current precedence of @{term "th"} is the highest precedence in |
|
847 |
the system (@{text "highest_cp_preced"}): |
|
848 |
@{thm [display] highest_cp_preced} |
|
849 |
\item The precedence of @{term "th"} is the highest precedence |
|
850 |
in the system (@{text "highest_preced_thread"}): |
|
851 |
@{thm [display] highest_preced_thread} |
|
852 |
\item The current precedence of @{term "th"} is the highest current precedence |
|
853 |
in the system (@{text "highest'"}): |
|
854 |
@{thm [display] highest'} |
|
855 |
\end{enumerate} |
|
856 |
*} |
|
857 |
||
858 |
text {* \noindent |
|
859 |
To analysis what happens after state @{term "s"} a sub-locale is defined, which |
|
860 |
assumes: |
|
861 |
\begin{enumerate} |
|
862 |
\item @{term "t"} is a valid extension of @{term "s"} (@{text "vt_t"}): @{thm vt_t}. |
|
863 |
\item Any thread created in @{term "t"} has priority no higher than @{term "prio"}, therefore |
|
864 |
its precedence can not be higher than @{term "th"}, therefore |
|
865 |
@{term "th"} remain to be the one with the highest precedence |
|
866 |
(@{text "create_low"}): |
|
867 |
@{thm [display] create_low} |
|
868 |
\item Any adjustment of priority in |
|
869 |
@{term "t"} does not happen to @{term "th"} and |
|
870 |
the priority set is no higher than @{term "prio"}, therefore |
|
871 |
@{term "th"} remain to be the one with the highest precedence (@{text "set_diff_low"}): |
|
872 |
@{thm [display] set_diff_low} |
|
873 |
\item Since we are investigating what happens to @{term "th"}, it is assumed |
|
874 |
@{term "th"} does not exit during @{term "t"} (@{text "exit_diff"}): |
|
875 |
@{thm [display] exit_diff} |
|
876 |
\end{enumerate} |
|
877 |
*} |
|
878 |
||
879 |
text {* \noindent |
|
880 |
All these assumptions are put into a predicate @{term "extend_highest_gen"}. |
|
881 |
It can be proved that @{term "extend_highest_gen"} holds |
|
882 |
for any moment @{text "i"} in it @{term "t"} (@{text "red_moment"}): |
|
883 |
@{thm [display] red_moment} |
|
884 |
||
885 |
From this, an induction principle can be derived for @{text "t"}, so that |
|
886 |
properties already derived for @{term "t"} can be applied to any prefix |
|
887 |
of @{text "t"} in the proof of new properties |
|
888 |
about @{term "t"} (@{text "ind"}): |
|
889 |
\begin{center} |
|
890 |
@{thm[display] ind} |
|
891 |
\end{center} |
|
892 |
||
893 |
The following properties can be proved about @{term "th"} in @{term "t"}: |
|
894 |
\begin{enumerate} |
|
895 |
\item In @{term "t"}, thread @{term "th"} is kept live and its |
|
896 |
precedence is preserved as well |
|
897 |
(@{text "th_kept"}): |
|
898 |
@{thm [display] th_kept} |
|
899 |
\item In @{term "t"}, thread @{term "th"}'s precedence is always the maximum among |
|
900 |
all living threads |
|
901 |
(@{text "max_preced"}): |
|
902 |
@{thm [display] max_preced} |
|
903 |
\item In @{term "t"}, thread @{term "th"}'s current precedence is always the maximum precedence |
|
904 |
among all living threads |
|
905 |
(@{text "th_cp_max_preced"}): |
|
906 |
@{thm [display] th_cp_max_preced} |
|
907 |
\item In @{term "t"}, thread @{term "th"}'s current precedence is always the maximum current |
|
908 |
precedence among all living threads |
|
909 |
(@{text "th_cp_max"}): |
|
910 |
@{thm [display] th_cp_max} |
|
911 |
\item In @{term "t"}, thread @{term "th"}'s current precedence equals its precedence at moment |
|
912 |
@{term "s"} |
|
913 |
(@{text "th_cp_preced"}): |
|
914 |
@{thm [display] th_cp_preced} |
|
915 |
\end{enumerate} |
|
916 |
*} |
|
917 |
||
918 |
text {* \noindent |
|
266 | 919 |
The main theorem of this part is to characterizing the running thread during @{term "t"} |
264 | 920 |
(@{text "runing_inversion_2"}): |
921 |
@{thm [display] runing_inversion_2} |
|
922 |
According to this, if a thread is running, it is either @{term "th"} or was |
|
923 |
already live and held some resource |
|
924 |
at moment @{text "s"} (expressed by: @{text "cntV s th' < cntP s th'"}). |
|
925 |
||
926 |
Since there are only finite many threads live and holding some resource at any moment, |
|
927 |
if every such thread can release all its resources in finite duration, then after finite |
|
928 |
duration, none of them may block @{term "th"} anymore. So, no priority inversion may happen |
|
929 |
then. |
|
930 |
*} |
|
931 |
||
932 |
(*<*) |
|
933 |
end |
|
934 |
(*>*) |
|
935 |
||
262 | 936 |
section {* Properties to guide implementation \label{implement} *} |
937 |
||
264 | 938 |
text {* |
266 | 939 |
The properties (especially @{text "runing_inversion_2"}) convinced us that the model defined |
940 |
in Section \ref{model} does prevent indefinite priority inversion and therefore fulfills |
|
264 | 941 |
the fundamental requirement of Priority Inheritance protocol. Another purpose of this paper |
266 | 942 |
is to show how this model can be used to guide a concrete implementation. As discussed in |
276 | 943 |
Section 5.6.5 of \cite{Vahalia96}, the implementation of Priority Inheritance in Solaris |
266 | 944 |
uses sophisticated linking data structure. Except discussing two scenarios to show how |
945 |
the data structure should be manipulated, a lot of details of the implementation are missing. |
|
946 |
In \cite{Faria08,conf/fase/JahierHR09,WellingsBSB07} the protocol is described formally |
|
947 |
using different notations, but little information is given on how this protocol can be |
|
948 |
implemented efficiently, especially there is no information on how these data structure |
|
949 |
should be manipulated. |
|
950 |
||
951 |
Because the scheduling of threads is based on current precedence, |
|
952 |
the central issue in implementation of Priority Inheritance is how to compute the precedence |
|
953 |
correctly and efficiently. As long as the precedence is correct, it is very easy to |
|
954 |
modify the scheduling algorithm to select the correct thread to execute. |
|
955 |
||
956 |
First, it can be proved that the computation of current precedence @{term "cp"} of a threads |
|
957 |
only involves its children (@{text "cp_rec"}): |
|
958 |
@{thm [display] cp_rec} |
|
959 |
where @{term "children s th"} represents the set of children of @{term "th"} in the current |
|
960 |
RAG: |
|
961 |
\[ |
|
962 |
@{thm (lhs) children_def} @{text "\<equiv>"} @{thm (rhs) children_def} |
|
963 |
\] |
|
964 |
where the definition of @{term "child"} is: |
|
965 |
\[ @{thm (lhs) child_def} @{text "\<equiv>"} @{thm (rhs) child_def} |
|
966 |
\] |
|
967 |
||
968 |
The aim of this section is to fill the missing details of how current precedence should |
|
969 |
be changed with the happening of events, with each event type treated by one subsection, |
|
970 |
where the computation of @{term "cp"} uses lemma @{text "cp_rec"}. |
|
971 |
*} |
|
972 |
||
973 |
subsection {* Event @{text "Set th prio"} *} |
|
974 |
||
975 |
(*<*) |
|
976 |
context step_set_cps |
|
977 |
begin |
|
978 |
(*>*) |
|
979 |
||
980 |
text {* |
|
981 |
The context under which event @{text "Set th prio"} happens is formalized as follows: |
|
982 |
\begin{enumerate} |
|
983 |
\item The formation of @{term "s"} (@{text "s_def"}): @{thm s_def}. |
|
984 |
\item State @{term "s"} is a valid state (@{text "vt_s"}): @{thm vt_s}. This implies |
|
985 |
event @{text "Set th prio"} is eligible to happen under state @{term "s'"} and |
|
986 |
state @{term "s'"} is a valid state. |
|
987 |
\end{enumerate} |
|
264 | 988 |
*} |
989 |
||
266 | 990 |
text {* \noindent |
991 |
Under such a context, we investigated how the current precedence @{term "cp"} of |
|
992 |
threads change from state @{term "s'"} to @{term "s"} and obtained the following |
|
993 |
conclusions: |
|
994 |
\begin{enumerate} |
|
995 |
%% \item The RAG does not change (@{text "eq_dep"}): @{thm "eq_dep"}. |
|
996 |
\item All threads with no dependence relation with thread @{term "th"} have their |
|
997 |
@{term "cp"}-value unchanged (@{text "eq_cp"}): |
|
998 |
@{thm [display] eq_cp} |
|
999 |
This lemma implies the @{term "cp"}-value of @{term "th"} |
|
1000 |
and those threads which have a dependence relation with @{term "th"} might need |
|
1001 |
to be recomputed. The way to do this is to start from @{term "th"} |
|
1002 |
and follow the @{term "depend"}-chain to recompute the @{term "cp"}-value of every |
|
1003 |
encountered thread using lemma @{text "cp_rec"}. |
|
1004 |
Since the @{term "depend"}-relation is loop free, this procedure |
|
1005 |
can always stop. The the following lemma shows this procedure actually could stop earlier. |
|
1006 |
\item The following two lemma shows, if a thread the re-computation of which |
|
1007 |
gives an unchanged @{term "cp"}-value, the procedure described above can stop. |
|
1008 |
\begin{enumerate} |
|
1009 |
\item Lemma @{text "eq_up_self"} shows if the re-computation of |
|
1010 |
@{term "th"}'s @{term "cp"} gives the same result, the procedure can stop: |
|
1011 |
@{thm [display] eq_up_self} |
|
1012 |
\item Lemma @{text "eq_up"}) shows if the re-computation at intermediate threads |
|
1013 |
gives unchanged result, the procedure can stop: |
|
1014 |
@{thm [display] eq_up} |
|
1015 |
\end{enumerate} |
|
1016 |
\end{enumerate} |
|
1017 |
*} |
|
1018 |
||
1019 |
(*<*) |
|
1020 |
end |
|
1021 |
(*>*) |
|
264 | 1022 |
|
272 | 1023 |
subsection {* Event @{text "V th cs"} *} |
1024 |
||
1025 |
(*<*) |
|
1026 |
context step_v_cps_nt |
|
1027 |
begin |
|
1028 |
(*>*) |
|
1029 |
||
1030 |
text {* |
|
1031 |
The context under which event @{text "V th cs"} happens is formalized as follows: |
|
1032 |
\begin{enumerate} |
|
1033 |
\item The formation of @{term "s"} (@{text "s_def"}): @{thm s_def}. |
|
1034 |
\item State @{term "s"} is a valid state (@{text "vt_s"}): @{thm vt_s}. This implies |
|
1035 |
event @{text "V th cs"} is eligible to happen under state @{term "s'"} and |
|
1036 |
state @{term "s'"} is a valid state. |
|
1037 |
\end{enumerate} |
|
1038 |
*} |
|
1039 |
||
1040 |
text {* \noindent |
|
1041 |
Under such a context, we investigated how the current precedence @{term "cp"} of |
|
1042 |
threads change from state @{term "s'"} to @{term "s"}. |
|
1043 |
||
1044 |
||
1045 |
Two subcases are considerted, |
|
1046 |
where the first is that there exits @{term "th'"} |
|
1047 |
such that |
|
1048 |
@{thm [display] nt} |
|
1049 |
holds, which means there exists a thread @{term "th'"} to take over |
|
1050 |
the resource release by thread @{term "th"}. |
|
1051 |
In this sub-case, the following results are obtained: |
|
1052 |
\begin{enumerate} |
|
1053 |
\item The change of RAG is given by lemma @{text "depend_s"}: |
|
1054 |
@{thm [display] "depend_s"} |
|
1055 |
which shows two edges are removed while one is added. These changes imply how |
|
1056 |
the current precedences should be re-computed. |
|
1057 |
\item First all threads different from @{term "th"} and @{term "th'"} have their |
|
1058 |
@{term "cp"}-value kept, therefore do not need a re-computation |
|
1059 |
(@{text "cp_kept"}): @{thm [display] cp_kept} |
|
1060 |
This lemma also implies, only the @{term "cp"}-values of @{term "th"} and @{term "th'"} |
|
1061 |
need to be recomputed. |
|
1062 |
\end{enumerate} |
|
1063 |
*} |
|
1064 |
||
1065 |
(*<*) |
|
1066 |
end |
|
1067 |
||
1068 |
context step_v_cps_nnt |
|
1069 |
begin |
|
1070 |
(*>*) |
|
1071 |
||
1072 |
text {* |
|
1073 |
The other sub-case is when for all @{text "th'"} |
|
1074 |
@{thm [display] nnt} |
|
1075 |
holds, no such thread exists. The following results can be obtained for this |
|
1076 |
sub-case: |
|
1077 |
\begin{enumerate} |
|
1078 |
\item The change of RAG is given by lemma @{text "depend_s"}: |
|
1079 |
@{thm [display] depend_s} |
|
1080 |
which means only one edge is removed. |
|
1081 |
\item In this case, no re-computation is needed (@{text "eq_cp"}): |
|
1082 |
@{thm [display] eq_cp} |
|
1083 |
\end{enumerate} |
|
1084 |
*} |
|
1085 |
||
1086 |
(*<*) |
|
1087 |
end |
|
1088 |
(*>*) |
|
1089 |
||
1090 |
||
1091 |
subsection {* Event @{text "P th cs"} *} |
|
1092 |
||
1093 |
(*<*) |
|
1094 |
context step_P_cps_e |
|
1095 |
begin |
|
1096 |
(*>*) |
|
1097 |
||
1098 |
text {* |
|
1099 |
The context under which event @{text "P th cs"} happens is formalized as follows: |
|
1100 |
\begin{enumerate} |
|
1101 |
\item The formation of @{term "s"} (@{text "s_def"}): @{thm s_def}. |
|
1102 |
\item State @{term "s"} is a valid state (@{text "vt_s"}): @{thm vt_s}. This implies |
|
1103 |
event @{text "P th cs"} is eligible to happen under state @{term "s'"} and |
|
1104 |
state @{term "s'"} is a valid state. |
|
1105 |
\end{enumerate} |
|
1106 |
||
1107 |
This case is further divided into two sub-cases. The first is when @{thm ee} holds. |
|
1108 |
The following results can be obtained: |
|
1109 |
\begin{enumerate} |
|
1110 |
\item One edge is added to the RAG (@{text "depend_s"}): |
|
1111 |
@{thm [display] depend_s} |
|
1112 |
\item No re-computation is needed (@{text "eq_cp"}): |
|
1113 |
@{thm [display] eq_cp} |
|
1114 |
\end{enumerate} |
|
1115 |
*} |
|
1116 |
||
1117 |
(*<*) |
|
1118 |
end |
|
1119 |
||
1120 |
context step_P_cps_ne |
|
1121 |
begin |
|
1122 |
(*>*) |
|
1123 |
||
1124 |
text {* |
|
1125 |
The second is when @{thm ne} holds. |
|
1126 |
The following results can be obtained: |
|
1127 |
\begin{enumerate} |
|
1128 |
\item One edge is added to the RAG (@{text "depend_s"}): |
|
1129 |
@{thm [display] depend_s} |
|
1130 |
\item Threads with no dependence relation with @{term "th"} do not need a re-computation |
|
1131 |
of their @{term "cp"}-values (@{text "eq_cp"}): |
|
1132 |
@{thm [display] eq_cp} |
|
1133 |
This lemma implies all threads with a dependence relation with @{term "th"} may need |
|
1134 |
re-computation. |
|
1135 |
\item Similar to the case of @{term "Set"}, the computation procedure could stop earlier |
|
1136 |
(@{text "eq_up"}): |
|
1137 |
@{thm [display] eq_up} |
|
1138 |
\end{enumerate} |
|
1139 |
||
1140 |
*} |
|
1141 |
||
1142 |
(*<*) |
|
1143 |
end |
|
1144 |
(*>*) |
|
1145 |
||
1146 |
subsection {* Event @{text "Create th prio"} *} |
|
1147 |
||
1148 |
(*<*) |
|
1149 |
context step_create_cps |
|
1150 |
begin |
|
1151 |
(*>*) |
|
1152 |
||
1153 |
text {* |
|
1154 |
The context under which event @{text "Create th prio"} happens is formalized as follows: |
|
1155 |
\begin{enumerate} |
|
1156 |
\item The formation of @{term "s"} (@{text "s_def"}): @{thm s_def}. |
|
1157 |
\item State @{term "s"} is a valid state (@{text "vt_s"}): @{thm vt_s}. This implies |
|
1158 |
event @{text "Create th prio"} is eligible to happen under state @{term "s'"} and |
|
1159 |
state @{term "s'"} is a valid state. |
|
1160 |
\end{enumerate} |
|
1161 |
The following results can be obtained under this context: |
|
1162 |
\begin{enumerate} |
|
1163 |
\item The RAG does not change (@{text "eq_dep"}): |
|
1164 |
@{thm [display] eq_dep} |
|
1165 |
\item All threads other than @{term "th"} do not need re-computation (@{text "eq_cp"}): |
|
1166 |
@{thm [display] eq_cp} |
|
1167 |
\item The @{term "cp"}-value of @{term "th"} equals its precedence |
|
1168 |
(@{text "eq_cp_th"}): |
|
1169 |
@{thm [display] eq_cp_th} |
|
1170 |
\end{enumerate} |
|
1171 |
||
1172 |
*} |
|
1173 |
||
1174 |
||
1175 |
(*<*) |
|
1176 |
end |
|
1177 |
(*>*) |
|
1178 |
||
1179 |
subsection {* Event @{text "Exit th"} *} |
|
1180 |
||
1181 |
(*<*) |
|
1182 |
context step_exit_cps |
|
1183 |
begin |
|
1184 |
(*>*) |
|
1185 |
||
1186 |
text {* |
|
1187 |
The context under which event @{text "Exit th"} happens is formalized as follows: |
|
1188 |
\begin{enumerate} |
|
1189 |
\item The formation of @{term "s"} (@{text "s_def"}): @{thm s_def}. |
|
1190 |
\item State @{term "s"} is a valid state (@{text "vt_s"}): @{thm vt_s}. This implies |
|
1191 |
event @{text "Exit th"} is eligible to happen under state @{term "s'"} and |
|
1192 |
state @{term "s'"} is a valid state. |
|
1193 |
\end{enumerate} |
|
1194 |
The following results can be obtained under this context: |
|
1195 |
\begin{enumerate} |
|
1196 |
\item The RAG does not change (@{text "eq_dep"}): |
|
1197 |
@{thm [display] eq_dep} |
|
1198 |
\item All threads other than @{term "th"} do not need re-computation (@{text "eq_cp"}): |
|
1199 |
@{thm [display] eq_cp} |
|
1200 |
\end{enumerate} |
|
1201 |
Since @{term th} does not live in state @{term "s"}, there is no need to compute |
|
1202 |
its @{term cp}-value. |
|
1203 |
*} |
|
1204 |
||
1205 |
(*<*) |
|
1206 |
end |
|
1207 |
(*>*) |
|
1208 |
||
1209 |
||
262 | 1210 |
section {* Related works \label{related} *} |
1211 |
||
1212 |
text {* |
|
1213 |
\begin{enumerate} |
|
1214 |
\item {\em Integrating Priority Inheritance Algorithms in the Real-Time Specification for Java} |
|
1215 |
\cite{WellingsBSB07} models and verifies the combination of Priority Inheritance (PI) and |
|
1216 |
Priority Ceiling Emulation (PCE) protocols in the setting of Java virtual machine |
|
1217 |
using extended Timed Automata(TA) formalism of the UPPAAL tool. Although a detailed |
|
1218 |
formal model of combined PI and PCE is given, the number of properties is quite |
|
1219 |
small and the focus is put on the harmonious working of PI and PCE. Most key features of PI |
|
1220 |
(as well as PCE) are not shown. Because of the limitation of the model checking technique |
|
1221 |
used there, properties are shown only for a small number of scenarios. Therefore, |
|
1222 |
the verification does not show the correctness of the formal model itself in a |
|
1223 |
convincing way. |
|
1224 |
\item {\em Formal Development of Solutions for Real-Time Operating Systems with TLA+/TLC} |
|
1225 |
\cite{Faria08}. A formal model of PI is given in TLA+. Only 3 properties are shown |
|
1226 |
for PI using model checking. The limitation of model checking is intrinsic to the work. |
|
1227 |
\item {\em Synchronous modeling and validation of priority inheritance schedulers} |
|
1228 |
\cite{conf/fase/JahierHR09}. Gives a formal model |
|
1229 |
of PI and PCE in AADL (Architecture Analysis \& Design Language) and checked |
|
1230 |
several properties using model checking. The number of properties shown there is |
|
1231 |
less than here and the scale is also limited by the model checking technique. |
|
1232 |
\item {\em The Priority Ceiling Protocol: Formalization and Analysis Using PVS} |
|
1233 |
\cite{dutertre99b}. Formalized another protocol for Priority Inversion in the |
|
1234 |
interactive theorem proving system PVS. |
|
1235 |
\end{enumerate} |
|
1236 |
||
1237 |
||
1238 |
There are several works on inversion avoidance: |
|
1239 |
\begin{enumerate} |
|
1240 |
\item {\em Solving the group priority inversion problem in a timed asynchronous system} |
|
1241 |
\cite{Wang:2002:SGP}. The notion of Group Priority Inversion is introduced. The main |
|
1242 |
strategy is still inversion avoidance. The method is by reordering requests |
|
1243 |
in the setting of Client-Server. |
|
1244 |
\item {\em A Formalization of Priority Inversion} \cite{journals/rts/BabaogluMS93}. |
|
1245 |
Formalized the notion of Priority |
|
1246 |
Inversion and proposes methods to avoid it. |
|
1247 |
\end{enumerate} |
|
1248 |
||
1249 |
{\em Examples of inaccurate specification of the protocol ???}. |
|
1250 |
||
1251 |
*} |
|
1252 |
||
1253 |
section {* Conclusions \label{conclusion} *} |
|
1254 |
||
286 | 1255 |
text {* |
1256 |
The work in this paper only deals with single CPU configurations. The |
|
1257 |
"one CPU" assumption is essential for our formalisation, because the |
|
1258 |
main lemma fails in multi-CPU configuration. The lemma says that any |
|
1259 |
runing thead must be the one with the highest prioirty or already held |
|
1260 |
some resource when the highest priority thread was initiated. When |
|
1261 |
there are multiple CPUs, it may well be the case that a threads did |
|
1262 |
not hold any resource when the highest priority thread was initiated, |
|
1263 |
but that thread still runs after that moment on a separate CPU. In |
|
1264 |
this way, the main lemma does not hold anymore. |
|
1265 |
||
1266 |
||
1267 |
There are some works deals with priority inversion in multi-CPU |
|
1268 |
configurations[???], but none of them have given a formal correctness |
|
1269 |
proof. The extension of our formal proof to deal with multi-CPU |
|
1270 |
configurations is not obvious. One possibility, as suggested in paper |
|
1271 |
[???], is change our formal model (the defiintion of "schs") to give |
|
1272 |
the released resource to the thread with the highest prioirty. In this |
|
1273 |
way, indefinite prioirty inversion can be avoided, but for a quite |
|
1274 |
different reason from the one formalized in this paper (because the |
|
1275 |
"mail lemma" will be different). This means a formal correctness proof |
|
1276 |
for milt-CPU configuration would be quite different from the one given |
|
1277 |
in this paper. The solution of prioirty inversion problem in mult-CPU |
|
1278 |
configurations is a different problem which needs different solutions |
|
1279 |
which is outside the scope of this paper. |
|
1280 |
||
1281 |
*} |
|
1282 |
||
262 | 1283 |
(*<*) |
1284 |
end |
|
1285 |
(*>*) |