Paper/Paper.thy
changeset 2 a04084de4946
child 17 105715a0a807
equal deleted inserted replaced
1:c4783e4ef43f 2:a04084de4946
       
     1 (*<*)
       
     2 theory Paper
       
     3 imports "../CpsG" "../ExtGG" "~~/src/HOL/Library/LaTeXsugar"
       
     4 begin
       
     5 
       
     6 (*
       
     7 find_unused_assms CpsG 
       
     8 find_unused_assms ExtGG 
       
     9 find_unused_assms Moment 
       
    10 find_unused_assms Precedence_ord 
       
    11 find_unused_assms PrioG 
       
    12 find_unused_assms PrioGDef
       
    13 *)
       
    14 
       
    15 ML {*
       
    16   open Printer;
       
    17   show_question_marks_default := false;
       
    18   *}
       
    19 
       
    20 notation (latex output)
       
    21   Cons ("_::_" [78,77] 73) and
       
    22   vt ("valid'_state") and
       
    23   runing ("running") and
       
    24   birthtime ("last'_set") and
       
    25   If  ("(\<^raw:\textrm{>if\<^raw:}> (_)/ \<^raw:\textrm{>then\<^raw:}> (_)/ \<^raw:\textrm{>else\<^raw:}> (_))" 10) and
       
    26   Prc ("'(_, _')") and
       
    27   holding ("holds") and
       
    28   waiting ("waits") and
       
    29   Th ("T") and
       
    30   Cs ("C") and
       
    31   readys ("ready") and
       
    32   depend ("RAG") and 
       
    33   preced ("prec") and
       
    34   cpreced ("cprec") and
       
    35   dependents ("dependants") and
       
    36   cp ("cprec") and
       
    37   holdents ("resources") and
       
    38   original_priority ("priority") and
       
    39   DUMMY  ("\<^raw:\mbox{$\_\!\_$}>")
       
    40 
       
    41 (*abbreviation
       
    42  "detached s th \<equiv> cntP s th = cntV s th"
       
    43 *)
       
    44 (*>*)
       
    45 
       
    46 section {* Introduction *}
       
    47 
       
    48 text {*
       
    49   Many real-time systems need to support threads involving priorities and
       
    50   locking of resources. Locking of resources ensures mutual exclusion
       
    51   when accessing shared data or devices that cannot be
       
    52   preempted. Priorities allow scheduling of threads that need to
       
    53   finish their work within deadlines.  Unfortunately, both features
       
    54   can interact in subtle ways leading to a problem, called
       
    55   \emph{Priority Inversion}. Suppose three threads having priorities
       
    56   $H$(igh), $M$(edium) and $L$(ow). We would expect that the thread
       
    57   $H$ blocks any other thread with lower priority and the thread itself cannot
       
    58   be blocked indefinitely by threads with lower priority. Alas, in a naive
       
    59   implementation of resource locking and priorities this property can
       
    60   be violated. For this let $L$ be in the
       
    61   possession of a lock for a resource that $H$ also needs. $H$ must
       
    62   therefore wait for $L$ to exit the critical section and release this
       
    63   lock. The problem is that $L$ might in turn be blocked by any
       
    64   thread with priority $M$, and so $H$ sits there potentially waiting
       
    65   indefinitely. Since $H$ is blocked by threads with lower
       
    66   priorities, the problem is called Priority Inversion. It was first
       
    67   described in \cite{Lampson80} in the context of the
       
    68   Mesa programming language designed for concurrent programming.
       
    69 
       
    70   If the problem of Priority Inversion is ignored, real-time systems
       
    71   can become unpredictable and resulting bugs can be hard to diagnose.
       
    72   The classic example where this happened is the software that
       
    73   controlled the Mars Pathfinder mission in 1997 \cite{Reeves98}.
       
    74   Once the spacecraft landed, the software shut down at irregular
       
    75   intervals leading to loss of project time as normal operation of the
       
    76   craft could only resume the next day (the mission and data already
       
    77   collected were fortunately not lost, because of a clever system
       
    78   design).  The reason for the shutdowns was that the scheduling
       
    79   software fell victim to Priority Inversion: a low priority thread
       
    80   locking a resource prevented a high priority thread from running in
       
    81   time, leading to a system reset. Once the problem was found, it was
       
    82   rectified by enabling the \emph{Priority Inheritance Protocol} (PIP)
       
    83   \cite{Sha90}\footnote{Sha et al.~call it the \emph{Basic Priority
       
    84   Inheritance Protocol} \cite{Sha90} and others sometimes also call it
       
    85   \emph{Priority Boosting} or \emph{Priority Donation}.} in the scheduling software.
       
    86 
       
    87   The idea behind PIP is to let the thread $L$ temporarily inherit
       
    88   the high priority from $H$ until $L$ leaves the critical section
       
    89   unlocking the resource. This solves the problem of $H$ having to
       
    90   wait indefinitely, because $L$ cannot be blocked by threads having
       
    91   priority $M$. While a few other solutions exist for the Priority
       
    92   Inversion problem, PIP is one that is widely deployed and
       
    93   implemented. This includes VxWorks (a proprietary real-time OS used
       
    94   in the Mars Pathfinder mission, in Boeing's 787 Dreamliner, Honda's
       
    95   ASIMO robot, etc.), but also the POSIX 1003.1c Standard realised for
       
    96   example in libraries for FreeBSD, Solaris and Linux. 
       
    97 
       
    98   One advantage of PIP is that increasing the priority of a thread
       
    99   can be dynamically calculated by the scheduler. This is in contrast
       
   100   to, for example, \emph{Priority Ceiling} \cite{Sha90}, another
       
   101   solution to the Priority Inversion problem, which requires static
       
   102   analysis of the program in order to prevent Priority
       
   103   Inversion. However, there has also been strong criticism against
       
   104   PIP. For instance, PIP cannot prevent deadlocks when lock
       
   105   dependencies are circular, and also blocking times can be
       
   106   substantial (more than just the duration of a critical section).
       
   107   Though, most criticism against PIP centres around unreliable
       
   108   implementations and PIP being too complicated and too inefficient.
       
   109   For example, Yodaiken writes in \cite{Yodaiken02}:
       
   110 
       
   111   \begin{quote}
       
   112   \it{}``Priority inheritance is neither efficient nor reliable. Implementations
       
   113   are either incomplete (and unreliable) or surprisingly complex and intrusive.''
       
   114   \end{quote}
       
   115 
       
   116   \noindent
       
   117   He suggests avoiding PIP altogether by designing the system so that no 
       
   118   priority inversion may happen in the first place. However, such ideal designs may 
       
   119   not always be achievable in practice.
       
   120 
       
   121   In our opinion, there is clearly a need for investigating correct
       
   122   algorithms for PIP. A few specifications for PIP exist (in English)
       
   123   and also a few high-level descriptions of implementations (e.g.~in
       
   124   the textbook \cite[Section 5.6.5]{Vahalia96}), but they help little
       
   125   with actual implementations. That this is a problem in practice is
       
   126   proved by an email by Baker, who wrote on 13 July 2009 on the Linux
       
   127   Kernel mailing list:
       
   128 
       
   129   \begin{quote}
       
   130   \it{}``I observed in the kernel code (to my disgust), the Linux PIP
       
   131   implementation is a nightmare: extremely heavy weight, involving
       
   132   maintenance of a full wait-for graph, and requiring updates for a
       
   133   range of events, including priority changes and interruptions of
       
   134   wait operations.''
       
   135   \end{quote}
       
   136 
       
   137   \noindent
       
   138   The criticism by Yodaiken, Baker and others suggests another look
       
   139   at PIP from a more abstract level (but still concrete enough
       
   140   to inform an implementation), and makes PIP a good candidate for a
       
   141   formal verification. An additional reason is that the original
       
   142   presentation of PIP~\cite{Sha90}, despite being informally
       
   143   ``proved'' correct, is actually \emph{flawed}. 
       
   144 
       
   145   Yodaiken \cite{Yodaiken02} points to a subtlety that had been
       
   146   overlooked in the informal proof by Sha et al. They specify in
       
   147   \cite{Sha90} that after the thread (whose priority has been raised)
       
   148   completes its critical section and releases the lock, it ``returns
       
   149   to its original priority level.'' This leads them to believe that an
       
   150   implementation of PIP is ``rather straightforward''~\cite{Sha90}.
       
   151   Unfortunately, as Yodaiken points out, this behaviour is too
       
   152   simplistic.  Consider the case where the low priority thread $L$
       
   153   locks \emph{two} resources, and two high-priority threads $H$ and
       
   154   $H'$ each wait for one of them.  If $L$ releases one resource
       
   155   so that $H$, say, can proceed, then we still have Priority Inversion
       
   156   with $H'$ (which waits for the other resource). The correct
       
   157   behaviour for $L$ is to switch to the highest remaining priority of
       
   158   the threads that it blocks. The advantage of formalising the
       
   159   correctness of a high-level specification of PIP in a theorem prover
       
   160   is that such issues clearly show up and cannot be overlooked as in
       
   161   informal reasoning (since we have to analyse all possible behaviours
       
   162   of threads, i.e.~\emph{traces}, that could possibly happen).\medskip
       
   163 
       
   164   \noindent
       
   165   {\bf Contributions:} There have been earlier formal investigations
       
   166   into PIP \cite{Faria08,Jahier09,Wellings07}, but they employ model
       
   167   checking techniques. This paper presents a formalised and
       
   168   mechanically checked proof for the correctness of PIP (to our
       
   169   knowledge the first one).  In contrast to model checking, our
       
   170   formalisation provides insight into why PIP is correct and allows us
       
   171   to prove stronger properties that, as we will show, can help with an
       
   172   efficient implementation of PIP in the educational PINTOS operating
       
   173   system \cite{PINTOS}.  For example, we found by ``playing'' with the
       
   174   formalisation that the choice of the next thread to take over a lock
       
   175   when a resource is released is irrelevant for PIP being correct---a
       
   176   fact that has not been mentioned in the literature and not been used
       
   177   in the reference implementation of PIP in PINTOS.  This fact, however, is important
       
   178   for an efficient implementation of PIP, because we can give the lock
       
   179   to the thread with the highest priority so that it terminates more
       
   180   quickly.
       
   181 *}
       
   182 
       
   183 section {* Formal Model of the Priority Inheritance Protocol *}
       
   184 
       
   185 text {*
       
   186   The Priority Inheritance Protocol, short PIP, is a scheduling
       
   187   algorithm for a single-processor system.\footnote{We shall come back
       
   188   later to the case of PIP on multi-processor systems.} 
       
   189   Following good experience in earlier work \cite{Wang09},  
       
   190   our model of PIP is based on Paulson's inductive approach to protocol
       
   191   verification \cite{Paulson98}. In this approach a \emph{state} of a system is
       
   192   given by a list of events that happened so far (with new events prepended to the list). 
       
   193   \emph{Events} of PIP fall
       
   194   into five categories defined as the datatype:
       
   195 
       
   196   \begin{isabelle}\ \ \ \ \ %%%
       
   197   \mbox{\begin{tabular}{r@ {\hspace{2mm}}c@ {\hspace{2mm}}l@ {\hspace{7mm}}l}
       
   198   \isacommand{datatype} event 
       
   199   & @{text "="} & @{term "Create thread priority"}\\
       
   200   & @{text "|"} & @{term "Exit thread"} \\
       
   201   & @{text "|"} & @{term "Set thread priority"} & {\rm reset of the priority for} @{text thread}\\
       
   202   & @{text "|"} & @{term "P thread cs"} & {\rm request of resource} @{text "cs"} {\rm by} @{text "thread"}\\
       
   203   & @{text "|"} & @{term "V thread cs"} & {\rm release of resource} @{text "cs"} {\rm by} @{text "thread"}
       
   204   \end{tabular}}
       
   205   \end{isabelle}
       
   206 
       
   207   \noindent
       
   208   whereby threads, priorities and (critical) resources are represented
       
   209   as natural numbers. The event @{term Set} models the situation that
       
   210   a thread obtains a new priority given by the programmer or
       
   211   user (for example via the {\tt nice} utility under UNIX).  As in Paulson's work, we
       
   212   need to define functions that allow us to make some observations
       
   213   about states.  One, called @{term threads}, calculates the set of
       
   214   ``live'' threads that we have seen so far:
       
   215 
       
   216   \begin{isabelle}\ \ \ \ \ %%%
       
   217   \mbox{\begin{tabular}{lcl}
       
   218   @{thm (lhs) threads.simps(1)} & @{text "\<equiv>"} & 
       
   219     @{thm (rhs) threads.simps(1)}\\
       
   220   @{thm (lhs) threads.simps(2)[where thread="th"]} & @{text "\<equiv>"} & 
       
   221     @{thm (rhs) threads.simps(2)[where thread="th"]}\\
       
   222   @{thm (lhs) threads.simps(3)[where thread="th"]} & @{text "\<equiv>"} & 
       
   223     @{thm (rhs) threads.simps(3)[where thread="th"]}\\
       
   224   @{term "threads (DUMMY#s)"} & @{text "\<equiv>"} & @{term "threads s"}\\
       
   225   \end{tabular}}
       
   226   \end{isabelle}
       
   227 
       
   228   \noindent
       
   229   In this definition @{term "DUMMY # DUMMY"} stands for list-cons.
       
   230   Another function calculates the priority for a thread @{text "th"}, which is 
       
   231   defined as
       
   232 
       
   233   \begin{isabelle}\ \ \ \ \ %%%
       
   234   \mbox{\begin{tabular}{lcl}
       
   235   @{thm (lhs) original_priority.simps(1)[where thread="th"]} & @{text "\<equiv>"} & 
       
   236     @{thm (rhs) original_priority.simps(1)[where thread="th"]}\\
       
   237   @{thm (lhs) original_priority.simps(2)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & 
       
   238     @{thm (rhs) original_priority.simps(2)[where thread="th" and thread'="th'"]}\\
       
   239   @{thm (lhs) original_priority.simps(3)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & 
       
   240     @{thm (rhs) original_priority.simps(3)[where thread="th" and thread'="th'"]}\\
       
   241   @{term "original_priority th (DUMMY#s)"} & @{text "\<equiv>"} & @{term "original_priority th s"}\\
       
   242   \end{tabular}}
       
   243   \end{isabelle}
       
   244 
       
   245   \noindent
       
   246   In this definition we set @{text 0} as the default priority for
       
   247   threads that have not (yet) been created. The last function we need 
       
   248   calculates the ``time'', or index, at which time a process had its 
       
   249   priority last set.
       
   250 
       
   251   \begin{isabelle}\ \ \ \ \ %%%
       
   252   \mbox{\begin{tabular}{lcl}
       
   253   @{thm (lhs) birthtime.simps(1)[where thread="th"]} & @{text "\<equiv>"} & 
       
   254     @{thm (rhs) birthtime.simps(1)[where thread="th"]}\\
       
   255   @{thm (lhs) birthtime.simps(2)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & 
       
   256     @{thm (rhs) birthtime.simps(2)[where thread="th" and thread'="th'"]}\\
       
   257   @{thm (lhs) birthtime.simps(3)[where thread="th" and thread'="th'"]} & @{text "\<equiv>"} & 
       
   258     @{thm (rhs) birthtime.simps(3)[where thread="th" and thread'="th'"]}\\
       
   259   @{term "birthtime th (DUMMY#s)"} & @{text "\<equiv>"} & @{term "birthtime th s"}\\
       
   260   \end{tabular}}
       
   261   \end{isabelle}
       
   262 
       
   263   \noindent
       
   264   In this definition @{term "length s"} stands for the length of the list
       
   265   of events @{text s}. Again the default value in this function is @{text 0}
       
   266   for threads that have not been created yet. A \emph{precedence} of a thread @{text th} in a 
       
   267   state @{text s} is the pair of natural numbers defined as
       
   268   
       
   269   \begin{isabelle}\ \ \ \ \ %%%
       
   270   @{thm preced_def[where thread="th"]}
       
   271   \end{isabelle}
       
   272 
       
   273   \noindent
       
   274   The point of precedences is to schedule threads not according to priorities (because what should
       
   275   we do in case two threads have the same priority), but according to precedences. 
       
   276   Precedences allow us to always discriminate between two threads with equal priority by 
       
   277   taking into account the time when the priority was last set. We order precedences so 
       
   278   that threads with the same priority get a higher precedence if their priority has been 
       
   279   set earlier, since for such threads it is more urgent to finish their work. In an implementation
       
   280   this choice would translate to a quite natural FIFO-scheduling of processes with 
       
   281   the same priority.
       
   282 
       
   283   Next, we introduce the concept of \emph{waiting queues}. They are
       
   284   lists of threads associated with every resource. The first thread in
       
   285   this list (i.e.~the head, or short @{term hd}) is chosen to be the one 
       
   286   that is in possession of the
       
   287   ``lock'' of the corresponding resource. We model waiting queues as
       
   288   functions, below abbreviated as @{text wq}. They take a resource as
       
   289   argument and return a list of threads.  This allows us to define
       
   290   when a thread \emph{holds}, respectively \emph{waits} for, a
       
   291   resource @{text cs} given a waiting queue function @{text wq}.
       
   292 
       
   293   \begin{isabelle}\ \ \ \ \ %%%
       
   294   \begin{tabular}{@ {}l}
       
   295   @{thm cs_holding_def[where thread="th"]}\\
       
   296   @{thm cs_waiting_def[where thread="th"]}
       
   297   \end{tabular}
       
   298   \end{isabelle}
       
   299 
       
   300   \noindent
       
   301   In this definition we assume @{text "set"} converts a list into a set.
       
   302   At the beginning, that is in the state where no thread is created yet, 
       
   303   the waiting queue function will be the function that returns the
       
   304   empty list for every resource.
       
   305 
       
   306   \begin{isabelle}\ \ \ \ \ %%%
       
   307   @{abbrev all_unlocked}\hfill\numbered{allunlocked}
       
   308   \end{isabelle}
       
   309 
       
   310   \noindent
       
   311   Using @{term "holding"} and @{term waiting}, we can introduce \emph{Resource Allocation Graphs} 
       
   312   (RAG), which represent the dependencies between threads and resources.
       
   313   We represent RAGs as relations using pairs of the form
       
   314 
       
   315   \begin{isabelle}\ \ \ \ \ %%%
       
   316   @{term "(Th th, Cs cs)"} \hspace{5mm}{\rm and}\hspace{5mm}
       
   317   @{term "(Cs cs, Th th)"}
       
   318   \end{isabelle}
       
   319 
       
   320   \noindent
       
   321   where the first stands for a \emph{waiting edge} and the second for a 
       
   322   \emph{holding edge} (@{term Cs} and @{term Th} are constructors of a 
       
   323   datatype for vertices). Given a waiting queue function, a RAG is defined 
       
   324   as the union of the sets of waiting and holding edges, namely
       
   325 
       
   326   \begin{isabelle}\ \ \ \ \ %%%
       
   327   @{thm cs_depend_def}
       
   328   \end{isabelle}
       
   329 
       
   330   \noindent
       
   331   Given four threads and three resources, an instance of a RAG can be pictured 
       
   332   as follows:
       
   333 
       
   334   \begin{center}
       
   335   \newcommand{\fnt}{\fontsize{7}{8}\selectfont}
       
   336   \begin{tikzpicture}[scale=1]
       
   337   %%\draw[step=2mm] (-3,2) grid (1,-1);
       
   338 
       
   339   \node (A) at (0,0) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>0"}};
       
   340   \node (B) at (2,0) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>1"}};
       
   341   \node (C) at (4,0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>1"}};
       
   342   \node (D) at (4,-0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>2"}};
       
   343   \node (E) at (6,-0.7) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>2"}};
       
   344   \node (E1) at (6, 0.2) [draw, circle, very thick, inner sep=0.4mm] {@{text "cs\<^isub>3"}};
       
   345   \node (F) at (8,-0.7) [draw, rounded corners=1mm, rectangle, very thick] {@{text "th\<^isub>3"}};
       
   346 
       
   347   \draw [<-,line width=0.6mm] (A) to node [pos=0.54,sloped,above=-0.5mm] {\fnt{}holding}  (B);
       
   348   \draw [->,line width=0.6mm] (C) to node [pos=0.4,sloped,above=-0.5mm] {\fnt{}waiting}  (B);
       
   349   \draw [->,line width=0.6mm] (D) to node [pos=0.4,sloped,below=-0.5mm] {\fnt{}waiting}  (B);
       
   350   \draw [<-,line width=0.6mm] (D) to node [pos=0.54,sloped,below=-0.5mm] {\fnt{}holding}  (E);
       
   351   \draw [<-,line width=0.6mm] (D) to node [pos=0.54,sloped,above=-0.5mm] {\fnt{}holding}  (E1);
       
   352   \draw [->,line width=0.6mm] (F) to node [pos=0.45,sloped,below=-0.5mm] {\fnt{}waiting}  (E);
       
   353   \end{tikzpicture}
       
   354   \end{center}
       
   355 
       
   356   \noindent
       
   357   The use of relations for representing RAGs allows us to conveniently define
       
   358   the notion of the \emph{dependants} of a thread using the transitive closure
       
   359   operation for relations. This gives
       
   360 
       
   361   \begin{isabelle}\ \ \ \ \ %%%
       
   362   @{thm cs_dependents_def}
       
   363   \end{isabelle}
       
   364 
       
   365   \noindent
       
   366   This definition needs to account for all threads that wait for a thread to
       
   367   release a resource. This means we need to include threads that transitively
       
   368   wait for a resource being released (in the picture above this means the dependants
       
   369   of @{text "th\<^isub>0"} are @{text "th\<^isub>1"} and @{text "th\<^isub>2"}, which wait for resource @{text "cs\<^isub>1"}, 
       
   370   but also @{text "th\<^isub>3"}, 
       
   371   which cannot make any progress unless @{text "th\<^isub>2"} makes progress, which
       
   372   in turn needs to wait for @{text "th\<^isub>0"} to finish). If there is a circle of dependencies 
       
   373   in a RAG, then clearly
       
   374   we have a deadlock. Therefore when a thread requests a resource,
       
   375   we must ensure that the resulting RAG is not circular. In practice, the 
       
   376   programmer has to ensure this.
       
   377 
       
   378 
       
   379   Next we introduce the notion of the \emph{current precedence} of a thread @{text th} in a 
       
   380   state @{text s}. It is defined as
       
   381 
       
   382   \begin{isabelle}\ \ \ \ \ %%%
       
   383   @{thm cpreced_def2}\hfill\numbered{cpreced}
       
   384   \end{isabelle}
       
   385 
       
   386   \noindent
       
   387   where the dependants of @{text th} are given by the waiting queue function.
       
   388   While the precedence @{term prec} of a thread is determined statically 
       
   389   (for example when the thread is
       
   390   created), the point of the current precedence is to let the scheduler increase this
       
   391   precedence, if needed according to PIP. Therefore the current precedence of @{text th} is
       
   392   given as the maximum of the precedence @{text th} has in state @{text s} \emph{and} all 
       
   393   threads that are dependants of @{text th}. Since the notion @{term "dependants"} is
       
   394   defined as the transitive closure of all dependent threads, we deal correctly with the 
       
   395   problem in the informal algorithm by Sha et al.~\cite{Sha90} where a priority of a thread is
       
   396   lowered prematurely.
       
   397   
       
   398   The next function, called @{term schs}, defines the behaviour of the scheduler. It will be defined
       
   399   by recursion on the state (a list of events); this function returns a \emph{schedule state}, which 
       
   400   we represent as a record consisting of two
       
   401   functions:
       
   402 
       
   403   \begin{isabelle}\ \ \ \ \ %%%
       
   404   @{text "\<lparr>wq_fun, cprec_fun\<rparr>"}
       
   405   \end{isabelle}
       
   406 
       
   407   \noindent
       
   408   The first function is a waiting queue function (that is, it takes a
       
   409   resource @{text "cs"} and returns the corresponding list of threads
       
   410   that lock, respectively wait for, it); the second is a function that
       
   411   takes a thread and returns its current precedence (see
       
   412   the definition in \eqref{cpreced}). We assume the usual getter and setter methods for
       
   413   such records.
       
   414 
       
   415   In the initial state, the scheduler starts with all resources unlocked (the corresponding 
       
   416   function is defined in \eqref{allunlocked}) and the
       
   417   current precedence of every thread is initialised with @{term "Prc 0 0"}; that means 
       
   418   \mbox{@{abbrev initial_cprec}}. Therefore
       
   419   we have for the initial shedule state
       
   420 
       
   421   \begin{isabelle}\ \ \ \ \ %%%
       
   422   \begin{tabular}{@ {}l}
       
   423   @{thm (lhs) schs.simps(1)} @{text "\<equiv>"}\\ 
       
   424   \hspace{5mm}@{term "(|wq_fun = all_unlocked, cprec_fun = (\<lambda>_::thread. Prc 0 0)|)"}
       
   425   \end{tabular}
       
   426   \end{isabelle}
       
   427 
       
   428   \noindent
       
   429   The cases for @{term Create}, @{term Exit} and @{term Set} are also straightforward:
       
   430   we calculate the waiting queue function of the (previous) state @{text s}; 
       
   431   this waiting queue function @{text wq} is unchanged in the next schedule state---because
       
   432   none of these events lock or release any resource; 
       
   433   for calculating the next @{term "cprec_fun"}, we use @{text wq} and 
       
   434   @{term cpreced}. This gives the following three clauses for @{term schs}:
       
   435 
       
   436   \begin{isabelle}\ \ \ \ \ %%%
       
   437   \begin{tabular}{@ {}l}
       
   438   @{thm (lhs) schs.simps(2)} @{text "\<equiv>"}\\ 
       
   439   \hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\
       
   440   \hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Create th prio # s)|)"}\smallskip\\
       
   441   @{thm (lhs) schs.simps(3)} @{text "\<equiv>"}\\
       
   442   \hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\
       
   443   \hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Exit th # s)|)"}\smallskip\\
       
   444   @{thm (lhs) schs.simps(4)} @{text "\<equiv>"}\\ 
       
   445   \hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\
       
   446   \hspace{8mm}@{term "(|wq_fun = wq\<iota>, cprec_fun = cpreced wq\<iota> (Set th prio # s)|)"}
       
   447   \end{tabular}
       
   448   \end{isabelle}
       
   449 
       
   450   \noindent 
       
   451   More interesting are the cases where a resource, say @{text cs}, is locked or released. In these cases
       
   452   we need to calculate a new waiting queue function. For the event @{term "P th cs"}, we have to update
       
   453   the function so that the new thread list for @{text cs} is the old thread list plus the thread @{text th} 
       
   454   appended to the end of that list (remember the head of this list is assigned to be in the possession of this
       
   455   resource). This gives the clause
       
   456 
       
   457   \begin{isabelle}\ \ \ \ \ %%%
       
   458   \begin{tabular}{@ {}l}
       
   459   @{thm (lhs) schs.simps(5)} @{text "\<equiv>"}\\ 
       
   460   \hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\
       
   461   \hspace{5mm}@{text "let"} @{text "new_wq = wq(cs := (wq cs @ [th]))"} @{text "in"}\\
       
   462   \hspace{8mm}@{term "(|wq_fun = new_wq, cprec_fun = cpreced new_wq (P th cs # s)|)"}
       
   463   \end{tabular}
       
   464   \end{isabelle}
       
   465 
       
   466   \noindent
       
   467   The clause for event @{term "V th cs"} is similar, except that we need to update the waiting queue function
       
   468   so that the thread that possessed the lock is deleted from the corresponding thread list. For this 
       
   469   list transformation, we use
       
   470   the auxiliary function @{term release}. A simple version of @{term release} would
       
   471   just delete this thread and return the remaining threads, namely
       
   472 
       
   473   \begin{isabelle}\ \ \ \ \ %%%
       
   474   \begin{tabular}{@ {}lcl}
       
   475   @{term "release []"} & @{text "\<equiv>"} & @{term "[]"}\\
       
   476   @{term "release (DUMMY # qs)"} & @{text "\<equiv>"} & @{term "qs"}\\
       
   477   \end{tabular}
       
   478   \end{isabelle}
       
   479 
       
   480   \noindent
       
   481   In practice, however, often the thread with the highest precedence in the list will get the
       
   482   lock next. We have implemented this choice, but later found out that the choice 
       
   483   of which thread is chosen next is actually irrelevant for the correctness of PIP.
       
   484   Therefore we prove the stronger result where @{term release} is defined as
       
   485 
       
   486   \begin{isabelle}\ \ \ \ \ %%%
       
   487   \begin{tabular}{@ {}lcl}
       
   488   @{term "release []"} & @{text "\<equiv>"} & @{term "[]"}\\
       
   489   @{term "release (DUMMY # qs)"} & @{text "\<equiv>"} & @{term "SOME qs'. distinct qs' \<and> set qs' = set qs"}\\
       
   490   \end{tabular}
       
   491   \end{isabelle}
       
   492 
       
   493   \noindent
       
   494   where @{text "SOME"} stands for Hilbert's epsilon and implements an arbitrary
       
   495   choice for the next waiting list. It just has to be a list of distinctive threads and
       
   496   contain the same elements as @{text "qs"}. This gives for @{term V} the clause:
       
   497  
       
   498   \begin{isabelle}\ \ \ \ \ %%%
       
   499   \begin{tabular}{@ {}l}
       
   500   @{thm (lhs) schs.simps(6)} @{text "\<equiv>"}\\
       
   501   \hspace{5mm}@{text "let"} @{text "wq = wq_fun (schs s)"} @{text "in"}\\
       
   502   \hspace{5mm}@{text "let"} @{text "new_wq = release (wq cs)"} @{text "in"}\\
       
   503   \hspace{8mm}@{term "(|wq_fun = new_wq, cprec_fun = cpreced new_wq (V th cs # s)|)"}
       
   504   \end{tabular}
       
   505   \end{isabelle}
       
   506 
       
   507   Having the scheduler function @{term schs} at our disposal, we can ``lift'', or
       
   508   overload, the notions
       
   509   @{term waiting}, @{term holding}, @{term depend} and @{term cp} to operate on states only.
       
   510 
       
   511   \begin{isabelle}\ \ \ \ \ %%%
       
   512   \begin{tabular}{@ {}rcl}
       
   513   @{thm (lhs) s_holding_abv} & @{text "\<equiv>"} & @{thm (rhs) s_holding_abv}\\
       
   514   @{thm (lhs) s_waiting_abv} & @{text "\<equiv>"} & @{thm (rhs) s_waiting_abv}\\
       
   515   @{thm (lhs) s_depend_abv}  & @{text "\<equiv>"} & @{thm (rhs) s_depend_abv}\\
       
   516   @{thm (lhs) cp_def}        & @{text "\<equiv>"} & @{thm (rhs) cp_def}
       
   517   \end{tabular}
       
   518   \end{isabelle}
       
   519 
       
   520   \noindent
       
   521   With these abbreviations in place we can introduce 
       
   522   the notion of a thread being @{term ready} in a state (i.e.~threads
       
   523   that do not wait for any resource) and the running thread.
       
   524 
       
   525   \begin{isabelle}\ \ \ \ \ %%%
       
   526   \begin{tabular}{@ {}l}
       
   527   @{thm readys_def}\\
       
   528   @{thm runing_def}
       
   529   \end{tabular}
       
   530   \end{isabelle}
       
   531 
       
   532   \noindent
       
   533   In the second definition @{term "DUMMY ` DUMMY"} stands for the image of a set under a function.
       
   534   Note that in the initial state, that is where the list of events is empty, the set 
       
   535   @{term threads} is empty and therefore there is neither a thread ready nor running.
       
   536   If there is one or more threads ready, then there can only be \emph{one} thread
       
   537   running, namely the one whose current precedence is equal to the maximum of all ready 
       
   538   threads. We use sets to capture both possibilities.
       
   539   We can now also conveniently define the set of resources that are locked by a thread in a
       
   540   given state and also when a thread is detached that state (meaning the thread neither 
       
   541   holds nor waits for a resource):
       
   542 
       
   543   \begin{isabelle}\ \ \ \ \ %%%
       
   544   \begin{tabular}{@ {}l}
       
   545   @{thm holdents_def}\\
       
   546   @{thm detached_def}
       
   547   \end{tabular}
       
   548   \end{isabelle}
       
   549 
       
   550   %\noindent
       
   551   %The second definition states that @{text th}  in @{text s}.
       
   552   
       
   553   Finally we can define what a \emph{valid state} is in our model of PIP. For
       
   554   example we cannot expect to be able to exit a thread, if it was not
       
   555   created yet. 
       
   556   These validity constraints on states are characterised by the
       
   557   inductive predicate @{term "step"} and @{term vt}. We first give five inference rules
       
   558   for @{term step} relating a state and an event that can happen next.
       
   559 
       
   560   \begin{center}
       
   561   \begin{tabular}{c}
       
   562   @{thm[mode=Rule] thread_create[where thread=th]}\hspace{1cm}
       
   563   @{thm[mode=Rule] thread_exit[where thread=th]}
       
   564   \end{tabular}
       
   565   \end{center}
       
   566 
       
   567   \noindent
       
   568   The first rule states that a thread can only be created, if it is not alive yet.
       
   569   Similarly, the second rule states that a thread can only be terminated if it was
       
   570   running and does not lock any resources anymore (this simplifies slightly our model;
       
   571   in practice we would expect the operating system releases all locks held by a
       
   572   thread that is about to exit). The event @{text Set} can happen
       
   573   if the corresponding thread is running. 
       
   574 
       
   575   \begin{center}
       
   576   @{thm[mode=Rule] thread_set[where thread=th]}
       
   577   \end{center}
       
   578 
       
   579   \noindent
       
   580   If a thread wants to lock a resource, then the thread needs to be
       
   581   running and also we have to make sure that the resource lock does
       
   582   not lead to a cycle in the RAG. In practice, ensuring the latter
       
   583   is the responsibility of the programmer.  In our formal
       
   584   model we brush aside these problematic cases in order to be able to make
       
   585   some meaningful statements about PIP.\footnote{This situation is
       
   586   similar to the infamous \emph{occurs check} in Prolog: In order to say
       
   587   anything meaningful about unification, one needs to perform an occurs
       
   588   check. But in practice the occurs check is omitted and the
       
   589   responsibility for avoiding problems rests with the programmer.}
       
   590 
       
   591  
       
   592   \begin{center}
       
   593   @{thm[mode=Rule] thread_P[where thread=th]}
       
   594   \end{center}
       
   595  
       
   596   \noindent
       
   597   Similarly, if a thread wants to release a lock on a resource, then
       
   598   it must be running and in the possession of that lock. This is
       
   599   formally given by the last inference rule of @{term step}.
       
   600  
       
   601   \begin{center}
       
   602   @{thm[mode=Rule] thread_V[where thread=th]}
       
   603   \end{center}
       
   604 
       
   605   \noindent
       
   606   A valid state of PIP can then be conveniently be defined as follows:
       
   607 
       
   608   \begin{center}
       
   609   \begin{tabular}{c}
       
   610   @{thm[mode=Axiom] vt_nil}\hspace{1cm}
       
   611   @{thm[mode=Rule] vt_cons}
       
   612   \end{tabular}
       
   613   \end{center}
       
   614 
       
   615   \noindent
       
   616   This completes our formal model of PIP. In the next section we present
       
   617   properties that show our model of PIP is correct.
       
   618 *}
       
   619 
       
   620 section {* The Correctness Proof *}
       
   621 
       
   622 (*<*)
       
   623 context extend_highest_gen
       
   624 begin
       
   625 (*>*)
       
   626 text {* 
       
   627   Sha et al.~state their first correctness criterion for PIP in terms
       
   628   of the number of low-priority threads \cite[Theorem 3]{Sha90}: if
       
   629   there are @{text n} low-priority threads, then a blocked job with
       
   630   high priority can only be blocked a maximum of @{text n} times.
       
   631   Their second correctness criterion is given
       
   632   in terms of the number of critical resources \cite[Theorem 6]{Sha90}: if there are
       
   633   @{text m} critical resources, then a blocked job with high priority
       
   634   can only be blocked a maximum of @{text m} times. Both results on their own, strictly speaking, do
       
   635   \emph{not} prevent indefinite, or unbounded, Priority Inversion,
       
   636   because if a low-priority thread does not give up its critical
       
   637   resource (the one the high-priority thread is waiting for), then the
       
   638   high-priority thread can never run.  The argument of Sha et al.~is
       
   639   that \emph{if} threads release locked resources in a finite amount
       
   640   of time, then indefinite Priority Inversion cannot occur---the high-priority
       
   641   thread is guaranteed to run eventually. The assumption is that
       
   642   programmers must ensure that threads are programmed in this way.  However, even
       
   643   taking this assumption into account, the correctness properties of
       
   644   Sha et al.~are
       
   645   \emph{not} true for their version of PIP---despite being ``proved''. As Yodaiken
       
   646   \cite{Yodaiken02} pointed out: If a low-priority thread possesses
       
   647   locks to two resources for which two high-priority threads are
       
   648   waiting for, then lowering the priority prematurely after giving up
       
   649   only one lock, can cause indefinite Priority Inversion for one of the
       
   650   high-priority threads, invalidating their two bounds.
       
   651 
       
   652   Even when fixed, their proof idea does not seem to go through for
       
   653   us, because of the way we have set up our formal model of PIP.  One
       
   654   reason is that we allow critical sections, which start with a @{text P}-event
       
   655   and finish with a corresponding @{text V}-event, to arbitrarily overlap
       
   656   (something Sha et al.~explicitly exclude).  Therefore we have
       
   657   designed a different correctness criterion for PIP. The idea behind
       
   658   our criterion is as follows: for all states @{text s}, we know the
       
   659   corresponding thread @{text th} with the highest precedence; we show
       
   660   that in every future state (denoted by @{text "s' @ s"}) in which
       
   661   @{text th} is still alive, either @{text th} is running or it is
       
   662   blocked by a thread that was alive in the state @{text s} and was waiting 
       
   663   for or in the possession of a lock in @{text s}. Since in @{text s}, as in
       
   664   every state, the set of alive threads is finite, @{text th} can only
       
   665   be blocked a finite number of times. This is independent of how many
       
   666   threads of lower priority are created in @{text "s'"}. We will actually prove a
       
   667   stronger statement where we also provide the current precedence of
       
   668   the blocking thread. However, this correctness criterion hinges upon
       
   669   a number of assumptions about the states @{text s} and @{text "s' @
       
   670   s"}, the thread @{text th} and the events happening in @{text
       
   671   s'}. We list them next:
       
   672 
       
   673   \begin{quote}
       
   674   {\bf Assumptions on the states {\boldmath@{text s}} and 
       
   675   {\boldmath@{text "s' @ s"}:}} We need to require that @{text "s"} and 
       
   676   @{text "s' @ s"} are valid states:
       
   677   \begin{isabelle}\ \ \ \ \ %%%
       
   678   \begin{tabular}{l}
       
   679   @{term "vt s"}, @{term "vt (s' @ s)"} 
       
   680   \end{tabular}
       
   681   \end{isabelle}
       
   682   \end{quote}
       
   683 
       
   684   \begin{quote}
       
   685   {\bf Assumptions on the thread {\boldmath@{text "th"}:}} 
       
   686   The thread @{text th} must be alive in @{text s} and 
       
   687   has the highest precedence of all alive threads in @{text s}. Furthermore the
       
   688   priority of @{text th} is @{text prio} (we need this in the next assumptions).
       
   689   \begin{isabelle}\ \ \ \ \ %%%
       
   690   \begin{tabular}{l}
       
   691   @{term "th \<in> threads s"}\\
       
   692   @{term "prec th s = Max (cprec s ` threads s)"}\\
       
   693   @{term "prec th s = (prio, DUMMY)"}
       
   694   \end{tabular}
       
   695   \end{isabelle}
       
   696   \end{quote}
       
   697   
       
   698   \begin{quote}
       
   699   {\bf Assumptions on the events in {\boldmath@{text "s'"}:}} We want to prove that @{text th} cannot
       
   700   be blocked indefinitely. Of course this can happen if threads with higher priority
       
   701   than @{text th} are continuously created in @{text s'}. Therefore we have to assume that  
       
   702   events in @{text s'} can only create (respectively set) threads with equal or lower 
       
   703   priority than @{text prio} of @{text th}. We also need to assume that the
       
   704   priority of @{text "th"} does not get reset and also that @{text th} does
       
   705   not get ``exited'' in @{text "s'"}. This can be ensured by assuming the following three implications. 
       
   706   \begin{isabelle}\ \ \ \ \ %%%
       
   707   \begin{tabular}{l}
       
   708   {If}~~@{text "Create th' prio' \<in> set s'"}~~{then}~~@{text "prio' \<le> prio"}\\
       
   709   {If}~~@{text "Set th' prio' \<in> set s'"}~~{then}~~@{text "th' \<noteq> th"}~~{and}~~@{text "prio' \<le> prio"}\\
       
   710   {If}~~@{text "Exit th' \<in> set s'"}~~{then}~~@{text "th' \<noteq> th"}\\
       
   711   \end{tabular}
       
   712   \end{isabelle}
       
   713   \end{quote}
       
   714 
       
   715   \noindent
       
   716   The locale mechanism of Isabelle helps us to manage conveniently such assumptions~\cite{Haftmann08}.
       
   717   Under these assumptions we shall prove the following correctness property:
       
   718 
       
   719   \begin{theorem}\label{mainthm}
       
   720   Given the assumptions about states @{text "s"} and @{text "s' @ s"},
       
   721   the thread @{text th} and the events in @{text "s'"},
       
   722   if @{term "th' \<in> running (s' @ s)"} and @{text "th' \<noteq> th"} then
       
   723   @{text "th' \<in> threads s"}, @{text "\<not> detached s th'"} and 
       
   724   @{term "cp (s' @ s) th' = prec th s"}.
       
   725   \end{theorem}
       
   726 
       
   727   \noindent
       
   728   This theorem ensures that the thread @{text th}, which has the
       
   729   highest precedence in the state @{text s}, can only be blocked in
       
   730   the state @{text "s' @ s"} by a thread @{text th'} that already
       
   731   existed in @{text s} and requested or had a lock on at least 
       
   732   one resource---that means the thread was not \emph{detached} in @{text s}. 
       
   733   As we shall see shortly, that means there are only finitely 
       
   734   many threads that can block @{text th} in this way and then they 
       
   735   need to run with the same current precedence as @{text th}.
       
   736 
       
   737   Like in the argument by Sha et al.~our
       
   738   finite bound does not guarantee absence of indefinite Priority
       
   739   Inversion. For this we further have to assume that every thread
       
   740   gives up its resources after a finite amount of time. We found that
       
   741   this assumption is awkward to formalise in our model. Therefore we
       
   742   leave it out and let the programmer assume the responsibility to
       
   743   program threads in such a benign manner (in addition to causing no 
       
   744   circularity in the @{text RAG}). In this detail, we do not
       
   745   make any progress in comparison with the work by Sha et al.
       
   746   However, we are able to combine their two separate bounds into a
       
   747   single theorem improving their bound.
       
   748 
       
   749   In what follows we will describe properties of PIP that allow us to prove 
       
   750   Theorem~\ref{mainthm} and, when instructive, briefly describe our argument. 
       
   751   It is relatively easy to see that
       
   752 
       
   753   \begin{isabelle}\ \ \ \ \ %%%
       
   754   \begin{tabular}{@ {}l}
       
   755   @{text "running s \<subseteq> ready s \<subseteq> threads s"}\\
       
   756   @{thm[mode=IfThen]  finite_threads}
       
   757   \end{tabular}
       
   758   \end{isabelle}
       
   759 
       
   760   \noindent
       
   761   The second property is by induction of @{term vt}. The next three
       
   762   properties are 
       
   763 
       
   764   \begin{isabelle}\ \ \ \ \ %%%
       
   765   \begin{tabular}{@ {}l}
       
   766   @{thm[mode=IfThen] waiting_unique[of _ _ "cs\<^isub>1" "cs\<^isub>2"]}\\
       
   767   @{thm[mode=IfThen] held_unique[of _ "th\<^isub>1" _ "th\<^isub>2"]}\\
       
   768   @{thm[mode=IfThen] runing_unique[of _ "th\<^isub>1" "th\<^isub>2"]}
       
   769   \end{tabular}
       
   770   \end{isabelle}
       
   771 
       
   772   \noindent
       
   773   The first property states that every waiting thread can only wait for a single
       
   774   resource (because it gets suspended after requesting that resource); the second 
       
   775   that every resource can only be held by a single thread; 
       
   776   the third property establishes that in every given valid state, there is
       
   777   at most one running thread. We can also show the following properties 
       
   778   about the @{term RAG} in @{text "s"}.
       
   779 
       
   780   \begin{isabelle}\ \ \ \ \ %%%
       
   781   \begin{tabular}{@ {}l}
       
   782   @{text If}~@{thm (prem 1) acyclic_depend}~@{text "then"}:\\
       
   783   \hspace{5mm}@{thm (concl) acyclic_depend},
       
   784   @{thm (concl) finite_depend} and
       
   785   @{thm (concl) wf_dep_converse},\\
       
   786   \hspace{5mm}@{text "if"}~@{thm (prem 2) dm_depend_threads}~@{text "then"}~@{thm (concl) dm_depend_threads}
       
   787   and\\
       
   788   \hspace{5mm}@{text "if"}~@{thm (prem 2) range_in}~@{text "then"}~@{thm (concl) range_in}.
       
   789   \end{tabular}
       
   790   \end{isabelle}
       
   791 
       
   792   \noindent
       
   793   The acyclicity property follows from how we restricted the events in
       
   794   @{text step}; similarly the finiteness and well-foundedness property.
       
   795   The last two properties establish that every thread in a @{text "RAG"}
       
   796   (either holding or waiting for a resource) is a live thread.
       
   797 
       
   798   The key lemma in our proof of Theorem~\ref{mainthm} is as follows:
       
   799 
       
   800   \begin{lemma}\label{mainlem}
       
   801   Given the assumptions about states @{text "s"} and @{text "s' @ s"},
       
   802   the thread @{text th} and the events in @{text "s'"},
       
   803   if @{term "th' \<in> threads (s' @ s)"}, @{text "th' \<noteq> th"} and @{text "detached (s' @ s) th'"}\\
       
   804   then @{text "th' \<notin> running (s' @ s)"}.
       
   805   \end{lemma}
       
   806 
       
   807   \noindent
       
   808   The point of this lemma is that a thread different from @{text th} (which has the highest
       
   809   precedence in @{text s}) and not holding any resource, cannot be running 
       
   810   in the state @{text "s' @ s"}.
       
   811 
       
   812   \begin{proof}
       
   813   Since thread @{text "th'"} does not hold any resource, no thread can depend on it. 
       
   814   Therefore its current precedence @{term "cp (s' @ s) th'"} equals its own precedence
       
   815   @{term "prec th' (s' @ s)"}. Since @{text "th"} has the highest precedence in the 
       
   816   state @{text "(s' @ s)"} and precedences are distinct among threads, we have
       
   817   @{term "prec th' (s' @s ) < prec th (s' @ s)"}. From this 
       
   818   we have @{term "cp (s' @ s) th' < prec th (s' @ s)"}.
       
   819   Since @{text "prec th (s' @ s)"} is already the highest 
       
   820   @{term "cp (s' @ s) th"} can not be higher than this and can not be lower either (by 
       
   821   definition of @{term "cp"}). Consequently, we have @{term "prec th (s' @ s) = cp (s' @ s) th"}.
       
   822   Finally we have @{term "cp (s' @ s) th' < cp (s' @ s) th"}.
       
   823   By defintion of @{text "running"}, @{text "th'"} can not be running in state
       
   824   @{text "s' @ s"}, as we had to show.\qed
       
   825   \end{proof}
       
   826 
       
   827   \noindent
       
   828   Since @{text "th'"} is not able to run in state @{text "s' @ s"}, it is not able to 
       
   829   issue a @{text "P"} or @{text "V"} event. Therefore if @{text "s' @ s"} is extended
       
   830   one step further, @{text "th'"} still cannot hold any resource. The situation will 
       
   831   not change in further extensions as long as @{text "th"} holds the highest precedence.
       
   832 
       
   833   From this lemma we can deduce Theorem~\ref{mainthm}: that @{text th} can only be 
       
   834   blocked by a thread @{text th'} that
       
   835   held some resource in state @{text s} (that is not @{text "detached"}). And furthermore
       
   836   that the current precedence of @{text th'} in state @{text "(s' @ s)"} must be equal to the 
       
   837   precedence of @{text th} in @{text "s"}.
       
   838   We show this theorem by induction on @{text "s'"} using Lemma~\ref{mainlem}.
       
   839   This theorem gives a stricter bound on the threads that can block @{text th} than the
       
   840   one obtained by Sha et al.~\cite{Sha90}:
       
   841   only threads that were alive in state @{text s} and moreover held a resource.
       
   842   This means our bound is in terms of both---alive threads in state @{text s}
       
   843   and number of critical resources. Finally, the theorem establishes that the blocking threads have the
       
   844   current precedence raised to the precedence of @{text th}.
       
   845 
       
   846   We can furthermore prove that under our assumptions no deadlock exists in the state @{text "s' @ s"}
       
   847   by showing that @{text "running (s' @ s)"} is not empty.
       
   848 
       
   849   \begin{lemma}
       
   850   Given the assumptions about states @{text "s"} and @{text "s' @ s"},
       
   851   the thread @{text th} and the events in @{text "s'"},
       
   852   @{term "running (s' @ s) \<noteq> {}"}.
       
   853   \end{lemma}
       
   854 
       
   855   \begin{proof}
       
   856   If @{text th} is blocked, then by following its dependants graph, we can always 
       
   857   reach a ready thread @{text th'}, and that thread must have inherited the 
       
   858   precedence of @{text th}.\qed
       
   859   \end{proof}
       
   860 
       
   861 
       
   862   %The following lemmas show how every node in RAG can be chased to ready threads:
       
   863   %\begin{enumerate}
       
   864   %\item Every node in RAG can be chased to a ready thread (@{text "chain_building"}):
       
   865   %  @   {thm [display] chain_building[rule_format]}
       
   866   %\item The ready thread chased to is unique (@{text "dchain_unique"}):
       
   867   %  @   {thm [display] dchain_unique[of _ _ "th\<^isub>1" "th\<^isub>2"]}
       
   868   %\end{enumerate}
       
   869 
       
   870   %Some deeper results about the system:
       
   871   %\begin{enumerate}
       
   872   %\item The maximum of @{term "cp"} and @{term "preced"} are equal (@{text "max_cp_eq"}):
       
   873   %@  {thm [display] max_cp_eq}
       
   874   %\item There must be one ready thread having the max @{term "cp"}-value 
       
   875   %(@{text "max_cp_readys_threads"}):
       
   876   %@  {thm [display] max_cp_readys_threads}
       
   877   %\end{enumerate}
       
   878 
       
   879   %The relationship between the count of @{text "P"} and @{text "V"} and the number of 
       
   880   %critical resources held by a thread is given as follows:
       
   881   %\begin{enumerate}
       
   882   %\item The @{term "V"}-operation decreases the number of critical resources 
       
   883   %  one thread holds (@{text "cntCS_v_dec"})
       
   884   %   @  {thm [display]  cntCS_v_dec}
       
   885   %\item The number of @{text "V"} never exceeds the number of @{text "P"} 
       
   886   %  (@  {text "cnp_cnv_cncs"}):
       
   887   %  @  {thm [display]  cnp_cnv_cncs}
       
   888   %\item The number of @{text "V"} equals the number of @{text "P"} when 
       
   889   %  the relevant thread is not living:
       
   890   %  (@{text "cnp_cnv_eq"}):
       
   891   %  @  {thm [display]  cnp_cnv_eq}
       
   892   %\item When a thread is not living, it does not hold any critical resource 
       
   893   %  (@{text "not_thread_holdents"}):
       
   894   %  @  {thm [display] not_thread_holdents}
       
   895   %\item When the number of @{text "P"} equals the number of @{text "V"}, the relevant 
       
   896   %  thread does not hold any critical resource, therefore no thread can depend on it
       
   897   %  (@{text "count_eq_dependents"}):
       
   898   %  @  {thm [display] count_eq_dependents}
       
   899   %\end{enumerate}
       
   900 
       
   901   %The reason that only threads which already held some resoures
       
   902   %can be runing and block @{text "th"} is that if , otherwise, one thread 
       
   903   %does not hold any resource, it may never have its prioirty raised
       
   904   %and will not get a chance to run. This fact is supported by 
       
   905   %lemma @{text "moment_blocked"}:
       
   906   %@   {thm [display] moment_blocked}
       
   907   %When instantiating  @{text "i"} to @{text "0"}, the lemma means threads which did not hold any
       
   908   %resource in state @{text "s"} will not have a change to run latter. Rephrased, it means 
       
   909   %any thread which is running after @{text "th"} became the highest must have already held
       
   910   %some resource at state @{text "s"}.
       
   911 
       
   912 
       
   913   %When instantiating @{text "i"} to a number larger than @{text "0"}, the lemma means 
       
   914   %if a thread releases all its resources at some moment in @{text "t"}, after that, 
       
   915   %it may never get a change to run. If every thread releases its resource in finite duration,
       
   916   %then after a while, only thread @{text "th"} is left running. This shows how indefinite 
       
   917   %priority inversion can be avoided. 
       
   918 
       
   919   %All these assumptions are put into a predicate @{term "extend_highest_gen"}. 
       
   920   %It can be proved that @{term "extend_highest_gen"} holds 
       
   921   %for any moment @{text "i"} in it @{term "t"} (@{text "red_moment"}):
       
   922   %@   {thm [display] red_moment}
       
   923   
       
   924   %From this, an induction principle can be derived for @{text "t"}, so that 
       
   925   %properties already derived for @{term "t"} can be applied to any prefix 
       
   926   %of @{text "t"} in the proof of new properties 
       
   927   %about @{term "t"} (@{text "ind"}):
       
   928   %\begin{center}
       
   929   %@   {thm[display] ind}
       
   930   %\end{center}
       
   931 
       
   932   %The following properties can be proved about @{term "th"} in @{term "t"}:
       
   933   %\begin{enumerate}
       
   934   %\item In @{term "t"}, thread @{term "th"} is kept live and its 
       
   935   %  precedence is preserved as well
       
   936   %  (@{text "th_kept"}): 
       
   937   %  @   {thm [display] th_kept}
       
   938   %\item In @{term "t"}, thread @{term "th"}'s precedence is always the maximum among 
       
   939   %  all living threads
       
   940   %  (@{text "max_preced"}): 
       
   941   %  @   {thm [display] max_preced}
       
   942   %\item In @{term "t"}, thread @{term "th"}'s current precedence is always the maximum precedence
       
   943   %  among all living threads
       
   944   %  (@{text "th_cp_max_preced"}): 
       
   945   %  @   {thm [display] th_cp_max_preced}
       
   946   %\item In @{term "t"}, thread @{term "th"}'s current precedence is always the maximum current 
       
   947   %  precedence among all living threads
       
   948   %  (@{text "th_cp_max"}): 
       
   949   %  @   {thm [display] th_cp_max}
       
   950   %\item In @{term "t"}, thread @{term "th"}'s current precedence equals its precedence at moment 
       
   951   %  @{term "s"}
       
   952   %  (@{text "th_cp_preced"}): 
       
   953   %  @   {thm [display] th_cp_preced}
       
   954   %\end{enumerate}
       
   955 
       
   956   %The main theorem of this part is to characterizing the running thread during @{term "t"} 
       
   957   %(@{text "runing_inversion_2"}):
       
   958   %@   {thm [display] runing_inversion_2}
       
   959   %According to this, if a thread is running, it is either @{term "th"} or was
       
   960   %already live and held some resource 
       
   961   %at moment @{text "s"} (expressed by: @{text "cntV s th' < cntP s th'"}).
       
   962 
       
   963   %Since there are only finite many threads live and holding some resource at any moment,
       
   964   %if every such thread can release all its resources in finite duration, then after finite
       
   965   %duration, none of them may block @{term "th"} anymore. So, no priority inversion may happen
       
   966   %then.
       
   967   *}
       
   968 (*<*)
       
   969 end
       
   970 (*>*)
       
   971 
       
   972 section {* Properties for an Implementation\label{implement} *}
       
   973 
       
   974 text {*
       
   975   While our formalised proof gives us confidence about the correctness of our model of PIP, 
       
   976   we found that the formalisation can even help us with efficiently implementing it.
       
   977 
       
   978   For example Baker complained that calculating the current precedence
       
   979   in PIP is quite ``heavy weight'' in Linux (see the Introduction).
       
   980   In our model of PIP the current precedence of a thread in a state @{text s}
       
   981   depends on all its dependants---a ``global'' transitive notion,
       
   982   which is indeed heavy weight (see Def.~shown in \eqref{cpreced}).
       
   983   We can however improve upon this. For this let us define the notion
       
   984   of @{term children} of a thread @{text th} in a state @{text s} as
       
   985 
       
   986   \begin{isabelle}\ \ \ \ \ %%%
       
   987   \begin{tabular}{@ {}l}
       
   988   @{thm children_def2}
       
   989   \end{tabular}
       
   990   \end{isabelle}
       
   991 
       
   992   \noindent
       
   993   where a child is a thread that is only one ``hop'' away from the thread
       
   994   @{text th} in the @{term RAG} (and waiting for @{text th} to release
       
   995   a resource). We can prove the following lemma.
       
   996 
       
   997   \begin{lemma}\label{childrenlem}
       
   998   @{text "If"} @{thm (prem 1) cp_rec} @{text "then"}
       
   999   \begin{center}
       
  1000   @{thm (concl) cp_rec}.
       
  1001   \end{center}
       
  1002   \end{lemma}
       
  1003   
       
  1004   \noindent
       
  1005   That means the current precedence of a thread @{text th} can be
       
  1006   computed locally by considering only the children of @{text th}. In
       
  1007   effect, it only needs to be recomputed for @{text th} when one of
       
  1008   its children changes its current precedence.  Once the current 
       
  1009   precedence is computed in this more efficient manner, the selection
       
  1010   of the thread with highest precedence from a set of ready threads is
       
  1011   a standard scheduling operation implemented in most operating
       
  1012   systems.
       
  1013 
       
  1014   Of course the main work for implementing PIP involves the
       
  1015   scheduler and coding how it should react to events.  Below we
       
  1016   outline how our formalisation guides this implementation for each
       
  1017   kind of events.\smallskip
       
  1018 *}
       
  1019 
       
  1020 (*<*)
       
  1021 context step_create_cps
       
  1022 begin
       
  1023 (*>*)
       
  1024 text {*
       
  1025   \noindent
       
  1026   \colorbox{mygrey}{@{term "Create th prio"}:} We assume that the current state @{text s'} and 
       
  1027   the next state @{term "s \<equiv> Create th prio#s'"} are both valid (meaning the event
       
  1028   is allowed to occur). In this situation we can show that
       
  1029 
       
  1030   \begin{isabelle}\ \ \ \ \ %%%
       
  1031   \begin{tabular}{@ {}l}
       
  1032   @{thm eq_dep},\\
       
  1033   @{thm eq_cp_th}, and\\
       
  1034   @{thm[mode=IfThen] eq_cp}
       
  1035   \end{tabular}
       
  1036   \end{isabelle}
       
  1037 
       
  1038   \noindent
       
  1039   This means in an implementation we do not have recalculate the @{text RAG} and also none of the
       
  1040   current precedences of the other threads. The current precedence of the created
       
  1041   thread @{text th} is just its precedence, namely the pair @{term "(prio, length (s::event list))"}.
       
  1042   \smallskip
       
  1043   *}
       
  1044 (*<*)
       
  1045 end
       
  1046 context step_exit_cps
       
  1047 begin
       
  1048 (*>*)
       
  1049 text {*
       
  1050   \noindent
       
  1051   \colorbox{mygrey}{@{term "Exit th"}:} We again assume that the current state @{text s'} and 
       
  1052   the next state @{term "s \<equiv> Exit th#s'"} are both valid. We can show that
       
  1053 
       
  1054   \begin{isabelle}\ \ \ \ \ %%%
       
  1055   \begin{tabular}{@ {}l}
       
  1056   @{thm eq_dep}, and\\
       
  1057   @{thm[mode=IfThen] eq_cp}
       
  1058   \end{tabular}
       
  1059   \end{isabelle}
       
  1060 
       
  1061   \noindent
       
  1062   This means again we do not have to recalculate the @{text RAG} and
       
  1063   also not the current precedences for the other threads. Since @{term th} is not
       
  1064   alive anymore in state @{term "s"}, there is no need to calculate its
       
  1065   current precedence.
       
  1066   \smallskip
       
  1067 *}
       
  1068 (*<*)
       
  1069 end
       
  1070 context step_set_cps
       
  1071 begin
       
  1072 (*>*)
       
  1073 text {*
       
  1074   \noindent
       
  1075   \colorbox{mygrey}{@{term "Set th prio"}:} We assume that @{text s'} and 
       
  1076   @{term "s \<equiv> Set th prio#s'"} are both valid. We can show that
       
  1077 
       
  1078   \begin{isabelle}\ \ \ \ \ %%%
       
  1079   \begin{tabular}{@ {}l}
       
  1080   @{thm[mode=IfThen] eq_dep}, and\\
       
  1081   @{thm[mode=IfThen] eq_cp_pre}
       
  1082   \end{tabular}
       
  1083   \end{isabelle}
       
  1084 
       
  1085   \noindent
       
  1086   The first property is again telling us we do not need to change the @{text RAG}. 
       
  1087   The second shows that the @{term cp}-values of all threads other than @{text th} 
       
  1088   are unchanged. The reason is that @{text th} is running; therefore it is not in 
       
  1089   the @{term dependants} relation of any other thread. This in turn means that the 
       
  1090   change of its priority cannot affect other threads.
       
  1091 
       
  1092   %The second
       
  1093   %however states that only threads that are \emph{not} dependants of @{text th} have their
       
  1094   %current precedence unchanged. For the others we have to recalculate the current
       
  1095   %precedence. To do this we can start from @{term "th"} 
       
  1096   %and follow the @{term "depend"}-edges to recompute  using Lemma~\ref{childrenlem} 
       
  1097   %the @{term "cp"} of every 
       
  1098   %thread encountered on the way. Since the @{term "depend"}
       
  1099   %is assumed to be loop free, this procedure will always stop. The following two lemmas show, however, 
       
  1100   %that this procedure can actually stop often earlier without having to consider all
       
  1101   %dependants.
       
  1102   %
       
  1103   %\begin{isabelle}\ \ \ \ \ %%%
       
  1104   %\begin{tabular}{@ {}l}
       
  1105   %@{thm[mode=IfThen] eq_up_self}\\
       
  1106   %@{text "If"} @{thm (prem 1) eq_up}, @{thm (prem 2) eq_up} and @{thm (prem 3) eq_up}\\
       
  1107   %@{text "then"} @{thm (concl) eq_up}.
       
  1108   %\end{tabular}
       
  1109   %\end{isabelle}
       
  1110   %
       
  1111   %\noindent
       
  1112   %The first lemma states that if the current precedence of @{text th} is unchanged,
       
  1113   %then the procedure can stop immediately (all dependent threads have their @{term cp}-value unchanged).
       
  1114   %The second states that if an intermediate @{term cp}-value does not change, then
       
  1115   %the procedure can also stop, because none of its dependent threads will
       
  1116   %have their current precedence changed.
       
  1117   \smallskip
       
  1118   *}
       
  1119 (*<*)
       
  1120 end
       
  1121 context step_v_cps_nt
       
  1122 begin
       
  1123 (*>*)
       
  1124 text {*
       
  1125   \noindent
       
  1126   \colorbox{mygrey}{@{term "V th cs"}:} We assume that @{text s'} and 
       
  1127   @{term "s \<equiv> V th cs#s'"} are both valid. We have to consider two
       
  1128   subcases: one where there is a thread to ``take over'' the released
       
  1129   resource @{text cs}, and one where there is not. Let us consider them
       
  1130   in turn. Suppose in state @{text s}, the thread @{text th'} takes over
       
  1131   resource @{text cs} from thread @{text th}. We can prove
       
  1132 
       
  1133 
       
  1134   \begin{isabelle}\ \ \ \ \ %%%
       
  1135   @{thm depend_s}
       
  1136   \end{isabelle}
       
  1137   
       
  1138   \noindent
       
  1139   which shows how the @{text RAG} needs to be changed. The next lemma suggests
       
  1140   how the current precedences need to be recalculated. For threads that are
       
  1141   not @{text "th"} and @{text "th'"} nothing needs to be changed, since we
       
  1142   can show
       
  1143 
       
  1144   \begin{isabelle}\ \ \ \ \ %%%
       
  1145   @{thm[mode=IfThen] cp_kept}
       
  1146   \end{isabelle}
       
  1147   
       
  1148   \noindent
       
  1149   For @{text th} and @{text th'} we need to use Lemma~\ref{childrenlem} to
       
  1150   recalculate their current precedence since their children have changed. *}(*<*)end context step_v_cps_nnt begin (*>*)text {*
       
  1151   \noindent
       
  1152   In the other case where there is no thread that takes over @{text cs}, we can show how
       
  1153   to recalculate the @{text RAG} and also show that no current precedence needs
       
  1154   to be recalculated.
       
  1155 
       
  1156   \begin{isabelle}\ \ \ \ \ %%%
       
  1157   \begin{tabular}{@ {}l}
       
  1158   @{thm depend_s}\\
       
  1159   @{thm eq_cp}
       
  1160   \end{tabular}
       
  1161   \end{isabelle}
       
  1162   *}
       
  1163 (*<*)
       
  1164 end
       
  1165 context step_P_cps_e
       
  1166 begin
       
  1167 (*>*)
       
  1168 text {*
       
  1169   \noindent
       
  1170   \colorbox{mygrey}{@{term "P th cs"}:} We assume that @{text s'} and 
       
  1171   @{term "s \<equiv> P th cs#s'"} are both valid. We again have to analyse two subcases, namely
       
  1172   the one where @{text cs} is not locked, and one where it is. We treat the former case
       
  1173   first by showing that
       
  1174   
       
  1175   \begin{isabelle}\ \ \ \ \ %%%
       
  1176   \begin{tabular}{@ {}l}
       
  1177   @{thm depend_s}\\
       
  1178   @{thm eq_cp}
       
  1179   \end{tabular}
       
  1180   \end{isabelle}
       
  1181 
       
  1182   \noindent
       
  1183   This means we need to add a holding edge to the @{text RAG} and no
       
  1184   current precedence needs to be recalculated.*}(*<*)end context step_P_cps_ne begin(*>*) text {*
       
  1185   \noindent
       
  1186   In the second case we know that resource @{text cs} is locked. We can show that
       
  1187   
       
  1188   \begin{isabelle}\ \ \ \ \ %%%
       
  1189   \begin{tabular}{@ {}l}
       
  1190   @{thm depend_s}\\
       
  1191   @{thm[mode=IfThen] eq_cp}
       
  1192   \end{tabular}
       
  1193   \end{isabelle}
       
  1194 
       
  1195   \noindent
       
  1196   That means we have to add a waiting edge to the @{text RAG}. Furthermore
       
  1197   the current precedence for all threads that are not dependants of @{text th}
       
  1198   are unchanged. For the others we need to follow the edges 
       
  1199   in the @{text RAG} and recompute the @{term "cp"}. To do this we can start from @{term "th"} 
       
  1200   and follow the @{term "depend"}-edges to recompute  using Lemma~\ref{childrenlem} 
       
  1201   the @{term "cp"} of every 
       
  1202   thread encountered on the way. Since the @{term "depend"}
       
  1203   is loop free, this procedure will always stop. The following lemma shows, however, 
       
  1204   that this procedure can actually stop often earlier without having to consider all
       
  1205   dependants.
       
  1206 
       
  1207   \begin{isabelle}\ \ \ \ \ %%%
       
  1208   \begin{tabular}{@ {}l}
       
  1209   %%@ {t hm[mode=IfThen] eq_up_self}\\
       
  1210   @{text "If"} @{thm (prem 1) eq_up}, @{thm (prem 2) eq_up} and @{thm (prem 3) eq_up}\\
       
  1211   @{text "then"} @{thm (concl) eq_up}.
       
  1212   \end{tabular}
       
  1213   \end{isabelle}
       
  1214 
       
  1215   \noindent
       
  1216   This lemma states that if an intermediate @{term cp}-value does not change, then
       
  1217   the procedure can also stop, because none of its dependent threads will
       
  1218   have their current precedence changed.
       
  1219   *}
       
  1220 (*<*)
       
  1221 end
       
  1222 (*>*)
       
  1223 text {*
       
  1224   \noindent
       
  1225   As can be seen, a pleasing byproduct of our formalisation is that the properties in
       
  1226   this section closely inform an implementation of PIP, namely whether the
       
  1227   @{text RAG} needs to be reconfigured or current precedences need to
       
  1228   be recalculated for an event. This information is provided by the lemmas we proved.
       
  1229   We confirmed that our observations translate into practice by implementing
       
  1230   our version of PIP on top of PINTOS, a small operating system written in C and used for teaching at 
       
  1231   Stanford University \cite{PINTOS}. To implement PIP, we only need to modify the kernel 
       
  1232   functions corresponding to the events in our formal model. The events translate to the following 
       
  1233   function interface in PINTOS:
       
  1234 
       
  1235   \begin{center}
       
  1236   \begin{tabular}{|l@ {\hspace{2mm}}|l@ {\hspace{2mm}}|}
       
  1237   \hline
       
  1238   {\bf Event} & {\bf PINTOS function} \\
       
  1239   \hline
       
  1240   @{text Create} & @{text "thread_create"}\\
       
  1241   @{text Exit}   & @{text "thread_exit"}\\
       
  1242   @{text Set}    & @{text "thread_set_priority"}\\
       
  1243   @{text P}      & @{text "lock_acquire"}\\
       
  1244   @{text V}      & @{text "lock_release"}\\ 
       
  1245   \hline
       
  1246   \end{tabular}
       
  1247   \end{center}
       
  1248 
       
  1249   \noindent
       
  1250   Our implicit assumption that every event is an atomic operation is ensured by the architecture of 
       
  1251   PINTOS. The case where an unlocked resource is given next to the waiting thread with the
       
  1252   highest precedence is realised in our implementation by priority queues. We implemented
       
  1253   them as \emph{Braun trees} \cite{Paulson96}, which provide efficient @{text "O(log n)"}-operations
       
  1254   for accessing and updating. Apart from having to implement relatively complex data\-structures in C
       
  1255   using pointers, our experience with the implementation has been very positive: our specification 
       
  1256   and formalisation of PIP translates smoothly to an efficent implementation in PINTOS. 
       
  1257 *}
       
  1258 
       
  1259 section {* Conclusion *}
       
  1260 
       
  1261 text {* 
       
  1262   The Priority Inheritance Protocol (PIP) is a classic textbook
       
  1263   algorithm used in many real-time operating systems in order to avoid the problem of
       
  1264   Priority Inversion.  Although classic and widely used, PIP does have
       
  1265   its faults: for example it does not prevent deadlocks in cases where threads
       
  1266   have circular lock dependencies.
       
  1267 
       
  1268   We had two goals in mind with our formalisation of PIP: One is to
       
  1269   make the notions in the correctness proof by Sha et al.~\cite{Sha90}
       
  1270   precise so that they can be processed by a theorem prover. The reason is
       
  1271   that a mechanically checked proof avoids the flaws that crept into their
       
  1272   informal reasoning. We achieved this goal: The correctness of PIP now
       
  1273   only hinges on the assumptions behind our formal model. The reasoning, which is
       
  1274   sometimes quite intricate and tedious, has been checked by Isabelle/HOL. 
       
  1275   We can also confirm that Paulson's
       
  1276   inductive method for protocol verification~\cite{Paulson98} is quite
       
  1277   suitable for our formal model and proof. The traditional application
       
  1278   area of this method is security protocols. 
       
  1279 
       
  1280   The second goal of our formalisation is to provide a specification for actually
       
  1281   implementing PIP. Textbooks, for example \cite[Section 5.6.5]{Vahalia96},
       
  1282   explain how to use various implementations of PIP and abstractly
       
  1283   discuss their properties, but surprisingly lack most details important for a
       
  1284   programmer who wants to implement PIP (similarly Sha et al.~\cite{Sha90}).  
       
  1285   That this is an issue in practice is illustrated by the
       
  1286   email from Baker we cited in the Introduction. We achieved also this
       
  1287   goal: The formalisation allowed us to efficently implement our version
       
  1288   of PIP on top of PINTOS \cite{PINTOS}, a simple instructional operating system for the x86 
       
  1289   architecture. It also gives the first author enough data to enable
       
  1290   his undergraduate students to implement PIP (as part of their OS course).
       
  1291   A byproduct of our formalisation effort is that nearly all
       
  1292   design choices for the PIP scheduler are backed up with a proved
       
  1293   lemma. We were also able to establish the property that the choice of
       
  1294   the next thread which takes over a lock is irrelevant for the correctness
       
  1295   of PIP. 
       
  1296 
       
  1297   PIP is a scheduling algorithm for single-processor systems. We are
       
  1298   now living in a multi-processor world. Priority Inversion certainly
       
  1299   occurs also there.  However, there is very little ``foundational''
       
  1300   work about PIP-algorithms on multi-processor systems.  We are not
       
  1301   aware of any correctness proofs, not even informal ones. There is an
       
  1302   implementation of a PIP-algorithm for multi-processors as part of the
       
  1303   ``real-time'' effort in Linux, including an informal description of the implemented scheduling
       
  1304   algorithm given in \cite{LINUX}.  We estimate that the formal
       
  1305   verification of this algorithm, involving more fine-grained events,
       
  1306   is a magnitude harder than the one we presented here, but still
       
  1307   within reach of current theorem proving technology. We leave this
       
  1308   for future work.
       
  1309 
       
  1310   The most closely related work to ours is the formal verification in
       
  1311   PVS of the Priority Ceiling Protocol done by Dutertre
       
  1312   \cite{dutertre99b}---another solution to the Priority Inversion
       
  1313   problem, which however needs static analysis of programs in order to
       
  1314   avoid it. There have been earlier formal investigations
       
  1315   into PIP \cite{Faria08,Jahier09,Wellings07}, but they employ model
       
  1316   checking techniques. The results obtained by them apply,
       
  1317   however, only to systems with a fixed size, such as a fixed number of 
       
  1318   events and threads. In contrast, our result applies to systems of arbitrary
       
  1319   size. Moreover, our result is a good 
       
  1320   witness for one of the major reasons to be interested in machine checked 
       
  1321   reasoning: gaining deeper understanding of the subject matter.
       
  1322 
       
  1323   Our formalisation
       
  1324   consists of around 210 lemmas and overall 6950 lines of readable Isabelle/Isar
       
  1325   code with a few apply-scripts interspersed. The formal model of PIP
       
  1326   is 385 lines long; the formal correctness proof 3800 lines. Some auxiliary
       
  1327   definitions and proofs span over 770 lines of code. The properties relevant
       
  1328   for an implementation require 2000 lines. 
       
  1329   %The code of our formalisation 
       
  1330   %can be downloaded from
       
  1331   %\url{http://www.inf.kcl.ac.uk/staff/urbanc/pip.html}.\medskip
       
  1332 
       
  1333   \noindent
       
  1334   {\bf Acknowledgements:}
       
  1335   We are grateful for the comments we received from anonymous
       
  1336   referees.
       
  1337 
       
  1338   \bibliographystyle{plain}
       
  1339   \bibliography{root}
       
  1340 *}
       
  1341 
       
  1342 
       
  1343 (*<*)
       
  1344 end
       
  1345 (*>*)