Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Modeling Semaphores with Uppaal

Semaphores in Uppaal

Frits Vaandrager
Radboud University Nijmegen
March 2006

Version 4.0 of Uppaal includes C-like syntax for defining functions. This new syntax makes it possible to define very simple and elegant models of semaphores. This webpage briefly describes the semaphore models and how they can be used to solve some nontrivial concurrency problems taken from William Stallings, Operating Systems: Internals and Design Principles, 5th ed, Prentice-Hall, 2005. I have been using ideas from Martijn Hendriks, Koen Vermeer, Justus Freijzer and Bart Kerkhof.

Modeling Semaphores in Uppaal

Since Uppaal does not include the concept of a blocked process, we take a slightly more abstract view in modeling semaphores: when a process does a Wait on a semaphore it may only proceed after subsequent receipt of a Go message. If the semaphore has a positive value it will issue such a message immediately. Otherwise, the process id will be entered in a queue, and the Go message will be sent later when the process id is removed from the queue as a result of a Signal.

Technically, a semaphore s is modeled as an automaton that interacts with its environment through three sorts of synchronization actions: semWait[s][p]?, semSignal[s][p]? and semGo[s][p]!, where p is some process id in the interval [0,N-1]. The semaphore template has three parameters of type integer: (1) id, the unique identifier of the semaphore, (2) init_val, the initial value of the semaphore, and (3) queue_size, the maximal number of processes in the waiting queue. Since Uppaal does not support dynamically growing data structures, we need to fix an upper bound on the size of the queue. In our model, the queue is implemented as an array of queue_size. The semaphore also has an integer state variable count, which records the current value of the semaphore. The model is schematically depected below.

The Uppaal files for the model are available at

semaphore.xml and semaphore.q

In this model the semaphore is used to achieve mutual exclusion for four processes. Some comments on the model:

Since currently in Uppaal it is not possible to initialize a parametrized array, we need a special transition to do this. This imperfection of Uppaal actually has a positive consequence: due to the extra transition the automaton looks like a honeybee! By making the initial location "committed" we ensure that the initialization takes place before any other activity in the system. (In Uppaal, outgoing transitions of a committed location have priority and must be carried out immediately.)
In five transitions we use a select field p int[0,N-1]. This simply means that we have instances of these transitions for each p in the interval [0,N-1], that is, for each process.
In Dijkstra's orgininal definition, count is not further decreased when it is 0 and a semWait occurs. Stalling does decrease the count in this case. It does not make any difference for either the observable behavior of the semaphore automaton or in terms of the number of reachable states. I slightly prefer Dijkstra's definition but decided to follow Stallings here, since I am using the examples from his book. An advantage of Stallings' definition is that when count is negative its value tells you how many processes are waiting. With Dijkstra's definition you don't have that information readily available.
In case a semWait occurs when count is positive, or a semSignal occurs when count is negative, the automaton perfoms a semGo transition. I have made the start location of this transition committed, to express that the semGo is generated in a single atomic transaction with the preceding semWait or semSignal. An auxiliary variable q is required to store the id of the process to which the semGo signal is sent.
The definitions of the various operations on the queue (enQueue(p), deQueue(), etc) are straightforward. My objective has been to optimize on the use of memory rather than on computation time. So, for instance, I did not introduce pointer variables to store the first empty position in the array but just compute this position with a simple while loop in the body of the enQueue function. In model checking, memory usage is typically the bottleneck and so it is important to minimize the number of state variables.
Each semWait(s) operation by a process p that is using semaphore s translates to two consecutive transitions labeled with synchronization actions semWait[s][p]! and semGo[s][p]?, respectively. Each semSignal(s) operation by p is encoded by a transition with synchronization action semSignal[s][p]!.
Correctness properties that can be verified using Uppaal include (a) mutual exclusion, (b) absence of deadlock, (b) no queue overflow.

The Uppaal model for the binary semaphore is a small and obvious variation of the general semaphore model. It is available at

binary_semaphore.xml

Tackling Concurrency Problems

Now we have models of semaphores, we can start playing with them!

An Incorrect Solution to the Producer/Consumer Problem

Here is a model of the incorrect solution to the Infinite-Buffer Producer/Consumer Problem using Binary Semaphores, discussed by Stallings on pages 221-224:

producerconsumer.xml and producerconsumer.q

The model is obtained in a straightforward manner from the code presented by Stallings in Fig 5.9. As Stallings point out, the problem with this solution is that variable n may become negative, that is, the consumer may consume an item from the buffer that does not exist. By posing the query E<> n<0 to the verifier ("there exists a path to a state in which n<0"), Uppaal produces a counterexample almost instantaneously. Essentially (modulo permutation of independent transitions), this is the same counterexample as the one presented by Stallings in Table 5.3. The ability of the Uppaal simulator to replay the counterexample greatly helps in understanding what goes wrong. Note that Uppaal is not able to explore the full (inifinite) state space of this model.

Solutions to the Bounded Producer/Consumer Problem

I also looked at the solution to the Bounded-Buffer Producer/Consumer Problem Using Semaphores presented by Stallings in Figure 5.13:

boundedproducerconsumer.xml and boundedproducerconsumer.q

Stallings claims correctness of this solution, but does not prove it. Even for large values of sizeofbuffer like 10.000, Uppaal can prove mutual exclusion and absence of deadlock automatically within a few seconds After introducing an auxiliary variable buffer that is incremented by function produce() and decremented by function consume(), Uppaal can prove the invariant property buffer >=0 and buffer <= sizeofbuffer, that is the consumer never consumes an item that does not exist, and there is no buffer overflow.

The above solution to the Bounded-Buffer Producer/Consumer Problem is also presented by Tanenbaum and Woodhul in their textbook on Operating Systems (Second Edition). The authors observe that when the order of the Wait operations in the Producer code is reversed there is a deadlock. This observation can easily be verified using Uppaal:

faultyboundedproducerconsumer.xml

Dining Philosophers

Of course, it is trivial to encode "solutions" to the Dining Philosophers problem in Uppaal, find deadlocks, etc

Here is the first solution of Figure 6.12 that contains a deadlock Dining_Philosophers.xml, Dining_Philosophers.q
Here is the second (correct) solution of Figure 6.13 Dining_Philosophers_with_Doorman.xml, Dining_Philosophers_with_Doorman.q

Conclusions

The article on semaphores in Wikipedia criticizes semaphores:

Semaphores remain in common use in programming languages that do not intrinsically support other forms of synchronization. They are the primitive synchronization mechanism in many operating systems. The trend in programming language development, though, is towards more structured forms of synchronization, such as monitors and channels. In addition to their inadequacies in dealing with deadlocks, semaphores do not protect the programmer from the easy mistakes of taking a semaphore that is already held by the same process, and forgetting to release a semaphore that has been taken. Hoare, Hansen, Andrews, Wirth, and even Dijkstra have called semaphores obsolete. Although I tend to agree with the above conclusion, I would like to add as a footnote that - as illustrated on this page - with the help of modern model checking technology it has become quite easy to verify the correctness of solutions of concurrency problems with semaphores. So the main disadvantage of semaphores, it is easy to make mistakes, has been overcome to some extend. However, we need to do many more examples to substantiate this claim. Also there is a need for tools that can automatically extract models out of software, since manual construction of models is time consuming and error prone. Results in this direction would be important since, as Wikipedia also observes, semaphores are still widely used in practice, for instance in embedded software.