Anthony C. Davies
Simon Fraser University
School of Engineering Science

ANTHONY C. DAVIES
Department of Electronic Engineering
King's College London
Strand, London WC2R 2LS
England
E-mail: tonydavies@kcl.ac.uk

Title: COMMUNICATIONS WITHIN DISTRIBUTED REAL-TIME COMPUTING SYSTEMS:
ARBITERS, METASTABILITY AND WAIT-FREEDOM

Friday, September 15, 2000 at 10:30 a.m. in Room ASB 9705


Abstract

The talk will review the basic concepts of metastability in flip-flops, etc., with examples including a multi-way arbiter design that is so bad that it almost always exhibits metastability in its behaviour and will then explain properties that a wait-free asynchronous communications mechanism has to satisfy, and illustrate this with a description of some ``four-slot'' mechanisms which meet the correctness requirements in a fully asynchronous environment, and which are capable of efficient implementation in digital hardware.

More details can be found in:
``A Comparison of some Wait-Free Communications Mechanisms'' by Ian G. Clark and Anthony C. Davies, presented at a workshop on ``Asynchronous Interfaces: Tools, Techniques and Implementations'' (AINT '2000), 19th-20th July 2000, Delft, Netherlands, and in ``Dynamic Properties of a Multiway Arbiter'' by Anthony C. Davies, presented at ISCAS 2000, 29-31st May 2000, Geneva, Switzerland.

Further information can also be found at http://www.eee.kcl.ac.uk/~comfort.

Biography

Anthony C. Davies is professor emeritus at King's College London, having taken early retirement from administration and teaching duties at King's College in October 1999. He moved to King's College from City University London in 1990, where he was for a number of years Director of the Centre for Information Engineering. He has in the past taught at Purdue University and University of British Columbia.

He was awarded a B.Sc.(Eng.) with First Class Honours by Southampton University, an M.Phil by University of London, and a PhD by City University. He is a Fellow of IEEE and IEE. See also http://www.eee.kcl.ac.uk/member/staff/a_davies.html.

His main recent teaching activities have been in digital signal processing and in software design methods, and in the last few years, his research funding has covered the areas of non-linear dynamics and chaos, the estimation of crowd behaviour, and more recently, aspects of asynchronous systems design. He is currently the IEEE Circuits and Systems Society Vice President for Region 8.

Extended abstract

As speed and complexity of integrated circuits increases, chip designs are likely to comprise many locally synchronous processors communicating asynchronously - sometimes known as GALS (globally asynchronous locally synchronous) - because the difficulties of distribution of a high-frequency clock across a complex integrated circuit may make fully synchronous designs impracticable. There might also be a trend to using fully asynchronous designs. Compared to synchronous designs, these offer the possibility of lower power consumption (since switching is needed only when computation needs to be performed) and better electromagnetic compatibility (because of the elimination of a high-power, high-frequency clock).

Reliable communication between independent processors in an asynchronous environment poses a number of problems and presents interesting opportunities. In the first place there is metastability. In transferring data between non-synchronised systems (and also in converting an external analogue signal into an internal digital format) there is an unavoidable risk that unstable equilibrium points in the non-linear dynamic behaviour of the latches and other digital storage devices used can lead to occasional transients in which the signal levels takes an abnormally long time to settle to the ``logic-one'' or logic zero' levels. Schemes for synchronising independent systems, arbiters for controlling mutually-exclusive access to shared resources (such as memory), and analogue-to-digital convertors are all subject to the risk of metastablity. Designs need to take this into account and minimise the probability of occurrence.

In the second place there is the opportunity to achieve wait-free communications. It is often supposed that all interprocess communications necessarily involve some form of synchronisation (for example, the ADA rendezvous) which implies waiting, and a risk of deadlock in the case of either faulty design or component failure. The possibility of designing mechanisms to implement wait-free communication between processes (whether on the same or different processors) is often assumed impossible, and the need for arbiters in the access to the associated shared memory is assumed inevitable. However, for a class of interprocess communications for which the write operation is destructive and the read operation is non-destructive, it is possible to devise communications mechanisms in which the writer is always free to write and the reader is always free to read. The data objects to be communicated are assumed to be either records with a number of fields or arrays with a number of elements, with the requirement that the reader must always obtain a coherent object which should be the most recently available one. Wait-freedom offers the prospect of making digital systems with a property associated with analogue systems - in the latter, failures on one part generally cannot hold up or stop the operation of the other parts, whereas some faults in conventional digital systems can lead to total system deadlock.

A mechanism for wait-free communications is required to satisfy properties of data coherence, data sequencing and data freshness, and additionally the effects of metastability must be allowed for when operating in a fully asynchronous environment. A number such mechanisms have been independently proposed, based upon widely differing concepts and analysed by radically different methods. These designs are typically rather complicated and difficult to understand, so it is by no means easy to be confident about their correctness, and some designers have overlooked important properties that such mechanisms ought to satisfy.


Last updated Wednesday September 6 22:23:23 PDT 2000.