Document [original]

A Set-Based Calculus and its Implementation

vorgelegt von Diplom-Informatiker

Wolfgang Grieskamp

aus Essen

Am Fachbereich 13, Informatik,

der Technischen Universit¨

at Berlin

zur Erlangung des akademischen Grades

Doktor der Ingenieurwissenschaften

– Dr.-Ing. –

genehmigte Dissertation

Promotionsausschuß:

Vorsitzender : Prof. Dr. G¨

unter Hommel

Berichter : Prof. Dr. Peter Pepper

Berichter : Prof. Dr. Stefan J¨

ahnichen

Tag der wissenschaftlichen Aussprache: 26.11.1999

Berlin 1999

D 83

Meinen Eltern

Zusammenfassung

Diese Arbeit untersucht die Fundierung und Implementierung eines Berechnungsmodells

f¨ur mengenbasierte Computersprachen. Es wird das



Z-Kalk¨ul eingef¨uhrt, eine mini-

male mengenbasierte Ausdruckssprache, verwandt mit dem



-Kalk¨ul, das Anwendungs-

sprachen wie die Spezifikationssprache Z oder funktional-logische Programmiersprachen

komplett und nat¨urlich einbetten kann. Syntax,Semantik und Gleichungstheorie von



werden definiert. Mehrere Berechnungsmodelle werden entwickelt. Durch gezielte An-

wendung der Gleichungstheorie ergibt sich eine Narrowingsemantik, die durch lokale

Modifikation der Redexauswahl nicht-strikt oder strikt ausf¨allt, und insbesondere interes-

sant f¨ur interpretative “on-line” Auswertung ist, da sie auf symbolischer Termersetzung

basiert. Das zentrale Berechnungsmodell aber basiert auf einer nat¨

urlichen Semantik, die

durch ihren operationalen Charakter das Design kompilierter “off-line” Auswertung vor-

bereitet, und das Berechnungsreferenzmodell von



Zfestlegt. Dieses Modell kombiniert

funktionale Reduktion und nebenl¨aufige Resolution, und definiert eine klare Trennung

zwischen der (deterministischen, funktionalen) Konstruktion von Mengen und der (nicht-

deterministischem, logischen) Dekonstruktion. Da Mengen sowohl extensional als auch

intensional dargestellt werden k¨onnen, ist das Modell von h¨oherer Ordnung.

Die Implementierung erfolgt in Form einer abstrakten Maschine, der ZAM, welche

auf der Technik nebenl¨aufiger Resolutionsagenten basiert. Die Maschine realisiert eine

effiziente Tiefensuche ¨uber die Verwaltung von Auswahlpunkten und Modifikationspro-

tokollen. Durch ein Priorit¨atenmodell in der nebenl¨aufigen Ausf¨uhrung werden nichtde-

terministische Berechnungen dynamisch verz¨ogert und deterministische propagiert. Die

ZAM ist als Stackmaschine vollst¨andig in Z spezifiert, und wird in der Implementierung

in C++ zu einer Registermaschine verfeinert, welche persistente Register verwendet, um

den Aufwand der Protokollierung von Modifikation zu minimieren.

Die Anwendung der vorgestellten Konzepte erfolgt im Rahmen des ZeTa Systems,

im BMBF Projekt ESPRESS entwickelt, das verschiedenartige Notationen und Werkzeuge

auf der semantischen Grundlage von Z integriert. Der ZaP ¨

Ubersetzer, Bestandteil des

ZeTa Systems, realisiert mit Hilfe des



Z-Kalk¨uls die Ausf¨uhrung von Z Modellen, und

wurde beispielsweise in ESPRESS f¨ur die Automatisierung von Softwaretests eingesetzt.

Die Definition von



Z, der Berechnungsmodelle, der abstrakten Maschine und der

Ubersetzung von



Zin Maschinencode ist vollst¨andig in der Spezifikationssprache Z ge-

geben (mit kleineren syntaktischen Erweiterungen), und daher syntax- und typgepr¨uft.

Insofern ist diese Arbeit auch eine Fallstudie der Anwendbarkeit von Z f¨ur anspruchsvol-

le Probleme der Metamodellierung.

Preface

In response to those who thought that Cantor’s theory of sets might be destroyed by the

well-known paradoxes discovered at the beginning of this century, Hilbert once spoke

about the “paradise created by Cantor from which nobody shall ever expel us” (quoted

in Vaught [1995]). On the level of non-constructive calculi, the beauty of the set-based

approach has been recognized in computer science in the formal notation Z [Spivey, 1992;

Toyn, 1998], developed in the eighties and semantically based on Zermelo-Fraenkli set

theory, which solves the paradoxes of classical set theory. In recent years, a growing

number of authors haveadvocated the useof sets in declarativeprogramming and program

analysis.

In this thesis, I use the set-based approach as the basis for a novel, small intermediate

calculus, which allows to naturally embed concepts from higher-order functional, logic

and concurrent constraint programming, as well as set-based specification languages such

as Z. I define the syntax and semanticsof the calculus, analyze its equational theory, inves-

tigate several computation models, provide an abstract machine that implements a model,

and define a compilation from the calculus to the machine. The power of the calculus

and its implementation allows an unorthodox approach to the integration of higher-order

functional and constraint logic programming.

The present thesis, written between January and September 1999, grew out of my ac-

tivities in the application oriented research project ESPRESS (1995-1998), which was con-

cerned with methods and tools for the specification of safety-critical embedded systems

[B¨ussow et al., 1996; B¨ussow and Grieskamp, 1997; B¨ussow et al., 1997a; Grieskamp

et al., 1998; B¨ussow et al., 1998; B¨ussowand Grieskamp, 1999]. One goal in the ESPRESS

context was to provide a tool to execute Z specifications, for the purpose of evaluating

test data and animating requirements specification of embedded systems based on Z. The

development of a compiler for Z was influenced by my earlier experiences with the im-

plementation of the algebraic-functional language Opal [Schulte and Grieskamp, 1992;

Hartel et al., 1996; Didrich et al., 1994; Frauenstein et al., 1996a,b; Didrich et al., 1998],

which strongly suggested basing such a compiler on a small intermediate language in

the style of the



-calculus. This led to a first design of the calculus presented in this

thesis, called



Zbecause of its original purpose for abstracting from the diversity of the

Z notation. This first version was used as an intermediate language in the experimental

compiler ZaP, which is part of the Z-based tool-integration environment ZeTa [B¨ussow

and Grieskamp, 1999], developed under the supervision of the author in ESPRESS.ZaP

has proved to be a feasible tool, in particular for the evaluation of test data, and it is being

used for this purpose in several projects conducted in cooperation with Daimler-Chrysler,

FT3/SM, Berlin. The work presented here consolidates and refines the technology of ZaP

on the basis of these experiences. On the one hand, it closes the gap between the exper-

imental prototype ZaP and its theoretical foundation, and on the other it extends ZaP’s

model by indeterministic computation, which proved to be desirable in the course of the

case studies.

Acknowledgments

I would like to thank Prof. Dr. Peter Pepper for his continuous support on both a personal

and professional level over the past years. Peter’s tolerance combined with brilliance has

been a major yardstick for me. I would also like to thank Prof. Dr. Stefan J¨ahnichen and

Prof. Dr. Hartmut Ehrig for their encouragement and confidence in my ESPRESS work.

Numerous colleagues at Peters group, ¨

UBB, have indirectly contributed to this thesis,

among them Klaus Didrich, Michael Cebulla and Christian Maeder, not at least by taking

over some of my daily obligations during the time of the writing. In ESPRESS, the col-

laboration with Robert B¨ussow on the ZeTa system and other topics was very fruitful and

pleasuring. The exchange of ideas on Z, among other things, with Andreas Fett, Thomas

Santen and other colleagues of the ESPRESS context helped me to gain insights to aspects

of this work. The discussions I had from time to time with Markus Lepper were highly

stimulating. My special thanks go to Erich Mikk for his invaluable feedback on early

prototypes of the ZaP compiler and to Jacob Wieland for the fruitful discussion of the



calculus and its use for program analysis in his diploma thesis [Wieland, 1999].

Contents

1 Introduction 17

2 Meta Notation 23

2.1 A Sketch of Z . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

2.1.1 Types and Sematic Universe . . . . . . . . . . . . . . . . . . . . 25

2.1.2 Schema Text . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

2.1.3 Genericity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

2.1.4 Undefinedness . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

2.2 Notational Extensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

2.2.1 Where Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

2.2.2 If-Exists Notation . . . . . . . . . . . . . . . . . . . . . . . . . . 28

2.2.3 Inference Rule Systems . . . . . . . . . . . . . . . . . . . . . . . 28

2.2.4 Recursive Partial Functions . . . . . . . . . . . . . . . . . . . . 29

2.2.5 Local Variables in Schema Boxes . . . . . . . . . . . . . . . . . 29

2.2.6 Case Distinction in Predicates . . . . . . . . . . . . . . . . . . . 30

2.2.7 Selective



............................. 30

2.2.8 Overloading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

2.2.9 Further Extensions . . . . . . . . . . . . . . . . . . . . . . . . . 31

2.3 Mathematical Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

2.3.1 Sets and Relations . . . . . . . . . . . . . . . . . . . . . . . . . 32

2.3.2 Sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

2.3.3 Booleans . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

2.3.4 Order and Lattices . . . . . . . . . . . . . . . . . . . . . . . . . 33

3 The



ZCalculus 37

3.1 Introduction to



Z.............................. 38

3.1.1 Syntax and Informal Meaning . . . . . . . . . . . . . . . . . . . 38

3.1.2 Encodings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

3.2 Semantic Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

3.2.1 Set Representation . . . . . . . . . . . . . . . . . . . . . . . . . 43

3.2.2 Types................................ 44

3.2.3 Universe . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

3.2.4 Free Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

3.2.5 Standard Types and Constructors . . . . . . . . . . . . . . . . . . 47

3.2.6 Order and Fixed-Points . . . . . . . . . . . . . . . . . . . . . . . 48

3.3 Expression Syntax and Meaning . . . . . . . . . . . . . . . . . . . . . . 50

3.3.1 Variable, Expression and Pattern Type . . . . . . . . . . . . . . . 50

3.3.2 Typing Relation . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

3.3.3 Meaning Function . . . . . . . . . . . . . . . . . . . . . . . . . 51

3.3.4 Basic Forms: Typing and Meaning . . . . . . . . . . . . . . . . . 52

3.3.5 Expression Abbreviation Forms . . . . . . . . . . . . . . . . . . 56

3.4 Equational Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

3.4.1 Syntactic Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

3.4.2 Equivalence Relation . . . . . . . . . . . . . . . . . . . . . . . . 57

3.4.3 Boolean Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

3.4.4 Singleton Set Laws . . . . . . . . . . . . . . . . . . . . . . . . . 59

3.4.5 Translation Laws . . . . . . . . . . . . . . . . . . . . . . . . . . 60

3.4.6 Schema Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

3.4.7 Fixed-Point Unrolling Law . . . . . . . . . . . . . . . . . . . . . 66

3.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

4 Computation in



Z71

4.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

4.2 Normalization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

4.2.1 Property Normalization . . . . . . . . . . . . . . . . . . . . . . . 75

4.2.2 Normalization Rules . . . . . . . . . . . . . . . . . . . . . . . . 76

4.2.3 Properties of Normalization . . . . . . . . . . . . . . . . . . . . 79

4.3 Narrowing Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.3.1 General Narrowing . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.3.2 Outermost-in Narrowing . . . . . . . . . . . . . . . . . . . . . . 84

4.3.3 Strict Narrowing . . . . . . . . . . . . . . . . . . . . . . . . . . 85

4.4 The Reference Computation Model . . . . . . . . . . . . . . . . . . . . . 86

4.4.1 Values and Constraints . . . . . . . . . . . . . . . . . . . . . . . 87

4.4.2 Computation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

4.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

5 Abstract Machine 97

5.1 Basic Model of the ZAM .......................... 98

5.1.1 Values................................ 98

5.1.2 Intensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99

5.1.3 Threads and Goals . . . . . . . . . . . . . . . . . . . . . . . . . 100

5.1.4 Configurations . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

5.1.5 Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

5.2 Specification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

5.2.1 Axiomatic Definitions . . . . . . . . . . . . . . . . . . . . . . . 105

5.2.2 Auxiliary Instructions . . . . . . . . . . . . . . . . . . . . . . . 107

5.2.3 Execution Step and Selection . . . . . . . . . . . . . . . . . . . . 109

5.2.4 Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110

5.3 Compilation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124

5.3.1 Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124

5.3.2 Compilation Functions . . . . . . . . . . . . . . . . . . . . . . . 125

5.4 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

5.4.1 Data Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

5.4.2 Instruction Set . . . . . . . . . . . . . . . . . . . . . . . . . . . 131

5.4.3 Memory Management . . . . . . . . . . . . . . . . . . . . . . . 133

5.4.4 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133

5.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

6 The ZeTa System 139

6.1 Three Dimensions of Integration . . . . . . . . . . . . . . . . . . . . . . 139

6.1.1 Semantic Integration . . . . . . . . . . . . . . . . . . . . . . . . 140

6.1.2 Document Integration . . . . . . . . . . . . . . . . . . . . . . . 141

6.1.3 Tool Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

6.1.4 Graphical User Interfaces . . . . . . . . . . . . . . . . . . . . . 143

6.2 The ZaP Compiler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

6.2.1 The Basic Specification of the Birthday Book . . . . . . . . . . . 146

6.2.2 Refining the Birthday Book for Execution . . . . . . . . . . . . . 147

6.2.3 Refining the Birthday Book for Testing . . . . . . . . . . . . . . 150

6.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154

7 Conclusion 157

7.1 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157

7.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159

List of Figures

3.1 Type Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

3.2 Type Language: Syntactic Tools . . . . . . . . . . . . . . . . . . . . . . 45

3.3 Semantic Universe . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

3.4 Free Types: Syntax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

3.5 Free Types: Meaning . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

3.6 Free Type of Booleans . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

3.7 Free Type of Natural Numbers . . . . . . . . . . . . . . . . . . . . . . . 47

3.8 The Order on Partial Sets . . . . . . . . . . . . . . . . . . . . . . . . . . 48

3.9 Expressions and Patterns . . . . . . . . . . . . . . . . . . . . . . . . . . 50

3.10 Typing Relation: Declarations . . . . . . . . . . . . . . . . . . . . . . . 51

3.11 Meaning Function: Declarations . . . . . . . . . . . . . . . . . . . . . . 51

3.12 Value Matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

3.13 Syntax and Meaning: Constructor Application . . . . . . . . . . . . . . . 52

3.14 Syntax and Meaning: Singleton Set Display and Selection . . . . . . . . 52

3.15 Syntax and Meaning: Set Algebra . . . . . . . . . . . . . . . . . . . . . 53

3.16 Syntax and Meaning: Set Translation . . . . . . . . . . . . . . . . . . . . 54

3.17 Syntax and Meaning: Schema, Fixed-Point, and Variable . . . . . . . . . 55

3.18 Expression Abbreviation Forms . . . . . . . . . . . . . . . . . . . . . . 56

3.19 Syntactic Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

3.20 Equivalence Relation: Declaration . . . . . . . . . . . . . . . . . . . . . 58

3.21 Boolean Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

3.22 Excluded Middle Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

3.23 Selection Elimination Law . . . . . . . . . . . . . . . . . . . . . . . . . 59

3.24 Constructor Decomposition Laws . . . . . . . . . . . . . . . . . . . . . 60

3.25 Translation Composition and Identity Laws . . . . . . . . . . . . . . . . 60

3.26 Translation Distributivity Laws . . . . . . . . . . . . . . . . . . . . . . . 61

3.27 Translation Product Law . . . . . . . . . . . . . . . . . . . . . . . . . . 62

3.28 Translation Lifting Law . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

3.29 Translation Splitting Law . . . . . . . . . . . . . . . . . . . . . . . . . . 63

3.30 Translation Intersection Elimination Law . . . . . . . . . . . . . . . . . 63

3.31 Schema Elimination and Distributivity Laws . . . . . . . . . . . . . . . . 64

3.32 Schema Complement Lifting Law . . . . . . . . . . . . . . . . . . . . . 65

3.33 Schema Translation Elimination Laws . . . . . . . . . . . . . . . . . . . 65

3.34 Schema Membership Lifting Law . . . . . . . . . . . . . . . . . . . . . 65

3.35 Schema Property Substitution Law . . . . . . . . . . . . . . . . . . . . . 66

3.36 Fixed-Point Unrolling Law . . . . . . . . . . . . . . . . . . . . . . . . . 66

4.1 Extensional Equality and Unequality . . . . . . . . . . . . . . . . . . . . 72

4.2 Matching Contexts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

4.3 Normalized Expressions . . . . . . . . . . . . . . . . . . . . . . . . . . 74

4.4 Singleton Set Normalization . . . . . . . . . . . . . . . . . . . . . . . . 76

4.5 Constructor Normalization . . . . . . . . . . . . . . . . . . . . . . . . . 76

4.6 Union Normalization . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77

4.7 Intersection Normalization . . . . . . . . . . . . . . . . . . . . . . . . . 77

4.8 Complement Normalization . . . . . . . . . . . . . . . . . . . . . . . . . 78

4.9 Translation Normalization . . . . . . . . . . . . . . . . . . . . . . . . . 79

4.10 Schema Normalization . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

4.11 Narrowing Rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.12 Values and Constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

4.13 Strict Context Reduction . . . . . . . . . . . . . . . . . . . . . . . . . . 89

4.14 Variable, Schema, and Fixed-Point Reduction . . . . . . . . . . . . . . . 89

4.15 Translation and Complement Reduction . . . . . . . . . . . . . . . . . . 89

4.16 Set Value Reduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

4.17 Mu-Value Reduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

4.18 Compound Constraint Resolution . . . . . . . . . . . . . . . . . . . . . . 91

4.19 Expression Reduction in Constraints . . . . . . . . . . . . . . . . . . . . 91

4.20 Membership and Equality Resolution . . . . . . . . . . . . . . . . . . . 92

4.21 Test-Emptiness Resolution . . . . . . . . . . . . . . . . . . . . . . . . . 92

5.1 ZAM Values................................. 98

5.2 ZAM Intensions............................... 99

5.3 ZAM Threads and Goals . . . . . . . . . . . . . . . . . . . . . . . . . . 101

5.4 ZAM Configurations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

5.5 ZAM Instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103

5.6 Shifting Values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

5.7 Freezing Values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

5.8 Getting the Variables of a Value . . . . . . . . . . . . . . . . . . . . . . 106

5.9 Equality...................................107

5.10 Unification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108

5.11 Merging and Joining Value Sequences . . . . . . . . . . . . . . . . . . . 108

5.12 Adding Information to Choices . . . . . . . . . . . . . . . . . . . . . . . 109

5.13 Ordering Index Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

5.14 ZAM Auxiliary Instructions . . . . . . . . . . . . . . . . . . . . . . . . . 109

5.15 ZAM Execution Step . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110

5.16 ZAM Dispatching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111

5.17 The LOAD Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111

5.18 The WAIT Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112

5.19 The LOADENV Instruction . . . . . . . . . . . . . . . . . . . . . . . . . 112

5.20 The UNIFY Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

5.21 The STORE Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

5.22 The MEMBER Instruction . . . . . . . . . . . . . . . . . . . . . . . . . 114

5.23 The TRYNEXT Auxiliary Instruction: The Empty Case . . . . . . . . . . 114

5.24 The TRYNEXT Auxiliary Instruction: The Extensional Case . . . . . . . 115

5.25 The TRYNEXT Auxiliary Instruction: The Intensional Case . . . . . . . . 115

5.26 The MKEMPTY and MKSINGLE Instructions . . . . . . . . . . . . . . . 116

5.27 The MKTERM Instruction . . . . . . . . . . . . . . . . . . . . . . . . . 116

5.28 The MKINTEN Instruction . . . . . . . . . . . . . . . . . . . . . . . . . 116

5.29 The UNION Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . 117

5.30 The ISECT Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . 117

5.31 The SUCCESS Instruction . . . . . . . . . . . . . . . . . . . . . . . . . 118

5.32 The MU Instruction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118

5.33 The TEST and TESTNATIVE Instructions . . . . . . . . . . . . . . . . . 119

5.34 The SEARCH Auxiliary Instruction: Dispatching . . . . . . . . . . . . . 120

5.35 The SEARCH Auxiliary Instruction: Select, More Intensions . . . . . . . 120

5.36 The SEARCH Auxiliary Instruction: Select: No More Intensions . . . . . 121

5.37 The SEARCH Auxiliary Instruction: Resume: Subgoal Success . . . . . . 122

5.38 The SEARCH Auxiliary Instruction: Resume: Subgoal Failure . . . . . . 123

5.39 The SEARCH Auxiliary Instruction: Backtracking . . . . . . . . . . . . . 123

5.40 Compilation Environment . . . . . . . . . . . . . . . . . . . . . . . . . . 124

5.41 Declaration of Compilation Functions . . . . . . . . . . . . . . . . . . . 125

5.42 Compiling General Expressions . . . . . . . . . . . . . . . . . . . . . . 125

5.43 Compiling Disjunctions and Conjunctions . . . . . . . . . . . . . . . . . 126

5.44 Compiling Literals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

5.45 Compiling Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127

5.46 Value Types of the ZAM in C++ ......................130

5.47 Control Types of the ZAM in C++ .....................130

5.48 Register Transfer Instruction Set of the ZAM ................131

6.1 Semantic Integration in ZeTa ........................140

6.2 Document Integration in ZeTa .......................141

6.3 Chains of Content Queries . . . . . . . . . . . . . . . . . . . . . . . . . 143

6.4 The XEmacs GUI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

6.5 The Swing GUI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

Chapter 1

Introduction

Ich ging im Walde

So f¨

ur mich hin,

Und nichts zu suchen,

Das war mein Sinn.

(Goethe)

The automatic evaluation of test data for safety-critical embedded systems is an interest-

ing application that can help to put declarative language implementation techniques into

industrial practice. Some studies report that more than 50% of development costs in this

application area go into testing. A setting for test-case evaluation that can improve this

situation is as follows: given a requirements specification by an executable prototype,

some input data describing a test case, and the output data from a run of the system’s

implementation on the given input, we check by executing the specification whether the

implementation meets its requirements. At the first sight, this goal would seem to be

simple, since input and output data are fixed. However, a real-world specification of a

complex system may contain a lot of “hidden” data, which is used to describe the observ-

able behavior. Thus, the problem scales up to finding the solution(s) to (a sequence of)

partial data bindings.

In the application-oriented research project ESPRESS2, which is concerned with meth-

ods and tools for the development of embedded systems, the set-based specification lan-

guage Z [Spivey, 1992] is used for requirements specification in combination with other

notations such as Statecharts which are incorporated by a shallow encoding in Z [B¨ussow

and Grieskamp, 1999]. Performing test-case evaluation in this setting requires a compu-

tation model for Z.

In this thesis, the underlying theory and implementation of a computation model for

Z are investigated. In fact, the work is not restricted to Z, since it is based on a small

intermediate calculus, called



Z, that can embed Z as well as other declarative notations.

The major contributions are the introduction of the novel calculus



Z, the definition of

the calculus’ computation model (operational semantics) and the definition of an abstract

machine for implementing the calculus. In this initiatory Chapter, an outline of these

topics is given.

1Im Goethejahr 1999 haben wir uns vorgenommen, (fast) alle Kapitelwahlspr¨uche beim großen deut-

schen Dichter auszuleihen.

2ESPRESS is a joint project involving industrial partners (Daimler-Benz AG and Robert Bosch AG) and

academic research institutes (GMD-First, FhG-ISST, and TU-Berlin), funded by the German “Bundesmin-

isterium f¨ur Bildung, Wissenschaft, Forschung und Technologie”.

18 Introduction

The



ZCalculus

The



Zcalculus is a pure expression language in the spirit of the



-calculus. Since it can

embed the full Z language – the simple-typed



-calculus, set calculus, predicate calculus,

and schema calculus – computation in



Zcan not be complete. However, a computa-

tion model and implementation by an abstract machine can be provided that is compa-

rable with that of higher-order functional logic paradigms [Smolka, 1998; Hanus, 1999;

Chakravarty et al., 1997].

The



Zcalculus is based upon a small number of constructs, which are tentatively

summarized below (e



EXP being expressions and p



PAT patterns):



EXP



























"!#$



%'&)(



PAT

EXP

The calculus employs constructor terms,





, and the basic operations of set algebra, set

union, intersection and complement. Patterns, p



PAT, are the subset of expressions

built from constructor terms and variables. In addition, the following expression forms

are provided:

The form



eselects the element of a singleton set e.

The form e



-

describes the translation of the set e. The result consists of all

elements that match p

, such that there exists an element in ethat matches the pat-

ternp



under the same substitution. For example,

./103254678/1032:9;7<=>

-

!)?@A0B

(where numbers are special

-ary constructor symbols).

The form







describes a schema (set comprehension). The property eis a set

expression of Boolean type – truth being the set containing a dedicated singleton

element and falsity the empty set.

The form

&)(

edescribes a fixed-point of a set (or tuple of sets) e.

The calculus is able to represent all concepts of Z in a remarkably consistent way. For

example, a Cartesian product e

ED

is expressed as e

F

 

2 G!6

H

  2

G!I2

with

 2 

, the constructor symbol for pairs, used in mixfix notation, and a pattern “joker”

(or anonymous variable). Relational application, Re, is encoded as



8

J5  KA!I2

where

is the

-ary constructor for the unit in Boolean sets. Function application, e



is given as



"

L8

7

 

2 G!M=N 2

E

!MPO

As these examples indicate, the heart

of the



Zcalculus is its novel construct for set translation, which provides a means for

the constructive description of morphisms in the category of



Zsets.

In Chapter 3, we develop the exact syntax and semantics of the calculus and provide

an equational theory for it. The semantic model of



Zis based on a hierarchical typed

universe, where sets are represented as partial characteristic (Boolean) functions in a set-

theoretical sense, which can be founded by Zitself or by ZF set theory. This representation

of sets is a generalization of partial three-valued logics [Owe, 1997] to the case of set

algebra. The equational theory of



Zis powerful enough to induce a narrowing semantics

by a pure expression transformation, as will be shown in Chapter 4.

Introduction 19

Computation in



Several notions of computation are investigated in Chapter 4. We start with a normaliza-

tion of expressions to a disjunctive normal form, which, combined with simplification, is

a prerequisite for all further investigations (including compilation). From the normaliza-

tion, we can easily deduce a general narrowing semantics which is suited for symbolic

“on-line” computation. By restricting the redex selection order of narrowing, several sub-

models are derived.

The reference computation model of



Z, however, is given in the style of natural se-

mantics, constituting a trade-off between computation power and efficient implementabil-

ity, preparing a straight transition to the abstract machine given in Chapter 5 for “off-line”

computation. This model draws a clear borderline between the functional (deterministic)

construction of sets and the logic (indeterministic) deconstruction. For example, in an

expression such as



Q



=-8

R

S

7

(where



is syntactic sugar), the

expressions e



and e

are concurrently reduced in a deterministic functional style. Once

an expression eihas reached a value form, a resolution step is performed, thereby binding

the variable xi.

The reference computation model of



Zis comparable with that of higher-order func-

tional logic languages with concurrent constraint resolution, though it goes beyond this,

supporting disjunction and negation. It supports the free combination of sets (relations

and functions) by set union, intersection and complement, thereby abstracting from in-

tensional and extensional representations. The widely used set-based Z specification style

can thus be fully supported. We give some examples in Z. Given the function f, the execu-

tion of its domain restriction,



=2TOTOTOU2

WV

f, causes no problems. We can also execute

the properties x

YX6ZJ[

for x

]\_^6`

f. We may execute f

, which might become a rela-

tion, if fis not injective. If gis a further function, then f



gand f



gare executable.

Similar possibilities carry over to the schema calculus, which can be fully executed as

well.

The limitations of the computation model are found in the executability of universal

quantification and of complement. The universal quantification,

e, is encoded in



Zas

c.

d

L

 KA!M

. The problem of executing this construct is conspicuously

apparent in the equational theory of



Z: set complement cannot be distributed over hiding

translation. Thus, we cannot expect complements to be pushed down completely to the

expression leafs during the construction of the disjunctive normal form. We deal with this

problem by computing independent “subgoals” for expressions nested inside of comple-

ments. These computations do not contribute to resolution in their enclosing context, and

are bound to the context by residuation. For the universal quantifier, this entails that it can

be computed provided the quantified range is finitely enumeratable.

The Abstract Machine ZAM

An abstract machine – called ZAM – which implements the reference computation model



Zis developed in Chapter 5. In the ZAM, concurrent threads collaboratively work on

a resolution task. The threads synchronize via variables: accessing an unbound variable

suspends a thread and binding a variable resumes threads waiting for it (residuation). The

machine maintains a stack of choice points which is shared by the threads jointly acting

on a resolution. When a thread executes a property such as x



e

, it continues

20 Introduction

with x





and creates a choice point for x



. A simple but powerful optimization

is achieved by an instance of the “Andorra Principle” [Warren, 1990], dynamically low-

ering the priority of threads that try to create choice points, thereby giving deterministic

computation preference.

The compilation of



Zto ZAM instructions is based on the disjunctive normal form

defined in Chapter 4. The instruction set of the ZAM is at a higher level than, for ex-

ample, of the Warren Abstract Machine [Warren, 1983], making compilation easy and

retraceable.

In Chapter 5, we give a comprehensive specification of the ZAM in Z’s sequential

specification style. This specification is intended to be executable using the concepts

presented in this thesis. A prototype of the ZAM has been implemented in C++, and will

be also outlined.

The ZeTa System and Conclusion

In Chapter 6, we give a short overview of the design of the ZeTa system [B¨ussow et al.,

1998; B¨ussow and Grieskamp, 1999], and the role of the ZAP compiler as integrated into

ZeTa. The goal of the ZeTa system is to offer an open environment for editing, browsing

and analyzing integrated specifications assembled from heterogeneous formalisms – such

as Statecharts, Z, temporal logic, message sequence charts, and other. The formalisms are

semantically integrated by a shallow encoding in Z, using the

df<g

conventions [B¨ussow

et al., 1997a]. The ZeTa system is designed to systematically support such encodings

by three dimensions of integration: semantic integration, document integration and tool

integration. The ZAP compiler, part of the ZeTa system, can thus be used not only to

execute Z specifications, but also other formalisms that are mapped to Z and thereby to



Z.Chapter 7 concludes with a summary of the contributions of this work and of future

lines of research. A discussion of related work can be found at the end of each of the

relevant chapters.

Reading Advices

Since this thesis aims at a comprehensive formalization of the concepts it introduces, it

contains a lot of detailed formal material, most of which as been put into figures that are

interleaved with the text. For layout technical reasons, figures occasionally occur before

their contents is explained. It is recommended to read (at least) until the first reference to

a figure in the text until its content is read.

Not all parts of this thesis need to be thoroughly read to get an idea of the main con-

cepts. Chapter 2 may be skipped by readers which are firm to Z and which do not want

to know how we explain certain Z extensions that have common roots in well-known no-

tational concepts (such as inference rules). In Chapter 3, it might be sufficient to read

the introduction to the



Zcalculus, Section 3.1 (on page 38), then the equational theory,

Section 3.4 (on page 57), skipping the proofs in there, and finally the discussion. In Chap-

ter 4, the definition of the distributive normal form in the introduction of Section 4.2 (on

page 74) and the entire Section 4.4 (on page 86) is essential, and, of course, the discus-

sion. In Chapter 5, the reader might find it sufficient to read Section 5.1 (on page 98),

Introduction 21

Section 5.4 (on page 129) and the discussion. Chapter 6 may be skipped by readers not

interested in the application.

Chapter 2

Meta Notation

Seh’ ich die Werke der Meister an,

So seh’ ich das, was sie getan.

(Goethe)

The meta notation used throughout this work is strictly based on the Z language [Spivey,

1992]. A brief introduction to Z is given in Section 2.1 (on page 25). In general, the

reader is expected to be familiar with Z.

One obvious advantage of using Z is that it is standardized1. Since Z is also a formal

language, another important advantage is that it is amenable to machine support. Thus,

most of the mathematical material in this thesis is verified by a parser and type-checker2.

Exceptions to the use of type-checked Z may appear in mathematical phrases inlined in

text and where template notations such as f



=2TOTOTOh2



provide a better way of arguing

than conventional Z. This is seldom the case, and is only applied when properties of

objects are described, not for their declaration.

Using a formal notation such as Z does not necessarily imply using a formal method

in the sense of formal reasoning and proof with machine support. The degree of detail of

description required to this end would be enormous. A significant amount of description

is therefore kept loose – informal assertions are made that are clear to a human reader but

unamenable to a machine. An important example of such usage is when we informally

select the “smallest” solution to some axioms. Though Z is powerful enough to express

such constraints, it would be cumbersome to write them down, and would thereby distract

from the real concern.

The Z language, though a general specification notation, is tailored to the description

of stated-based sequential systems. A typical instance of this usage is the specification of

the abstract machine, ZAM, developed in Chapter 5. However, in other parts of this the-

sis, description techniques such as inference rules and recursive partial functions over free

term algebras (abstract syntax) are required, which are not directly supported by Z’s syn-

tax. Some extensions are therefore made to the language (Section 2.2 (on page 28)), which

are explained as syntactic sugar, which is implemented using the macro-preprocessor of

the E

f<g

type checker. The printed output of the extensions looks natural, and all are

actually type-checked and have a well-defined meaning in Z.

1An ISO standardization of Z is currently in progress. We follow the the draft proposals for the Z

standard, [Z-ISO]. A good text book on Z is written by Spivey [1992]; the innovations in the forthcoming

standard relative to this source are described by Toyn [1998].

2The E

i7j

type-checker (written by the author), which is part of the ZeTa system, is used. See also

Chapter 6.

24 Meta Notation

Z provides a powerful mathematical toolkit for defining basic notions such as num-

bers, relations, and functions. It is assumed that the reader is familiar with Z’s toolkit,

which is a fairly standard account of basic set theory (see Spivey [1992] for a specifi-

cation). Some extensions to the toolkit are nevertheless required: they are defined in

Section 2.3 (on page 32).

2.1 A Sketch of Z 25

2.1 A Sketch of Z

The basic model of Z is reviewed below, without the intention of providing a complete

picture. For a comprehensive description of Z, see [Spivey, 1992; Toyn, 1998].

2.1.1 Types and Sematic Universe

The model of Z is based on typed set theory. Given a collection of so-called given types,

Z types are freely constructed by power set construction,

A,cartesian product,A

D

O7OTOD

An, and binding set construction,



l

nmOTOTOUm



. Binding sets denote

products with named components. A semantic universe for Z is large enough to contain

meanings for these types, where given types may have assigned arbitrary (not necessarily

countable) domains. Because of the type hierarchy, Z avoids Russel’s Antinomy: sets are

well-founded, and no set contains itself.

A distinguishing feature of Z is that types are “first-order citizens” of the language. A

type is represented by an expression which denotes a set. As a set, it just has the property

of being the largest set containing elements of the same type.

Z has a static typing discipline which forbids terms which are not well-typed. A

declaration such as x



Ehas two effects in Z: the variable xranges over the set denoted

by the expression E(x



E), and x is declared to be of some type, which is the largest

superset of E. For example, the constant

represents the given type of integer numbers,

and the constant

plqro

a subset of it. Then, declaring x

;p

lets xrange over

and assigns

the type

to it for type checking.

2.1.2 Schema Text

A central notion of Z is that of schema text, which denotes a set of bindings (tuples with

named components). A schema text, D



P, is built from a set of declarations Dand a set

of properties P. The declarations spawn a binding set, which is constrained by P. The

property is built of first-order predicative formulas over variables representing the binding

components.

For instance, the expression



Aps

represents the set of bindings where both

components xand yare constrained to be natural numbers by the declaration, and where

xis constrained to be less than or equal to yby the property.

In Z, it is always possible to shift constraints induced by declarations to the property

part. For example, the above schema can be transformed into



Jou

lpwv

Schema text is used in manifold ways in Z:

Schema text is used to directly denote sets of bindings in expressions, as in





ps

, which denotes the set of bindings where component xis less or equal to

component y. (We already used this notation above).

Schema text is used in quantifiers. For example, the universal quantifier is written



Q, where D



Pspawns the range of quantified values. This notation

is equivalent to

Q. Similarly,



Q, which is equivalent to

26 Meta Notation

Schema text is used in function abstractions,





E, where D



Pdenotes the

domain of the function. The binding set D



Pis converted into a set of tuples by

removing the names of the components. For instance, the domain of the abstraction



)pzm

)p{

|}C

yis the set of pairs (xrepresenting the first and ythe

second component) with the specified property (y

|{C

). The graph of the function

described is a subset of

k"GpDwp'Dp'

Schema text is used in set comprehension,







, where the tuples of D



are the elements of the comprehended set. A further variant of comprehension,







, allows us to explicitly construct the result tuples, as e.g. in



hp

[WZ<X4SC



c0H:

, which describes the set of pairs where an even number

is related with its direct successor.

Schema text is also used for introducing global constants. Using the notation

the components of the bindings of D



Pare introduced as global constants. If



Pcontains more then one binding, the constants are loosely specified. If D



is empty, the specification is inconsistent.

2.1.3 Genericity

Z supports genericity. Global constants of a specification may be generic over a list of

“formal” types. The meaning of a generic constant is a total function from an instantiation

of the formals into a value. The instantiations are not just type “terms” but arbitrary set

values. This distinguishes Z’s polymorphism from the Hindley-Milner polymorphism

found in functional programming languages and specification formalisms [Damas and

Milner, 1982]. For example, if

is the given type of integer numbers, then a generic

constant such as

"ne

may be instantiated as

"7eo

17

<o

C<

"nLA0B254;

, and so

on.

2.1.4 Undefinedness

The semantics of Standard Z does not prescribe a particular view of undefinedness. The

treatment of undefinedness is left to conventions introduced by specifiers and/or tools.

Throughout this thesis, the following conventions are adopted, which are oriented to de-

sirable properties of a partial logic, as described, e.g., by Owe [1997]:

There is a special predicate for testing the definedness of an expression:

M

eiff e

is defined.

The undefined value can be denoted by the expression



, which is an abbreviation

for



2.1 A Sketch of Z 27

Undefinedness strictly propagates in relational application, e e



, and in relational

test, e R e



A set may have “unknowns”, that is the proposition x



Xmay be undefined for

certain x’s. For schema text, x





Px, if the proposition Pc is undefined for some

c, then c









will be as well. At the same time, for some other c



such

that Pc



is defined, c

L







is defined.

Intersection and union of sets in set algebra or the schema calculus can deal with

unknowns. Briefly, if x





is unknown and x



is false, then x



z

false. This behavior is commutative and associative. Symmetrically, if x





unknown and x



is true, then x



)

is true.

Logical connectives can be seen as a special case of set-union and set-intersection,

and treat undefinedness in a similar way. Moreover, if Pis undefined, then



is as well. The conditional expression,



T'A

A7



, is treated as an

abbreviation for



A



v





, and is therefore non-strict.

Quantifiers are treated as the usual expansion to and-chains and or-chains regarding

undefinedness.

Note that the above treatment still is loose as regards the representation of undefinedness

in a model: we have merely stated some properties. The



Zcalculus is more explicit,

adding a



element to the carriers, which can be seen as a refinement of the above.

In Section 2.2.4 (on page 29), syntactic sugar will be introduced which allows a con-

venient notation for recursive partial functions making use of these conventions – in par-

ticular of the

)M

epredicate.

28 Meta Notation

2.2 Notational Extensions

Throughout this thesis, a few purely syntactical extensions of Z will be used, which embed

well-known mathematical notation techniques in the Z framework.

2.2.1 Where Notation

In Z, axioms of the kind



Qare commonly used at a top-level position of

a specification, where no outer scope exists. For this kind of “shallow” quantification,

the

-prefix notation distracts from the actual specification goal, the predicate Q, which

contains free variables bound by D



P. To enhance readability, we allow the alternative

notation Q

Q'6H



P. The binding priorities for Qare lowered to the level of schema

properties, such that newline and “

” can be used instead of logical conjunction, “

”.

Thus the notation

7m

Fm



Q)AH



is syntactic sugar for



v

-v

-v



2.2.2 If-Exists Notation

An often encountered situation in specification is the desire to formulate a “conditional

let”: if a unique binding for some schema text exists, then an expression should be evalu-

ated under this binding, otherwise an alternative expression should be evaluated. In plain

Z, one has to write

#y





F'A





A7

to achieve this effect. We

allow the notation

/=¡



F)A

A7

as syntactic sugar for the above phrase.

2.2.3 Inference Rule Systems

In computer science, a widely used notation for axioms of the kind



Qare

so-called inference rules:P

We allow this notation as syntactic sugar for the universal quantification.

Systems of inference rules are often used in this thesis to inductivelydescribe relations

on syntax. When defining a relation with an inference rule system, in addition to the

axioms described by the rules, a principle of generation is assumed. We say that a relation

is the smallest one which satisfies the rules. Formally, let R



Tbe the relation defined

(with Tits type) and I

T

upto In



the rules:

£¢}







F



!¤mEOTOTOdm





!¥

In this thesis, a formalization of generation is not usually given. However, it will be

sufficiently treated in the explanatory text.

2.2 Notational Extensions 29

2.2.4 Recursive Partial Functions

A commonly used technique for describing recursive partial functions is to use equations

of the form f E



. Here, the value Eis in f’s domain iff E



is defined. If so, the

mapping E





is in the functions graph. For example, an interpretation function which

maps terms to semantic values is defined iff the interpretation of subterms is defined. This

technique is indispensable for readability in those situations where the description of the

definedness and mapping cannot be adequately partitioned.

Throughout this thesis, we use the notation

¦





BOTOTO¨¦ O7OTO





Q)AH



as a syntactic abbreviation for the Z form:





©wX6ZJ[

§$ª )M





zv

«8X6Z<[

§©x

¬







vuO7OTOAv



X6ZJ[

§ª M



zv

wX6ZJ[

§-x







In general, with this style of definition, the left-hand sides Eimay be not exhaustive.

Moreover, we may spread equations for one function E

over several paragraphs. We

normally choose the smallest solution for a function defined by a system of recursive

partial definitions, given in one or several paragraphs. It is constructed similarly as for

relations (see Section 2.2.3 (on the facing page)).

If patterns of definitions overlap, we treat them to be priorized by textual order. For

example, the following recursive function definition removes each second element in a

sequence:



remove

4®6"n

¯ 17

remove

4"°

±)²

¦ °

±²

remove

4J



 

 ¦

Q'6H



317

The second definition case captures any sequence, but has less priority than the first case,

such that sequences whose length is greater then one are handled by the first case.

2.2.5 Local Variables in Schema Boxes

When specifying sequential systems using schemas in vertical box notation, one often

encounters the necessity of introducing local variables. In Z, this is has to be notated as

shown in the left-hand side below:

30 Meta Notation



State

Locals

v

OTOTOAv



State

Locals

.. .. . .. .. . . .



O7OTO

One obvious problem of this notation is that it becomes necessary to separate the proper-

ties Piin the context of the existential quantifier by “

” instead of line break or “;” which

would be more readable. We therefore introduce the notation on the right-hand side as a

shortcut for the left-hand side.

2.2.6 Case Distinction in Predicates

Case distinction in predicates must be denoted in Z as on the left-hand side below. We

introduce the notation on the right-hand side as a much more instructive variant:



State

³´Cx





µ]0



³¶Cx





r0



State



³´C

T'A

J

µ]0

A7



r0

2.2.7 Selective

When specifying state transition operations in Z’s sequential specification style, one uses

the



Soperator to create pre and post state versions of the fields of the schema S. If

certain fields are not going to be changed, equations such as x





have to be added to

the operation schema. Apart of notational noise, this diminishes localizability of changes

to S.

We therefore use the notation

¸

72TO7OTOh2

Sto express that, from the schema S,only

the variables xiare changed, and all other keep their value. This is syntactic sugar for

¹

HºJ

»R

=2TOTOTOU2

1rºJ



»R





2TOTOTOh2



1G!

2.2.8 Overloading

In a large Z specification that introduces hundreds of new names, keeping all these names

disjoint decreases the readibility when similar things need to carry different names. For

example, suppose a function which delivers the set of free variables of different types

of terms: naming this function

¼H^¨\¾½P¿IÀ<ÁFÂ

¼H^¨\Ã½"ÄFÅÇÆHÈ"É;Â

and so on, does not contributes to

readibility.

We therefore allow the selective use of ad-hoc overloading. Actually, this overloading

is implemented by using different L

TEX markup names for Z symbols which expand to

the same representation.

2.2 Notational Extensions 31

2.2.9 Further Extensions

The E

f<g

type checker, developed in ESPRESS and used for processing the mathemat-

ics used in this thesis, adds some further extensions to the Z notation, which have been

proposed to the Z ISO Panel:

The order in which paragraphs appear in a specification is arbitrary. The only re-

quirement is that the definition-use relation of paragraphs is acyclic. (A paragraph

is in definition-use relation to another paragraph if it introduces a name that is re-

ferred to by the other paragraph.)



and

are introduced as expression operators.

Mutually recursive free types are allowed. This is (roughly) explained as follows.

A free type definition, T

Ë

variants, introduces two paragraphs: one containing

the definition of the given type



, and a second containing the free type axioms

derived from the variants. Given the freedom of order of paragraphs, this allows

arbitrary recursive dependencies between free type names and other definitions.

Global constants can be declared multiple times, provided all declarations are type-

compatible.

A detailed justification, including the semantic foundation, is givenin [Fett and Grieskamp,

1998].

32 Meta Notation

2.3 Mathematical Tools

Standard Z provides a rich set of “mathematical tools” for working with sets, relations,

and functions. It is assumed that the reader is familiar with Z’s mathematical toolkit (a

specification can be found in [Spivey, 1992]). Some further tools are required in this

thesis, and will be defined below.

2.3.1 Sets and Relations

The Z toolkit is fairly complete as regards basic functions on sets and relations. For our

present purposes, it lacks only one concept: the reduction of a set of values by a binary

function.

edenotes a function that takes a set and reduces its elements using the function fand

the starting element e.fmust be right-commutativew.r.t. the argument set (f



n2



1z



n2

1

) since the order in which reduction is performed is not determined:





D

¯ k

¦ 













Î¾b







Ï

3"1"

Q'6H



;k

This definition makesuse of the syntactic extension fordefining recursive partial functions

(Section 2.2.4 (on page 29)). The domain of the function

eis induced by its defining

expression, which is (for non-trivial s) only defined if fis right-commutative and defined

for all elements.

2.3.2 Sequences

The Z toolkit lacks some typical functions known from functional languages for working

with sequences, which are defined as follows. With x

Ð

s, the element xis prepended

to the sequence s, with s

NÐ

xappended. By

Ñ/ÒÓ©

n2

n

, the sequence is delivered where

the elements are pairwise combined. The phrase



JÔ

sdenotes the reduction of the

function ffrom right to left on the sequence s, with starting value e; the phrase



JÏ

denotes the reduction from left to right.



Ð Õ%



6"n

±)²

Ð Õ%

617



²°



Ñ/ÒÓY3"n

D¸"7

Í "7d



Ñ/ÒÓ"°G±n2/°¤±5 ¦ °¤±

Ç

ÐÃ

U¦ 

LÐÖÑ/ÒÓ



Q'6H



6"n



6"n

2.3 Mathematical Tools 33



Ô "

D

D8"n

 Ô "

:2/°G±5 ¦

 "

:2

Ð

U¦



2F

JÔ



Q'6H



B"7



Ï 1

D

D8"7

 Ï "

:2/°G±5 ¦

 "

:2

×Ð

)¦



2F

JÏ



Q'6H



B"7

With f

Ø



, the “context mapping” of the function fto the elements in sfrom left

to right is denoted. eis an environment or “state” parameter, which is threaded through

the mapping:



Ø 

-D

D17

dÍ

D17

 Ø 

2F

2F°G±5" ¦ 

2/°G±5

 

2F

ÐÃ

"U¦ 6

_2

 L

Ù

L6"7





¾2





:mW

 Ã2

z

Ø-

Ã2





  2

ÐÃ



Q'6H



6"7

2.3.3 Booleans

In the forthcoming Z ISO Standard, schema expressions and value expressions have been

unified. Moreover, schemas with an empty signature are allowed (for example,

¾

This opens up the possibility of defining the type of Boolean values as the set of schemas

to the empty signature, and use schema reference in a predicate to refer to a Boolean

value. The type

is defined below following this approach:

ÚÕ£k¬ ! Û

\¥Ü/ÝWÞ_

true

^6à ½IÝWÕÞ¾

false

c

;kÚ

_¨y



! âãÕ£

;käÚ

¾7b



With

s, the members of the set of booleans sare disjuncted, with

s, they are con-

juncted. Note that Booleans are truly two-valued, in contrast to truth values as discussed

in Section 2.1.4 (on page 26).

2.3.4 Order and Lattices

Ordered sets and lattices provide a fundamental theory for computer science and discrete

mathematics. Below, some basic notions that are required later on in this thesis are for-

34 Meta Notation

malized in Z. The definitions and theorems are mostly taken from [Davey and Priestly,

1990].

An ordered set is a set equipped with a (partial) order,

, which is reflexive,

antisymmetric and transitive:



å 



Q'6H



The axiom

 å =

!Ù

Eis used throughout this thesis to state that “the” partial order of

a given set Aregarding lattice properties is equal to the expression Eof type A

A. For

example, the following axiom states that the partial order of sets of some given element

type FOO is set inclusion:



FOO

!  å 5çk

FOO

!Ù{ + 

The technique used for this kind of “overloaded” definition of

, where meanings

are assigned in dependency of generic instances, is comparable to Haskell’s type classes

[Wadler and Blott, 1989; Hudak et al., 1992]3.

From the order of a set X, its dual order is derived by relational inversion:



ÕÞ å =

Given an ordered set S

X, the supremum (least upper bound) and the infimum

(greatest lower bound), if existent, are denoted by

S):



é27êë;k

¦ 6

Õì



Tb

L

¾







7b

Ù

Ã

Q'6H

;k

¦ 6

Õí



7b

Ù

Ã







7b

d

Ë

Q'6H

;k

To build the supremum (infimum) of two-element sets



S+

X, the notation x



) is used. The former is called the join, the latter the meet:

3Standard Z, [Toyn, 1998], relaxes restrictions found in the Z of [Spivey, 1992] and permits such use

of genericity. Semantically, specifying

generically for all possible instantiations does not cause a

problem: for any set A, there exists at least the empty order,

ñ ð òIó

ôAõö

2.3 Mathematical Tools 35



î 2 ï 

 î ¬

)¦ é



Q'6H



 ï ¬

)¦ ê

¾

Q'6H

d

Alattice is a set L

X, where for all pairs x

Ù

L,x



and x



are defined. The

set of all subsets Lof Xwhich are a lattice is described by

à>^

X. In a complete lattice, for

each subset L

Ù+

Lthe supremum (infimum) exists.

L) are called the top (bottom)

element of a complete lattice L:



à>^

Õí

Ak

7b









$8X6ZJ[ î v



8X6ZJ[ ï :



à>^

Ä Õì

Çà ^

7b



;k



X6ZJ[é÷v



8X6ZJ[êl



Õ%

Jà>^

 Õ%

Jà>^

A well-known structure related to a complete lattice is a cpo (complete partial order).

It has a bottom element and behaves like a complete lattice as regards the infimum. For

the dual, it is only demanded that for every directed subset of S, the supremum exists.

Adirected set is one where for any finite subset the upper bound is in the set. Cpos are

commonly used for modeling the semantics of programming languages; however, for our

set-based calculus, the more general notion of lattices is better suited.

For complete lattices (and cpos), theorems exist that can characterize computability.

These will be briefly reviewed below. Let f



Lbe a function on a complete lattice

L. The set of fixed-points of fis defined as the set









f x



. The least fixed-point

exists if the infimum of this set exists (the set is not empty) and is a fixed-point (is in the

set). It is known that if fis order-preserving (monotonic), then the least fixed-point exists

for complete lattices. If fis also continuous, then the least fixed-point can be expressed

by the image of funder the supremum of the set of finite elements in the domain of f.

Continuity in this context means that for any subset S

L, it holds

é

ú5E

.é



We denote the set of continuous functions over a base type Xas X

¯¥û



¯¥û Õì



/X6ZJ[

là>^

vs\_^6`

à>^

_b

;k¥X6ZJ[



é

úP)

Ié

1:

Afinite element of a complete lattice is one where there exists only a finite number of

36 Meta Notation

elements in a chain starting at



and reaching the element. Let F

+üX6ZJ[

fbe the non-

empty set of all finite elements in the domain of f. Then, the central fixed-point theorem

for continuous functions and complete lattices states:



¯¤û

x êl

BX6ZJ[





f x



Ié



Thus, putting the supremum of maximal finite information into fyields the (possibly,

infinite) least fixed-point. Since (by continuity) fis order preserving, this (indeed “ideal”)

maximal finite information can be incrementally computed by successive application of f

starting at



L. These results are used in Chapter 3 to justify the fixed-point law of



Chapter 3

The

ZCalculus

So gestaltend, umgestaltend –

Zum Erstaunen bin ich da.

(Goethe)

The



Zcalculus is a set-based expression language which can embed set-oriented nota-

tions such as Z as well as functional and logical programming languages. The calculus

is designed for the automatic analysis and transformation, where it is desirable working

with a small set of language constructs. However,



Zis not “as small as possible”1. The

calculus has redundancy in that it allows extensional as well as intensional description of

the same meaning. For example, the set containing the numbers

and

can be denoted



Zas

A0Bw¨4A

or by the schema





þ0'

ÿ4;

. The support of both inten-

sional and extensional description allows the definition of computation as a source-level

transformation from intensional towards extensional representation, as shown in the next

chapter.

The language constructs of



Zare essentially those of set algebra (set intersection,

union and complement), set comprehension and a novel notation for set translation.

Moreover, singleton sets can be constructed and deconstructed. The calculus entails a

notion of free constructors, which are used to model tuples, Z-style bindings and free

types. As will be seen, the



-calculus, predicate calculus and schema calculus can be

derived from this basic set of operators.

In this chapter,



Zwill be introduced and formally defined. A semantic model is

developed, and the language constructs will be described one-by-one, giving their typing

rules and semantic meanings. An equational theory for



Zis developed. The chapter is

concluded with a discussion of some aspects of the calculus and of related work.

1Higher-order logic [Gordon and Melham, 1993], for instance, has a smaller number of language con-

structs.

38 The



ZCalculus

3.1 Introduction to



We give an introduction to the



Zcalculus and discuss the encoding of Z as well as logic

and functional logic programming languages in



3.1.1 Syntax and Informal Meaning

Let x



VAR be variable symbols, and let

%

CONS be (term) constructor symbols.

The



Zcalculus provides the syntactic categories of expressions, and a subset of them,

patterns:



EXP



























"!#$



%'&)(



PAT

EXP

The form





describes a constructor application, where eis a sequence of expressions.

(We generally use the convention of denoting sequences of terms tby t.) The set of pat-

terns PAT

EXP are those expressions that are solely built from constructor application

to patterns,





, and variables, x.

The form





denotes a singleton set. The form



eis the singleton set selection that

returns the element of a singleton set, and is undefined for each other set. The usual forms

for set algebra are provided: the empty set is denoted by



, set intersection by e





, set

union by e





and set complement by



The form e





describes the translation of the set e. The result of the transla-

tion is the set of all elements that match p

such that there exists an element in ewhich

matches the pattern p



under the same substitution. Thereby, both patterns may contain

unbound variables, which may be denoted by wildcards ( ). For example. For exam-

ple,

I/10B254An$/I032P9;nJ=>

2 

(where numbers are special

-ary constructor

symbols) equals to

A0¨

. The cartesian product of two sets, e

D

, can be expressed in



Zas e

F

 

2 G!6

H

  2

¤!

(

 2 

being the constructor symbol for pairs used

in mixfix notation). Conversely, the range of a binary relation e(a set of pairs) is de-

noted as e

N 2

®

. We call a translation with

¼H^¨\Ã½

#+ ¼H^¨\¾½

an embedding and

with

¼H^¨\¾½





@¼H^¨\Ã½

ahiding. Note that there might exist translations which are nei-

ther embeddings nor hidings. In general, translations are capable of expressing all kinds

of specification morphisms induced by signature morphisms as used in algebraic spec-

ification [Ehrig and Mahr, 1985; Wirsing, 1990]2, and thus can be seen as a means for

constructive description of (some) morphisms in the category of



Zsets.

The form







describes a set comprehension, or a schema in Z terminology3. It

denotes the set of values that match the pattern psuch that the Boolean expression eis true

under the binding of variables resulting from the match. Thereby, a Boolean expression

is just a set expression over the element type of units (a type containing just one element):

let

be the 0-ary constructor for this element: then truth is the set

FK

, falsity is the

2It is easy to see that sets of tuples or bindings (tuples with named components) correlate to sets of

models/algebras.

3We prefer the notion “schema” since the structure of the generating pattern, p, is preserved, whereas in

Z’s set comprehension pis reduced to its characteristic tuple.

3.1 Introduction to



Z39

empty set, conjunction is intersection, disjunction union, and negation complement. For

example, the (sugared) schema

;

w









describes the set of triples

where each member is larger than the previous one. The Boolean expression x



yis

syntactic sugar for the form

1



Jw;

"J=  KA!

(where



is a binary relation,

a set of pairs). The conclusive translation can be read as: we are not interested in the

actual result of the intersection, we merely want to know if it is empty or not. The entire

schema literally reads as

;

«;1



J8;

:<=  KA!  1



J8;

:<= ¶K;!.ÖO

The form

&'(

edenotes the smallest member of the set









(where the equal-

ity abbreviates

I

-w

J=  K;!

). The order needed here is discussed in Section 3.2.6

(on page 48). If a smallest member does not exist, the form is undefined.

The



Zcalculus uses a shallow polymorphic type system. To this end, constructors



have associated a (generic) type, and expressions are assumed to have a principal type.

A further condition for well-formedness is that patterns in translations and schemas are

used linearly (no variable is allowed to appear twice).

For commonly used forms of expressions, syntactic sugar is used, the most important

of which is the following:

 









¶KA!









I











s





is the “universal set”, the largest set of its according type domain.



eis an abbreviation

for the Boolean expression which is true iff the set eis nonempty. The two other forms

represent membership and equality.

3.1.2 Encodings

Constructs such as function application,



-abstraction, membership-test, equality, exis-

tential and universal quantification can be naturally encoded in the



Zcalculus. We

demonstrate this by sketching the encoding of Z in



Z. A brief comparison with logic

programming languages (horn-clause systems) and with the functional logic language

Curry [Hanus, 1999] is also given, anticipating some basic ideas underlying the compu-

tation model of



Encoding Z

Lambda Calculus. The operations of the simple-typed



-calculus are encoded as fol-

lows:





;

«



 

~

varse









1

w

ÃL

 

2 G!M=N 2

z

!M

This treats functions as sets of pairs. The encoding of the application does in fact also

capture relational application as used in Z: it is sufficient for eto be a binary relation that

is right-unique and defined at the point e



40 The



ZCalculus

Predicate Calculus. We have already noted that formulas are a special case of set ex-

pressions over the element type of units, and we have defined abbreviations for the basic

predicates of membership-test and equality. To complete the picture, we give the encoding

of quantifiers:













r





¨

J

The resulting semantics of the encoding of predicate calculus is a partial three-valued

logic (see Section 3.2.1 (on page 43)).

Schema Calculus. In Z, a schema denotes a set of bindings, bindings being tuples

(records) with named components. The set is constrained by the declarations of a schema

as well as by the property. For example, in Z, the schema



Lpzm

Öor



denotes

the set of bindings



Õ



, where x

p

o

, and x



Assuming



Zconstructors for bindings,



EÕ 2TOTOTOU2

Õ



, where the ai’s are

interpreted as part of the constructor name, the above Z schema can be encoded in



Zby





Õ





p





The constraint y

o

vanishes since

is a given type. The property x



yis further trans-

lated as described in the previous section. Other possible encodings exist, for example

p¬





Õ



I!3w



Õ









The connectivities of the schema calculus directly map to



Z. Schema conjunction in

Z, S

v

, is the set of bindings with the joined signature of S



and S

where components

with the same name are identified. This is expressed in



Zas

T





=



!6

F





7



!I2

where



is the pattern for the joined signature. For example, the Z schema conjunction









!!hvr







!!

(where A,Band Care given types) maps





Õ







!¥



Õ



¬



Õ



I!





Õ



ä



!¥L





Õ



¬



Õ

Õ 2

Õ



Existential schema quantification,



, is encoded as

















G!6



=







z







dÏ





G!

where





'Ï







denotes the removal of the components from Sin S



. Similarly, the

remaining connectivities of the schema calculus are encoded.

Note that



Z’s support of the schema calculus clearly shows that



Zis capable to

express relations between entire specification units, modules, etc.: Z schemas can be

thought of as an algebra on the calculus level.

3.1 Introduction to



Z41

Encoding Logic Languages

A logic program in a language such as Prolog consists of a set of Horn clauses A



n27OTOTOU2

Bn, where Ais called the conclusion and the Biare the premises of the clause.

The Aand Biare so-called atoms of the form R





, where tis a sequence of terms over

constructor symbols and variables, and Ris a n-ary relation. We can map Horn clause

systems to



Zas follows. Let A







and Bi







; then the “body” of the clause is

represented as

;

WÇ

5-

LìOTOTOP

-



. To define the meaning of the relation

R, we collect the bodies of all clauses in which Ris the head of the premise, as follows:

í;

:E;







{O7OTOP







OTOTO



;

E;







ìOTOTOP

snm



Qnm



We proceed similarly with the clauses for Qijin the system. If Rand Qijare recursively

dependend, the fixed-point operator will be used. Answering a query R





then merely

means computing the intersection R

8;

:

Encoding Functional Logic Languages

Languages such as Curry [Hanus, 1999] or Goffin [Chakravarty et al., 1997] can, in prin-

ciple, be encoded in



Z. The exact operational semantics will not be the same, since

a strict reduction order for



Zis preferred by design, whereas both Curry and Goffin,

which are extensions of Haskell, use lazy reduction. An illustration of such an encoding

is instructive, anyway.

A function definition in Curry is mapped as follows (fbeing a function symbol, pa

pattern, ca constraint, and ean expression; the fresh variables yimay not appear in the

patterns pi):

f p



]

=mWOTO7Ohm

f pn







&)(

*®;

n2

5E

Ù

$

=-{O7OTOP8;

E







Thus, a constraint cof Curry directly maps to a corresponding constraint in



Z. Con-

junctions of constraints are expressed by intersection. Calling a function that returns a

constraint with a free variable as the argument, such as Cx, amounts to membership test,



C. In fact,



Zallows a richer constraint language than Curry (in particular, disjunc-

tion and negation), which, however, will not always be executable.

The computation model of



Z, described later on, executes constraints concurrently,

using both residuation [Ait-Kaci et al., 1987] and backtracking and enumeration to deal

with free variables. An access to a free variable is suspended in a nested context. This

is, for example, the case for arguments of the



-operator (and therefore for function ap-

plication ee



which is defined using



) or for negative constraints. Backtracking takes

place for constraints which map to the membership test of a pattern in a set. For exam-

ple, consider the definition of an “append” relation on lists (below, an abbreviation for

introducing local variables is used:







is equivalent to

;

«

LN



42 The



ZCalculus

&)(

app

*®;"°G±n2

e



-

;

Ë

LÏ

;

$

app





Ë



Applying this relation to unbound xs and ys and bound zs will compute the possible par-

titions of the list zs, as usual in logic languages.



Zis higher-order because sets can be stored in constructed values. Similar as Curry

or Goffin, the computation model does not employ higher-order unification. This restric-

tion is, though, not tied to the notion of a type of a value (whether it is a set or not) but

to the representation: sets that are extensionally represented are incorporated by a simple

equality test in unification. An extensional representation is given by applying construc-

tors, singleton sets and set union to extensional values.

3.2 Semantic Model 43

3.2 Semantic Model

The model of



Zis constructed by an interpretation function over terms of a type lan-

guage. We will first discuss the concepts underlying the set representation in the model

and then define the type interpretation.

3.2.1 Set Representation

The model of



Zis based on a hierarchical typed universe, where sets are represented

as partial characteristic (Boolean) functions in a set-theoretical meaning, which can be

founded by Z itself or by Zermelo-Fraenklin set theory. We will use Z to this end.

Let Ube the domain of some type. The representation of an element of the powerset

of this type is a partial function f



Í Ú

. The function fcharacterizes the knowledge

about the elements in the set in the following way. A value u



Uis

1. a member of the set, if u

8X6ZJ[

f u;

2. not a member of the set, if u

X6ZJ[

v

f u;

3. unknown to be a member or not, if u

~8X6ZJ[

As an example, consider the (sugared)



Zschema



0EXJÒ ¼

 0B

. For this set,

known to be member, and all natural numbers greater then

are knownnot to be members;

however, for

, no information is available, since the expression

0LXJÒ ¼C

is undefined. Thus

this set is represented as the Boolean function



;p Ï HC<

¾

{0n!¤O

The given representation of sets allows for a natural definition of the set-algebraic

operations. Let f

n2

be the representations of two sets. The intersection is defined as



-?

¥X6ZJ[

¸X6ZJ[

nd



7ùM

^6à ½IÝ¨¨úU

7ùM

^6à ½IÝ¨¨ú

¾



!¥2

and the union is defined as



©?

¥X6Z<[

¸X6Z<[

7d



7ùG

\¤Ü/ÝB¨úU

7ùM

\¥Ü/Ý¨¨ú

_



!¤O

Thereby, the conjuction in our meta-language Z,

, is defined if one of the operands is

false, and disjunction,

, if one of the operands is true, independent of the definedness

of the other operand. Accordingly, the domains of the resulting characteristic functions

are constructed: we add all those elements to the domain for which the one or the other

operand already determines the result.

As an example for the effect of the set operators, consider the expression



A0ÖXJÒ ¼



0¨



|üC<

. The representation of the left set operand was already defined above.

The right set operand is given by the function



;p

_

|£C¨!

. The intersection of both

representations according to the above definition yields the function



;p

¾;0$XJÒ ¼

{0Wv

|£C¨!¥O

44 The



ZCalculus



TCONS

TVAR

^¨\¥Ò





TCONS

¯ p

TYPE

Ë s  !M



h/Q

TCONS



317

TYPE

H^¨\¥Ò



®

 

d



k



TYPE



l 



TVAR



Figure 3.1: Type Language

Whereas the left-operand is not defined for

, the result is (with

known not to be member

of the resulting set).

An interesting aspect is the definition of set complement in our set representation. It

is straight-forward:



?

3X6Z<[



f x

Thus the complement does not alter the domain of the characteristic function. What is

unknown to be member of the set keeps to be unknown. As a consequence, the “principle

of the excluded middle” will not hold in our framework: f

£



does not hold.

However,

ÿ

fholds. The valid laws will be discussed in more detail in Section 3.4

(on page 57).

The representation of sets and the definition of the set-operations provides a general-

ization of what is called “three-valued” logic in the literature to the case of set algebra.

We have already discussed that truth-vales are represented as sets over the “unit” type

with one member, denoted by the constructor

. The empty set,



, represents falsity, and

the singleton set containing the unit,

FK

, truth. Intersection, union, and complement over

sets of this type behave as conjunction, disjunction, and negation in the three-valued logic

described by Owe [1997].

3.2.2 Types

The model of



Zis constructed by an interpretation function over terms of a type lan-

guage. Terms of the type language,



Þ

TYPE, are built from type construction

L



Ç!

(where

 

TCONS are type constructors with arity), from power-set types



, and from

type variable application,



(

%

TVAR). The definitions are given in Figure 3.1. Note

that the Cartesian product is introduced by certain type constructors, which will be dis-

cussed in Section 3.2.4 (on page 46).

We suppose standard syntactic tools on type terms (Figure 3.2 (on the facing page)).

The set of free type variables in a type is deliveredby

¼H^H\¾½



. The subset of types which are

ground (contain no type variables) is denoted with TYPE

. A type variable substitution is

a partial function



TSUBS



TVAR

TYPE, such that

X6ZJ[

W

¼H^¨\¾½¬ùM\_^6`

ú5



The application of a substitution to a type is denoted as

½=Ü

/½

(!)

.TSUBS

denotes the set

of ground substitutions. The most general unifier of two types is described by the relation



T-

+*,

. These standard concepts are not formalized.

3.2 Semantic Model 45

TYPE

w;k

TYPE TSUBS

Õ

TVAR

TYPE

TSUBS

Õ

TVAR

TYPE

¼H^¨\¾½

TYPE

¯ k

TVAR

½=Ü

/½

TSUBS

TYPE

 ;kz

TYPE

TSUBS

TYPE



Figure 3.2: Type Language: Syntactic Tools



UNIV

Ü6`/X6Ý



UNIV

½1ÝH[ 

TYPE

¯ k

UNIV

,



TYPE

Ü6`/X6Ý

8½IÝ¨[

-

G





TYPE

½IÝ¨[

.

Ï 6Ü6`/X6Ý

3$ÓH^¨\

Ò ZJ`

UNIV

Ï 6Ü6`/X6Ý



/'0



UNIV

Í Ú

UNIV

3'0



UNIV

Í 

UNIV

Í Ú¬

,



TYPE

½IÝ¨[

4

Í Ú+]X6Z<[

/'0

3'0



/'0

½IÝ¨[¶¤k

5

L)¦ 

6½IÝ¨[

.

Í Ú

/'0



Q)AH

6



TYPE

Ü6`/X6Ý



/'0



Figure 3.3: Semantic Universe

3.2.3 Universe

The semantic universe is denoted by the Z type UNIV. It contains a special element,

Ü6`/X6Ý

, representing undefinedness. The universe is partitioned by the type interpretation

½IÝ¨[

.

over ground types, modulo the element for undefinedness, which is member of each

interpretation. The definitions are given in Figure 3.3.

The following remarks on the semantic interpretation

½1ÝH[

in Figure 3.3:

For type construction, the partitioning axiom already states everything required:

a type construction

L



Ç!

delivers a subset of the universe disjoint from any other

type interpretation, except that the element for undefinedness,

Ü6`/X6Ý

, is shared.

More constraints on certain type constructors are added in Section 3.2.4 (on the

next page), letting their interpretation being generated by value constructors.

As discussed above, a powerset type,



, denotes all the partial Boolean functions

over the domain of the meaning of



½IÝ¨[

7

®Í Ú

. To map this representation into

UNIV, we define an encoding function

/'0

as a partial injection which is defined for

values which can be generated from a ground type. The inductive definition of the

domain of

/'0

ensures that an encoding exists, and contradictions such as Russel’s

Antinomy are avoided.

The same value for undefinedness is shared since by the interpretation of powerset

46 The



ZCalculus



CONS



ÓBÝl

TCONS

Í "n

TVAR

D

CONS

Í "n

TYPE





ÓBÝUR 617

TVAR

8



CONS

Í "n

TYPE





lc^¨\¥Ò



®v¾b  6X6ZJ[

¼H^¨\¾½ùM\_^6`;

8

UJ.ú5+%\_^6` ¬:

Q'6H R6X6Z<[



ÓBÝ

G«R6X6ZJ[



ÓBÝ

X6Z<[S



ÓBÝUJPO4A$ÓH^¨\

Ò>ZJ`

CONS



ÓBÝ©2

Ò`/½



CONS

¯ "n

TYPE

Figure 3.4: Free Types: Syntax

types the totally undefined Boolean function,







¯ Ú

, which is identified with

Ü6`/X6Ý

in a power-set domain, cannot be distinguished. Reusing

Ü6`/X6Ý

over different types causes

no problems, since only typed expressions are considered in the



Zcalculus.

3.2.4 Free Types

The type constructors represent by default abstract, “given” subsets of the universe. For

some type constructors, this set may be generated by a collection of value constructors, a

commonly so-called free-type construction. A value constructor is a total injective map-

ping from a (possibly empty) sequence of values into a given value. A system of value

constructors effectively describes a many-sorted term algebra embedded into the model



The function



ÓBÝ

(Figure 3.4) associates a free type definition with certain type con-

structors, which consists of a sequence of type variables matching the type constructor’s

arity, and a mapping from constructor symbols to argument types which are closed under

the type variables. The given set of constructors, CONS, is partitioned by the constructors

of the free types.

For convenience, we introduce an auxiliary function which delivers the type of a value

constructor,

:

Ó3Ý®



R L )!¬w17

TYPE

TYPE. Furthermore, we assume there is a

function

Ò `/½



, which delivers the type of a constructor with “fresh” type variables.

The semantics of a value constructor is described bythe function

½1ÝH[

ÕR817

UNIV

UNIV, which delivers for a ground type substitution and constructor a total injection

w.r.t. the meaning of the constructors argument types into the type constructor’s domain.

Given this function, the domain of a type constructor is partitioned by the range of the

meaning of all its value constructors (Figure 3.5). We have formally stated freeness (con-

structors are injective) and generation (by the partitioning). It is further demanded that

the generated type’s domain,

½IÝ¨[½=Ü

/½

!

RL



J!.1

is the smallest which satisfies the above

property. A formalization is left open.

There may be free type configurations which are inconsistent: consider a constructor

with type

°Gk®;± 

. There is no solution by the above axioms for such a constructor,

and therefore it is not allowed. In Z one can further restrict the domain of constructor

functions, such that (in the above notation) one gets e.g. a type such as

;±  

for a

constructor, where

Õ

is the set of finite subsets of



. Thus Z’s solution to the problem

is to use partial injective constructor functions regarding the basic type of the arguments

3.2 Semantic Model 47

½1ÝH[ 

TSUBS

®¯

CONS

Í 17

UNIV

_bR6X6ZJ[

R8X6Z<[S½1ÝH[

'zv

½1ÝH[

Õ] 

317

UNIV







>



8

UJzv

_b

6X6ZJ[

½1ÝH[¥½5Ü

/½



U

"""

½1ÝH[]½=Ü

/½

QL



Ç!M1"dÏ6Ü6`/X6Ý

3

GW 3X6ZJ[

\_^6`;¥½IÝ¨[

hJ"$ÓH^¨\

Ò>Z<`½IÝ¨[½=Ü

/½

!

RL



<!.1hÏ 6Ü6`/X6Ý



Q'6H R6X6Z<[



ÓBÝzm

l

TSUBS

Um 6"n

TVAR



6"n

TYPE



CONS

Í "7

TYPE



 $2



:

Ó3ÝhÖm«\_^6` +]X6ZJ[



R{ dû 

Figure 3.5: Free Types: Meaning

°G±ä

TCONS

K 

CONS

:

Ó3Ý$°¤±e{



27FK® °G±53

ÚrÕ%k#°G±6m

ÛMÛ

Õ%

6½1ÝH[°G±

\¤Ü/Ý3m

Õ%

6½1ÝH[°G±

^6à ½IÝ

Figure 3.6: Free Type of Booleans

(the basic type of



k

). Since a clean equational theory for



Zheavily depends on

the fact that value constructors are total, this approach is not feasible for



Z. Instead,

a solution to the problem is to encode constraints on constructor arguments in special

given types (thus



would be a given type), and provide primitive (partial) encoding and

decoding functions which interpret these types. This approach is, however, not detailed

here.

3.2.5 Standard Types and Constructors

Some standard types and associated constructors with sugared notation will be defined

below.

Units and Booleans. The unit type,

°G±

, contains exactly one element, denoted by the

0-ary constructor

(Figure 3.6). The type of Booleans is a powerset of the unit type, and

is abbreviated as

. The semantic values

ÛMÛ

and

are the characteristic functions mapping

the unit value to true and false, respectively.

Natural Numbers. The type of natural numbers is available. It is generated by the

countable set of

-ary constructors

CÙ270J2TOTO7O

, denoted by the total injection (Figure 3.7).

ps

TCONS

Ap

CONS

:

Ó3Ý)pÞ



25«RÇ\¾^A`; 

°G±5

Figure 3.7: Free Type of Natural Numbers

48 The



ZCalculus

+



TYPE

=2

«6½1ÝH[

-

Í Ú

/'0

«å

/'0

ª

_b

6X6ZJ[

)X6Z<[



v

_b

6X6ZJ[

)ÏeX6ZJ[





zv

_b

6X6ZJ[

ÏeX6ZJ[









^6à ½IÝ32



^6à ½IÝ¨

6





\¤Ü/Ý32



\¥Ü/Ý¨





^Aà ½1ÝH





\¤Ü/ÝH





^6à ½IÝB2



\¥Ü/ÝB



\¥Ü/Ý32



^6à ½IÝ¨





^6à ½IÝ¨





\¥Ü/ÝB

+



TYPE

½1ÝH[]Gk



Ùà ^

UNIV

Figure 3.8: The Order on Partial Sets

This forces the interpretation of

in the semantic universe to be actually isomorphic to

natural numbers. Requiring the existence of this type ensures that our semantic model is

not trivial.

Bindings and Products. A family of type constructors and value constructors for bind-

ings is available. Bindings are records with named components, called fields. A binding

type constructor is denoted as

e

D

F=mWOTO7OUm



E

. This type is generated by exactly one

constructor,



eÕ mOTOTOUm

Õ



. The formal definition is omitted here; it matches

the notion of bindings in Z.

Cartesian products and their elements, tuples, are a special case of bindings, with

“anonymous” field names, which will be denoted as 1,2, and so on. A cartesian product

type,



FSDOTOTO)D

F

n, abbreviates the binding type



G

T7mOTOTOhm



H

, and the tuple

constructor

 2TOTOTOh2 

the binding constructor



Õ m«OTOTOUm

Õ



3.2.6 Order and Fixed-Points

The fixed-point operator of



&)(

e, selects the smallest element from the set









. To this end an order on values needs to be defined, and lattice properties need to

be established (see Section 2.3.4 (on page 33)).

Our model of sets suggests to define the order such that u





v, where



3.2 Semantic Model 49

is the definition of intersection on characteristic functions:



©?

¥X6Z<[

)X6Z<[

nd



7ùM

^6à ½IÝ/¨ú)l

7ùM

^6à ½IÝ/¨ú

¾



The order confirming to this notion of intersection is defined in Figure 3.8 (on the fac-

ing page). The elements of a set-type can be refined from non-membership to unknown

membership to membership. This follows from that non-membership intersected with un-

known membership yields non-membership. For sets over the domain





the order is

illustrated in Figure 3.8 (on the preceding page). It is clear that with this order power-set

domains are complete lattices: the lattices “join” equals to set intersection and the “meet”

to union. The bottom element is the total function which delivers false for all members

of the element type’s domain,

½IÝ¨[

-

, and the top element the total function which deliv-

ers true. Note that the bottom element does not coincide with the completely undefined

characteristic function,

Ü6`/X6Ý

6

, as shown in Figure 3.8 (on the facing page).

We will not attempt to define ordering for the given values of the universe, though this

might be in principle possible for constructed values, allowing to build “infinite terms” by

fixed-points. For the fixed-point operator,

&)(

e, it is demanded by the type conditions

that the free variables in poverwhich a (simultaneous) fixed-point is constructed represent

set-values, and that pis an exhaustive pattern (like a tuple) which matches each value of

its type domain. The point-wise lifting of set-order to values matching such patterns is

obvious and a formalization is left open.

50 The



ZCalculus



VAR

EXPUT

Ë {O7OTO

PATUT

Ak

EXPUT



PATUT

VAR p

817

PATUT



E

PATUT

CONS

I KMLON

PATUT

PATEX

Ak

PATUT

PATLIN

;k

PATUT

Figure 3.9: Expressions and Patterns

3.3 Expression Syntax and Meaning

This section provides the definitive reference for the exact typing conditions and meaning



Zexpressions. For an informal introduction and motivation, we refer to Section 3.1

(on page 38).

The forms of



Zexpressions will be incrementally defined, each one together with

its typing rule and meaning function. There are also a few syntactic abbreviations for

common forms of expressions, which are introduced in Section 3.3.5 (on page 56).

3.3.1 Variable, Expression and Pattern Type

The syntax of (untyped) expressions is defined by the free type EXPUT (Figure 3.9) which

will be extended as we proceed. The set of variables, VAR, is pre-given. Patterns are the

smallest subset of expressions which satisfies the rules in Figure 3.9. A pattern is called

exhaustive, if it matches all values of its type. The property of being exhaustive can be

syntactically checked by analyzing the free types the constructors of the pattern belong

to. A pattern is called linear, if no variable in it appears twice. The set PATEX contains all

exhaustive patterns, the set PATLIN all linear patterns; a formalization is left open.

3.3.2 Typing Relation

The well-typed expressions, EXP

EXPUT, are those which are in the typing relation,

P Q

E

(Figure 3.10 (on the next page)). The context for typing,



TYPASS, is a

mapping from variables to types. Rules for the typing are supplied with each individual

expression form in Section 3.3.4 (on page 52) below. The well-typed patterns are patterns

which are well-typed expressions.

For well-typed expression it is assumed that there is a type-reconstruction function,

:

Ó3Ý

e, which delivers the type of an expression in its context, as fixed by

. The

function

:

Ó3Ý

is not strictly consistent with our formal model, since an expression does

not contain information about its usage context. However, it is conceptually possible to

model such a function formally by adding type annotations to expressions.

3.3 Expression Syntax and Meaning 51

TYPASS

Õ

VAR

TYPE

Ð ;k¬

TYPASS

EXPUT

TYPE



EXP

Õì



EXPUT

By



TYPASS





TYPE

PRQ

S

d

PAT

Õ

PATUT



EXP



ÓBÝu

EXP

TYPE

Figure 3.10: Typing Relation: Declarations

SEMCTX

Õ

TSUBS

D

UNIASS

½1ÝH[ 

SEMCTX

EXP

UNIV

X6ZJ[¥½IÝ¨[

Wí



EXP

By



TYPASS

2



TYPE

PRQ

U

¼H^¨\Ã½¬ùG\_^6`

ú5d¼H^¨\Ã½



+]X6ZJ[

O0äv

X6ZJ[

X6ZJ[

O4v

_b

6X6ZJ[

O4

w½IÝ¨[´¥½5Ü

/½

O03

1""

Q)AH

r

SEMCTX

Figure 3.11: Meaning Function: Declarations

3.3.3 Meaning Function

With each expression form, the meaning is supplied in subsequent sections, defining a

case for the recursive partial function,

½1ÝH[

, that is declared in Figure 3.11. The context,

r

SEMCTX, consists of a ground type substitution and a value assignment forvariables.

The domain of

½IÝ¨[

-T

is fixed to those expressions which fit to the context. An expres-

sion fits if its typing,

P Q



, confirms to

, which means the following: all types in

, as well as the expression’s result type



, are ground under the substitution

OË0

4, and

all values in the value assignment are in their type’s domain. In a similar way we could

state that every value delivered by the meaning function is in the according result type’s

domain, which is, however, omitted.

For the definition of the meaning function, we need the notions of value matching

and substitution. A pattern pcan be matched against a value uunder a value assignment



UNIASS



VAR

UNIV, written as p

['\

u. The inverse of value matching is

substitution, written as

½5Ü

/½

p(Figure 3.12). These fairly standard concepts are not

formalized.

4The Z phrase e

]

nselects the n-th element of a tuple.

UNIASS



VAR

UNIV

[

;k

PAT

UNIASS

UNIV



½5Ü

/½

UNIASS

PAT

UNIV

Figure 3.12: Value Matching

52 The



ZCalculus

EXPUT

Ë {O7OTOd<  =

^

CONS

D"n

EXPUT



Ò`/½

S



R

_

dmX6ZJ[



X6ZJ[



<ÇrX6ZJ[

6X6ZJ[



PRQ



J



<



,* 

PRQ



¬ÐF½5Ü

/½

(!)

TYPASS

CONS

I KMLON

EXPUT

TYPE

bdc bBe

I KMLON

TYPE

TSUBS

½IÝ¨[



=)¦¶½1ÝH[

OË0)W"½1ÝH[

Whû



Q)AH

r

SEMCTX

m 

CONS

6"n

EXP

Figure 3.13: Syntax and Meaning: Constructor Application

EXPUT

Ë {O7OTOd<I J

f

EXPUT



s





EXPUT



PRQ

S

PgQ



ÐHk

$

TYPASS

EXPUT

TYPE

PgQ

ÐHk



PRQ



S

TYPASS

EXPUT

TYPE

½IÝ¨[

.

JU¦

/'0

¤

3½1ÝH[¥½5Ü

/½

OË0$

:

Ó3Ý

1

_

r½IÝ¨[

!

 



 ¦ 6

Õ

3'0

¥½IÝ¨[

-T



WX6Z<[

c½IÝ¨[½=Ü

/½

O0$1Gk a)5



ÓBÝ

1"zv

a¸ÚlÍ

UNIV

F)AY

aU

\¤Ü/Ý«A7RÜ6`/X6Ý

Q)AH

r

SEMCTX



EXP

Figure 3.14: Syntax and Meaning: Singleton Set Display and Selection

3.3.4 Basic Forms: Typing and Meaning

The tools for defining typing conditions and semantic meaning have been introduced in

the previous sections. In the sequel, for each basic expression form its typing and meaning

will be defined.

Constructor Application

A constructor application is denoted as

ikj

ldmSnSndn(m

, with n

prq

. The definitions are

given in Figure 3.13.

In the typing rule, we use the function

sBtvuwxsi

, which delivers the type of a construc-

tor with “fresh” type variables. The meaning is derived from the constructor’s semantic

interpretation,

wzy{-|}i

, for given ground type substitution

found in the context.

Singleton Set Display and Selection

A set which contains exactly one element is denoted by the expression



, called a

singleton set display. The element of a singleton set is selected with the expression



called a singleton set selection. The definitions are given in Figure 3.14.

The type of the display is constructed from the type of the element. The operand of

the selection must denote a set. The type of the result is extracted from the type of this

set.The singleton set display denotes a characteristic function which is exactly true for the

display contents, and false for all other elements of the type. The meaning of singleton

3.3 Expression Syntax and Meaning 53

EXPUT

v

ndnSnhj 

of

EXPUT



EXPUT



j 

of

EXPUT



EXPUT



E



EXPUT





fresh

g



 W

TYPASS

=



TVAR



$(

g



$

W

TYPASS





EXPUT



TYPE

R

l2$ld

R

A5U2l

+

U



lUm



j:w^'wh|Sl

W

TYPASS









EXPUT



c b





TYPE







TSUBS

 ¢¡ £ ¤

¡ ¥ ¤¦

wzy{;§¨j

o © ª'«

j=¬



wzy{j:w^'w(§,n®'jOs¯'°Ey2

o¢o±³²:´¶µ

w¢y

 j

lk



oW© ·O¸'¹

¨º'«

j:w¢y{;§





¨º'«

j:w¢y{;§



o»±

ª'«

j=¬



j¼'½{

l¾¿¼'½{





oUÁ

²:´¶µ

w¢yÂW.j



oUÁ

²:´¶µ

w¢yÂ

±.Ã





 j

lk



oW© ·O¸'¹

¨º'«

j:w¢y{;§





¨º'«

j:w¢y{;§



o»±

ª'«

j=¬



j¼'½{

l¾¿¼'½{





oUÁ

~sÇÆÈyÂ%Éj



odÁ

~sÇÆyÂ

±-Ã





 jÈ

o © ·O¸'¹

Ëº'«

j:w¢y{;§

o±Xª'«

j=¬



¼'½»{

±-Ã

Ì

f u

ÍÏÎ

¸¶Ð¸



SEMCTX



lUm





EXP

2



TYPE

Figure 3.15: Syntax and Meaning: Set Algebra

set selection is defined only if the operand denotes a characteristic function over the entire

types domain, which is true for exactly one element. Note that for a partial characteristic

function, the



form is undefined.

Set Algebra

The empty set is denoted by the expression



, the set intersection by the expression

lÑ



, set union by e

lÑ



, and set complement by



e. The definitions are given in

Figure 3.15.

The type of the empty set is a powerset over some fresh type variable



. For set

intersection and union, both operands must denote sets of types which can be unified. For

the set complement, the operand must be a set.

The meaning of the empty set is a total characteristic function which constantly yields

false. The meaning of set intersection, set union, and set complement has been already

discussed in Section 3.2.1 (on page 43). Here, we take care for the encoding details.

Set Translation

The translation of a set, e

l}ÒÓ

BÅ

, describes the set of values from ewhich are trans-

formed by the pattern p

to p



. The definitions are given in Figure 3.16 (on the next

page).

The patterns must be typeable under a shared environment, and must be linear. The

type of emust be unifiable with the type of p

; the resulting type is the substitution of the

unifier in the type of p



54 The



ZCalculus

EXPUT

v

ndnSnhj

ÒÓ Å

of

EXPUT



PATUT



PATUT



)ÔÕ

l2SSld

}ÔÖ

Sd

lØ×

PATLIN



Ù×

PATLIN

R

(



Sl

R

lAÒÓ

ÚÅW³jwf'w(|hU





TYPASS





EXPUT







PATUT

bBc b



c b





TYPE





TSUBS

ÛÜ

}



UNIV



lUm





PAT







UNIV

ÞÕ|



TSUBS

ß¾Hà



UNIASS

lÕ×"w¢y{¨jwf'w(|Ïj:s:¯'°Dy

oÚo

Õ×@w¢y{j:w^'w(|ÏjOs¯'°Ey



oÚo

l2áDâ

lGÄ

Aá'â

d

sÇÆ



lUm





PAT



UNIV



UNIV





ÛÜ





wzy{;§¨j

lAÒÓ

ÚÅ

o%©

·O¸'¹

}¨º'«

jwzy{-§





w¢y{j:w^'wW§,n®'jOs¯'°Ey

oÚo







w¢y{j:w^'wW§,n®'jOs¯'°Ey



oÚo»±

ª'«

jÚj=¬

äã

jOsÇÆfj

lAÒÓ



oUÁ

¼'½»{

o+±9å

sÇÆÇj

ÖÒÓ

oÚo



j=¬



çæ

j:sÇÆÇj

lÒÓ



oUÁ

lBÂ

o,±²:´'µ

wzy

oÚo

ÍÏÎ

¸¶Ð¸



SEMCTX





EXP



ldm





PAT

Figure 3.16: Syntax and Meaning: Set Translation

For the specification of the meaning of translation, we define an auxiliary relation,



ÛÜ





, which relates values which are translated by two patterns. The relation is

described polymorphically, by relating all those values u

lUm



which are in an instance of

the pattern’s types under a common ground type substitution

and which can be matched

by the patterns using the same value assignment

. The auxiliary function

sÇÆ

yields the

set of values which are reached under the translation by a pattern pair.

For the meaning of translation, two cases have to be distinguished: those elements

which are defined in the characteristic function denoted by the set eand which are reached

by p

lÙÒÓ



, are in the domain of the resulting characteristic function; their membership

is derived by Boolean disjunction of their image under the reverse of pattern translation.

Those values which can not be reached by the translation are defined to be constantly not

members of the set. This definition reflects that a translation may “filter out” undefined

elements of the translated set. Consider for example a set over lists where the membership

of the empty list, denoted by the constructor nil, is unknown, whereas the membership of

all elements denoted by the constructor cons

is known. Translating such a set

by cons

ÒÓ

cons

results in a set of which nil cannot be member. This is

reflected by the second

in Figure 3.16. The example also shows that a translation by

identical patterns does not necessarily yield an identical set value.

Schema, Fixed-Point, and Variable

A schema is denoted by the expression





, where pis a pattern, and ean expression

of type

. A fixed-point is denoted by

é%ê

e. A variable application is written as x.

The definitions are given in Figure 3.17 (on the next page).

3.3 Expression Syntax and Meaning 55

EXPUT

v

ndnSnhjz~  

o^

PATUT



EXPUT



äj:é%ê ë

o^

PATUT



EXPUT





of

VAR



¼'½»{

Ô

¨ì´

Æíw



PATLIN





S(

¨îï



dè





}5





TYPASS



PATUT





EXPUT



TYPE

×9¼'½{



R





(

TYPASS



VAR

¼'½{

}Ô

ì´

Æíw



PATLIN



PATEX

ØÆ

}ÔÑð

uñ

Ô

Sl^

îò Ô

SU2l





U

R

éçê

w^'w|}Sl





TYPASS







TSUBS



PATUT



EXPUT



c b





TYPE

wzy{;§¨jx~





o4©

ª'«



wzy{jwf'w(§,n®'j:s:¯'°Dy

o¢o





UNIV

ó èA+à



UNIASS



ª'«



wzy{õj§,n®DmB§,nvö

ª'«



'u¼'y

²¾±

ÒÓ





søsxÅ

 jOéçê

o%©_·O¸¶¹





wf'wW§,n®Dj:s:¯'°Dy

Ëº'«

jwzy{Y§+~







o»±

·O¸¶¹

cands



oUÁ

~sÇÆÈyÂ

ùOú}û

cands

¸äü

cands

¸¶·ýU¸

'u¼'y

 j

o ©

§,nvö

ÍÏÎ

¸¶Ð¸



SEMCTX





PAT





EXP





VAR

Figure 3.17: Syntax and Meaning: Schema, Fixed-Point, and Variable

The pattern of a schema must be typed under a minimal type assignment, and must be

linear. The property must be typed as a Boolean under an extension of the environment

by the type assignment used for the pattern, where variables of the pattern hide variables

in the environment. The resulting type of the schema is a powerset of the pattern’s type.

The pattern of a fixed-point must be typed under a minimal type assignment, must be

linear, and must be exhaustive. Each variable in the pattern must have a set type. The

definition must be typed under an extension of the environment by the type assignment

for the pattern, where variables from the pattern hide variables in the environment, and

must unify with the pattern’s type. The resulting type is this unified type.

The meaning is defined as follows. For schemas, a characteristic function is con-

structed, the domain of which are the elements from the schema’s pattern type such that,

if they match the pattern, then the property evaluates to a defined Boolean value under the

bindings resulted by this match. The schema’s characteristic function yields true for an

element if the pattern has matched and the properties Boolean value is true.

For the fixed-point, we use the meaning of







to construct the set of can-

didates. By the typing conditions we know that the variables in prepresent sets, and the

domain of p’s type is ordered by

. Thus, if the infimum of the candidate set is a

candidate, we select it, otherwise

'u¼'y

is delivered. A sufficient condition for existence

of the fixed-point is that eis order-preserving, see Section 2.3.4 (on page 33).

56 The



ZCalculus

}

 



}

5







EXPUT



EXPUT





ÒÓ











ÍÏÎ

¸'Ð¸



EXPUT





VAR

× m

 

EXPUT



EXPUT



EXPUT





jz~

H



×F~



ÍÏÎ

¸'Ð¸



EXPUT









EXPUT



EXPUT









EXPUT

ùOú





¸¶ü



¸¶·:ýd¸

ùOú





¸¶ü

heade

¸¶·:ýd¸

head e





taile









EXPUT

ùOú





¸¶ü

¸¶·:ýd¸

ùOú





¸¶ü

heade

¸¶·:ýd¸

head e





taile

Figure 3.18: Expression Abbreviation Forms

3.3.5 Expression Abbreviation Forms

We have already mentioned several syntactic abbreviations for frequent patterns of



expressions. The abbreviations are specified in Figure 3.18 as constants and injective

functions which map into expressions. The functions work on untyped syntax; the typing

conditions are inherited from the basic forms they map to.

The following explanations about the definitions. The universal set,

, is the comple-

ment of the empty set. The undefined value,



, is the singleton selection on the empty

set. The Boolean expression



e, which tests whether the given set is non-empty, is an

abbreviation for a translation to the Boolean unit,



. The complementary form



etests

whether the given set is empty. The Boolean membership, e

, is mapped to an in-

tersection of a singleton set containing the element with the set to test membership in,

and a subsequent test for emptiness of the result. The equality, e



, is derived from

membership in a singleton set.

In order to work with sequences of unions and intersections, two further abbreviations

are introduced:



edescribes the generalized union, and



ethe generalized intersection

of the sequence of expressions e. For empty expression sequences, the according neutral

element (



resp.

) is denoted.

3.4 Equational Theory 57

ì´

Æíw



EXP





VAR

EXPASS



VAR

EXP

 

ñj

PAT



EXPASS



PAT

w^'w



EXPASS



EXP

{



Uwf'w



PAT



EXP



EXPASS

{



Uwf'w





PAT





EXP

Dì´

ÆMw

³jz~



ÒÓ

PATCTX



PAT



PAT

EXPCTX



EXP



EXP

ì´

Æí½'ÆM¼'yÆ





VAR





VAR

Figure 3.19: Syntactic Tools

3.4 Equational Theory

An equational theory for



Zwill be developed in this section. The laws will be shown to

be consistent with the model of



Zdeveloped in the previous section.

3.4.1 Syntactic Tools

For defining the equivalences, some syntactic tools for working with expressions are re-

quired (Figure 3.19). The variables which occur free in an expression (are not bound

by some schema’s pattern) are delivered with

ì´

Æíw

e. Two patterns are unified using the

notation p







determining the smallest assignment

EXPASS



VAR

EXP

which makes them equal. An expression assignment can be substituted in an expres-

sion by

w^'w

e. Substitution is partial because of typing conditions for the substituted

expressions. These fairly standard concepts are not formalized.

Given a pattern pand an expression e, we build a substitution of the variables in the

pattern by sub-components of the expression with

{



Uwf'w¾j

. Variables xare substi-

tuted by a “selection” from the expression eexpressed via a translation. Note that the

meaning of this selection may be undefined (

'u¼'y

), in case the pattern pdoes not match

the value of e.

A pattern context,

PATCTX, is a “pattern with a hole”. It is described by an in-

jective function from a pattern to a pattern. Similar, an expression context

õ×

EXPCTX

is defined. The context functions are partial because of typing conditions.

For variables a total order is assumed, such that we can convert a finite set of variables

into a sequence of variables by the function

ì´

ÆM½¶ÆM¼'yÆ

3.4.2 Equivalence Relation

The equivalences are described by a relation over well-typed expressions of the same

type. This relation has the equivalence properties and is a congruence (is closed under

contexts). The declarations are given in Figure 3.20. The equivalence relation will be

incrementally refined below:

is supposed to be the smallest relation satisfying the

laws in Figure 3.20 and the laws added later on.

58 The



ZCalculus



ñ~



EXP

s¯'°Ey



s¯'°Ey



Ô Ô

ÄF~



¼'½{

$#

ÍÏÎ

¸¶Ð¸

Ô Ô



EXP





EXPCTX

Figure 3.20: Equivalence Relation: Declaration



Éj



Ô Ô



Ô Ô







Éj



Ô Ô



@j



Ô Ô

ÍÏÎ

¸¶Ð¸

Ô Ô



EXP



Éj



Ô Ô



Ô Ô



Éj



Ô Ô



Éj



Ô Ô

ÍÏÎ

¸'Ð¸

Ô Ô



EXP

 j





>

 j





>

 

 jÈ

ÍÏÎ

¸¶Ð¸

Ô Ô



EXP

Figure 3.21: Boolean Laws

3.4.3 Boolean Laws

The set operations follow almost all of the usual laws for Boolean algebras. Commu-

tativity and distributivity of union and intersection hold, and De Morgan’s laws are sat-

isfied, such that we can build a conjunctive or disjunctive normal form of expressions

(Figure 3.21).

However, the “principle of the excluded middle” does not hold: e

>



is not

valid for arbitrary e, as well as e



. The reason for this is the partiality of sets:

if econtains unknowns, then in the complement of ethese unknowns keep unknown, and

therefore do not absorb each other. As a consequence, the implication e

e, interpreted

as the usual abbreviation for



>

e, does not equal to

Proof

.We prove one of the distributivity laws and one of the De Morgan laws.

Let f



(

be characteristic functions. We want to show (ignoring encoding de-

tails) that the pair u

ÒÓ

bis in the characteristic function h



lkVj



(

iff it is

in the function h





lk



Vj

lÑ

(

Case u

ÒÓ sÇÆÈy

:According to definition of union and intersection, u

ÒÓ sÇÆÈyÏ×

iff u

ÒÓ sÇÆy×

l+Ä>j

ÒÓ sÇÆÈy ×

,Ê

ÒÓ sÇÆy×

(

. This is equivalent to

ÒÓ sÇÆy×

lÖÄ

ÒÓ sÇÆÈy$×



Ê¨j

ÒÓ sÇÆÈy×

lHÄ

ÒÓ sÇÆy×

(

, which

in turn holds iff u

ÒÓ sÇÆÈy}×



3.4 Equational Theory 59

G>~





G>~



ÍÏÎ

¸¶Ð¸



EXP

Figure 3.22: Excluded Middle Laws

~



ÍÏÎ

¸¶Ð¸



EXP

Figure 3.23: Selection Elimination Law

Case u

ÒÓ

²:´¶µ

w¢y

:Then we have u

ÒÓ

²:´'µ

wzyÏ×

iff u

ÒÓ

²:´¶µ

w¢y ×

lÊj

ÒÓ

²:´¶µ

w¢y ×

Ä

ÒÓ

²:´'µ

wzy³×

(

. This is equivalent to

ÒÓ

²:´'µ

wzy³×

lÊ

ÒÓ

²:´¶µ

w¢yX×



Äïj

ÒÓ

²:´¶µ

w¢y×

lGÊ

ÒÓ

²:´¶µ

w¢y×

(

, which holds iff u

ÒÓ

²:´'µ

wzy×



Let f

and f



be characteristic functions. We show that the pair u

ÒÓ

bis in the

characteristic function h



 j

l 



iff it is in the function h







l >



We thereby use the following fact: let g





g, then it follows from the definition

of complement: u

ÒÓ

iff u

ÒÓ Ì

g, where

bis (Z-level) negation of

the Boolean value b. (Note the difference to the (wrong) assertion, u

ÒÓ

Ìïj

ÒÓ

Case u

ÒÓ sÇÆÈy

:We have u

ÒÓ sÇÆÈy ×

iff u

ÒÓ

²:´'µ

wzy ×j

lÑ



, iff u

ÒÓ

²:´¶µ

w¢y ×

lÕÊ

ÒÓ

²:´'µ

wzy5×



, iff u

ÒÓ sÇÆy ×Y

lÙÊ

ÒÓ sÇÆÈy ×Y



, iff u

ÒÓ sÇÆÈy ×



Case u

ÒÓ

²:´¶µ

w¢y

:We have u

ÒÓ

²:´'µ

wzy ×

iff u

ÒÓ sÇÆÈyÏ×j

l 



, iff u

ÒÓ sÇÆÈy ×

lÖÄ

ÒÓ sÇÆÈy×



, iff u

ÒÓ

²:´'µ

wzy×.

lØÄ

ÒÓ

²:´¶µ

w¢y×-



, iff u

ÒÓ

²:´'µ

wzy×



3.4.4 Singleton Set Laws

Excluded Middle

For singleton sets the “excluded middle” holds (Figure 3.22). One can easily see the

validity by considering the characteristic function constructed for a singleton set: it is

total and true exactly for the element e. The complement yields a total characteristic

function which is true for all elements except e.

Selection Elimination

Singleton set selection on a singleton set construction reduces to the singleton sets mem-

ber (Figure 3.23). The validity can be easily seen from the representation of singleton sets

and the meaning of the



-operator.

60 The



ZCalculus

~ikj



)$ 

jÚj=¬



-

±



ÒÓ ikj¢j

jxt ¼kj¢®ÕnSn



oÚo¢o

~ih

0

W



ÒÓ_i»Å

ÍÏÎ

¸¶Ð¸



CONS







EXP







VAR









òq

Figure 3.24: Constructor Decomposition Laws

lGÒÓ

ÚÅ

(

ØÒÓ

BÅ

wf'w

lAÒÓ wf'w



ÇÅ m

ùOú







(

lGÒÓ

ÚÅ

(

ØÒÓ

BÅ

 m

ùOú







(

ÒÓ

ÍÏÎ

¸¶Ð¸



EXP



ldm

Um

(



PAT





PATEX





EXPASS

Figure 3.25: Translation Composition and Identity Laws

Constructor Decomposition

A singleton set containing a constructor application with arity greater zero can be decom-

posed into an intersection of projections. A singleton set containing a zero-arity construc-

tor equals to a translation of a truth value (Figure 3.24). The first law is formulated in

“

nSnSn

” notation as follows:

~ikj

ldmSnSndnWm



l^

lGÒÓ ikj

ldmSnSndn(m

Å' nSnSnB@~



ÒÓ_ikj mSnSnSnWm

It holds since the translation



ÒÓ_ikj¢nSndnWm

mdnSnSn

yields the set ofall values

ikjznSnSnWm

mSnSndn

where the ith argument is fixed by eiand all other are arbitrary. Intersecting the sets re-

sulting from such a translation for each argument of the constructor must thus yield in

~ikj

ldmSnSndn(m



. Thereby, definedness of eiis not significant, since constructors and sin-

gleton sets are not strict.

3.4.5 Translation Laws

Composition and Identity

Consecutive translations are composable, if the intermediate patterns can be unified. If

they cannot be unified, consecutive translation reduces to the empty set. A translation by

an exhaustive pattern has no effect (Figure 3.25).

Proof

.We prove composition of consecutive translation. We want to show that the

pair u

ÒÓ

bis in the characteristic function h



l5ÒÓ

ÚÅ

(

ÒÓ

BÅ

iff there exists

an assignment

such that p







(

and u

ÒÓ

bis in the characteristic function h





wf'w



lGÒÓ w^'w



ÇÅ

Case u

ÒÓ sÇÆy

:When u

ÒÓ sÇÆy×

, according to definition of translation there exist

value assignments

à mfà

, and values u

Ô Ô

, such that p

u,p

(

á

Ïá

and p

lØá

Ô Ô

. Since p

(

and p



match the same value u

, there must exists a small-

est pattern assignment

such that p







(

, and

jwf'w



e e

as well as

jwf'w



(

e e e

. The assignment

specializes the patterns compatible to the

value u

, and hence we also have

jwf'w

e e

Ô Ô

and

j:w^'w



e e e

u, and

therefore u

ÒÓ sÇÆÈy}×



3.4 Equational Theory 61



ofÃ

lAÒÓ

BÅ

lGÒÓ

ÚÅ'

lAÒÓ

ÚÅ



ofÃ

lAÒÓ

BÅ

lGÒÓ

ÚÅ'

lAÒÓ

ÚÅ>m

ùOú

ì´

ÆMw

ì´

Æíw



j=

o^Ã

lAÒÓ

ÚÅ

 j

lGÒÓ

ÚÅ

ùOú

ì´

ÆMw

ì´

Æíw

ÖÄ

lØ×

PATEX

×

PATEX

ÍÏÎ

¸¶Ð¸



EXP



lUm





PAT

Figure 3.26: Translation Distributivity Laws

Case u

ÒÓ

²:´'µ

wzy

:Similar argumentation as above, but with negated propositions.

Distributivity over Set-Operators

Any translation distributes over set union. A translation by an embedding distributes

over set intersection. An embedding distributes over set complement if it is exhaustive

(Figure 3.26).

An analogy to predicate logic givesan intuition whyhiding translations (where

ì´

ÆMw

ì´

Æíw



) do not distribute over intersection. Hiding in



Zcan be interpreted as an existen-

tial quantifier. In predicate calculus, the following holds:

Qiff

j=Þ

jÈÞ

. However, it does not hold

Qiff

jÈÞ

ÄòjÈÞ

, since in the

later case Pand Qmust be satisfied simultaneously for the same x.

Proof

.We prove distributivity of any translation over union and of embedding transla-

tions over intersection, using the auxiliary relation vp



ÛÜ



u(Section 3.3.4 (on page 54)).

We prove that u

ÒÓ

bis in h





o^Ã

lYÒÓ

BÅ

iff it is in h





l;ÒÓ

ÚÅ¶



Case u

ÒÓ sÇÆÈy

:We have u

ÒÓ sÇÆÈy×

iff



UNIV



ÛÜ



ÒÓ_sÇÆÈy×



iff



UNIV





ÛÜ



ÒÓ sÇÆÈy.×

ÒÓ sÇÆy.×

iff

j=Þ



UNIV



ÛÜ



ÒÓ sÇÆÈy}×

Êòj=Þ



UNIV



ÛÜ



ÒÓ

sÇÆÈy×

iff u

ÒÓ sÇÆÈy×



Case u

ÒÓ

²:´¶µ

w¢y

:similar.

We prove that u

ÒÓ

bis in h





o^Ã

lYÒÓ

BÅ

iff it is in h





l;ÒÓ

ÚÅ¶

lHÒÓ

ÚÅ

, where

ì´

ÆMw

ì´

Æíw



. Thereby the following property is used:

ì´

ÆMw

ì´

Æíw



, then for a given uthere exists not more then one vin the same

type such that vp



ÛÜ



u. This follows from the fact that all variables in p

are

bound by a match of p



against u.

Case u

ÒÓ sÇÆÈy

:We have u

ÒÓ_sÇÆÈy×

iff



UNIV





ÛÜ



ÒÓ sÇÆÈy}×

ÒÓ sÇÆy×

iff

j=Þ



UNIV



ÛÜ



ÒÓ sÇÆÈy6×

Ä j=Þ



UNIV



ÛÜ



ÒÓ sÇÆy ×

iff u

ÒÓ sÇÆy ×



. Since of the unique

existential quantifier, the conjunction could have been pulled out.

62 The



ZCalculus

lAÒÓ

¢Å¶

ÒÓ



o^Ã

(

ØÒÓ

BÅ

lGÒÓ

ÚÅ

(

HÒÓ

ÇÅ¶

ÒÓ



(

HÒÓ

ÇÅ

ÍÏÎ

¸¶Ð¸



EXP



lUm

Sm



(



PAT





EXPASS













ì´

Æíw jwf'w





ì´

Æíwñj:w^'w



o 

Figure 3.27: Translation Product Law

Case u

ÒÓ

²:´¶µ

w¢y

:similar.

Distributivity of Hiding over Product

A hiding applied to a product can be distributed. A product is an intersection of translated

sets which are independent (Figure 3.27). The independence of the operands is checked

by unifying their target translation patterns, und stating that the source patterns under the

resulting substitution have disjoint variables.

A formal proof is omitted, but an analogy to predicate calculus can serve as an intu-

ition for the validity of the law. In predicate calculus the following holds:

j=Þ

Ê j=Þ

, provided that yis not free in Px and xis not free in

Qy. A similar situation is described here.

The product law, in combination with the constructor decomposition law (Figure 3.24

(on page 60)), provides a generalization of decomposition in unification. Consider an

equality such as

ikj

oH

ikj

. Removing the syntactic sugar yields in

jz~ikj

G@~ikj



ofÃ

ÒÓ

&

¶Å

Applying the constructor decomposition law yields:

jz~



ÒÓ ikj

Åä9~



ÒÓ ikj

Å'



ÒÓ ikj

Åä9~



ÒÓ ikj

o Ã

ÒÓ



¶Å

Next we apply associativity of intersection and distributivity of embeddings to get

jÚjx~

G@~



o^Ã

ÒÓ_ikj

Å¶Éjx~

G@~



o^Ã

ÒÓ ikj

ofÃ

ÒÓ

&

¶Å

Now, since the outer intersection is a product, we can apply the distributivityof the hiding

ÒÓ



. The resulting consecutive translations are immediately contracted such that we

finally get:

jz~

G9~



ofÃ

ÒÓ

&

¶ÅE7jx~

G@~



o^Ã

ÒÓ

&

¶Å

Folding the syntactic sugar for equality, we reach at a







Lifting of Hiding

A hiding as an operand of an intersection can be lifted out of the intersection (Figure 3.28

(on the next page)). The hidden part is introduced by an embedding for the right operand,

3.4 Equational Theory 63

lGÒÓ

ÚÅ'



HÒÓ

l¢Å

ofÃ

lAÒÓ

ÚÅ>m

ùOú

ì´

ÆMw



ì´

Æíw

ÍÏÎ

¸¶Ð¸



EXP



lUm





PAT

Figure 3.28: Translation Lifting Law

lGÒÓ

ÚÅ

lHÒÓ j j

j j

ÒÓ

ÚÅ

ÍÏÎ

¸¶Ð¸



EXP



ldm





PAT





?@

VAR



ì´

ÆMw

ì´

Æíw

GÌ

ì´

Æíw



ì´

ÆMw

lUØÆ

ì´

Æíw

lç

ì´

Æíw



Figure 3.29: Translation Splitting Law

, by just reverting the translation. On the intersection, the hiding is then performed

again.

We do not provide a formal proof, but just point to an analogy in predicate calculus.

jÈÞ

Qis equivalent to

Q, provided xis not free in Q. Lifting of

hiding is a generalization of this.

Splitting into Embedding and Hiding

A translation which is neither hiding nor embedding can be splitted into an embedding

and a consecutive hiding (Figure 3.29). The correctness of this law follows directly from

the law of composition of translations (Figure 3.25 (on page 60)): unifying

against

itself yield in an empty assignment, henceforth applying the composition law of transla-

tion yields in e

wf'w

lÙÒÓ wf'w

¢Å

, which equals to the left hand side expression in

Figure 3.29.

It should be noted that since embeddings can be distributed and hidings can be lifted

out of intersections, meanwhile other translations can be decomposed into embeddings

and hidings, it is possible to normalize expressions such that embeddings are pushed to

the leafs and hidings are pulled to the top of intersections.

Intersection Elimination

Two translated sets in intersection, with target patterns which cannot be unified, reduce to

the empty set (Figure 3.30).

Note that in combination with the constructor decomposition law (Figure 3.24 (on

page 60)) this law leads to the reduction of intersections

~ikj

G@~i



, with

i;÷



to the empty set.

lGÒÓ

ÚÅ'

ÒÓ



 m

ùOú









ÍÏÎ

¸¶Ð¸



EXP



lUm

Sm





PAT





EXPASS

Figure 3.30: Translation Intersection Elimination Law

64 The



ZCalculus

Ñ







ÒÓ









H9~













H9~









j=Ë~





ofÃ

ÒÓ

ÍÏÎ

¸¶Ð¸



EXP





PAT

Figure 3.31: Schema Elimination and Distributivity Laws

3.4.6 Schema Laws

Elimination and Distributivity

A schema with a false property can be reduced to the empty set, one with a true prop-

erty to a translated universal set preserving the pattern (this is necessary for the case the

pattern is not exhaustive). Schema construction distributes over set union, intersection,

and complement (Figure 3.31). The distributivity over complement requires a translation

of the resulting set with p

ÒÓ

pfor the case that pis not exhaustive. In this case, the

negated schema denotes also those values which do not fit to p. These are filtered out by

the translation.

Proof

.We prove one of the distributivity laws. In order to represent the property eof

a schema,





, functions F

UNIASS



wzy{

BADC

ó è

are used: F

yields the

characteristic Boolean-valued function of a property in dependency of a value assignment

.We want to show that the pair u

ÒÓ

bis in the characteristic function h





lk

S

iff it is in the function h







l^H9~



d

. Suppose there is not match

u. Then obviously u

ÒÓ

²:´'µ

wzy}×

iff u

ÒÓ

²:´¶µ

w¢y}×



. Suppose there is a match, and

let it be

Case u

ÒÓ sÇÆy

:We have u

ÒÓ sÇÆyX×

iff



"ÒÓ sÇÆy³×>j

lÙàG

)à

(where



repre-

sents the constructor of the unit value), iff



ÒÓ sÇÆÈy}×

lÙàÊ

E

ÒÓ sÇÆÈy×

,à

, iff

ÒÓ_sÇÆÈy×



Case u

ÒÓ

²:´'µ

wzy

:dual.

The proof for intersection is similar.

Complement Lifting

We have described how schema distributes over complement. The opposite of this law is

given in Figure 3.32 (on the facing page). The operand

 j

ÒÓ

adds those values

which are in the result set since they do not match the pattern p. For exhaustive patterns

it reduces to



, since a translation by the same exhaustive pattern can be eliminated.

3.4 Equational Theory 65

¨~







H>Ëj

ÒÓ

ÍÏÎ

¸¶Ð¸



EXP





PAT

Figure 3.32: Schema Complement Lifting Law





lGÒÓ

ÚÅ

 m

ù:ú









lGÒÓ

ÚÅ

~wf'w



w^'w

Rm

ù:ú





lGÄ

ì´

Æíw

ì´

ÆMw



ÍÏÎ

¸¶Ð¸



EXP



l^m





PAT





EXPASS

Figure 3.33: Schema Translation Elimination Laws

Translation Elimination

Any translation of a schema where the schema’s pattern and the translation’s source pat-

tern cannot be unified reduces to the empty set. An embedding translation on a schema

can be eliminated (Figure 3.33).

Membership Lifting

A schema which contains a membership-test, p

e, where pis linear and contains only

variables from the schema’s pattern, can be eliminated provided the expression edoes not

contain variables dependent on the schema’s pattern (Figure 3.34).

Note that membership lifting also captures equality:







is an abbreviation

for



×ï~

 

, which eventually reduces to



provided xis not free in e.

Property Substitution

A schema intersected with a translated singleton set is equivalent to an expression where

the singleton set’s element is “imported” by substitution into the schema’s property (Fig-

ure 3.35 (on the next page)). Since we can always add the identity translation, x

ÒÓ

the law captures also the case where no real translation of the singleton set is involved.

Since other laws have ensured that embedding translations applied to schemas can be

eliminated, and hiding translations can be lifted out of intersections, we do not need to

consider the case of the translation of the schema itself.

Proof (informal)

. Recall that

{



Uw^'w¾j

is defined as



PAT





EXP

Eì´

Æíw

Xjz~



ÒÓ





ÒÓ

ÍÏÎ

¸¶Ð¸



EXP





PAT



PATLIN



ì´

ÆMw

ÔÑð

ì´

ÆMw



ì´

Æíw



ì´

Æíw



Figure 3.34: Schema Membership Lifting Law

66 The



ZCalculus



H@~



lGÒÓ

ÚÅ

~wf'w



wf'wñjx{



Uw^'w¾j:w^'w



l^m

o¢o

GV~



lGÒÓ

ÚÅ

ÍÏÎ

¸¶Ð¸



EXP



l^m





PAT





EXPASS







K

Figure 3.35: Schema Property Substitution Law

é%ê

wf'wñjz{

L

Uw^'w¾j

mÚéçê

oÚo

ÍÏÎ

¸¶Ð¸



PAT





EXP





SEMCTX

éçê

×9¼'½{j:w¢y{Y§

o»±



wzy{¨j:w^'wh§,n®AjOs¯'°Ey

oÚo»±

jÈ¬



±É·O¸'¹



UNIASS



wzy{¨j§,n®DmB§,nvö



Figure 3.36: Fixed-Point Unrolling Law

In Figure 3.35, the selection of a component from e

will simplify if

wf'w

is a simple

variable, name it x: then we have the substitution

{

L

Uw^'w¾j

o2

ÒÓ_³jz~



ÒÓ

¶m

and, after further simplification steps, the replacement of xin the schema’s property e

reduces to e

. However, if

w^'w



is a proper pattern, then the selections via the



operator in the substituted expressions may be in principle undefined. Yet, in this case,

though the property

w^'w2jx{



Uwf'w¾jwf'w



lUm

oÚo

eof the schema becomes undefined, the

singleton set expression



lAÒÓ

¢Å

must also reduce to



, such that the undefinedness

of the property does not count.

3.4.7 Fixed-Point Unrolling Law

The fixed-point operator,

é%ê

e, can be unrolled, provided the expression erepresents

a continuous function on the domain of the pattern’s type (Figure 3.36). This law differs

from all others in that the application condition is semantic. The pragmatics is that the

unrolling law is used in the operational semantics of



Zunder the unproved assumption

of the above condition, and is thus correct only relative to such assumptions.

Proof (informal)

. The correctness of this law follows from the fixed-point theorem for

lattices discussed in Section 2.3.4 (on page 33). By the typing conditions, we know that p

is an exhaustive pattern where the variables represent set types, and thus its type’s domain

is a complete lattice.

3.5 Discussion 67

3.5 Discussion

We have presented the



Zcalculus, which provides a way of viewing the

-calculus,

predicate calculus, schema calculus, set algebra, etc., in a unified framework. A set-based

model for the calculus has been defined, syntax, typing conditions and semantics have

been provided, and a set of equational laws has been given. A few further points are

discussed below, including the relation to other work.

Partial Sets and Recursion

Features like sets with unknowns and the explicit representation of undefinedness makes



Z’s model different from the one normally used for Z. However, as discussed in Sec-

tion 2.1.4 (on page 26), the forthcoming ISO Standard does not prescribe a particular

position on undefinedness. The treatment of undefinedness is left to conventions intro-

duced by specifiers and/or tools.

Representing undefinedness in the model of



Zis essential. The need for its repre-

sentation is usually attributed to computation. For example, the reason that VDM [Plat

and Larsen, 1992; Larsen and Pawlowski, 1995] represents undefinedness, unlike Z, is

often explained by its need to deal with computation. For



Z, the more important ar-

gument is that explicit representation of undefinedness is the key to a simple equational

theory, which can be applied without testing guards for definedness (which are, in reality,

intractable). Central laws, such as the distributivity of schema construction over conjunc-

tion,



lk

S



lUÖ9~



S

, would otherwise become more complex. Other

laws affected are property substitution (related to

-reduction or unfolding), construc-

tor decomposition and many more. The observation can be compared with the cleaner

equational properties of lazy functional languages as opposed to strict ones.

It is notable that, although we have undefinedness, it is decoupled from fixed-point

construction, which is based on classical lattice theory. In fact, the least element for

fixed-point construction is the empty set, not the undefined set. Building fixed-points

would also work for models without undefinedness.

Since



Z’s approach is not constructive a priori, a fixed-point may be undefined. At

the level of the model semantics, this causes no problems. In the equational theory, how-

ever, we have to place conditions about continuity of the expression ein

éçê

eover the

variables from pto allow symbolic computation of fixed-points. The application condition

of this law is indeed intractable, but in practice – when mapping Z to



Zand converting

some recursive equations into the fixed-point form – a pragmatic solution is acceptable,

which adds the unproved assumption that the condition for the fixed expression is obeyed.

Set Algebra

The



Zcalculus is based on set algebra, and its operators are well known, except that

they are interpreted with a partial semantics. A new contribution is the combination of

set-algebraic operators with the translation construct. Translation is used to model micro-

elements, like function application and quantifiers, and at the same time it scales up to

describe morphisms between macro-elements, like entire specifications. Translations are

a way of describing morphisms in the category of sets of the



Zuniverse, and some of

68 The



ZCalculus

the equational laws seem to have a direct correspondence to concepts in category theory

[Barr and Wells, 1990]. However, the relation to category theory needs to be further

investigated.

Sets In Programming Languages

Sets for use in programming were proposed by the designers of SETL back in the eighties

[Kruchten et al., 1984]. Recently, sets in declarative programming have again attracted

interest in the community. Several set-oriented programming languages have been de-

fined. This work usually adds special primitives to functional or logic languages, provid-

ing predicates for membership or subset relation and introducing non-free constructors

for decomposing sets in pattern-based rule definitions. One approach along these lines is

the language

log



(pronounced SETLOG), which is an extension of Prolog by special

terms for set expressions, including extensional and intensional representation, and en-

capsulated search [Dovier et al., 1996; Rossi, 1997]. Another interesting approach here is

subset logic programming, described e.g. in [Jana and Jayaraman, 1999; Jayaraman and

Moon, 1999], which allows for recursive subset clauses.



Zdiffers from these approaches in that it embeds functional and logic concepts into

sets and not vice versa, thereby using a pure expression language in the spirit of the

-calculus. This makes



Zsuitable as an intermediate language for the integration of

paradigms. On the other hand, as will be seen in Chapter 4, the computation model for



presented in this thesis is not as ambitious as the one for subset logic programming [Moon,

1997], which (in a first-order setting) employs set unification and fixed-points over sub-

set constraints, quite expensive features [Arenas-Sanchez and Dovier, 1997; Stolzenburg,

1999]. These symbolic computation techniques are, in principle, entailed in



Z’s model,

but no attempt has been made to formulate equational laws which reflect them.

Higher-Order Logic



Zis related to higher-order logic [Gordon and Melham, 1993] in its ambition to re-

duce mechanical reasoning to a small calculus. However,



Zhas more redundancy, since

it contains extensional elements (singleton sets) and set algebra, whereas basic HOL is

purely intensional, and uses only abstraction and application as the kernel expression

forms. In [Santen, 1999], it has been shown that Z can be consistently embedded into

HOL, with the practical effect that tools like Isabelle [Paulson, 1994] can be used for me-

chanical reasoning. Here we propose that the redundancy of



Z, supporting extensional

description and set algebra, is in fact an advantage, because it allows the construction of

kernel machines that are specialized for the basic concept that is of interest in reasoning

– boolean algebras. This is at least indicated for the applications of partial evaluation and

(symbolic) computation, as will be investigated in the next chapter.

Concurrent Constraint, Functional Logic Languages

Most authors presenting implementation techniques for concurrent constraint or func-

tional logic languages base their work on a kernel language which is usually an extension

of a subset of ML or Haskell. Special “choice” constructs (generalizations of the

PQ

ýd¸

construct of functional languages) introduce indeterminism, and logic variables can be

3.5 Discussion 69

declared [Kuchen et al., 1990; Lock, 1992]. In languages that support concurrent con-

straints, collections of constraints can also be used in position of Boolean expressions

[Chakravarty et al., 1997; Hanus, 1999; Mehl, 1999]. These kernel languages are of-

ten tailored to the application in hand. In contrast, the



Zcalculus uses a more general

approach, since it is based on standard notions of set algebra.

An approach similar to that of



Zare the calculi for the Oz language [Smolka, 1998],

described in [Smolka, 1994a,b]. The

-Calculus, Calculus Aand Calculus Buse a lan-

guage overconjunctions and disjunctions of constraints. Basic constraints are declarations

of predicates and functions, as well as equalities and applications of predicates, and com-

mitted choice. The calculi are pure expression languages, similar to



Z. An equational

theory is given which serves to define the operational semantics modulo expression con-

gruences. As will be discussed in Chapter 4, the computation model has some similarities

to that of



Zdiffers from the Oz calculi in that it uses set algebra instead of predicate logic. It

incorporates extensional representation of disjunctive information, which the Oz calculi

do not support. On the other hand,



Zcurrently has no equivalent of committed choice,

which is important for representing explicit concurrency. Committed choice cannot be

represented as syntactic sugar in



Z. By a way of an attempt, let g

lGÓ

ÖÓ



be a

choice in



Z, where g

and g



are Boolean guards and the meaning is defined as



lÑ



lU³ ~



>

lk

dX



h



lk

S

The problem is the case where both g

and g



are true: in a committed choice, we expect

that either e

or e



is chosen nondeterministically. This kind of non-determinism is “don’t

care”, which is not supported by



Z: the calculus is, in fact, purely declarative, unfolding

“don’t know” indeterminism into sets. Thus, the property x

lÑ



encounters all

possibilities in the union of e

and e



. How choice can be incorporated into



Zrequires

further research. A more general approach than only supporting choice is aimed at, based

on temporal interval logic [B¨ussow and Grieskamp, 1997]. This will be discussed under

future work in the conclusion in Chapter 7.

Chapter 4

Computation in

Daß man nicht weiß,

Was man denkt,

Wenn man denkt;

alles ist als wie geschenkt.

(Goethe)

Models for computation in



Zare investigated in this chapter. The principal goal of

computation is seen to be the transformation of a



Zexpression into a “more” extensional

but equivalent form. The process, thereby, is incremental: given a set expression e, one

computes, for example, the equivalent expression

l^H



. Computation may then be

continued with e

(in case it is not already completely reduced) or with e



In general, the pragmatics of computation differs if it is performed “on-line” or “off-

line”1. On-line computation works by direct symbolic interpretation of expressions,

whereas off-line computation involves compilation steps, targeting at a tailored encoding

of expressions and values at execution time. In this chapter, we will have this distinction

not explicitly: all the computation models we investigate are symbolic, in the sense that

they work on expressions (or slight extensions of them). However, some models are better

suited to off-line computation and compilation, as we shall see.

Underlying every model of computation we investigate is a normalization transforma-

tion which establishes a disjunctive normal form, including simplifications of expressions

(Section 4.2 (on page 74)). The normalization already exploits most of the equational

laws given for



Zin Chapter 3. It is confluent and terminating, which is important to

make it applicable to the partial evaluation and optimization of



Zprograms at compile

time. Indeed, since it terminates, it has to leave out laws such as the fixed-point unrolling

law.We obtain a narrowing semantics of



Zby adding to normalization some further re-

ductions, which are based on the property substitution law, membership lifting law, and

fixed-point unrolling law (Section 4.3 (on page 82)). By constraining the reduction order

of narrowing to outermost-in and strict narrowing, we obtain some interesting submodels.

The narrowing semantics is well suited to on-line computation, however, not to off-

line computation (as will be discussed). Therefore, a further model is developed, using

the style of natural semantics, and constituting



Z’s reference computation model,RCM

(Section 4.4 (on page 86)). This model defines computation by making a distinction

between (strict, functional) deterministic reduction and (logic, concurrent) indeterministic

resolution, and prepares a straight transition to the abstract machine defined in the next

chapter. The chapter concludes with a discussion of the results obtained and of related

work.

1The notions “on-line” and “off-line” are taken from the realm of partial evaluation (e.g., Consel and

Danvy [1993]).

72 Computation in







EXP



¼'½{





ikj

Fikj





CONS







UWVYX

EXP



¼'½{



¼'½»{









¼'½{





¼'½»{







jÚjz~ 





jÚjz~ 









UZVYX

EXP

÷



EXP

iäl}÷



i'ÖÊïjÈÞ



¼'½»{

÷



iälUj

÷Yi'j









CONS







UZVYX

EXP

jÈÞ



¼'½{



¼'½»{



÷



Êïj=Þ



¼'½»{





¼'½»{



÷



jÚjz~ 

÷



jÚjz~ 









UWV[X

EXP

Figure 4.1: Extensional Equality and Unequality

4.1 Preliminaries

Some preliminary definitions are given in this section: extensional equality, extensional

confluence and matching contexts.

Extensional Equality

We introduce the notion of extensional equality of expressions. This (partial) equality

identifies expressions that are built from constructor terms and union of singleton sets; it

cannot identify schemas and translations. Thus, the relation



is an approximation of

expression equivalence, such that e

lk

Öô



holds. Since it is approximative, the

negation

lk

ô Ì



does not hold. We therefore define a second relation,

extensional inequality, written as e

l÷



, which obeys the property e

l2÷

Öô Ì



Both relations are defined by the rule system in Figure 4.1.

The use of



requires that unions be bracketed to the right (cf. the definition of



Section 3.3.5 (on page 56)), and that the empty set be removed in a non-empty union.

Since equality will only be applied to normalized expressions that puts unions into such a

form, this constraint is feasible.

No attempt is made, by the given notion of equality, to identify further expression

forms. It is generally assumed that intersection is resolved before equality is tested. Sets

described by translation, complement and, in particular, by schemas cannot be identified.

Extensional Confluence

On the basis of the equality relation



, the notion of an expression relation Rbeing

extensionally confluent is defined. For such a relation, all terminating derivations that

lead to an expression in the domain of



must yield equal results under



. Thus, if

4.1 Preliminaries 73

]_^\`ba



2j

?@





UNCTX





]_^\`ba

EXP





EXP





INCTX





]_^\`ba

EXP





EXP





Figure 4.2: Matching Contexts

e R

nSnSn

R e

and e R

nSnSn

R e



, where e

l^m



×¼'½{

R, are terminating derivations, then

l×¼'½»{$j 

³×Ë¼'½»{j 

lk



. This notion of confluence reflects the

fact that as long as “symbolic” computation is done, confluence is not asserted; only if a

derivation leads to an extensional form it counts.

Matching Contexts

Asequence matching context serves for the convenient formulation of rewriting rules.

With a sequence matching context, we can match one or more elements out of a sequence

and replace them by new elements or removethem. For example, if

cdA

=öm

feC

matches some

sequence containing

and

at arbitrary positions, then

cdA?g

+öm

geC

matches the sequence

where one occurence of

is replaced by

,ö

and one occurence of

is replaced by

ge

chAig

,ö

C

is the sequence where the occurence of

is replace by

+ö

, and the occurence

is removed. Sequence matching context

]_^\`ba



(



the element type of the

sequence) are declared in Figure 4.2.

For the common case of sequence matching in combination with generalized union

and intersection, in Figure 4.2 we also define union matching contexts



UNCTX and

intersection matching contexts

j

INCTX.



eabbreviates



for some sequence

matching context

(similar for intersection).

74 Computation in



EXPB

 

ikj





x expressions

EXPD

 



c disjunctions

EXPC

 



l conjunctions

EXPL

 





a literals

EXPA

 

g

6~



¾ 

lAÒÓ

ÚÅË

atoms



éçê

6~



"×

EXPP

 



 









c properties

Figure 4.3: Normalized Expressions

4.2 Normalization

Underlying every model of computation we investigateis a transformation that establishes

a disjunctive normal form, including simplifications of expressions. Normalization is

described by the relation

« 

EXP

Applying this relation as a context rewriting establishes a disjunctive normal form as

described by the grammar in Figure 4.3. The DNF is justified by the de-Morgan laws of

the calculus. In addition to the DNF form, some further assertions hold about normalized

expressions:

Any translation that is neither hiding nor embedding has been decomposed into

an embedding followed by a hiding. Hidings are pulled, embeddings pushed and

consecutive translations contracted. Complement is pushed as far as possible –

justified by the lifting and distributivity laws of translation

Note that hiding translations cannot be completely pulled out of expressions, since

complement is not distributive w.r.t. hiding; thus, for example, the following ex-

pression is in normal form: c

 jÚj

lk



o^Ã

lGÒÓ

ÚÅ

Potential for simplification is exploited to a maximum: the only laws not consid-

ered for this purpose are membership lifting, property substitution and fixed-point

unrolling.

Schema properties are decomposed into one of the forms: pattern equality, p





pattern membership, p

c, or test for emptiness and nonemptiness,



cor



The variables appearing in the patterns p,p

, and p



must be directly bound by

the enclosing schema, and the patterns must be linear. In fact, after removing the

syntactic sugar, the forms overlap. However, for the sake of simplicity, it will be

assumed in the sequel that the property forms can be distinguished.

4.2 Normalization 75

4.2.1 Property Normalization

We take a closer look at the normalization of schema properties. The benefit of having a

schema property in the form





is that the membership lifting law (Figure 3.34

(on page 65)) can be applied as soon as cdoes not depend on variables from p, yielding

the expression c

ÒÓ

Normalization ensures that any so-called flexible appearances of variables in a prop-

erty are converted into a membership property, p

c, together with a sequence of simple

equations of flexible variables, x



. By a flexible appearance, we mean a variable x

used as in



m

 

, where Cis a positive conjunctive context built from intersec-

tions and translations, and

a context of constructor applications.

The normalization of flexible variables works as follows. Let





D



be a

schema with a positive conjunctive property as an intermediate result of normalization,

where constructor decomposition has been performed, hidings have been pulled and ab-

sorbed by the overall hiding of



, and embeddings have been pushed to the leaves.

Henceforth,





must match (modulo associativity and commutativity)



jz~

lU

lAÒÓ

lzÅ' nSnSnB@~



ÒÓ

Å¶

where the xiare flexible and where cdoes not contain variables in a flexible position.

Next, we unify the patterns qiagainst each other (if this fails, then the property reduces



already at normalization time). Let

be the resulting assignment, and let



lÒÓ wf'w



ldmSnSndn(m

ÒÓ wf'w



be an assignment for the flexible variables xi,

and q



w^'w



the unified translation target. Let x

 ì´

Æíw





ì´

Æíw

the variables in qthat are bound to the flexible variables (qcan contain further variables

introduced by embeddings). Then, we construct the schema

~wf'w



j

×ïjwf'w



ofÃ

ÒÓ j

Åx n

However, this schema might not be well-formed, since

wf'w



pmay result in a non-

linear pattern. But in this case, a simple renaming of double occurrences of variables can

be performed, adding aliases like x



for each renamed variable.

By way of an example, consider the schema

Õj



. This results in the

normalized schema



×òjx~



ÒÓ j

Å'

ofÃ

j m

ÒÓ

Åz n

As a further example, demonstrating aliasing, consider the schema

~äj

Gj

o5



. For the property one literally gets



jx~



ÒÓ j m

Å¶9~



ÒÓ j

Å'@~



ÒÓ j m

Å'9~



ÒÓ j

ÅD

Unification of

j m

with

) and

j m

yields

. The resulting schema is

~äj

äj

×jx~



ÒÓ j

Å¶





 n

In a subsequent normalization step, this is reduced to a conjunction of schemas, one hold-

ing the pattern membership, the other the alias.

2Membership elimination is not part of normalization – butthe computation models based on normalized

expressions use it as a central simplification step.

76 Computation in



6~

;Ó



EXP

Figure 4.4: Singleton Set Normalization



~ihFÓ

0

W

ÒÓ i»Å



CONS



VAR

(







ïq

~ikj

FÓ



j¢j=¬



-

ò±



ÒÓ_ikjÚj

t ¼kj¢®ÕnSn



o¢o



CONS



UZVYX

EXP



UZVYX

VAR

Figure 4.5: Constructor Normalization

4.2.2 Normalization Rules

The rewriting rules for simplification and normalization are defined in the sequel, using

the notation style of inference rules. Though this kind of notation is particularly suitable

for inductive definitions over the structure of expressions (which does not apply to the

rewriting rules below because they are not recursive in their premises), it is used here be-

cause the application conditions of rules are sometimes complex, and are better readable

when denoted in rule style.

Singleton-Set Elimination

Rule

+®

in Figure 4.4 resembles the reduction of singleton-set selection.

Constructor Decomposition

Rules

Øö

and

cbe

in Figure 4.5 handle the decomposition of singleton sets containing con-

structor terms (accordingto the lawsin Figure 3.24 (onpage 60)). Even0-ary constructors

are transformed into translations, since incompatible constructions (e.g.

~i Ö9~i



with

i"÷



) are detected by the intersection of translations (Rule

Ù®

Union

Rules

c3p

c\q

in Figure 4.6 (on the facing page) model the simplification of set union

based on Boolean laws (Figure 3.21 (on page 58)). Unions are rebracketed such that the

generalized union,



, can apply. Tautologies are reduced and duplicate elements are

removed. The “excluded middle” for singleton sets is tackled by rule

cbq

Intersection

Rules

cbr

Ù®

s

in Figure 4.7 (on the facing page) model the simplification and nor-

malization of set intersection based on boolean laws (Figure 3.21 (on page 58)). As for

union, intersections are rebracketed, tautologies are reduced and duplicate elements are

removed. The “excluded middle” for singleton sets is tackled by rule

+®

e

. Rule

+®

distributes intersection over union.

4.2 Normalization 77

lk





(

³Ó

lÑVj

h

(







EXP



m

ÏÓ





EXP





UNCTX



Ó



EXP





UNCTX

x

lk





l^m



Ó









EXP





UNCTX

{

lk





l^ mf~

S

C

Ó







EXP





UNCTX

Figure 4.6: Union Normalization

|

l





(

XÓ

lkVj



(







EXP

jA

Èm





EXP

j



INCTX

løl

j;A

Ó

j;A



EXP

j



INCTX

lÈ

lk





lUm











EXP





INCTX

lÑ





l^ mÇ¨~

d

C

Ó









EXP





INCTX

2

l2÷



;A

ldm



C

Ó









EXP

j



INCTX

A

lk



Ó

;A

Ñ

;A









EXP





INCTX

ÌïjÈÞ



EXPASS



K



A

lAÒÓ

Å¢m



HÒÓ



Ó









EXP











PAT

j



INCTX

Ì j=Þ



EXPASS







A

lAÒÓ

Åzmd~

)

d

C









EXP









PAT





INCTX

{

ÌïjÈÞ



EXPASS



K



j A

l,

lU mU~



S

C

Ó









EXP







PAT





INCTX

ì´

Æíw

ì´

ÆMw

HA

ÒÓ

6Ó

Vj

ADC

o^Ã

ÒÓ

o^Ã

ÒÓ

j



INCTX



EXP



PAT

Figure 4.7: Intersection Normalization

Rules

Ù®

+®

q

handle the intersection of translations and/or schemas which cannot

fit, because the corresponding patterns cannot be unified.

Rule

Ù®

models the lifting of hiding translations out of an intersection context (cf.

Figure 3.28 (on page 63)).

Complement

Rule

Øöq

in Figure 4.8 (on the next page) absorbs consecutive complement. Rules

Øöä®

and

ÖöDö

distribute set complement over union and intersection. Rule

Öö

distributes

complement over embedding translations that are exhaustive. Rule

Öö

distributes com-

plement over schema, adding those values that cannot match the schema’s pattern.

78 Computation in





}

Ëj=



EXP

zl

 j

lk





l 









EXP

ø

 j

lk





l 









EXP



(

ì´

Æíw

ì´

ÆMw



PATEX



PATEX

Ëj

ÒÓ

j=

ofÃ

ÒÓ



EXP



PAT



~



FÓ



H>Ëj

ÒÓ



PAT



EXP

Figure 4.8: Complement Normalization

Translation

Rules

Öö

s

and

Öö

in Figure 4.9 (on the facing page) distribute translation over set union

and intersection. For intersection, only embedding translations can be distributed; hidings

are in fact lifted as described earlier (Rule

Ù®

). Rule

Öö



reduces the translation of the

empty set. Rule

Öö

eliminates a translation by the same exhaustive pattern. (An instance

where this rule applies is a translation by the same variable, x

ÒÓ

x.)

Rule

Öö

handles consecutive translations that are contradictory, because the target

pattern of the first translation cannot be unified with the source pattern of the second.

Rule

c\e

tackles consecutive translation that can be contracted. It is only applied

if the resulting translation is a hiding or an embedding, in order to prevent a circular

application w.r.t. Rule

c\e

»®

, that splits a translation which is not a hiding or embedding

into a consecutive embedding and hiding.

Rule

c\e

'ö

reduces a translation of a schema whose pattern does not unify with the

translation’s source pattern. Rule

cbee

eliminates an embedding translation of a schema

by substituting in the schema’s property the assignment resulting from unification of the

schema’s pattern with the translation’s source pattern.

Schema

Rules

cbevp

and

c\es

in Figure 4.10 (on the next page) handle distribution of schema over

set union and intersection. Rules

c\eo

and

cbe

eliminate schemas with tautological prop-

erties.

Rule

cbeq

describes the conversionof a schema property,



D

, into apattern membership-

test, p

c(as discussed above on page 75). This rule is sketched only informally. We

assume that



D

is normalized as far as possible, and that constructor decomposition is

not applied after this rule (otherwise we run into a cycle, since a candidate for constructor

decomposition is introduced by the rule). Moreover, the fact that a renaming of double

variables in

wf'w



phas to be performed, adding corresponding alias equations, is not

formalized.

4.2 Normalization 79



lk



o^Ã

ÒÓ

Å$Ó

ÒÓ

Å'



ÒÓ







EXP



EXP



ì´

Æíw

ì´

ÆMw

lk



o^Ã

ÒÓ

Å$Ó

ÒÓ

Å'



ÒÓ







EXP



EXP





lGÒÓ

BÅÓ









PAT



{

PATEX

ÒÓ

ÅÓ



EXP



PAT



Ìïj=Þ



EXPASS







(

lGÒÓ

ÚÅ

(

ØÒÓ

BÅ$Ó





EXP









PAT

(}







(

ì´

ÆMwñj:w^'w



ì´

Æíw jwf'w



ì´

Æíw jwf'w



ì´

ÆMwñj:w^'w



lGÒÓ

ÚÅ

(

ÖÒÓ

ÚÅÓ

wf'w

lAÒÓ wf'w



BÅ



EXP









PAT





EXPASS

(

ì´

Æíw

ì´

ÆMw

SHÌ

ì´

Æíw



ì´

Æíw



j j

Öì´

ÆM½¶ÆM¼'yÆj

ì´

Æíw

l%

ì´

Æíw



o o

lAÒÓ

ÚÅ$Ó

lGÒÓ

ÒÓ

ÚÅ



EXP







PAT

(

ø

Ìïj=Þ



EXPASS







l)



ÖÒÓ

(

¢ÅÓ





EXP







PAT

((





S

ì´

Æíw



ì´

Æíw

(

l,



ÖÒÓ

(

BÅÓ

~w^'w



(

wf'w







EXP







PAT





EXPASS

Figure 4.9: Translation Normalization

(D2



lk

SYÓ



lUG@~



S



PAT







EXP

(



l

dYÓ



l^H@~



S



PAT







EXP

(

Ñ;Ó





PAT

(



;Ó

ÒÓ



PAT

(

{

ì´

ÆMw









lGÒÓ_wf'w



ldmSnSnSnhm

ÒÓ_w^'w



¨ì´

Æí½'Æí¼'yÆj

ì´

ÆMwñjwf'w







ì´

Æíw



o¢o





Jj;A

lU

lAÒÓ

lzÅzmSndnSnWmd~



ÒÓ

;Ó

~w^'w



äj

×ïjwf'w



Jj;ADC

o¢o^Ã

w^'w



lGÒÓ j

Åx



PAT





INCTX





EXPASS



UZVYX

VAR

Figure 4.10: Schema Normalization

4.2.3 Properties of Normalization

The relation

has some properties that will be investigated below. Let



the lifting of

to a context rewriting, such that e





iff there exists a context

and subexpressions e



such that e





, and e







80 Computation in



PROPOSITION 4.1 The relation



is sound w.r.t. the meaning of





Proof (informal)

. The rules are directly derived from the equational laws.

PROPOSITION 4.2 The relation



is terminating.



Proof (informal)

. The rules of

can be characterized in terms of three classes:

elimination rules which decrease the size of their operands; distributivityrules which push



Zoperator down into an expression, thus possibly duplicating some operands; and

lifting rules which do the opposite of distribution. The elimination rules are obviously

safe as regards termination. Distributivity and lifting rules are safe if they exclude each

other: what has been distributed must not be lifted to its original form in a later rewriting

step. There are two problematic cases here:

Rule

c\e

composes and rule

cbe

»®

splits translations. But the application conditions

have been chosen such that composition is only performed if splitting cannot be

applied to the result.

Rule

cbe

decomposes patterns and rule

c\eq

synthesizes them. Preventing a cycle

in the application of these rules is not been formalized (because Rule

cbevq

is left

informal) but it is easily possible: since the effect of

c\eq

is localized to a schema’s

property, normalization can be sequentialized. First the rule system is comprehen-

sively applied without using Rule

c\eq

, then in a second pass

c\eq

is comprehen-

sively applied.

PROPOSITION 4.3



is confluent w.r.t. extensional equality.



Proof (informal)

. Since we can assume soundness, the problem reduces to showing that

for each derivation that leads to an extensional value (one which is in the domain of



each alternative derivation also leads to an extensional value. The sources for increasing

extensionality in a derivationare elimination rules. The question is whether the possibility

of applying some of these rules is obstructed if we prefer applying another rule before a

possible elimination step:

The Boolean elimination rules cause no problems here. A union or intersection con-

text is either reduced to a tautology, which is the “best” result that can be achieved,

or a duplicate operand is removed, still allowing elimination in its context with the

remaining operand.

The application of distributivity and lifting rules actually increases the potential for

simplification, so there is no problem if one prefers distributing before a possible

elimination step.

On the basis of Proposition 4.2 and Proposition 4.3, we can assert the existence of a

total function

u'ÆÈ{

which normalizes and simplifies an expression. For nonextensional

4.2 Normalization 81

results of normalization, where confluence cannot be asserted, some arbitrary but fixed

reduction order is assumed:

u'ÆÈ{



EXP



EXP

PROPOSITION 4.4 The function

u'Æ{

puts an expression into disjunctive normal form, as

specified by the grammar in Figure 4.3.



Proof (informal)

. In general, establishing a disjunctive normal form is a standard pro-

cess. Our rule system differs from this only in that translations need to be treated, and

properties of schemas must be normalized. The decomposition of any nonhiding and

nonembedding into an embedding followed by a hiding, and the strict distribution of

embeddings and lifting of hidings, ensures that translations are normalized as required.

Establishing the normal form for properties was justified on Page 75.

PROPOSITION 4.5 There exists an innermost-out reduction strategy, an inductive algo-

rithm over the expression structure, which implements

u'ÆÈ{



Proof (informal)

. The algorithm is the usual one for establishing a disjunctive normal

form. For example, for distributivity of intersection over union, we can give an inductive

rule in the style of:

jA

fC)



j;A



yC)











A

lk

zC) 

In general, distributivity and lifting rules drive the algorithm. Elimination rules are ap-

plied at each expression depth (here depicted by e





before the final result

expression is returned.

82 Computation in





j A



¾ md~



C





u'Æ{Yj

#

j A

w^'w jx{



Uwf'w¾j

o¢o

¾ md~



C

oÚo





EXPCTX







INCTX



EXPB







EXPP



PAT









S

JjHA



W md~



lÒÓ

ÚÅ





u'ÆÈ{;j

#

JjA

z~wf'w



wf'w jz{

L

dwf'w¾jwf'w



lUm

oÚo

¾ m



lÒÓ

BÅ

o¢o





EXPCTX







INCTX



EXPB



i



EXPP







PAT





EXPASS



(

ì´

ÆMw



ì´

ÆMw



9jx~









u'ÆÈ{;j

#

ÒÓ

oÚo





EXPCTX



PAT



EXPC



assert continuity of d

9j:é%ê





u'Æ{Yj

#

¿jwf'w2jz{

L

Uw^'w¾j

mÚéçê

o¢o





EXPCTX



PAT





EXPD

Figure 4.11: Narrowing Rules

4.3 Narrowing Semantics

In normalized expressions nearly all the equational laws of



Zare exploited. The laws

not used are property substitution,membership lifting and fixed-point unrolling. Consid-

ering the normal form of expressions, these are the laws driving reduction: with property

substitution, variables are bound; with membership lifting, schemas are eliminated; and

with fixed-point unrolling, a circular expression is lifted one level upwards in its con-

text. Applying each of these steps may provide further simplification and normalization

potential.

We will thus define the narrowing semantics of



Zas consecutively applying sub-

stitution, membership lifting and unrolling in an expression context, and afterwards re-

normalizing the entire context. The strategy for selecting the next redex can thereby be

altered. In the general case, where redex selection is arbitrary, we conjecture a complete-

ness result w.r.t.



Z’s equational theory. A possible restriction is an outermost-in strategy.

A feasible restriction is “strict narrowing”, which is not complete, but more adequate for



Zthan outermost-in, as will be discussed.

4.3.1 General Narrowing

The relation





represents a narrowing step:







EXPB

The rules for





, using an arbitrary expression context

, are given in Figure 4.11.

Rule



implements property substitution for the case of a plain intersected singleton set;

rule



does so for a translated singleton set. These rules are direct applications of the law

found in Figure 3.35 (on page 66). Rule

He

realizes membership lifting (cf. Figure 3.34

(on page 65)). Rule

Hp

implements fixed-point unrolling (cf. Section 3.4.7 (on page 66)),

where continuity of the expression dneeds to be asserted.

4.3 Narrowing Semantics 83

We investigate some relevant properties of





: soundness, extensional conflu-

ence, and extensional completeness.

PROPOSITION 4.6 The relation





is sound w.r.t. the meaning of



Z, modulo the

assertion that in each appearance of the fixed-point operator, the fixed expression is con-

tinuous.



Proof (informal)

. Normalization is sound. The additional rules of





are directly

derived from the equational laws of the



Zcalculus.

PROPOSITION 4.7 The relation





is confluent w.r.t. extensional equality.



Proof (informal)

. We can assume extensional confluence and termination of normaliza-

tion as well as soundness. As with the argument for confluence in Proposition 4.3 (on

page 80), the question is whether indeterminism in the reduction order obstructs the po-

tential for obtaining an extensional value in a derivation. This is excluded because each

individual rule in fact increases potential for simplification, regardless of the order in

which they are applied: rules



and



instantiate variables, and Rules

He

and

Hp

lift

expressions out of their isolated context of a schema property into an intersection context,

where further interactions are enabled.

CONJECTURE 4.8 The relation





is extensionally complete w.r.t. the equational

theory of



Z: whenever e

and e

is extensional, then there exists an e

Ô Ô



, such that

u'Æ{





Ô Ô



A formal proof of this conjecture is omitted. In fact, the proof has been indirectly provided

by construction of normalization and narrowing, providing a strategy for the comprehen-

sive, directed application of



Z’s equational laws:

The normalization of translations – hidings are pulled, embeddings pushed – en-

ables maximal elimination of them.

All possibilities for applying property substitution are detected.

The normalization to DNF, together with constructor decomposition, enables all

simplifications in intersections.

The normalization of schema properties recovers all possibilities for applying the

membership lifting law.

Complements, which hinder the application of property substitution and other laws,

are pushed as far as possible. The “excluded middle” for singletons is not obstructed

by this. Consider

H¨~



× ~

» n

Using membershiplifting before pushing complement yields

G>~



, enabling

the application of excluded middle. On the other hand, pushing the complement

over the schema yields in

H9~

j

×F~



 n

84 Computation in



However, now property substitution can be applied, resulting in

H9~

E j

×F~



 m

which likewise reduces to



after some steps.

General narrowing is so powerful because the reduction order is undetermined. Since

all orders of reduction are possible , the “right” one (which terminates if termination is

possible at all) is also contained in





. However, this also means that there are

many derivations leading to a non-terminating “dead-end”, and thus general narrowing is

not very feasible for implementing computation.

4.3.2 Outermost-in Narrowing

The narrowing semantics defined in the previous section can be modified by supplying

an outermost-in strategy for selecting the next redex. In order to model “parallel-or” and

“parallel-and”, indeterminism still needs to be present in the reduction order, but it can be

significantly pruned.

The selection of an outermost-in redex is described by a context, declared as



OUTEREDEX



EXP



EXP and specified by the following “grammar with a hole”:





 



AY



C



j



AY



C

 Ëj

[



6j

[



ofÃ

lAÒÓ

BÅ $Xj

[



³

The redex selection is indeterministic in the choice of the member of an intersection or

a union, which models the “concurrency” of reduction by interleaving. Note that we do

not select subexpressions in constructor terms, schema, and singleton set: this reflects the

non-strictness of this order.

By replacing the expression context

in the narrowing rules



p

by a context



, we get an outermost-in narrowingstrategy. We conjecture that this strategy is “shallow

complete”: for forced expressions it obtains the same results as general narrowing. A

forced expression is one captured by the context



during a derivation sequence. For

example, if we have initially

~äj



H@~



ÒÓ j m

then cwill be protected by the singleton set, but after substitution we get

~äj



H@~



ÒÓ j m

which is then transformed by membership lifting to

ÒÓ j

Å¶9~



ÒÓ j m

such that cis now captured by



. In an actual implementation of this strategy, one would

use sharing techniques such that the reduction of the bubbled cin the translation would

also rewrite the original cin the singleton set – yielding a “lazy” reduction strategy.

The problem with this strategy is that, by contrast to functional logic languages which

base on lazy narrowing [Hanus, 1994], in



Zequality and unequality is required for sim-

plification rules concerned with sets of sets – a further possible source apart from the

4.3 Narrowing Semantics 85

redex selection for forcing reduction. For example,

~'~

l^ H9~'~

d 

reduces (by normal-

ization Rule

+®

) to



, if e

and e



reduce to an extensionally unequal value. Should we

force the reduction of elements of singleton sets in an intersection, such that extensional

equality or unequality might be applicable? The problem is thus to transparently define

when values are forced. For this and other reasons, we prefer another strategy, called

“strict narrowing”.

4.3.3 Strict Narrowing

The basic decision in strict narrowing is to extend the selection of a redex to the elements

of singleton sets, such that we get the following refined redex definition compared with

outermost-in narrowing:

V

v



A[c

@

C



j

A[c

@

C

 Ëj

V

Xj

9

o^Ã

lAÒÓ

ÚÅË5³j

V

6~

@ 

By replacing the general expression context

with

in the narrowing rules, and by fur-

ther demanding that the property substitution rules,



and



>ö

, are only performed if the

element of the singleton set cannot be further reduced by





, we obtain a strict nar-

rowing strategy. This strategy is still indeterministic because of the undetermined choice

of which operand in union and intersection is to be selected next. It is not completely

identical to what is usually described as “innermost-out” narrowing [Hanus, 1994], since

we still do not look inside of schemas. Rather, it is related to applicative order reduction

and weak head-normal form in the

-calculus [Barendregt, 1984].

Since, by constructor decomposition, expressions of the kind

~ikj



are normalized



ÒÓ ikj

, it is useful for reasons of transparency to also make constructor appli-

cation strict. We therefore add to the above definition of

contexts that select operands

of constructor application (we do not formalize this).

Strict narrowing will be our operational semantics of choice, and the computation

model developed in the next section will be based on this strategy. Its restriction is that

diverging reduction of elements of singleton sets which would be not “forced” in lazy nar-

rowing propagates to the context. However, if in this context other reductions determine

the result, as in

diverges

G

reducesToEmpty, then the nontermination will be caught by

the concurrent reduction model.

86 Computation in



4.4 The Reference Computation Model

In this section, the reference computation model of



Z, called RCM, is described, using

the style of natural semantics [Kahn, 1987]. The intention is to obtain a model that is ab-

stract enough to still support a conceptual idea of computation in



Z, but also near enough

to an implementation to serve as a starting point for the design of an abstract machine for

“off-line” computation. The strict narrowing semantics given in the previous section is not

well suited to this purpose, since it is based on the symbolic manipulation of terms and

requires renormalization of complete expression contexts after each narrowing step. In

the model given here, substitution is to be replaced by environments and renormalization

by techniques which can be mapped to backtracking and search.

In the previous section, we have already discussed why for conceptual reasons we

prefer a strict semantics for computation. The model designed here has a few further

restrictions, driven by the goals of efficient implementability and transparent traceabil-

ity of execution. The last point is particularly important for the application to test-case

evaluation, where the reason for execution failure needs to be traceable by humans. The

following restrictions will be imposed in comparison with the strict narrowing semantics:

In principle, the strict narrowing semantics provides “symbolic computation”. An

expression is irreducible if it cannot be further reduced by





, but such an

irreducible expression may still be processed, for example, by substituting it in a

property.

In the RCM of



Zdefined here, expressions need to be reducible to a value. If

this value form is not reached, and the expression is needed to continue computa-

tion, then computation will stop. The restriction to value forms allows an efficient

representation in an abstract machine, and makes the sources of failure traceable.

The strict narrowing semantics given in the previous section allows arbitrary “paral-

lel or” and “parallel and” reduction. In an implementation, this would mean that any

subexpression of union and intersection has to be computed by concurrent threads,

which causes significant efficiency problems.

The RCM presented here makes a distinction between deterministic, “functional”

reduction and indeterministic, “logic” resolution. The deterministic reduction is de-

signed using the residuation technique [Ait-Kaci et al., 1987], which lets a compu-

tation suspend as soon as it encounters a free “logic” variable. The logic resolution

uses parallel execution techniques.

Reduction takes place for evaluating set expressions and plain expressions. Reso-

lution is done when a reduced set value vis tested for membership, p

v. “Parallel

or” is not provided – disjunctions are executed from left to right, thus enabling

backtracking techniques for their implementation.

For defining the RCM, we first specify values, a subset of expressions. Though values

are still represented symbolically, we will later see in Chapter 5 that they correspond to

an efficient representation in an implementation. We then define a set of inference rules

which describe computation.

4.4 The Reference Computation Model 87

EXPV

 

ikj

G~



3



l

.





5×

CTR

 



k





lkÄ

G

S-



lkÊ

G

S

|7×

VALASS

}

VAR

EXPV



å



CTR



CTR





?@

VALASS



@

EXPP





CTR

ì´

Æíw



CTR





VAR





VALASS



CTR



~

lkÊ



S

oÕ

j|





l

ÊVj|





S



~

lkÄ



S

oÕ

j|





l

ÄVj|





S



j|



k

o 

j|}Ï|

oHÃ



k

ÍÏÎ

¸'Ð¸

|ñmB|



VALASS



5

lUm

z

S



CTR





EXPP

wzw



CTR

VALASS

wzwWj



lÊ

G

S

oÖ ´

wzw

¡

l%

w¢w

¡

S

wzwWj



lÄ

G

S

oÖ ´

wzw

¡

l%

w¢w

¡

S

wzwÕ|



k



ÍÏÎ

¸'Ð¸



VALASS



5

lUm

z

S



CTR





EXPP

UVCTX

IVCTX



2j

@

EXPV



EXP

 m ÷



ñj

EXP



VALASS



EXP

 m ÷



ñj

EXP



VALASS



VALASS



EXP

Figure 4.12: Values and Constraints

4.4.1 Values and Constraints

Values are a subset of normalized expressions, enriched by a tailored representation of a

schema-like construct, called an intension. An intension consists of a pattern and a con-

straint. A basic constraint,



k

, is a value assignment paired with a property expression.

Constraints may be composed by conjunction and by disjunction, though we expect only

conjunctions in intensions (disjunctions are needed later on). The definitions, together

with some auxiliary functions on constraints and values, are given in Figure 4.12.

An intension



|



¢k¡

can be interpreted as the schema

¶w^'w)|

£k

¾

. However,

there are a few differences. First, it is possible that the above expansion results in an

infinite term, since

may contain an assignment referring to the intension itself. This

circularity of assignments is used to express fixed-points in the computation model. Sec-

ond, the property

may contain free variables which are bound neither by pnor by

These variables may be considered “locally existential” for the intension, and are used to

represent hiding translations by intensions. When an intension is instantiated in a context,

then these variables need to be renamed away from the variables of the context.

Some auxiliary functions for working with constraints are introduced. We write





for a conjunction of constraints, and



for a disjunction. For a sequence of basic con-

straints,





is written. The set of free variables used in a constraint can be retrieved by

ì´

Æíw

>





describes the propagation of an assignment into a constraint, and distributesover

88 Computation in



conjunction and disjunction. For a basic constraint, we have



j|



¡k

o

j:|}Ï|

oÕÃ



Here, propagation is only defined if

|}Ï|

yields a proper function. This is only the case

if, in their common domain, both assignments map to the same value (or the common

domain is empty).

Given a constraint, its overall assignment is selected with the function

w¢w

+

. The

assignment

w¢w

¡

is only defined if the union of the assignments of the subconstraints

yields a proper function.

Some auxiliary functions for working with values are required. Value union and inter-

section contexts,



UVCTX and



IVCTX, are a special case of general union and

intersection contexts, UNCTX and INCTX, but expect the sequence elements to be values.

In Section 4.1 (on page 72) we defined a (partial) extensional equality and extensional

unequality. These definitions carry over to values. As schemas can not be decided as

equal or unequal, intensions cannot. On the basis of value equality and unequality, a uni-

fication for “extended patterns”, built from constructor application, variables and values,

and written as p

l 





, is assumed. This unification uses



for comparing values

found at the bottom of constructor terms. The form p

l÷





decides whether unification

fails (using

÷

). The notation p

l)





“refines” the assignment

to the assignment

by the result of unifying

wf'w)|

with

wf'w)|



: similarly p

l5÷





for the failure

case.

In principle, more powerful notions of unification can be embedded in our computa-

tion model (e.g. set unification). This would require some technical modifications to the

rules given in the next section to deal with sets of unifiers returned by unification instead

of a single unifier.

4.4.2 Computation

Computation is defined by two relations, which are mutually dependent. The first relation,



Ó

¤

, describes a strict reduction step on expressions under the value assignment

. It

cannot be applied to unbound variables, which means, in the context of its usage, that the

corresponding computation “suspends”. The relation e



Ó

¤

has indeterminism, but it

is “don’t care”: an implementation can choose some arbitrary fixed order to implement

reduction.

The second relation,

¥§¦¨

, describes a resolution step by mapping a constraint

onto a constraint.



is expected to be in disjunctive normal form, and



will be, too.

The indeterminism in this relation (provided it does not result from recursive application

of e



hÓ

©¤

) is “don’t know”. Thus, an implementation has to provide a concurrent

(interleaving) application strategy.

ÑÓ

¤



2j

EXP



VALASS



EXP

§¦



CTR

We expect that, before computation starts, expressions will be normalized to disjunc-

tive normal form, as specified by the function

u'ÆÈ{

ein Section 4.2. However, during

computation, no renormalization is necessary. Thus normalization can be done at “com-

pile time”. In contrast to the narrowing semantics, where simplification is an intrinsic

part of normalization, the strict computation model does not actually depend on initial

4.4 The Reference Computation Model 89

l



Ó

ª¤



Ó

ª¤!c



EXP





VALASS



SREDEX

Figure 4.13: Strict Context Reduction

S

×9¼'½»{"|



hÓ

|



VAR





VALASS

¤(

ì´

Æíw

ì´

ÆMw

¿¼'½»{9|







Ó

¤«

|







PAT





EXP





VALASS

¤2









hÓ

¤



Ó

ª¤

nSndn



Ó

¤

é%ê



hÓ

©¤



PAT





EXP





VALASS





EXPV

Figure 4.14: Variable, Schema, and Fixed-Point Reduction

simplifications. The essential conditions are that properties of schemas are normalized,

as described in Section 4.2 (on page 74), and complement is pushed.

Expression Reduction Rules

Rule

G®

in Figure 4.13 describes an inductive reduction step in a strict expression context,

SREDEX

EXP



EXP. A strict context

is specified as with the strict narrow-

ing semantics. As explained above, the freedom of choice in selecting a strict redex in

expression reduction amounts to “don’t care” indeterminism instead of “don’t know”.

The rules in Figure 4.14 model the step where an expression is converted into a value.

Rule

ñö

describes the substitution of a variable, which will only be possible if the variable

is bound by

. This is one source of “suspension”. Rule

¬e

reduces a schema to an

intension. The rule is only applicable if free variables of the property are either bound

by the pattern or by the substitution – i.e. variables from outer scopes are not allowed

to be free. This is the other source of “suspension”. Rule

¬p

describes the construction

of a fixed-point. The rule is cyclic in its premise, forestalling the resulting value vin an

assignment

which is used to calculate this value (the usual technique used in natural

semantics for constructing fixed-points). A finite number of reduction steps is required to

map the expression eto a value v: the expression emay contain several schemas, and the

value form is not reached until all these schemas have been converted to intensions.

The rules in Figure 4.15 describe the reduction of translation and complement. Rule

¬s

converts a translation into an intension. It exploits the possibility of using existential

variables in an intensions constraint: p

may contain variablesnot bounded by p



. Rule

¬o

describes the translation of a complement into an intension.

The rules in Figure 4.16 (on the next page) describe the evaluation of set operators

lGÒÓ

BÅ



ÑÓ

¤«





lÕ×





EXPV









PAT





VALASS

fresh





Ó

¤«









EXPV





VAR





VALASS

Figure 4.15: Translation and Complement Reduction

90 Computation in





È



ÑÓ

¤®j



ADC





VALASS





UVCTX

{

lk





lUm



C



hÓ

©¤®j









EXPV





VALASS





UVCTX

®A

È

¯C



hÓ

¤

)





VALASS

j



UVCTX

l

l÷



®A

l^m





Ó

¤

}







EXPV





VALASS





UVCTX

løl

lÑ



®A

ldm



C



hÓ

¤ j

¨A







EXPV





VALASS





UVCTX

lÈ

lA





exvs



ì´

Æíw

/

Sñæ

ì´

ÆMw

S

exvs





fresh

j®A?

l)

l

l

?

@

}



S

fC



Ó

¤¨j°A?

=wf'w)|

l)|



~

lkÄÉj





S

o¢o

C







PAT







CTR





VALASS

exvs





VAR





VALASS





IVCTX

l

l÷





®Ai

l,

v

l



m







S

fC



Ó

ª¤

}







PAT







CTR





VALASS

j



IVCTX

l

2

j®A

lk





Ó

¤°A

k

®A



C







EXPV





VALASS

j



IVCTX

Figure 4.16: Set Value Reduction

l

fresh





Ó

¤

)





EXPV



VAR





VALASS

l

§§¦d



z



Ó

¤



z



VAR





VALASS



CTR

l



q



¼'½»{ |

×9¼'½»{$j |





¼'½»{ |

 |





o¢o



Ó

|®



VAR





VALASS





UWVYX

VALASS

Figure 4.17: Mu-Value Reduction

on set values. Rules

¬

¬q

describe the elimination of tautologies in a union of set

values; Rules

¬r

G®D®

for the intersection of set values. Rules

G®ö

and

H®

e

model the

combination of intensional values by intersection: if the patterns can be unified, then we

simply concatenate the constraints of the intensions, making local existential variables

disjoint by a renaming

into fresh variables. If the patterns do not unify, the intersection

reduces to



. Finally, Rule

G®

p

describes the distribution of intersection over unionvalues.

It should be noted that, in an implementation, intersection can be realized more efficiently

as suggested by the rules: given a total order on values and set values in a normalized

form



Ö









, the intersection of two such values can be realized by processing

the extensional part, v, in linear time, and then the combinations with the intensional part.

The remaining rules in Figure 4.17 describe the handling of the



-operator. Once

the argument of



has been reduced to a value v, the elements of vare enumerated by

resolving a constraint x

v. The state of this subresolution is stored in an intermediate

expression construct,



z

. When the subresolution is in solved form,

j |



, a

4.4 The Reference Computation Model 91

£§¦



j |



=³

A~vC



§¦

j |



=³





CTR



UZVYX

CTR





UZVYX

VALASS

d

A

|

ä



§¦





VALASS



UWVYX

CTR

¦(



[chA

|

ä

§¦

ä





VALASS



´fµ¶·

CTR

¦y2



[chA

|



§¦



[cdA~C





VALASS



´fµ¶·

CTR

©§¦













YcdA~vC

§¦

wzw(j



Ö®





[cEA 

Ö®

C

mSnSnSn(m

wzwWj







[chA 



CTR



UWV[X

CTR



´fµ¶·

CTR



Figure 4.18: Compound Constraint Resolution



Ó

¤



§¦







VALASS



EXP



PAT



Ó

¤





§¦









VALASS



EXP

 





Figure 4.19: Expression Reduction in Constraints

unique assignment for xis extracted, if it exists.

Constraint Resolution Rules

Compound constraints are resolved by the rules in Figure 4.18. Rule

ñ®

describes the

left-to-right resolution of disjunctive constraints, Rule

¾ö

the elimination of unsolvable

constraints. Rules

¹ne

and

¹ºp

describe the elimination of unsolvableand solved constraints

in a conjunction. Rule

¹ns

describes a resolution step of a basic constraint in a conjunction,

where S

A~C

is a sequence context. This rule models the concurrent execution of constraints

by “interleaving”. An arbitrary basic constraint is selected (remember that constraints are

in DNF) and resolved one step, possibly yielding a disjunction. Maintaining the DNF by

pushing the conjunctive context

over each member of the disjunction corresponds to

backtracking in an implementation. The assignments of the conjuncts,



i, are propagated

over

, such that a steady refinement of the assignments is guaranteed.

The rules in Figure 4.19 describe the reduction of expressions in basic constraints.

These rules are only able to fire if all free variables in eare bounded by

– otherwise



Ó

¤

is “suspended”.

The rules in Figure 4.20 (on the next page) describe the resolution of pattern mem-

bership in readily reduced set values, and resolution of equality constraints. Rule

¹ºq

decomposes the membership test on a union of values into a disjunctive constraint. This

is the only way basic constraints can rewrite into disjunctions, thereby introducing the

need for backtracking. Rules

¹nr

and

ñ®q

describe a basic resolution step, reducing a

membership test on a singleton set to an assignment or a failure. Rules

ñ®D®

and

2®ö

describe the resolution of a membership test on an intension. Variables used in the inten-

sion’s pattern and constraint are renamed by

to fresh ones. Rules

2®

and

ñ®

p

model

the resolution of a pattern equality.

The remaining rules in Figure 4.21 (on the following page) describe the resolution of

92 Computation in



{



×òj

lk



§¦

Fj:|



Ê j|









VALASS



PAT







EXPV







×F~



»§¦







VALASS



PAT



EXPV

÷





× ~



§¦

ä





VALASS



PAT



EXPV

Sløl

ì´

Æíw

>

Õ

ì´

Æíw



fresh







w^'w









¼§¦







VALASS



PAT



CTR





VALASS

SlÈ

ì´

Æíw



fresh



÷



w^'w







§¦

ä





VALASS



PAT



CTR





VALASS

lG











½§¦







VALASS







PAT

2

l÷











§¦

ä





VALASS







PAT

Figure 4.20: Membership and Equality Resolution

fresh





§¦











VALASS



EXPV



VAR

 





§§¦d





z

¦£§¦





z





VALASS



VAR



CTR

 





l





A

|





oÚo

¦£§¦







VALASS



VAR



UZVYX

CTR

 





{





mÚ|

ä

¦¾§¦



compls





VALASS



VAR



UWV[X

CTR

 





Figure 4.21: Test-Emptiness Resolution

set tests,



sv, where the tested set has been reduced to a value. Similar to the treatment



in expression reduction, a subresolution is started which enumerates the elements

of the set vby resolving the constraint x

v. The state of this resolution is stored in an

intermediate property expression,



z

, where sis either



, depending on the pole

of the test. The test succeeds (or fails, depending on s) if at least one constraint can be

solved; it fails (or succeeds) if no constraint can be solved (which requires enumerating

the entire set).

4.5 Discussion 93

4.5 Discussion

We have developed several models for computation in



Z. Further points and related

work are discussed below.

Normalization and Simplification

Normalization has been defined as a prerequisite for any kind of computation, and will

be also required for compilation in the next chapter. Applying normalization ensures that

negative information is distributed as far as possible to the leaves of expressions, enabling

resolution techniques in the lifted positive contexts. Local resolution can be used in the

remaining (minimized) negative contexts. Universal quantification, which is represented



Zas the combination of complement and hiding, is, for example, solved by local

resolution.

A well-known problem of normalization toward a disjunctive normal form is the ex-

plosion of the expression size. This is one reason why we interleaved normalization

with simplification: pruning this explosion early. As a rule of thumb, “sensitive”



expressions do not explode if simplification is interleaved with DNF production. How-

ever, pathological cases might still cause serious problems. Temporary variables may be

used to avoid the explosion of the DNF. For example, the expression e

Vj

lÑ



can be

normalized to the form:

j»~äj







V~

lk

S

ÒÓ j m

ofÃ

ÒÓ

With this technique, expression sharing can also be preserved, and new sharing poten-

tials can be exploited using common subexpression elimination. However, the technique

also has its disadvantages: the number of constraints increases and the introduction of

new temporary variables implies administrative overhead for binding and accessing these

variables.

An alternative approach for avoiding explosion of expression size on normalization

is to use directed acylcic graphs (DAGs) with optimal sharing for representing expres-

sions. Structurally equal expressions do not appear twice in such a DAG. It is known

e.g. from symbolic model checking [McMillan, 1993] that, for boolean algebras, quite

efficient algorithms for constructing and maintaining this representation exist. However,

further experiments are required, particularly regarding the treatment of the non-Boolean

operators of



Z, like translation,



-selection, etc.

Narrowing

Narrowing can be elegantly expressed in



Z’s framework by pure expression transforma-

tion, combining normalization and simplification with substitution, membership lifting

and unrolling. This is a result of the “higher-orderness” of the



Zcalculus, which allows

us to represent “meta” information, like the set of (partial) solutions, by expressions of

the calculus itself.

In rule based functional logic languages, solutions are represented as substitutions

on the meta level. Here, narrowing is defined as follows (see Hanus [1994] for a sur-

vey): given a (nonground) goal expression, a subexpression is selected which unifies

94 Computation in



with the left-hand side of some rule under a minimal substitution, and is replaced by its

right-hand side instantiated with the substitution. The substitution is then applied to the

entire expression. At a first sight, this looks different from what we call narrowing in



Z. However, the same elements are, in fact, present as a special case in



Z. The “rules”

belonging to a function or relation symbol are simply represented as a union of schemas,



l}

lUG nSndnÇ9~





. A “goal” is then merely the intersection of Rwith some

(partial) data binding, represented by a (translated) singleton set: R

@~



lAÒÓ

BÅ

. Rule

“selection” is represented by distributing the singleton set over the union Rand perform-

ing property substitution. Each member of the resulting union is part of the explicitly

represented search space, which, in narrowing for rule-based languages, is coded in the

nondeterminism of the narrowing relation.

As e.g. demonstrated by Hanus [1994], modifying the selection strategy for the next

redex in narrowing is a powerful tool for investigating variations of computation. This

technique has been applied to our version of narrowing in



Zas well, defining general

narrowing, outermost-in narrowing and strict narrowing. Strict narrowing is the oper-

ational semantics of choice for



Zbecause outermost-in narrowing lacks a transparent

definition of forcing in



We have not formalized a strategy for needed narrowing [Antony et al., 1994], which

can be considered “state-of-the-art” for narrowing in functional logic languages, since it

yields certain optimality results regarding the length of derivations. Needed narrowing is

expressible in our framework. Consider a goal like

jx~

l6

lUG nSnSnB@~





9~



needed narrowing would mean that before distribution we force eas far as required by

every pi–i.e. to the shallow structure of the most special pattern, where each piis a spe-

cialization of. In the worst case, this pattern is just a variable, so that no forcing happens.

Actually, we have “definitional trees” [Antony,1992], as used for defining needed narrow-

ing, directly in the calculus, because we have set union and schemas. However, it seemed

more important to support extensional equality, which obstructs a transparent notion of

forcing for needed narrowing. We should note that extensional equality and unequality is

only required to deal with sets of sets – for example for reducing

~'~

lU G@~'~

S 



if e

and e



reduce to an extensionally unequal value. If



Z’s computation model would

be restricted to not use extensional equality and unequality, as it is the default in other

frameworks, lazy or needed narrowing could be easily adapted.

Reference Computation Model

The reference computation model given in Section 4.4 (on page 86), using the technique

of natural semantics, is a trade-off between computation power and execution efficiency.

It prepares a straight-forward transition to an implementation by the abstract machine

ZAM, as shown in the next chapter. The model is even more strict than in strict narrowing,

using a combination of functional reduction by residuation [Ait-Kaci et al., 1987] and

logic resolution. The justification for this decision is, on the one hand, easier traceability

of execution by humans, which is important for the application to test data evaluation,

and, on the other hand, a strongly assumed better overall performance. The last point

draws on experience with functional programming and language implementation: the

theoretical result of optimality of lazy or needed reduction does not coincide with the

practical experience, because the overhead of laziness has to be paid for significantly,

4.5 Discussion 95

which often results in that programmers use strictness annotations [Hartel et al., 1996].

The RCM is related to the one described by Hanus [1997] for Curry. Set values in

our semantics,













, are comparable to the “definitional trees” used here

and in work about narrowing. However, since set values are first-order inhabitants of the

operational domain, which definitional trees are not, they can be combined by intersection

and union at computation time, and, moreover, augmented by extensional data. This is

important in the realm of set-based programming, where sets, relations and functions are

often used for representing data structures, which are incrementally constructed.

Unlike the model of Hanus [1997], subresolutions are supported, which are invoked

from expression reduction for the



-operator and for dealing with set tests. This tech-

nique corresponds to “deep guards” and “encapsulated search”, as found, for example,

in the computation model of Oz [Smolka, 1994a] or

log



[Dovier et al., 1996]. Our

computation model has some similarities to the one given for Oz by Smolka [1994a].

The so-called “computation spaces” in this work, which represent a constraint store, are

given in our model as a conjunction of properties paired with assignments. Writing to

the constraint store is equivalent in our model to a resolution step, which updates the as-

signments. A subspace is opened when a subresolution is started for the



-operator or

emptiness-tests. However, as discussed in the previous chapter, there is no equivalent to

committed choice in



Zand its computation model.

Computing More

The computation models we have presented are not the end of the story. There are three

major potential ways of increasing computation power which are not considered here:

higher-order unification (or narrowing), set unification and subset logic programming.

Higher-order unification [Snyder and Gallier, 1989] is used in theorem provers such

as Isabelle [Paulson, 1994] for the highly generic formulation of inference rules. Higher-

order narrowing is proposed as an extension for functional logic languages [Prehofer,

1994; Hanus and Prehofer, 1999] or logic languages such as

Prolog [Nadathur and

Miller, 1988]. The underlying idea of both applications is to extend unification to the

synthesis of

-terms. Substitution is then augmented by

-reduction. Carrying this po-

tential over to



Zwould require to synthesize schemas during unification, which needs a

generalization of higher-order unification that deals only with functions.

Unification over (finite) sets extends unification on free term algebras to the nonfree

algebra of sets, and amounts, technically, to unification modulo an associative, commu-

tative and idempotent equational theory [Siekmann, 1989]. Algorithms and complex-

ity investigations are found, e.g., in [Spratt, 1996; Arenas-Sanchez and Dovier, 1997;

Stolzenburg, 1999]. Adapting these techniques to



Zwould allow unification on the ex-

tensional part of sets. For example, let S



~äj









. If we are going to bind

the variable zto an extensional set, then, with set unification, we expect that xand yare

assigned all possible partitions of z. Because of idempotency of set union, these partitions

are not necessarily disjoint, which makes up the computational complexity of set unifi-

cation. Leaving aside efficiency considerations, for the framework of



Zan integrated

unification method is ultimately required, which combines higher-order unification with

set-unification, yielding an algorithm that can deal with extensional as well as intensional

set representations. More research needs to be done in this direction.

96 Computation in



The major computation potential subset logic programming adds to the mentioned

techniques is extending fixed-point construction from equationsto subset relations [Moon,

1997; Jayaraman and Moon, 1999]. This technique allows us to compute the smallest

set satisfying cyclic subset constraints. The computation of such fixed-points is quite

complex, in the general case. Moon [1997] uses memoization techniques for relations

to make it feasible. His model is first-order, and further investigation of its possible

integration in the higher-order framework of



Zis required.

Chapter 5

Abstract Machine

Mit anderen kann man sich belehren,

Begeistert wird man nur allein.

(Goethe)

This chapter describes an abstract machine, called ZAM, a compilation of



Zto the ma-

chine’s instruction set and an implementation of the machine in C++. The ZAM realizes

the reference computation model (RCM) defined in Section 4.4 (on page 86) in the previ-

ous chapter. Its specification in Z is on a level of abstraction that addresses the issues of

efficient implementation but is still sufficiently concise to reason about the machine.

The ZAM uses concurrently executing threads for constraint resolution (modeling the

relation

§¦

of the RCM). Each thread has a local value stack for expression reduction

(modeling

Ó

ª¤

). In the current implementation ofthe ZAMin C++, the stack is in fact

mapped to registers (which is possible because its size is bound), but for the machine’s

specification the stack model is better suited since it is more abstract.

The ZAM’s threads synchronize via variables: accessing an unbound variable sus-

pends a thread and binding a variable resumes threads waiting for it (called residuation,

[Ait-Kaci et al., 1987]). Backtracking is implemented by maintaining a choice point stack

which is shared by the threads working on a resolution goal. Choice points are created if

a thread executes a constraint p

l



: in this case, the thread continues its execution

with p

and creates a choice point for the alternative p



. A simple but powerful

optimization of the ZAM is to lower the scheduling priority of threads that try to create

choice points, giving preference to threads that can continue deterministically under the

current variable binding (called the “Andorra Principle”, [Warren, 1990]).

The compilation of



Zto ZAM instructions is based on the normalization of expres-

sions introduced in Chapter 4. Together with the stack-based machine model, this makes

it easy to formulate and retrace.

This chapter is organized as follows. First, we introduce the model of the ZAM,

defining its basic data types and instruction set. It follows a comprehensive specification

of the meaning of the instructions in Z’s sequential specification style. Next, we give

the compilation schema. Finally, we discuss the prototypical implementation of the ZAM

in C++, including optimizations such as implementing the reduction stack by registers.

Some first benchmarks of the implementation are given. The chapter concludes with a

discussion of the results and of related work.

98 Abstract Machine

VarInx



Value

 

Var



VarInx





Term



CONS





Value





Set



_?@

Value



@

Intension



Figure 5.1: ZAM Values

5.1 Basic Model of the ZAM

The basic model of the ZAM is described in this section. After defining the represen-

tation of values,goals and threads, the instructions are discussed informally. A formal

specification of the machine’s instructions is given in the next section.

5.1.1 Values

Figure 5.1 shows the basic types used by the ZAM to represent values. There are three

variants of values: variables (represented by an index in a variable table, as described

later on), terms and sets1.

A set value, Set

extens

intens

, consists of two sequences, the first containing the

extensional part of a set and the second the intensionalpart (the type Intension is described

in the next section – it corresponds to an intension of the RCM). Let vjbe the values in

extens and ikbe the intensions in intens; then a set value of the ZAM represents the set

value of the RCM

l^H nSndnÇ9~

H

lk nSndnÇ

The ZAM’s representation is well suited for implementing the union and intersection of

sets:

For set union, we simply unite the extensional and intensional parts:

Set

extens

lUm

intens



Set

extens

Sm

intens



o 

Set

extens

¿

extens

Sm

intens

9¿

intens



Here,

denotes a suitable union function which conforms to extensional equality

in the RCM (cf. Section 4.1 (on page 72)). In an implementation, we would in fact

use ordered sequences for representing extens and intens, and their union could thus

be implemented with linear time complexity.

Set intersection is realized as follows (using



Z’s notation for intensions):

Set

extens

lUm

intens



Set

extens

Um

intens



o 

Set

extens

À

extens

m

A?



Set

extens

lUm

0ADC

Set

fA~C

intens





m





Set

extens

Sm

0ADC

Set

fA~C

intens



m





Set

ADC

intens

Set

fADC

intens



yC

1In fact, there is also a fourth variant for representing native values such as numbers; this, however, is

not relevant to the conceptual model of the ZAM.

5.1 Basic Model of the ZAM 99

Code



Á@

Instruction

Intension

varcnt



-

pat



Value

env



?@

GoalInx



VarInx

params





Value

constrs





Code

Figure 5.2: ZAM Intensions

Thus we can construct the disjunctive normal form of intersected set values in a

constant number of steps. If any of the extensior intensiis empty, the above con-

struction can be further optimized, since in this case the corresponding constructed

intension is known to denote the empty set and can be dropped. In particular, if

both sets are completely extensional, we need to compute only the intersection

which – in an implementation by ordered sequences – can be done in linear time.

5.1.2 Intensions

An intension (Figure 5.2) is described by the count of variables it allocates for its exe-

cution, a value (representing the pattern in an



Zintension







), a sequence of index

references representing the environment the intension was constructed in, a sequence of

values describing parameters and a sequence of constraints. A constraint is given by a

sequence of instructions.

Instantiation. When an intension is instantiated in a goal context (goals are discussed

in the next section), the number of variables it requires are allocated in a block of the

variable table of the goal context (variable tables are just a mapping from variable indices

to information about the state of the variable). Then, for each constraint of the intension,

a thread is spawned, parameterized by the offset of the allocated variable block. From the

executing thread, variables are accessed by adding a static offset found in the code to this

dynamic offset.

Consider, for example, the



Zintension



=ikj



5

l

Å}Ä



S

W

. Here, zis a

“local existential” variable of the intension. This intension is represented in the ZAM as

varcnt



»m

pat



Term

j:i m

A

Var

q»m

Var

C

env



nSnSn(m

params

}

nSnSnWm

constrs



A

l

q»mÇöÅÈm

f

ömS®dÅ

ÃCfÄ

where the offsets used for addressing the variables are

for x,

for y, and

for z. The

reentrant instantiation of an intension is realized as follows: let varshift be the base offset

of the intension’s variable block in the instantiation context; then, we simply

“shift” the pattern pat, adding to each variable the offset varshift (resulting in

Term

j:i m

A

Var

varshift

Var

varshift

Ë®

)

100 Abstract Machine

store varshift in the threads that execute the constraints



l

and



S

, such that, for each

access to a variable from a thread, varshift is added to the static offset

Environment. The environment of an intension, env, is a sequence of indices which

refer to pairs of goal indices (as discussed in the next section) and a variable block offset.

The environment allows access to the variables of the lexical enclosing goal contexts this

intension was created in.

For example, in the



Zexpression

é%ê

ë~äj





Å:

, the variable fis bound

by the enclosing fixed-point operator. fis accessed from the intension created for the

inner schema via the first element of env.

For schemas, the RCM demands that before an according intension can be con-

structed, all context variables are bound (cf. expression reduction rule

¬e

). However,

the situation is different for fixed-points (cf. Rule

¬p

): during the creation of an intension

for the fixed-point, the environment may contain “holes” which are bind immediately

after the intension has been created, constructing a circular dependency between the in-

tension and its environment. For this reason, it is not possible to directly represent the

environment by the bindings of the free variables. An extra level of indirection has to be

introduced using goal indices.

Parameters. The parameters of an intension, params, are used to pass values from the

creation context of an intension. In fact, parameters serve a purpose similar to that of

the environment; however, they can be used to pass temporary values which do not have

a variable counterpart in the context. On the other hand, parameters cannot have cyclic

dependencies to the intension they belong to.

5.1.3 Threads and Goals

Athread executes the code of a constraint. A goal is a collection of threads which to-

gether work on a set of variables and which share a stack of choice points (Figure 5.3 (on

the facing page)). A goal is created to execute an intension. During execution, further

intension may be dynamically instantiated in the context of a goal (corresponding to the

membership test of an element in an intension; cf. resolution rule

2®D®

of the RCM.)

For threads and goals, the index types ThreadInx andGoalInx are defined, which allow

references of them via thread and goal tables defined in a configuration of the ZAM (see

next section)2.

A thread refers to two goals: the parent where it belongs to and a possible child,

describing a subresolution. In turn, a subgoal refers via its parent field to the thread that

executes it. Subgoal resolution corresponds to the execution of the



-operator (Rules

G®

s

H®

0

of the RCM) and the test for emptiness (Rules

2®

s

2®

A thread may be in the status running, indicating that it has not yet finished its ex-

ecution, or it may be in one of the states success,failure or error, indicating that it has

stopped executing. The error status results from the execution of the



-operator on a

2Instead of using indices here, we could have well used a notion of memory and references. This would

complicate the declarative description of the ZAM in Z, however.

5.1 Basic Model of the ZAM 101

ThreadInx



H-

GoalInx



Priority



~q»mS®

Status

 

running



failure



success



error

Thread

parent

child



GoalInx

prio



Priority



status



Status

env



?@

GoalInx



VarInx



params



?@

Value

stack



?@

Value

varshift



VarInx



code



Code

Goal

parentThread



ThreadInx

pat



Value

threads





ThreadInx

choices





Choice

vtab



VarInx

Binding

 

Free



+

ThreadInx





Bound



Value



Choice

bound





VarInx

dump



ThreadInx

?@

Value



Code

spawned





ThreadInx

Figure 5.3: ZAM Threads and Goals

nonsingleton set, or from the undecidable equality test of intensions in unification. failure

corresponds to failure in unification.

A thread inherits the environment of the intension whose constraint it executes as well

as the parameters (fields env and params).

A goal holds the set of threads spawned in its realm. The resolution represented by the

goal has succeeded if all its threads are in status success; it has failed if one of its threads

is in status failure.

A goal’s field vtab represents the variable table as a mapping from variable indices to

a value of type Binding. If a variable is free, a set of threads may be attached which are

waiting for the variable to become bound (variant Free of type Binding).

The field choices of a goal represents the choice stack. During execution of a thread,

whenever the thread encounters a constraint such as p

lk



, it continues execution

with p

and pushes a new choice to the stack, holding the information for the alter-

native p



and accumulating changes made to the goal which need to be undone on

backtracking:

bound indicates which variables were bound while this choice was active (the active

choice is the topmost one on the choice stack). Whenever a variable is bound during

unification, it is added to this set. On backtrack, these bindings are undone.

dump saves the (earliest) state of threads which have advanced during this choice

was active. Whenever a thread performs an execution step and has not yet dumped

102 Abstract Machine

Config

ttab



ThreadInx

Thread

gtab



GoalInx

Goal

root



GoalInx

active





ThreadInx

Figure 5.4: ZAM Configurations

its state to the active choice, it now does so. The dump of a thread is described by

its stack and by its current constraint (instruction stream). On backtracking, these

values are restored3.

Note that the mapping dump also contains the information to resume the thread

that initiated this goal with the next alternative, p



. For more details, see the

specification (Section 5.2 (on page 105)).

spawned describes the threads that have been spawned while this choice was active.

On backtracking, these threads are killed.

The ZAM uses a simple but efficient strategy for thread dumping: whenever a thread

executes its next instruction but has not yet dumped its state to the topmost choice, it now

does so. This is made possible by choice point scheduling. Only if no other threads can

continue (because all other threads are waiting for variables to become bound) is a thread

creating a choice point scheduled for execution. Thus, computations that are independent

of a particular choice are performed automatically before the corresponding choice point

is created.

5.1.4 Configurations

Aconfiguration describes the overall state of the ZAM (Figure 5.4). It contains tables of

threads and goals, the root goal and a sequence of thread indices describing the active

threads in the order they are to be scheduled (a formal specification of scheduling is given

in Section 5.2 (on page 105)).

In the specification of the ZAM, we assume that an unlimited number of threads and

goals can be allocated from the tables in a configuration. Similarly, we allocate from the

variable table of goals without trying to retrieve “unused” variables. In the implemen-

tation in C++, garbage-collection techniques are used to manage this requirement (cf.

Section 5.4 (on page 129)).

5.1.5 Instructions

The instructions of the ZAM are summarized in Figure 5.5 (on the facing page). Their

meaning is informally described below.

3Dumping the entire stack is indeed expansive. In the implementation in C++, instead of stacks for

storing temporaries, we use a “single assignment” register model, which makes dumping very efficient.

This is discussed in Section 5.4 (on page 129).

5.1 Basic Model of the ZAM 103

Instruction

 

LOAD



VarInx





WAIT



VarInx





LOADENV





VarInx





LOADPAR







STORE



VarInx





UNIFY



MEMBER



MKTERM



CONS



E-

¾



MKEMPTY



MKSINGLE



MKVAR



VarInx





MKINTEN



É

E-

7

¼?@

Code





UNION



ISECT



TEST



TESTNATIVE



SUCCESS

Figure 5.5: ZAM Instructions

Waiting for Variables. TheWAIT

inx

instruction lets the executingthread suspend un-

til the addressed variable (and all the variables it indirectly refers to by a possible binding)

are bound.

Loading Values. The LOAD

inx

instruction loads the variable with the given index

from the current goal’s variable table to the executing thread’s stack. If the variable is

bound (maps to the variant Boundval in the variable table), the bound value is pushed,

otherwise the term Varinx.

The LOADENV

dist

inx

instruction loads the value of the environment variable inx

to the stack, from the goal described by the lexical nesting distance dist. The environment

variable is accessed as follows: let thr be the executing thread and

goalInx

varshift

oH

thr

envdist the environment entry; then, the variable

config

goalsgoalInx

vtab

varshift

inx

is selected. The addressed variable is expected to be bound, and its binding must not

refer to other free variables, which has to be guaranteed by the compilation scheme (thus

corresponding to the RCM, which ensures that intensions can only be created if context

variables are bound or become immediately bound after fixed-point construction).

TheLOADPAR

inx

instruction loads the parameter value indexedby inx (thr

paramsinx).

Storing and Unifying Values. The STORE

inx

instruction pops a value from the stack

and unifies it with the variable addressed by inx. The UNIFY instruction pops two values

from the stack and unifies them. If unification fails, the thread stops in status failure;

if unification is not decidable (for instance, because it requires the comparison of inten-

sions), it stops in status error.

104 Abstract Machine

Checking for Membership. The MEMBER instruction checks whether the value found

beneath the top of the stack is a member of the set on top of the stack. This instruction is

the only one that creates choice points.

Before execution of this instruction continues, the priority of the thread is lowered and

the thread suspends (if the priority is not already low). Then a choice point is created and

the thread proceeds as follows. Let Set

elems

intens

be the set on top of the stack, and u

the element beneath the top, for which membership is tested:



elems





intens



, the executing thread stops in status failure, after remov-

ing its choice point from the stack.

Otherwise, if elems







rest, then the top of the stack is replaced by v, and the

alternative Set

rest

intens

is recorded in the choice. Then. the thread behaves as

with the UNIFY instruction (the stack consists of v









nSnSn

at this time).

Otherwise, let



elems



and intens



inten





rest. The alternative Set

fADC

rest

recorded in the choice and the intension inten is instantiated:

1. A block of inten

varcnt variables is allocated in the variable table of the goal.

Let the start index of this variable block be called varshift in the sequel.

2. The pattern inten

pat is “shifted” by adding to each variable the offset varshift

and pushed to the stack.

3. For each constraint in inten

constrs. a thread is spawned and initialized with

varshift. If the choice stack is not empty, then the new thread is added to the

set choice

spawned.

4. Thethread behaves as with theUNIFY instruction (the stack consists ofshift

inten

pat

varshift





at this time).

Constructing Values. The MKTERM

ji m

arity

instruction pops arity number of argu-

ments from the stack and pushes the value Term

ji m

args

to the stack. The MKVAR

inx

instruction pushes the value Varinx to the stack.

TheMKEMPTY instruction pushes the empty set, Set

ADC

A~C

, to the stack. The MKSINGLE

instruction pops the value vfrom the stack and pushes a singleton set, Set

jz~

¶m

ADC

The MKINTEN

parcnt

varcnt

constrs

instruction creates a new intension inten and

pushes the set Set

fADC

UmU~

inten



to the stack. The instruction expects parcnt parameters

on the stack, which are stored in the params field. Above the parameters, on top of the

stack, the pattern of the intension is expected, which is stored in the pat field. The created

intension inherits its environment from the executing thread: let thr be this thread, then

inten

env



thr

parent

thr

varshift





thr

env. The constraints of the intension are given

by the sequence constrs.

Unionand Intersection. The UNION instruction pops two set values from the stack and

pushes their union, constructed as discussed in Section 5.1.1 (on page 98). The ISECT

instructions pops two set values and pushes their intersection.

5.2 Specification 105

Unique Selection. The MU instruction pops a value u



Set

elems

intens

from the

stack and behaves as follows:



elems





intens



, or if



elems

>®

, then the thread stops in status error.

If elems





and



intens



, then the thread pushes the value v.

Otherwise, a reference value is calculated: if elems





, the reference value

is v, otherwise it is the result of the first successful subresolution of one of the

intensions. Then, for all backtracks over all subresolutions of all intensions in the

set, it is checked whether they yield the same value as the reference value. If this

is the case, the reference value is pushed to the stack; otherwise, the thread stops in

status error.

For more details, the reader is referred to the specification of the ZAM’s instructions in

the next section.

Test for Emptiness/Nonemptiness. The TEST instruction pops a set value from the

stack and checks if it is nonempty. If so, it continues; otherwise, it stops in status failure.

The TESTNATIVE instruction fails if the value on the stack is empty.

The test of nonemptiness or emptiness immediately succeeds or fails if the set on the

stack has a nonempty extensional part. Otherwise, intensions in the set are tried to resolve

in a subresolution.

Succeeding. The SUCCESS instruction lets the executing thread stop in status Succeed.

5.2 Specification

In this section, the instructions of the ZAM are comprehensively specified in Z’s sequen-

tial specification style (cf. e.g. Woodcock and Davies [1996]). The specification is in-

tended to be executable by the ZAM itself.

5.2.1 Axiomatic Definitions

A set of axiomatic definitions for working with the data types of the ZAM is defined in

this section.

Working with Values

The auxiliary function shift offsv yields a value where offs is added to each variable’s in-

dex (Figure 5.6 (on the following page)). Since, by construction, variables cannot appear

in sets, the definition needs only to recurse over term values.

The auxiliary function freezevtabv returns a value where all variables in vwhich are

bound in vtab are replaced by their actual binding (Figure 5.7 (on the next page)).

The auxiliary function freev returns the set of indices of variables found in the value

v(Figure 5.8 (on the following page)). Variables are only expected to occur in terms, not

in sets.

106 Abstract Machine

shift



VarInx



Value



Value

shiftoffs

Vari

o ©

Var

offs

 j

Term

ji m

oÚo%©

Term

ji m

shiftoffs



ÍÏÎ

¸'Ð¸

offs



VarInx





Value





@

Value

i



CONS





VarInx

Figure 5.6: Shifting Values

freeze



VarInx

Binding



Value



Value

freezevt

Vari

o ©

ùOú

0Ç





Value



×9¼'½{

vti



Boundv

¸¶ü

freezevtv

¸¶·:ýd¸

Vari

 j

Term

ji m

oÚoW©

Term

j:i m

freezevt



ÍÏÎ

¸'Ð¸



VarInx

Binding





Value







Value



CONS





VarInx

Figure 5.7: Freezing Values

Equality and Unification

The (partial) auxiliaryfunction equal

lUm



determines equalityof two values (Figure 5.9

(on the next page)). Confirming to extensional equality in



Z’s RCM, the function is

undefined if the values cannot be compared, which happens if sets are tried to compare

which have a nonempty intensional part.

The (partial) auxiliary function unify

tries to unify the given values uand v,

under the variable table vt (Figure 5.10 (on page 108)). On success, the function returns a

new variable table and the set of indices of variables which have been bound. The defini-

tion is based on a function uni whose type is appropriate for sequence catamorphisms, and

a function uniVar which unifies a variable with a value. The function may be undefined

for similar reasons as the equal function (since it uses equal to determinate equality of

sets). The function uniVar makes an occurrence test before it binds a variable.

Merging and Joining Value Sequences

The auxiliary function vs

us constructs the union of two value sequences, removing any

duplicates according to equal. It is undefined if equality is undefined (Figure 5.11). In

vars



Value





VarInx

vars Vari





Term

j:i m

o¾©





varsv





ÍÏÎ

¸'Ð¸



VarInx





Value





v@

Value

i



CONS

Figure 5.8: Getting the Variables of a Value

5.2 Specification 107

equal



Value



Value

ó è

equal

Var i

Vari

o ©

sÇÆÈy

 j

Term

ji m

Term

j:i m



o¢o%© Ã









d



¼'½{

equal



 j

Set

lUm

A~C

Set

Um

ADC

o¢o © Ã









d





equal

 j

Set

lUm

Set

dm



o¢o ©



 j

o © ²:´¶µ

w¢y

ÍÏÎ

¸'Ð¸



VarInx

Gi



CONS



ldm





?@

Value

ldm







Value



lUm





@

Intension





Value

Figure 5.9: Equality

order to remove undeterminism in the ordering of the result, we assume a (unspecified)

predicate someorder on value sequences. Similarly, the function vs

us constructs the

intersection of two value sequences.

Adding Information to Choices

The auxiliary function addBound adds information to the bound field of a choice, the

function addDump to the dump field and the function addSpawned to the spawned field

(Figure 5.12).

Ordering Index Sets

Sometimes, we require to order a finite set of indices (for variables, threads and goals).

The function orderinx implements some (fixed but arbitrary) order on index sets (Fig-

ure 5.13 (on page 109)), yielding a sequence containing the order.

5.2.2 Auxiliary Instructions

For representing internal states of threads auxiliary instructions are used. These can be

considered as a kind of “micro programs” implementing other instructions. For example,

let MEMBER





rest be the instruction stream of a thread. For executing the MEMBER

instruction, a choice point needs to be created, and the thread initiating the choice needs

to dump a state thatcontinues with an alternativeto the choice. We realize this bydumping

TRYNEXT

pat

extens

intens





rest

as the instruction state of the thread, where TRYNEXT is an auxiliary instruction executing

the next alternative (pat

Set

extens

intens

oÚo

. A similar technique is used for specifying

subresolutions of the MU,TEST, and TESTNATIVE instructions.

The auxiliary instructions are declared in Figure 5.14 (on page 109), and will be ex-

plained as they are used.

108 Abstract Machine

UniRes

v



VarInx

Binding

oA

%

VarInx





Fail

unify



VarInx

Binding

o

Value



Value

UniRes

uni



Value



Value

oA

UniRes

uniVar



VarInx



Value



UniRes

unify





VarInx

Binding





Value

uni

j¢j

$oÚo

uni

j¢j

Fail

o ©

Fail

Dj¢j

Vari

res

o ©

res

Dj¢j

Vari

res

o ©

uniVar

res

Dj¢j

Vari

res

o ©

uniVar

res

Dj¢j

Term

ji m

Term

j:i m

o¢o

res

o¾©

ùOú

!





¸äü

uni

res

¯È4É

t °Öj

¸¶·ýU¸

Fail

Dj¢j

o¢o ©

ùOú

equal

freezevtu

freezevtv

¸äü

¸¶·ýU¸

Fail

ÍÏÎ

¸'Ð¸



Value







Value





VarInx

i



CONS

res



UniRes





VarInx

Binding







VarInx

uniVar

oÚo(©

ù:ú

Ç





Value



vti



Bound u

¸¶ü

uni

jÚj

oÚo

¸ä·:ýd¸

ùOú

vars

freezevtv

¸äü

ÒÓ

Boundv

¶m

V~



¸¶·ýU¸

Fail

ÍÏÎ

¸'Ð¸



VarInx





Value





VarInx

Binding







VarInx

Figure 5.10: Unification



@

Value



¼

Value

?@

Value

someorder



2j



Value

oW©

j





Value







Value

Þ



@Æ

equal

¶

Ì4Þ



¼'½»{





equal

wsi

wsj



someorderws

ÍÏÎ

¸'Ð¸



?@

Value

oW©

j





Value







Value

Þ







equal

¶

Ì4Þ



¼'½»{





equal

wsi

wsj



someorderws

ÍÏÎ

¸'Ð¸



?@

Value

Figure 5.11: Merging and Joining Value Sequences

5.2 Specification 109

addBound





VarInx



Choice



Choice

addDump



ThreadInx





Value



Code



Choice



Choice

addSpawned





ThreadInx



Choice



Choice

addBound

o¾©



LÊ

bound

Choice





Choice





bound



bound





Choice

ÍÏÎ

¸'Ð¸





VarInx





Choice

addDump

stack

code

o(©



LÊ

dump

Choice





Choice





dump



dump

ÒÓ j

stack

code





Choice

ÍÏÎ

¸'Ð¸



ThreadInx



stack



@

Value



code



Code





Choice

addSpawned

o¾©



LÊ

spawned

Choice





Choice





spawned



spawned





Choice

ÍÏÎ

¸'Ð¸





ThreadInx





Choice

Figure 5.12: Adding Information to Choices

orderinx



-



?@Ë-

uäj

orderinxs

oñ

ÍÏÎ

¸'Ð¸



I-

Figure 5.13: Ordering Index Sets

5.2.3 Execution Step and Selection

The state overwhich instructions are specified, and a single executionstep of the ZAM, are

defined in Figure 5.15 (on the next page). The state, called a selection, is an enrichment of

the ZAM’s configuration, Config, by the inclusion of those thread’s and goal’s data fields

which are selected for execution. The enrichment allows a direct access to these fields in

the specification, supporting Z’s sequential specification style.

The execution step of the ZAM chooses a thread to execute based on the order given

by the active sequence of a configuration and of thread priorities. It initializes the data

fields from the thread and goal tables, dispatches the instruction (dispatching is defined

later on), and stores the data fields contained in the selection back to the configuration.

In Figure 5.15 (on the following page), the signature of the operation Schedule is

Instruction

 

ndnSn



TRYNEXT



Value



¼?@

Value



@

Intension







SearchState



SearchMode



?@

Intension



SearchState

v

Select



Resume



Backtrack

SearchMode

v

MuFirst



MuNext



Value





Test



TestN

Figure 5.14: ZAM Auxiliary Instructions

110 Abstract Machine

Selection

Config



Goal



Thread



this



ThreadInx



instr



Instruction

Step



Schedule

Dispatch

Restore

Schedule

active

Config



Selection



-

... . .. . . . . . .



active

òq



min



¼'½»{

active





¼'½»{

active



ttab

activek

o¢o

prio

ttab

activej

o¢o

prio



this



activei

active





¼'½{

active







Î

active



Thread



ttabthis



Goal



gtabparent

Restore

ttab

gtab

Selection

ttab



ttab

this

ÒÓ j

~

Thread



gtab



gtab

parent

ÒÓ j

~

Goal



Figure 5.15: ZAM Execution Step

Config



Selection

, the signature of the operation Dispatch is

Selection

, and the

signature of the operation Restore is

Config



Selection

. Thus the signature of the

overall Step is

Config

: one configuration is mapped into the next one, with the help of

the intermediate data in the selection.

Dispatching is defined in Figure 5.16 (on the next page). The basic dispatch operation

is enclosed by two operations, PreDispatch and PostDispatch, which perform actions

common to the execution of every instruction:

PreDispatch dumps the executing thread’s state (stack and instruction stream) to

the active choice, if one exists and if a dump to this choice has not already been

performed (cf. discussion of dumping in Section 5.1.3 (on page 100)). It then loads

the next instruction into the instr field of the operation, and advances the instruction

stream, code.

PostDispatch checks whether the threads belonging to the current goal have fin-

ished. If this is the case, and the goal has a parent thread, this parent is resumed.

5.2.4 Instructions

The meaning of the instructions of the ZAM is specified in the sequel. The reader is

referred to Section 5.1.5 (on page 102) for an informal description of the instructions.

5.2 Specification 111

Dispatch



PreDispatch

BasicDispatch

PostDispatch

PreDispatch

choices

code

Selection

ùOú



choices

òqÄ

this

×9¼'½»{$j

head choices

dump

¸äü

choices



addDump

this

stack

code

headchoices





tailchoices

¸¶·ýU¸

choices



choices

instr

code

oñ

head code

tailcode

BasicDispatch

}

XWait

XLoad

XLoadEnv

XStore

XUnify

XMember

XTryNext

XMkInten

XMkTerm

XMkEmpty

XMkSingle

XUnion

XIsect

XMu

XTest

XTestN

XSearch

XSuccess

PostDispatch

active

Selection

ùOú

status



running



¼'½»{

threads





this

ttabt

status



running

gtabparent

parentThread



¸äü

active



active



j

gtabparent

parentThread

¸¶·ýU¸

active



active

Figure 5.16: ZAM Dispatching

XLoad

active

stack

Selection

inx



VarInx



val



Value

... . .. . . . . . .

instr



LOAD

inx

val



ù:ú

Ç





Value



vtab

varshift

inx

oñ

Bound v

¸¶ü

¸ä·:ýd¸

Var

varshift

inx

stack



val





stack

active



active





this

Figure 5.17: The LOAD Instruction

The LOAD Instruction

The LOAD

inx

(Figure 5.17) instruction loads the variable, addressed by varshift

inx,

or its binding, to the stack (recall that varshift is the offset added for the variable instances

of the executing thread).

112 Abstract Machine

XWait

active

vtab

code

Selection

inx

cand



VarInx



cands





VarInx

... . .. . . . . . .

instr



WAIT

inx

cands



ùOú





Value



vtab

varshift

inx

o 

Boundv

¸¶ü

vars

freezevtabv

¸¶·:ýd¸

varshift

inx



ùOú

cands



¸äü

cand

cands

vtab



vtab

cand

ÒÓ

Free

j¢j

Free

vtabcand

V~

this



code



WAIT

inx





code

active



active

¸¶·ýU¸

active



active





this

Figure 5.18: The WAIT Instruction

XLoadEnv

active

stack

Selection

dist



-

ld

ginx



GoalInx



inx

vshift



VarInx



val



Value

... . .. . . . . . .

instr



LOADENV

dist

inx

ginx

vshift

oç

envdist

stack



freeze

gtabginx

vtab

j¢j

Bound

jÚj

gtabginx

vtab

vshift

inx

o¢oÚo





stack

active



active





this

Figure 5.19: The LOADENV Instruction

The WAIT Instruction

The WAIT

inx

instruction (Figure 5.18) calculates the set of free variables, the index

varhshift

inx refers to directly or indirectly. If this set is nonempty, it takes an arbitrary

one of it, adds the running thread to the waitingset of this variable, and prepares to execute

WAIT again when the thread is resumed. Otherwise, the thread just continues.

The LOADENV Instruction

The LOADENV

dist

inx

instruction loads the variable inx from the goal of the lexical

distance dist to the stack. The environment variable is expected to be bound, as well as

all variables its binding refers to. The loaded value is frozen using the variable table of

the goal context. Freezing is essential since the variable indices appearing in the binding

of the environment variable are local to the environment’s goal, and have no meaning in

the current context.

5.2 Specification 113

XUnify

stack

status

active

choices

Selection





VarInx

... . .. . . . . . .

instr



UNIFY



stack



tail

tailstack

ùOú

Ìïj

vtab

stack

öäm

stack

×9¼'½»{

unify

¸äü

status



error



active



active

¸¶·ýU¸

ùOú

unify

vtab

stack

öm

stack

o 

Fail

¸¶ü

status



failure



active



active

¸¶·:ýd¸

vtab

o 

unify

vtab

stack

öm

stack

status



status

active



active

orderinx





Free

vtabi



oÑ



this

ùOú

K

choices

ïq

¸¶ü

choices



addBound

headchoices





tailchoices

¸¶·:ýd¸

choices



choices

Figure 5.20: The UNIFY Instruction

XStore

code

active

Selection

inx



VarInx

... . .. . . . . . .

instr



STORE

inx

code



LOAD

inx





UNIFY





code

active



active





this

Figure 5.21: The STORE Instruction

The UNIFY Instruction

The UNIFY instruction (Figure 5.20) pops two values from the stack and unifies them. If

unification fails, the executing thread fails; if unification is undefined, the thread switches

to the error status. Otherwise, the variable table is updated, threads which are waiting

for the bound variables are resumed and – if the choice stack is not empty – the variables

bound by the unification are recorded in the active choice.

The STORE Instruction

TheSTORE

inx

instruction (Figure 5.21) isexplained by thecode substitution LOAD

inx



UNIFY.

The MEMBER Instruction

The MEMBER instruction (Figure 5.22 (on the following page)) is implemented by the

auxiliary instruction TRYNEXT. On execution of MEMBER, first the priority of the run-

ning thread is lowered, if it is not already low. Otherwise, a choice point is created, and

the instruction TRYNEXT is scheduled for execution.

114 Abstract Machine

XMember

prio

code

stack

choices

active

Selection



@

Value





?@

Intension

... . .. . . . . . .

instr



MEMBER

ùOú

prio

òq

¸äü

prio



code



MEMBER





code

stack



stack



choices



choices

active



active





this

¸¶·ýU¸

Set

oç

head stack

choices



bound



dump



spawned



(



choices

code



TRYNEXT

head

tailstack





code

stack



tail

tailstack

prio



active



active





this

Figure 5.22: The MEMBER Instruction

XTryNext



XTryNextEmpty

XTryNextExten

XTryNextInten

XTryNextEmpty

choices

status

Selection



Value



... . .. . . . . . .

instr



TRYNEXT

ADC

choices



tailchoices

status



failure

Figure 5.23: The TRYNEXT Auxiliary Instruction: The Empty Case

The instruction TRYNEXT

is specified by three cases, depending on the value

of es and is. If both es and is are empty, the executing thread fails after removing the

corresponding choice from the choice stack (Figure 5.23).

If the extensional part of TRYNEXT

is not empty, one value is extracted, a

continuation with the remaining values is dumped to the choice, and pis unified with the

extracted value (Figure 5.24 (on the facing page)).

If the extensional part of TRYNEXT

is empty, but the intensional part is not

empty, one intension iis extracted and instantiated in the context of the current goal

(Figure 5.25 (on the next page)). The variable table is extended by i

varcnt free variables.

For each constraint in i

constrs, a thread is spawn and initialized with the offset of the

allocated variables and the intension’s stored environment and parameters. The spawned

threads are added to theactivechoice, and the alternativefor this choice is dumped, similar

as in the extensional case. The thread continues unifying pwith the shifted intension’s

pattern, shiftvshift inten

pat.

5.2 Specification 115

XTryNextExten

code

stack

choices

active

Selection



Value





@

Value





?@

Intension

... . .. . . . . . .

instr



TRYNEXT

choices



addDump

this

stack

TRYNEXT

tailes





code

headchoices





tailchoices

stack



head es









stack

code



UNIFY





code

active



active





this

Figure 5.24: The TRYNEXT Auxiliary Instruction: The Extensional Case

XTryNextInten

code

stack

choices

active

vtab

ttab

threads

Selection



Value





Intension





v@

Intension



spawned





ThreadInx

... . .. . . . . . .

instr



TRYNEXT

ADC





vtab



vtab

V~





vtab

Ë®ÕnSn



vtab

varcnt

ÒÓ

Free



ttab



ttab

@~





ttab

Ë®ØnSn



ttab



constrs

ÒÓ

parent



parent

child



q»m

prio



®Dm

status



running

env



env

params



params

stack

}

ADC

varshift

}



vtab

Ë®Dm

code



constrs



ttab

^

spawned



¼'½»{

ttab

æÕ¼'½»{

ttab

choices



addDump

this

stack

TRYNEXT

ADC





code

addSpawned

spawned

headchoices

oÚo





tailchoices

threads



threads



spawned

stack



shift



vtab

¨®

pat









stack

code



UNIFY





code

active



active

orderinxspawned





this

Figure 5.25: The TRYNEXT Auxiliary Instruction: The Intensional Case

The Make Instructions

The MKEMPTY instruction pushes an empty set to the stack. The MKSINGLE instruc-

tions pops a value from the stack and pushes a singleton set containing this value (Fig-

ure 5.26 (on the following page)).

The MKVAR

inx

instruction (Figure 5.27 (on the next page)) pushes the value Varinx

to the stack. The MKTERM

ji m

instruction pops narguments from the stack and pushes

the value Term

j:i m

args

The MKINTEN

parcnt

varcnt

constrs

instruction (Figure 5.28) creates a new inten-

sion and pushes it to the stack; the environment of the intension is created from the exe-

cuting threads context.

116 Abstract Machine

XMkEmpty

stack

active

Selection

instr



MKEMPTY

stack



Set

ADC

ADC





stack

active



active





this

XMkSingle

stack

active

Selection

instr



MKSINGLE

stack



Set

headstack

ADC





tailstack

active



active





this

Figure 5.26: The MKEMPTY and MKSINGLE Instructions

XMkVar

stack

active

Selection

inx



VarInx

... . .. . . . . . .

instr



MKVAR

inx

stack



Var

inx





stack

active



active





this

XMkTerm

stack

active

Selection



CONS





-

... . .. . . . . . .

instr



MKTERM

ji m

stack



Term

ji mjz®ÕnSn

stack





j¢j

Ë®ÕnSn



stack



active



active





this

Figure 5.27: The MKTERM Instruction

The UNION and ISECT Instruction

The UNION instruction pops two values from the stack and builds the union of them

(Figure 5.29 (on the facing page)). If the union is undefined (because it requires equality

on values which cannot be computed) the thread stops in status error.

The ISECT instruction pops two values from the stack and builds the intersection of

XMkInten

stack

active

Selection

parcnt

varcnt



-



constrs





Code

... . .. . . . . . .

instr



MKINTEN

parcnt

varcnt

constrs

stack



Set

ADC

A

varcnt



varcnt

pat



headstack

params



j¢®ÕnSn

parcnt

tailstack

env

}

parent

varshift





env

constrs



constrs

ÄC





parcnt

Ë®ÕnSn



stack

tailstack

active



active





this

Figure 5.28: The MKINTEN Instruction

5.2 Specification 117

XUnion

active

stack

status

Selection

ldm





@

Value



lUm





?@

Intension

... . .. . . . . . .

instr



UNION

stack



Set

lUm



stack



Set

Sm



ùOú

ldm



×@¼'½»{$j

¸äü

stack



Set

¿

Sm







tail

tailstack

active



active





this



status



status

¸¶·ýU¸

status



error



active



active



stack



tail

tailstack

Figure 5.29: The UNION Instruction

XIsect

active

stack

status

Selection

ldm





@

Value



lUm





?@

Intension

... . .. . . . . . .

instr



ISECT



stack



Set

l^m



stack



Set

Sm



ùOú

ldm



×@¼'½»{$j

¸äü

stack



Set

À

Sm

join

Set

l^m

ADC

Set

ADC



oÚo

join

Set

dm

ADC

Set

ADC

oÚo

join

Set

fADC

Set

fA~C

o¢o





tail

tailstack

active



active





this



status



status

¸¶·ýU¸

status



error



active



active



stack



tail

tailstack

join



Set



Set



?@

Intension

join

Set

lUm

Set

Sm



o¢o¾©

ùOú

!

nÅ



q}Ä





Å





q

¸¶ü

varcnt



®Dm

pat



Var

q»m

env



ADC

params



Set

l^m

Set

dm



constrs



AfA

LOAD

jq

LOADPAR

j¢®

MEMBER

SUCCESS

LOAD

jq

LOADPAR

j=ö

MEMBER

SUCCESS

CfCfÄC

¸¶·:ýd¸

ADC

ÍÏÎ

¸'Ð¸

ldm







Value



lUm





@

Intension

Figure 5.30: The ISECT Instruction

them (Figure 5.30). If the intersection of the extensional parts undefined (because it re-

quires equality on values which cannot be computed) the thread stops in status error. The

specification uses an auxiliary function join

®Dm

that creates an intension, implement-

ing the



Zform





lkÄ



f

(cf. discussion of intersection, Section 5.1.1 (on

page 98)). For realizing the constraints of this intension, we pass the set values v

and v



as intension parameters.

118 Abstract Machine

XSuccess

status

Selection

instr



SUCCESS

status



success

Figure 5.31: The SUCCESS Instruction

XMu

active

stack

status

code

Selection



@

Value





?@

Intension

... . .. . . . . . .

instr



Set





stack



stack

ùOú





¸äü

code



Select

MuFirst





code

status



status



active



active





this

¸¶·ýU¸

ùOú





¸¶ü

code



Select

MuNext

head es





code

status



status



active



active





this

¸¶·:ýd¸

status



error



active



active



code



code

Figure 5.32: The MU Instruction

The SUCCESS Instruction

The SUCCESS instruction stops the running thread and puts into status success (Fig-

ure 5.31).

The MU Instruction

TheMU instruction (Figure 5.32) delegates itsworkto the auxiliary instruction SEARCH

state

mode

that implements subresolution for a sequence of intensions, is.SEARCH is started in

state Select and one of two modes: MuFirst indicates that no candidate value is present

for unique selection, whereas MuNext cand defines a candidate (the details are discussed

later on). The candidate is initialized from a singleton extensional part of the set on top

of the stack. If the extensional part contains more then one element, the MU instruction

immediately switches to status error.

The TEST and TESTNATIVE Instructions

As the MU instruction, the TEST and TESTNATIVE instructions (Figure 5.33 (on the next

page)) are based on the auxiliary instruction SEARCH

state

mode

(defined in the next

section). Both instructions are decided immediately, if the extensional part of the set on

top of the stack is nonempty: the TEST instruction which checks for nonemptiness then

succeeds, whereas the TESTNATIVE instruction fails.

5.2 Specification 119

XTest

active

stack

code

Selection



@

Value





?@

Intension

... . .. . . . . . .

instr



TEST

Set





stack



stack

ùOú





¸äü

code



Select

Test





code

active



active





this

¸¶·ýU¸

code



code



active



active





this

XTestN

active

stack

status

code

Selection



@

Value





?@

Intension

... . .. . . . . . .

instr



TESTNATIVE

Set





stack



stack

ùOú





¸äü

code



Select

TestN





code

status



status



active



active





this

¸¶·ýU¸

status



failure



code



code



active



active

Figure 5.33: The TEST and TESTNATIVE Instructions

The SEARCH Auxiliary Instruction

Theauxiliary instruction SEARCH

state

mode

implements “encapsulated search” and

is used to model the MU and TEST

TESTNATIVE instructions. The state is one of the

following:

Select: select the next intension from is and start a subgoal for its resolution. After

selection, the executing thread suspends until the subgoal’s execution has finished.

Resume: the state in which the current thread continues when it is resumed since

the subgoal’s execution has finished. Actions are performed in dependency on the

mode of the search: the search may be finished, backtracking may be invoked or

the next intension may be tried.

Backtrack: indicates that the current subgoal shall be backtracked

The mode describes which functionality is implemented by the search:

MuFirst: search for a



-value. No candidate is yet known.

MuNext cand: search for a



-value. A candidate is known, which must equal to

each other possible candidate.

Test: test for nonemptiness. The search stops and the initiating thread continues

normally as soon as the first resolution is successful.

120 Abstract Machine

XSearch



XSearchSelectMore

XSearchSelectEndMuFirst

XSearchSelectEndMuNext

XSearchSelectEndTest

XSearchSelectEndTestN

XSearchResumeSuccessMu

XSearchResumeSuccessTest

XSearchResumeSuccessTestN

XSearchResumeFailure

XSearchResumeError

XSearchBacktrack

Figure 5.34: The SEARCH Auxiliary Instruction: Dispatching

XSearchSelectMore

child

ttab

gtab

active

code

Selection

mode



SearchMode





Intension







Intension

... . .. . . . . . .

instr



Select

mode





child





gtab

Ë®

ttab



ttab

@~





ttab

Ë®ØnSn



ttab



constrs

ÒÓ

parent



child



q»m

prio



®Dm

status



running

env



env

params



params

stack

}

ADC

varshift



®Dm

code



constrs



ttab

^

gtab



gtab

É~

child

ÒÓ

parentThread



this

pat



pat

threads



¼'½»{

ttab

æØ¼'½»{

ttab

choices



ADC

vtab

}



®ÕnSn

varcnt

ÒÓ

Free



lÄ

^

code



Resume

mode





code

active



active

orderinx

j¼'½»{

ttab

æØ¼'½»{

ttab

Figure 5.35: The SEARCH Auxiliary Instruction: Select, More Intensions

TestN: test for emptiness. The search stops in status failure as soon as the first

resolution is successful.

The SEARCH

state

mode

instruction is defined by several schemas that discriminate

over state,mode and the sequence of intensions is. The dispatcher is defined in Fig-

ure 5.34.

Selection State. Figure 5.35 defines the selection state for the case that the intension

sequence is nonempty, which is independent of the mode of the search. For the next

intension i, a new subgoal is created and stored in the child field of the executing thread.

The spawned threads are activated and the executing thread suspends, after preparing to

continue in search state Resume.

Figure 5.36 (on the facing page) defines the selection state for the case that the inten-

sion sequence is empty. The action performed depends on the mode: in mode MuFirst,

the initiating MU instruction produces an error (the set MU is executed on is empty). In

mode MuNext cand,cand is the unique result of executing MU. In mode Test, the tested

5.2 Specification 121

XSearchSelectEndMuFirst

status

Selection

instr



Select

MuFirst

ADC

status



error

XSearchSelectEndMuNext

active

stack

Selection

cand



Value

.. .. . .. .. . . .

instr



Select

MuNext cand

ADC

stack



cand





stack

active



active





this

XSearchSelectEndTest

status

Selection

instr



Select

Test

0ADC

status



failure

XSearchSelectEndTestN

active

Selection

instr



Select

TestN

ADC

active



active





this

Figure 5.36: The SEARCH Auxiliary Instruction: Select: No More Intensions

set is empty which lets the executing thread fail; in mode TestN, the test for emptiness has

succeeded and the executing thread just continues.

Resume State. Figure 5.37 (on the next page) defines schemas for the Resume state

in case the subgoal has been successfully resolved. We end here since a thread is auto-

matically resumed by the general dispatch operation PostDispatch (Figure 5.16) when the

threads of its subgoal have finished. The subgoal is successful if all its threads are in status

success. The action depends on the mode of search: in case of Test or TestN, the search

is finished, and the executing thread either continues or stops in status failure. In case of

MuFirst or MuNextcand, the result of the subresolution is frozen and compared with a

possible previous candidate for unique selection. The frozen value must not contain free

variables; if it does so, or if a previous candidate does not equal to the frozen result, or if

equality is undefined (since the compared values contain intensions), the executing thread

stops in status error. Otherwise, the search switches either to status Backtrack if there are

pending choices, or to status Select if there are no more choices, trying the next possible

intension.

Figure 5.38 (on page 123) defines schemas for the Resume state in case the subgoal

has failed or is in status error. In case of a failure, the next alternative is tried, either by

backtracking or by selecting the next intension. An error status is just propagated to the

executing thread.

Backtrack State. Figure 5.39 (on page 123) defines the Backtrack status of a search. It

can be asserted that the choice stack is not empty. In the subgoal, changes as described

by the topmost choice entry are undone: dynamically spawned threads are removed, the

choice itself is pruned and the variables are reset to be free. The threads which have

dumped to the choice are restored. The restored threads are reactivated (they may run or

not on backtrack) and the executing thread suspends after preparing to continue in state

Resume when the backtracked subgoal has finished.

122 Abstract Machine

XSearchResumeSuccessTest

active

Selection





Intension



sgoal



Goal

... . .. . . . . . .

instr



Resume

Test

sgoal



gtabchild





sgoal

threads

ttabt

status



success

active



active





this



XSearchResumeSuccessTestN

status

Selection





Intension



sgoal



Goal

... . .. . . . . . .

instr



Resume

TestN

sgoal



gtabchild





sgoal

threads

ttabt

status



success

status



failure

XSearchResumeSuccessMu

active

code

status

Selection

mode



SearchMode





?@

Intension



sgoal



Goal



cand

res



Value

... . .. . . . . . .

instr



Resume

mode



MuFirst

mode



MuNext cand

sgoal



gtabchild





sgoal

threads

ttabt

status



success

res



freezesgoal

vtabsgoal

pat

ùOú

varsres



mode



MuNext cand

ô j

cand

res

×9¼'½{

equal

cand

res

o¢o

¸äü

code



ùOú

K

sgoal

choices

q

¸¶ü

Backtrack

¸¶·:ýd¸

Select

MuNextres





code

active



active





this



status



status

¸¶·ýU¸

code



code



active



active



status



error

Figure 5.37: The SEARCH Auxiliary Instruction: Resume: Subgoal Success

5.2 Specification 123

XSearchResumeFailure

active

code

Selection

mode



SearchMode





?@

Intension



sgoal



Goal

... . .. . . . . . .

instr



Resume

mode

sgoal



gtabchild

Þ



sgoal

threads

ttabt

status



failure

code



ù:ú

K

sgoal

choices

òq

¸¶ü

Backtrack

¸ä·:ýd¸

Select

mode





code

active



active





this

XSearchResumeError

status

Selection

mode



SearchMode





?@

Intension



sgoal



Goal

... . .. . . . . . .

instr



Resume

mode



sgoal



gtabchild



sgoal

threads

ttabt

status



failure



sgoal

threads

ttabt

status



error

status



error

Figure 5.38: The SEARCH Auxiliary Instruction: Resume: Subgoal Failure

XSearchBacktrack

gtab

ttab

code

active

Selection

mode



SearchMode





?@

Intension



sgoal



Goal



choice



Choice

... . .. . . . . . .

instr



Backtrack

mode



sgoal



gtabchild



sgoal

choices

q»

choice



head sgoal

choices

gtab



gtab

child

ÒÓ

parentThread

}

sgoal

parentThread

pat

}

sgoal

pat

threads



sgoal

threads

choice

spawned

choices



bound



dump



:

spawned



:

(



tailsgoal

choices

vtab



vtab



choice

bound

ÒÓ

Free



lÄ

^

ttab



ttab



¼'½»{

choice

dump

ÒÓ j:

Ê

stack

code

status

Thread



l

Thread



ttabt

stack

code

o2

choice

dumpt

status



running



Thread



code



Resume

mode





code

active





active



sgoal

threads



ËÎ

active

orderinx

j¼'½»{

choice

dump

Figure 5.39: The SEARCH Auxiliary Instruction: Backtracking

124 Abstract Machine

ENV



VAR

VarInx

oG

VAR

çoG

E

VAR

enter



ENV



%

VAR



ENV

index





ENV





VAR

çn®

nest





ENV





VAR

%nvö

mkflex

}



ENV



vars





VAR

[

%n®Dm



%n öm



e

Õ

vars

flex





ENV

e

enter

vars

o¾©

·O¸'¹

order

}ì´

Æí½'ÆM¼'yÆ

vars

[

%n®



¼'½»{

order

order i

ÒÓ

ò®E¶m



VAR

ùOú

vars

¸¶ü

¸ä·:ýd¸

nest

[

Ë®Dm

flex

vars

ÍÏÎ

¸'Ð¸



ENV



vars





VAR

Figure 5.40: Compilation Environment

5.3 Compilation

In this section, the compilation of



Zto ZAM instructions is described. Compilation

works on expressions normalized as described in Section 4.2 (on page 74) in Chapter 4.

A top-level expression, constituting a



Z“program”, may, for example, be given as a

property with free variables, such as x

e, where xdescribes the output of the program.

Compiling x

ewith an initial environment declaring xyields a sequence of code instruc-

tions which can be executed as the constraint of an intension of the ZAM. Backtracking

over the goal for this intension yields the possible bindings for x.

5.3.1 Environment

The compilation is based on an environment holding information about variable names in

the contextof a compiled expression. An environment,

9×

ENV, is atriple

indices

nests

flex

where indices is a mapping from variables to their associated indices, nests a mapping

from variables to their scoping nest and flex a set of variables which are considered as

“flexible”. For flexible variables, in contrast to nonflexible, it is not require to emit a

WAIT instruction before they can be accessed. The specification of environments is given

in Figure 5.40.

The enter

vars

function extends the environment for entering a scope which de-

clares the variables vars. Variables from outer scopes are shadowed; for those which are

still visible the scoping nest is incremented by one. The function index

[

retrieves the

index of the variable xin

, the function nest

the scoping nest of x. The function

mkflex

[

vars

adds the given variables to the flex set, the function flex

retrieves the flex

set.In the definition of enter, we use the function

ì´

ÆM½'Æí¼'yÆ

that makes a sequence of some

fixed but arbitrary order from a finite set of variables. Note that variable indices in the

ZAM start with

(whereas Z’s sequences are indexed starting from

). From the flex set,

we remove the newly entered variables, since any variables with the same name in this set

are shadowed, and the flexible information becomes invalid for them.

5.3 Compilation 125

BAS



ENV



EXPB



Code

DIS



ENV



EXPD



Code

CON



ENV



EXPC



Code

LIT



ENV



EXPL



Code

PRO



ENV



EXPA



Code

Figure 5.41: Declaration of Compilation Functions

BAS

¿j:ikj

o^o%©

j

Y¹

BAS

oÑ



MKTERM

ji m





DIS



CON



LIT

ÍÏÎ

¸¶Ð¸



ENV





@

EXPB

i



CONS





EXPD





EXPC





EXPL

Figure 5.42: Compiling General Expressions

5.3.2 Compilation Functions

For each level of the disjunctive normal form (described in Figure 4.3 (on page 74)), a

compilation function is provided: general expressions, disjunctions, conjunctions, literals

and properties. These functions constitute a mutual recursive system, which will be spec-

ified in the subsections below. The declaration of the compilation functions is given in

Figure 5.41.

Compiling General Expressions

The function

BAS

btakes any expression in normal form and compiles it to a sequence

of instructions (Figure 5.42). For constructor application,

ikj

, code is generated which

pushes the arguments, and the MKTERM instruction is appended to this code. For other

expressions, the appropriate specialized compilation function is called.

Compiling Disjunctions and Conjunctions

The compilation of disjunction (union) and conjunction (intersection) is defined in Fig-

ure 5.43 (on the next page). For unions,



c,UNION instructions are folded into the

instructions generated from the operands c. A simple optimization can prevent the cre-

ation of the empty set as a neutral element, which we have omitted. For the compilation

of intersection,



l,ISECT instructions are folded into the instructions generated from

the literal operands. The sequence of conjuncted literals can be assumed to be nonempty

by the normal form of expressions.

Compiling Literals

The compilation of literals (and atoms, which we subsume here) is defined in Figure 5.44

(on the following page). The following comments:

126 Abstract Machine

DIS

¿j



o%©

j=¬



EXPC



code



Code

code

CON





UNION

MKEMPTY

ÍÏÎ

¸¶Ð¸



ENV







EXPC

CON

¿j

~

o¾©

jÈ¬



EXPL



code



Code

code

LIT





ISECT

LIT

head l

o¢o

taill

ÍÏÎ

¸¶Ð¸



ENV







EXPL





òq

Figure 5.43: Compiling Disjunctions and Conjunctions

LIT

¿j

o ©

ùOú

nest

[

o 

¸¶ü

ù:ú

flex

¸¶ü

WAIT

index

çm

oÚo

LOAD

index

oÚo

¸ä·:ýd¸

LOAD

index

[

o¢o

¸¶·:ýd¸

LOADENV

nest

[

index

oÚo

 jz~



o ©

BAS





MKSINGLE

 j

o ©

DIS





 j=

o ©

LIT





MKVAR

jÈq

oÑ



MKINTEN

j¢®Dmd®Dm

AfA

WAIT

jÈq

LOAD

jÈq

MKSINGLE

LOADPAR

jz®

ISECT

TESTNATIVE

SUCCESS

CC

 j

lAÒÓ

ÚÅ

o%©_·O¸'¹

vars

}ì´

Æíw

lç

ì´

Æíw



·O¸'¹



mkflex

enter

[

vars

o»±

CON

BAS







MKINTEN

j¢®Dm



vars

AY¹

BAS

LOADPAR

j¢®

MEMBER

SUCCESS

CfC

 jz~



¾

o ©_·O¸'¹



Dì´

ÆMw

ºk

5æj

ì´

ÆMw



flex



nest

[

o 

qä

·O¸'¹



enter

[

ì´

ÆMw

o»±

j=¬



WAIT

index

oÚo¢o

Øì´

ÆM½'Æí¼'yÆ

BAS

mkflex

ì´

Æíw

oÚo





MKINTEN

jÈq»m



ì´

Æíw

AY¹

PRO

k¢C

 j:é%ê

o ©_·O¸'¹



Dì´

ÆMw

æ}j

ì´

Æíw



flex



nest

çm

oñ

q

·O¸'¹



mkflex

enter

[

ì´

ÆMw

ì´

ÆMw

o»±

j=¬



WAIT

index

oÚo¢o

Øì´

ÆM½'Æí¼'yÆ

BAS





MKINTEN

jÈq»m



ì´

Æíw

AY¹

PRO



oÑ



ÍÏÎ

¸¶Ð¸



ENV





VAR





EXPB





EXPD





EXPA



EXPC





EXPP



lUm





PAT

Figure 5.44: Compiling Literals

For a variable,x, we distinguish whether it is local (its lexical nesting is

) or from

the context. For local variables, we generate a LOAD instruction, prepending a

WAIT instruction if the variable is nonflexible. For a context variable, we generate

aLOADENV instruction.

5.3 Compilation 127

PRO

¿j





o%©

BAS

mkflex

çm

ì´

Æíw

oÚo

BAS

mkflex

çm

ì´

Æíw



oÚo







UNIFY





SUCCESS

 j



o ©

BAS





STORE

index

[

oÚoÑ



SUCCESS

 j



o ©

BAS

mkflex

çm

ì´

Æíw

oÚo

BAS





UNIFY





SUCCESS

 j

o ©

BAS

mkflex

çm

ì´

Æíw

oÚo

DIS





MEMBER





SUCCESS

 j



o ©

CON





TEST





SUCCESS

 j

0

o ©

CON





TESTNATIVE





SUCCESS

ÍÏÎ

¸¶Ð¸



ENV



l^m





PAT





VAR





EXPD





EXPC





EXPB

Figure 5.45: Compiling Properties

For a singleton-set construction,



, code is generated to strictly create the element

on the stack, and the MKSINGLE instruction is appended.

For a singleton-set selection,



d, code is generated which pushes the value of dto

the stack, and the MU instruction is appended.

For a set complement,



a, code is generated that realizes the



Zintension





0

jz~

ñ



, where vrepresents the evaluated value of a, which is passed as an

intension parameter.

For a translation, c

lÕÒÓ

ÚÅ

, code is generated that realizes the



Zintension





lÕ×



, where vrepresents the evaluated value of c, which is passed as an intension

parameter.

For a schema,



¾

, code for creating an intension is generated. Before the

creation of the intension, code is inserted which waits for all free variables of the

schema which are in the current scope and which are not flexible. This ensures that

the newly created intension does not refer to free variables from the environment.

It is sufficient to wait for the variables of the current scope (from the viewpoint of

the code creating the intension), since, inductively, the variables of the outer scopes

are bound. Flexible variables are excluded from this treatment.

A fixed-point,

éçê

b, is compiled as the form

~







, with the difference

that during compilation of p



bwe add the variables from pto the set of flexible

variables. This allows for the construction of cyclic intensions. Note that we need

the temporarily constructed intension that is immediately deconstructed by the MU

operator in order to get an environment where inner intensions created for bcan

depend on.

Compiling Properties

The compilation of properties is defined in Figure 5.45. The following comments:

A pattern equality, p





, is mapped to a UNIFY instruction where the patterns

are compiled such that all variables in them are flexible. Note that, by normal-

ization, the variables appearing in the patterns must be directly bounded by the

enclosing schema.

128 Abstract Machine

The next three cases are variants of pattern membership, p

c. For the case of a

direct assignment to a variable, x



b(a shortcut for x

×>~



) we use the STORE

instruction as an optimization. For the case of an assignment to a pattern, p



b, the

UNIFY instruction is used. In compiling the pattern p, the variables of the pattern

are made flexible. Finally, the general case, p

cis compiled to the MEMBER

instruction.

The remaining two cases handle the test for emptiness or nonemptiness of the set c,

and are mapped to the TEST resp. TESTNATIVE instructions.

5.4 Implementation 129

5.4 Implementation

A prototypical implementation of the ZAM in C++ has been realized, which is discussed

in this section. The implementation differs from the specified model in the following

respects:

In order to make dumping of thread states efficient, we use a persistent single as-

signment register machine instead of a stack machine. Since a single assignment

thread backtracking to an execution point always finds a valid temporary state. The

usage of a register machine has some influences on the instruction set, as will be

seen below.

Once having the concept of persistent single assignment registers, we may use dif-

ferent techniques for passing parameters to the MKINTEN instruction and other

instructions. Instead of copying the parameters, we just pass a reference to a persis-

tent register area (a pointer to a C++ array of values)4.

Since we have no problems with creating cyclic data structures in C++, the con-

struction of environments for intensions as in the ZAM’s Z specification can be

completely avoided, using intension parameters instead. Since the intension param-

eters are passed by a reference to a register area, it is particularly easy to back-patch

fixed-points.

Below, we sketch the data model of the ZAM in C++ and the instruction set of the register

version. Some first benchmarks for the prototypical implementation are also given.

5.4.1 Data Model

Figure 5.46 (on the following page) shows the basic types used in C++ to represent ZAM

values. Here, we also sketch the realization of native values, used for implementing

numbers and other builtin types. Native values are characterized by a virtual method

compare which implements a total order on them (in the C++ implementation, ordering

instead of extensional equality is used).

A set value, SetData(extens,intens), consists of two sorted sequences, rep-

resented as STL vectors. C++’s standard template library provides us with algorithms for

set union and intersection on sorted sequences.

As discussed, an intension in the C++ implementation does not hold an environment

but only an array of parameters. For each free variable of the intension’s constraints, the

params field must contain an entry where a copy of this variable’s value is stored. The

resulting representation of environment variables is very similar to closures, as yielded by

implementing nested functions by

-lifting in the implementation of functional languages.

A constraint is represented by an abstract class, which provides a method to spawn

athread executing this constraint. This abstraction allows the easy integration of spe-

cialized constraint implementations (for example, the intensions dynamically created for

4Pointers to registers seem to be an abuse of notions. We justify this by that the instruction set of the

C++ ZAM actually corresponds to a register transfer language. Moreover, storing registers in reentrant

memory is also a characteristic feature of RISC like processor architectures.

130 Abstract Machine

typedef ValueData const * Value; typedef Inten-

Data const * Inten;

typedef ConstrData const * Constr;

enum Kind { TERM, VAR, SET, NATIVE };

struct ValueData { Kind kind; };

struct TermData : ValueData { Constr cons; Value args[]; };

struct VarData : ValueData { int index; };

struct SetData : ValueData { vector<Value> extens;

vector<Inten> intens; };

struct NativeData : ValueData { virtual int com-

pare(...) = 0; ... };

struct IntenData { int varcount; Value pat; Value params[];

Constr constrs[]; };

struct ConstrData { virtual Thread spawn(..., int varshift) = 0; }

Figure 5.46: Value Types of the ZAM in C++

typedef ThreadData * Thread; typedef GoalData * Goal;

struct ThreadData { enum Status { RUNNING, SUCCESS, FAIL-

URE, ER-

ROR, MWAIT, TWAIT } status;

Instr * code; int varshift; int dumpDepth;

enum Priority { LO, HI } prio;

Value params[]; Value registers[];

union { <<data for MWAIT and TWAIT>> };};

struct VarInfo { Value binding; vector<Thread> wait-

ing; };

struct ChoiceInfo { Thread initiator; <<data for alterna-

tive>>; vector<int> bound; vec-

tor<ThreaDump> dumps;vector<Thread> spawned; };

struct ThreadDump { Thread dumper; Instr* code; };

struct GoalData { Thread parent; vector<Thread> threads;

Value pat;

vector<VarInfo> vtab;

vector<ChoiceInfo> choices; };

Figure 5.47: Control Types of the ZAM in C++

realizing set intersection (cf. Figure 5.30 (on page 117)). The spawn method is passed the

variable shift of the intension’s instance.

The representation of threads and goals in the C++ implementationis givenin Figure 5.47.

It is very similar to the specification in Z. There are the following differences:

5.4 Implementation 131

MKEMPTY

MKSINGLE r

MKINTEN s r

Ò Ó Ð

FIXINTEN r

MKTERM c

Ò Ó Ð

UNION r

ISECT r

WAIT v

MOVE r

UNIFY r

MEMBER r

MU r

TEST r

TESTNATIVE r

SUCCESS

Figure 5.48: Register Transfer Instruction Set of the ZAM

For reasons of efficiency, auxiliary instructions such as TRYNEXT

which

carry a data state can not be used. We thus have additional data in choices, rep-

resenting the state contained in TRYNEXT. Furthermore, threads have additional

states, MWAIT and TWAIT, and corresponding data components for execution the

MU and TEST/TESTNATIVE instructions; this data is only valid if a thread is in

one of these states.

The need of a thread to dump its state to the active choice is encoded in a more

efficient way than in the specification. The field ThreadData::dumpDepth

contains the maximal choice depth a thread has been dumped to, and monotonically

increases during the thread’s progress. The test for the need to dump is coded as

goal->choiceTable.size() > thread->dumpDepth.

As discussed, it is sufficient to just restore the program counter to backtrack threads

since the ZAM uses a single assignment register model for storing temporaries (field

ThreadData::registers):. Thus a thread can be restored to an arbitrary execu-

tion point, always finding a valid temporary state at this point5.

5.4.2 Instruction Set

The ZAM’s C++ implementation uses a register transfer instruction language. The in-

structions are summarized in Figure 5.48. The operands of instructions are the followings:

r: the index of a register. This addresses the field ThreadData::registers

of the executing thread.

v: the index of a variable. This addresses the GoalData::vtab field of the goal,

the executing thread belongs to. If the variable is bound, its binding is denoted,

otherwise a VarData term holding the variable’s index. The executing thread adds

the ThreadData::varshift value to the index, which determines the base of

the variable frame of on associated instantiation. If a variable is the destination of

an instruction, then the value of the instruction’s result is unified with the addressed

variable’s value.

5By construction, a thread can never dump in state MWAIT or TWAIT – since in these states it does not

progress. Thus the special temporary control data stored in ThreadData for these states needs not to be

dumped.

132 Abstract Machine

p: the index of a parameter. This addresses the field ThreadData::params of

the executing thread.

p: alternatively, a register, variable or parameter index. Similar as the Java

Virtual Machine, the ZAM provides in fact a set of specialized instructions for the

different addressing modes, which avoid dynamic encoding at runtime, but from

which is abstracted here. Around a hundred concrete instructions are derived from

the combinations of addressing modes in the abstract instructions in Figure 5.48 (on

the preceding page).

Ò Ó

: a reference to a register region. Provided is the starting index of a consecutive

region of registers.

s: the index of an intension. This points to a global table of intensions belonging to

the executed unit (the “constant pool” in notions of the JVM).

c: the index of a constructor. This points to a global table of constructors belonging

to the executed unit.

The following remarks on the instructions:

The MKEMPTY instruction stores the empty set in the destination operand, the

MKSINGLE instruction a singleton set containing the value in the source operand.

The value in the source operand must not contain free variables, otherwise the be-

havior is undefined (the compiled code ensures this with the WAIT instruction).

The MKINTEN instruction creates a new intension. The parameters of the in-

tension are found in the register region starting at r

Ò Ó

. In the intension’s field

IntenData::params,just apointer to the registerregion isstored. The FIXINTEN

instruction commits the parameters of the addressed intension to be completely ini-

tialized. This instruction may make a copy of IntenData::params; however,

by the assertion of “single assignments” to registers, the FIXINTEN instruction may

as well do just nothing. (Making a copy avoids space leaks, since the live-time of

the created intension may exceed the live-time of the creating thread; in this case, if

no copy is made, the entire register array of the thread cannot be reclaimed by the

garbage collector unless the intension is.)

The MKTERM instruction creates a new constructor term. The arguments of the

constructor are passed in the register region r

ÒØÓ

. Again, a copy of this region is not

required, though omitting it can create space leaks. (To copy or not, or to let this be

determined by compiler optimizations, is an open question.)

The UNION and ISECT instruction implement set union and intersection, as de-

scribed in the specification.

The WAIT instruction lets the executing thread suspend until the addressed variable

(and all the variables it indirectly refers to by a possible binding) are bound.

5.4 Implementation 133

The MOVE instruction transfers the source to the destination. If the destination is a

variable and is bound, this is similar to unification. The UNIFY instruction performs

a unification of its operands. If unification fails, the executing thread fails.

The MEMBER instruction tests whether the first operand is member of the second

set operand, as described in the specification.

The MU,TEST and TESTNATIVE instructions create subgoals for resolving their

operand, as described in the specification.

The SUCCESS instruction terminates the executing thread. If all threads of a goal

have been successfully terminated, then an associated parent thread of the goal is

resumed (which is present if the goal is a subgoal).

5.4.3 Memory Management

A Boehm-style conservative garbage collector [Boehm, 1993] is employed for memory

retrieval. The collector needs to handle inferior pointers (references “into” a memory

cell), because pointers to register regions may be stored in values, but also since the

standard template library of C++ is involved, which may use local allocation strategies

for containers. Nevertheless, the implementation of the garbage collector is compact (300

lines of code) and portable: this becomes possible since garbage collection can be invoked

synchronously inbetween the execution steps of threads, where no hardware stack exists,

and the only root to consider is the ZAM’s configuration which points to the topmost goal.

5.4.4 Performance

Since the compiler from

Zto ZAM’s register transfer code is not yet ready, performance

measurements can be only very rudimentary. As a first guess, an “assembler” implemen-

tation of the list concatenation function app has been used:

Ú9Û

app

ÜÞÝÖßDàÔ

×7Ñ

â\ã

ÝÖ

äåä

×çæ

ÑÖ

×5è

app

äåä

All solutions for app

, where cis a constant list of length

êëvì

, have been queried,

in a loop executed thousand times (such that garbage collection needs to be invoked).

The current implementation requires

êí¯îðï

seconds on a Pentium II/400, hence

ê0í¯îñï

ms for

one evaluation of app, backtracking over all possible 128 partitions. Around

ëîðò

million

instruction steps are executed, which amounts to

êóvóvôôô

steps per second.

As a further test, a recursive definition of the fac function has been coded:

Ú9Û

fac

ÜÞÝÖ

×=Ñ

áô5é

áõêvâ\ã

ÝÖ

×=Ñ

öHêé

÷\ÙFÖ?Ö

fac

éÝ

øêvâçÒ

ùÐ Ö

Ô ×~Ó×yÒúÖ Ô

×ùÐ

Ó×fâ

For numbers and arithmetics, a native implementation is used (the ZAM has also some

instructions for natives that have not been specified). Executing fac

thousand times is

done in

êîí

seconds, requiring

ûôvûôôô

steps: thus one recursive function application as

134 Abstract Machine

implemented with the

-operator requires around

ô¯îë

ms. In contrast, a highly efficient

implementation of a functional language such as that of Opal [Schulte and Grieskamp,

1992] requires around

ô¯îðôô¯êï

ms per recursive call –

ê@ïë

times faster. This is not surpris-

ing, since the code the Opal compiler generates for fac is identical regarding efficiency

to that of an according recursive C function – whereas function application via the

operator deals with relations. Nevertheless, more effort has to be invested to get function

application in the ZAM more efficient for special cases.

Taking into account that the ZAM is a virtual byte-code machine, these results are en-

couraging, though real benchmarking is required to get a clearer picture, and comparisons

with the performance of other implementations of functional logic languages have to be

made.

5.5 Discussion 135

5.5 Discussion

We defined the abstract machine ZAM, supplied a comprehensive specification of its op-

eration (which also serves as a case study for the Z specifications we wish to execute by

our approach), provided a compilation of normalized

Zinto the machine’s instruction

set and outlined the implementation of the machine in C++. Related work is discussed

below.

There are a wide variety of abstract machines for functional logic languages. Even

five years ago, Hanus [1994] mentioned fourteen different designs, which he considered

as the “most important” ones. In general, these machines can be divided into those that

have evolved from machines for logic languages, those from functional languages and

those that use techniques for concurrent constraint resolution.

Extending Logic Implementation Techniques

Machines based on implementation techniques for logic languages are often extensions

of Warren’s Abstract Machine [Warren, 1983; Ait-Kaci, 1991]. The WAM provides an

efficient implementation of Prolog, which breaks down unification and backtracking to a

level of well-designed, specialized low-level instructions. The ZAM does not attempt to

do this: its instructions are complex; unification, for example, is contained as “atomic”

functionality.

Some extensions of the WAM integrate narrowing (outermost or innermost) into the

machine, for example by adding the possibility of term replacement on heap values, as in

the A-WAM [Hanus, 1990]. Other extensions such as the K-WAM [Bosco et al., 1989]

are based on translating functions into relations, with the result that deterministic program

parts are executed using the full indeterministic machinery. This is, in fact, close to the

ZAM, where functions are already mapped to relations at the calculus level of

Z(though

the ZAM has the advantage of choice point scheduling, which Prolog implementations

can only hardly achieve because of the sequential execution model of Prolog.) Concep-

tually, both approaches are inadequate for implementing

Z, since concurrent constraint

resolution is essential for our application. Furthermore, these techniques do not treat

higher-orderness in the sense that the ZAM does, allowing arbitrary set extensions and

intensions to be combined at runtime to form new sets, relations and functions.

Extending Functional Implementation Techniques

Machines based on implementation techniques for functional languages use stack-based

reduction (as first described for the SECD machine, [Landin, 1964]), continuation-passing

style reduction (CPS, Appel and Jim [1989]), and graph-based reduction, the most promi-

nent instance of which is the STG (Spineless Tagless G-machine, [Jones and Salkild,

1988; Jones, 1992]). The stack-based implementations of functional languages have the

advantage that they can be mapped to off-the-shelf hardware, making effective use of the

target architecture’s hardware stack and registers, as is achieved, e.g., in the implemen-

tation of the functional language OPAL by compilation of (recursive) OPAL functions to

(recursive) C procedures [Schulte and Grieskamp, 1992; Schulte, 1992]. CPS techniques

make the reduction stack implicit by passing a continuation to each function application

136 Abstract Machine

to be called with the result of evaluation, implying that “stack frames” are actually allo-

cated on the heap. Given a good garbage-collection scheme, this technique can be rather

efficient [Appel, 1987], since it allows a natural optimization of tail calls, efficient real-

ization of exceptions (important for the implementation of ML [Milner et al., 1990]), and

a stackless implementation of concurrent threads, as used, e.g., for CML [Reppy, 1991].

The STG-based implementations use the graph-reduction paradigm, resulting in com-

piled code which looks – interestingly enough – rather similar to the final code generated

for higher-order functions in stack-based or CPS approaches. A “suspension” in the STG

is represented as a closure with an associated evaluation method. The first time the sus-

pension is forced, the evaluation result is memorized in the closure and the evaluation

method is updated. The next time the suspension is consulted, the updated method sim-

ply retrieves the result from the previous evaluation. In the stack-based implementation

of the strict language OPAL, exactly the same effect is achieved by memorization of the

evaluation result of

-ary function abstractions,

ÏnÖ×

e, in closures, with the only sig-

nificant difference to the STG that strict stack-based evaluation is the default and lazy

closure-based evaluation the special case.

One of the first attempts to extend functional reduction by logic computation is the

BAM [Kuchen et al., 1990]and its lazy version, the LBAM [Moreno-Navarro et al., 1990],

which are used to implement the functional logic language BABEL [Moreno-Navarro and

Rodriguez-Artalejo, 1992]. These machines extend graph reduction by nodes for logic

variables and use choice points and trailing as in the WAM (and the ZAM) to deal with

backtracking. An optimization by deterministic rule detection at runtime is described by

Loogen and Winkler [1991]. If the application of a rule does not bind any logic variable,

then – by orthogonality of the functional rule system – choice points for other cases can

be discarded. The ZAM, by contrast, cannot rely on orthogonality, since whether a set is a

function is not represented in

Z. Hpwever, we have shown how choice point scheduling

results in similar effects as the “dynamic cut” of Loogen and Winkler [1991].

Based on the STG model and the above-described closure technique, the JUMP ma-

chine [Chakravarty and Lock, 1991; Lock, 1992; Chakravarty and Lock, 1997] provides

an interesting approach for integrating logic computation into a functional machine. The

notion of a “closure” is extended to an “active object”, which is used to represent several

kinds of runtime objects – among them suspensions and logical variables. “Jumping” in

this machine amounts to calling the evaluation method of an active object, which behaves

differently in dependency of kind and state of the object. The JUMP machine uses tech-

niques similar to that of the WAM and the ZAM for dealing with backtracking: choice

points are used to hold information for restoring the state of the machine. The machine

can avoid the creation of choice points if functions are applied to ground terms, since it

can assume orthogonality of the function’s rules. One technical problem with the archi-

tecture of the JUMP machine is that it does not match well with modern, cache-based

processor architectures, since it makes abundant use of indirect code jumps between very

small code chunks – on these architectures, the older technique of “tag-based branching”

for compiled code might be in fact more appropriate, and a byte code interpreter is not

much slower.

The machines mentioned are examples of extending functional implementation tech-

niques by logic computation. Again, these techniques, like the discussed extensions of

the WAM, are inadequate for implementing

Z. The main problem is the lack of support

5.5 Discussion 137

for concurrent constraint resolution and the possibility to freely combine extensions and

intensions to build sets, relations and functions at runtime.

Concurrent Constraint Resolution

An influential design of a concurrent constraint-resolution machine is the implementation

of the Andorra Kernel Language, AKL [Janson and Haridi, 1991; Franzen et al., 1992;

Janson, 1994]. Among others, the “Andorra Principle” [Warren, 1990], of which the

ZAM’s technique of choice scheduling is an instance, is implemented by this machine.

One of the more recent designs of a constraint-resolution machine is the Oz machine,

part of the Mozart system [Mehl et al., 1995; Scheidhauer, 1998; Mehl, 1999]. The Oz

machine uses a so-called store to represent Oz’s computation spaces, containing bindings,

constraints and values. A so-called worker executes a thread, which is represented by a

stack of so-called tasks that are executed until the stack is empty. Tasks are tuples of envi-

ronments and instruction pointers. On execution of a task, new tasks may be pushed to the

stack of a worker, thus realizing dynamically calculated control flow. Since tasks encap-

sulate their environment, they can be moved between thread workers: hence suspensions

of computations that access unbound variables can be handled by moving the related task

to a newly created, suspended thread worker and continuing with the next task on the

stack of the current worker. One advantage of this model in comparsion with the ZAM

is that, initially, only one thread is used to execute a set of constraints, whereas the ZAM

spawns a thread for each constraint from the start. On the other hand, the representation

of tasks and of context switching to environments attached to each task has to be paid

for in the Oz machine, whereas thread creation in the ZAM costs only a few instructions.

Moreover, minimizing the number of active threads can also be performed by optimiza-

tion at compile time: based on a mode analysis that evaluates static data dependencies, a

set of

Zconstraints can be compiled to execute in a single ZAM thread.

The Oz machine does not use trailing and backtracking for implementing choices, but

instead copies stores. This allows several search strategies (breadth-first, depth-first) to be

easily realized. Copying is closely related to implementing search in functional languages

using a “continuation” function that holds the alternative of a choice, the current set of

context bindings, stored in a closure. The general problem with copying is that it is

speculative: it is not known whether the copy will actually be used during subsequent

computations.

A quite recent machinerealizing concurrent constraint-resolution is describedby Hanus

and Sadre [1999] for the implementation of Curry. This machine uses “and parallelism” as

usual in constraint resolution, but also “or parallelism” for the execution of disjunctions:

independent threads are spawned for the different choices in a disjunction, effectively

implementing breadth-first search, as required by Curry’s semantics. Choice scheduling,

resembling the Andorra principle [Warren, 1990] as in the ZAM, is applied to prune the

overhead of choices. To avoid copying, this machine associates with each thread a unique

key, and each variable contains a hash table of bindings indexed by thread keys. Though

speculative copying is avoided, the overhead of this approach is that each variable access

requires selection via a hash table.

The ZAM has a simpler and much more compact design than the other machines

mentioned, which it inherits from the minimalistic

Zcalculus. But the ZAM is also less

138 Abstract Machine

complex because it is restricted to built-in depth-first search, implemented by the classical

trailing and backtracking technique. Forproblems that can be solved by depth-first search,

this leads to more efficient execution results than copying of stores (as in the Oz machine)

or variable hashing (as in the Curry machine), but problems which require breadth-first

search cannot be handled adequately by the ZAM.

Chapter 6

The ZeTa System

“Sie meinen also tats¨achlich,

daß ein Kunstwerk kollektiv

hergestellt werden k¨

onnte?”

- : “Ab¨

arrj´

a!”

(Arno Schmidt: Die Gelehrtenrepublik)

The concepts presented in this thesis are applied in the broader context of the ZeTa sys-

tem, which was developed by the author together with Robert B¨ussow in the course of the

ESPRESS project. We conclude the thesis with a review of ZeTa and the application of

the

Zimplementation, called ZaP, in the context of the ZeTa system.

6.1 Three Dimensions of Integration

The goal of the ZeTa system is to provide an open environment for editing, browsing and

analyzing integrated specifications, assembled from heterogeneous formalisms – includ-

ing Statecharts, Z, temporal logic, message-sequence charts and others. These formalisms

are processed by different techniques, such as deduction-based analysis, model checking,

systematic testing and simulation. Established CASE tools (such as Statemate [Harel

et al., 1990]) shall be used in combination with the existing powerful tools for formal

methods (such as the deduction system Isabelle [Paulson, 1994] or the model checker

FDR [Formal Systems , Europe]), and specialized tools newly developed in the context of

ESPRESS – such as the ZAP compiler.

ZeTa’s design as an open environment for notations and tools was derived from the

experiences in the application oriented research project ESPRESS, in which the industrial

partners determined the choice of formalisms and tools. In basic research one can stick to

a particular notation or tool over a period of years, carefully tuning the approach. On the

other hand, in industrial research, comprehensible results are asked for soon and require-

ments and fashions change fairly quickly. From the point of view of basic research, it is

therefore desirable to have a flexible approach with regards to the “front-end” technology,

one which can easily cope with new requirements while at the same time allowing scope

for steady enhancement of the underlying “back-end” technology.

The ZeTa system sets out to achieve this goal by defining three dimensions of in-

tegration at the front-end level: semantic integration,document integration and tool in-

tegration. This logical integration levels are accompanied by graphical user interfaces.

The ultimate goal is to make basic tools, e.g. for computation, once they have been in-

tegrated on the semantic and the tool level, reusable for different document formats and

input languages.

140 The ZeTa System

.....

External Notations

and Tools

Statemate

Standard Z (data) µSZ core

(behavior)

µSZ with plugins

....

Temporal Logic

HOL-Z prover SMV Model Checker ZAP Compiler

Generic Tools

ü>ý

ü¢ý

þ¢ÿ





Figure 6.1: Semantic Integration in ZeTa

6.1.1 Semantic Integration

Semantic integrationin ZeTais realized by the



notation [B¨ussowet al., 1996, 1997a;

B¨ussow and Grieskamp, 1999].



is based on a core, which is a collection of con-

ventions for specifiying structure and behavior in Standard Z. External notations are inte-

grated by so-called languageplugins, which are defined by a shallow syntactic translation

into the



core. This is illustrated in Fig. 6.1.

The



core provides a model for concurrent components. Shared and private data

can be described, thus constituting the data space. A set of traces of bindings of the data

space defines the dynamic behavior. The dynamic behavior can be specified in several

ways: for example, by giving a data-state-transition relation, which can be executed by

ZaP, or by giving axiomatic constraints on the set of traces (which are not as easy to

execute – but see, however, future work in Chapter 7). Prior to trace construction, the

data space is extended by information to model concurrent access to its variables. To this

end, each variable of the data space is equipped with a lock, which can be acquired by

concurrently executing activities. The locks allow us to resolve racing, the situation in

which two parallel activities write to the same variable during the same execution step.

Racing is a prototypical situation of concurrent resource allocation and can be used to

model higher-level communication facilities, such as channels and service-access points

(i.e. remote procedure calls). For further details, see [B¨ussow and Grieskamp, 1997,

1999].

To integrate a new formalism, it is necessary to define a translation of this formalism

into the



core. For tool integration, a so-called model source adaptor that does the

translation has to be implemented. The adaptor concept is outlined below in Section 6.1.3

(on the next page). Whether integration of a formalism is possible depends on whether a

translation can be found. To what extent generic tools such as ZAP can be reused depends

on the



core definitions resulting from this translation.

6.1 Three Dimensions of Integration 141

.......

Statemate Event Diagram Editor

Die Kontrolle des Fahrgschwindigkeitskonstanter

wird durch diue folgende Klasse beschrieben:

\begin{class}{Control}

\end{class}

\includeMSL{STM:cntrl.sch}

\includeMSL{STM:cntrl.ach}

\includeMSL{EVD:cntrl.diag}

MSL = Model Source Locator

MSZ:control.tex

Editing Browsing Analyzing

.....

External Tools

User Views

Compound Document

Figure 6.2: Document Integration in ZeTa

6.1.2 Document Integration

The “cement” of integration in the ZeTa environment is an integrated specification docu-

ment (ISD). In most cases, this is a



specification, but in general an ISD needs not to

contain formal parts.

An ISD is given in a markup language and contains references to fragments of spec-

ification contributed by external tools. The references are described by model source

locators (MSL), a concept comparable to URLs. An MSL identifies the tool that is to

provide the (sub-)model and a tool specific identifier for the desired portion. This is illus-

trated in Figure 6.2. Besides the external tool Statemate, a tool for editing event diagrams

(providing a graphical frontend for temporal logics) is shown as an example.

The major difference between the concept of URLs and that of MSLs is that the latter

are interpreted in different ways, depending on the operation to be performed on an ISD.

For example, if an ISD is browsed, its MSLs are resolved to encapsulated postscript. If

it is analyzed, the MSLs may be resolved to an



core abstract syntax term which is

inserted in the enclosing component (depending on the kind of analysis and MSL).

The ISD itself is addressed by anMSL as well: if the ISD is givenin theL

TEX-markup,

then such an MSL looks like this: LTX:ICC#CruiseControl.tex. Here, LTX de-

notes the tool providing this MSL (the L

TEX-System), ICC denotes a



component

and CruiseControl.tex a file where this component is found.

6.1.3 Tool Integration

The basic operation provided by the ZeTa integration environment is querying the con-

tent of an MSL. This query is parameterized by the type of the desired content. Typical

types are “gif”, “encapsulated postscript”, “parsed abstract syntax”, “type-checked ab-

142 The ZeTa System

stract syntax”, “plugin abstract syntax”, and so on. Types may become more complex

containing further selective information, e.g. “the precondition of schema S”.

The ZeTa environment behaves as a broker between queries and tools that have reg-

istered in the environment the ability to compute content of specific types. Once a tool is

contacted to compute a particular content, it may recursively query the environment for

further content. For example, when a type checker is contacted to compute the “checked

abstract syntax” of an



class, it first queries the “parsed abstract syntax”. If this syntax

contains MSLs referring to external notations, it next queries the “plugin abstract syntax”,

then merges the syntaxes, and finally performs its type-check operation. This way, tool

chains are realized.

The basic model of ZeTa is comparable to a traditional Unix make, but it is more

powerful and reliable, since it is embedded in the strongly-typed Java programming envi-

ronment:

The content exchanged by tools via ZeTa are standardized data formats defined by

Java APIs. For example, for the



language (and the Z language embedded in



), the abstract syntax is prescribed by ZeTa in a Java representation.

A tool is represented in a ZeTa session as a Java object whose instances implement

the interface of so-called tool adaptors, and which can have state and behavior. The

adaptor object may implement its functionality completely in Java, or may wrap

another Unix process, or may be an RMI (remote method invocation) proxy to a

remotely running JVM.

Tool adaptors may import external models into the ZeTa session from other running

tools, not just the file system. For instance, the Statemate adaptor imports initial

content for its MSLs, STM:<chart>, by connecting to the “dataport” of a running

Statemate session.

ZeTa caches content resulting from computations, thus providing an outdating

mechanism. While make uses time stamps of files to decide whether an initial

content is outdated, in ZeTa adaptors that provide initial content may use arbitrary

techniques for this purpose.

ZeTa realizes a session protocol for computations, which keeps a record of the

diagnostics produced by tools, among other things.

The plugins of the



notation directly correspond to certains kind of tool adaptors

plugged into the ZeTa environment (in fact, historically, the ZeTa plugins preceded the



plugins). These adaptors, which import an external notation into ZeTa by providing

initial content for MSLs, are called model source adaptors (MSA).

In the simplest case, an MSA directly performs the translation of the external notation

to the



core. Usually, however, there are intermediate steps in the way from an

external notation to



, and the intermediate content may be used by tools specially

tailored for its type.

This is illustrated in Figure 6.3 (on the next page). The Statemate-MSA initially

creates an abstract syntax for the Statechart, denoted by the MSL STM:ICCCONTROL,

which, in a second step, is converted to the



core syntax. In turn, the L

TEX-MSA

6.1 Three Dimensions of Integration 143

STM:ICCCONTROL / statechart absy LTX:ICC / µSZ lexis

LTX:ICC / µSZ parsed absySTM:ICCCONTROL / µSZ plugin absy

LTX:ICC / STM Simulation Model

LTX:ICC / µSZ checked absy

LTX:ICC / HOLZ model

Figure 6.3: Chains of Content Queries

initially creates a



lexis for the MSL LTX:ICC (the file name in the MSL has been

omitted), which is converted first to a parsed abstract syntax, then to a type-checked ab-

stract syntax. Assuming that the component denoted by LTX:ICC refers to the MSL

STM:ICCCONTROL, the type checker queries the abstract syntax of the



plugin as-

sociated with this MSL and includes it in the resulting type-checked abstract syntax (thus

checking the type consistency of the plugin within its component context).

Now the generation of a model for the HOLZ proveradaptor [Santen, 1999] is entirely

based on the checked abstract syntax, with the plugins expanded. However, in order to

generate a model for the Statemate simulator, it is necessary to query the checked



syntax as well as the original representation of the Statechart. Generating a simulation

model amounts to compiling code for the Z schemas referred to from transitions and

bind them to the Statemate simulator. To generate the binding code, the structure of the

Statechart has to be known.

This example shows that in the real world the integration provided by



plugins

is not always the only underlying representation for tools. The translation of an external

notation to the



core is an abstraction step in which information (in a syntactical

sense) is lost.

6.1.4 Graphical User Interfaces

ZeTa realizes a strict partition of model, view and controller. The model is effectively

implemented by the individual tool adaptors and the ZeTa content-query broker. The

controller is based on a command language that can be also used in batch mode. Two

“views” (GUIs) are provided on top of the controller.

XEmacs GUI. This GUI is based on the extendable texteditor XEmacs (see Figure 6.4).

It is primarily intended for users working under UNIX, who are used to the Emacs editing

environment. It maintains a session log, which utilizes for each operation the produced

output and supports browsing diagnostics contained in the output. It further provides

VTEX, a “what-you-see-is-what-you-mean” editing mode for L

TEX/Z documents. VTEX

is a customizable XEmacs mode that incrementally parses and visualizes the L

TEX struc-

tures and math symbols entered by the user.

144 The ZeTa System

Figure 6.4: The XEmacs GUI

Swing GUI. This GUI is based on Java’s Swing system (see Figure 6.5). It allows the

activationof ZeTaoperations by graphical dialogs and maintains a session log that utilizes

for each operation the produced output. Browsing diagnostics is supported by contacting

an external editor, which may be Emacs, but other editors (in particular. MS-Windows-

specific editors) may be used as well.

6.2 The ZaP Compiler

ZaP (version

) is an experimental compiler for the Z language, which supports the full

higher-order functional sublanguage of Z. It is a predecessor of the forthcoming ZaP-

6.2 The ZaP Compiler 145

Figure 6.5: The Swing GUI

, whose underlying concepts are described in this thesis and which will support the

functional logic sublanguage embedded in Z. Apart from its more powerful computation

model, the forthcoming ZaP-

will not differ from ZaP-

, so we can present the user’s

view of ZaP based on the old system.

ZaP follows the approach of compiling every Z construct. Nonexecutable constructs

of Z will be rejected once their execution has been attempted at runtime. This approach

reflects that whether a Z construct is executable depends on how it is used at runtime. For

example, a function might be defined in such a way that there is no constructive mapping

from input to output and that the application of the function to an input is not executable.

However, it might be still possible to test whether a given pair of input and output values

is in the function’s graph1.

1For future versions of ZaP, it is planned to have an analysis which gives hints for non-executable

146 The ZeTa System

Below, the well-known example of the birthday book specification is used to illustrate

the basic operation of ZaP. Please note that this example cannot serve as a systematic

outline of ZaP’s computation power, but it at least illustrates the ability to execute the

schema calculus, to lift schemas to functions and to define recursive higher-order func-

tions and relations – in an application context for test data evaluation. Starting with the

usual abstract specification of the birthday book, we refine it for the purpose of executabil-

ity (mainly fixing given types in the specification), and add a framework for evaluating

test data described by input/output behavior.

6.2.1 The Basic Specification of the Birthday Book

The specification is partitioned into several Standard Z sections. The first one, BBSpec,

contains the basic, or “abstract” specification of the birthday book example, as usually

found in the literature. The section is opened with the following declaration (all Z para-

graphs followingthis declaration until the next section declaration will belong to BBSpec):



BBSpec

We first define the given types NAME and DATE, and the data state of the birthday book:

NAME

DATE

BirthdayBook

known



NAME

birthday

NAME



DATE

known



birthday

An initial state of the birthday book is given by the following schema:

InitBirthdayBook

áLá°Ò

BirthdayBook

birthday



!Ó

The value of this schema can be evaluated. In order to perform the evaluation the user

types in the name InitBirthdayBook in the Execute panel of the Swing GUI or in the

command line of the XEmacs GUI, resulting in the following (conceptual) output:

InitBirthdayBook

{<birthday == {},known == {}>}

Note that the field known of InitBirthdayBook is implicitely defined by the invariant of

the schema BirthdayBook,known

"

birthday.

Up to now, we have specified the state of the birthday book and a possible initial value,

but no state transitions. These are defined by the following Z “operation” schemas:

constructs at compilation time.

6.2 The ZaP Compiler 147

AddBirthday

BirthdayBook

name

ä

NAME

date

ä

DATE

name

$'&

known

birthday

(

á

birthday

ã%Ý

name

ùÐ

date

vâ

FindBirthday

)

BirthdayBook

name

ä

NAME

date

¡ä

DATE

name

è

known

date

á

birthday

name

l×

Remind

)

BirthdayBook

today

ä

DATE

cards

¡ä



NAME

cards

áõÝ

known

birthday

×á

today

vâ

Remove

BirthdayBook

name

ä

NAME

birthday

(

áõÝ

name

vâ

birthday

The schema AddBirthday adds a mapping name

GùÐ

date

to the birthday book’s state,

the schema FindBirthday lookups the birthday of a given name

, and the schema Remind

yields the set of names whose birthday is today

. The schema Remove deletes the entry

for name

from the birthday book.

We are not yet able to execute any state transitions, since the types NAME and DATE

are given (uninterpreted). In the next section, we will refine these types.

6.2.2 Refining the Birthday Book for Execution

A refinement of the section BBSpec for the purpose of executionis provided by the section

BBExec:



BBExec

-/.0

12

BBSpec

The purpose of the refinement is to fix the given types NAME and DATE. The Z of ZeTa

allows to refine given types by free types:

NAME

äåäá

Werner

Barbara

Martin

DATE

äWäá

date

Date

148 The ZeTa System

Date

day

ä¯ê=î@î0û¯ê

month

ä¡ê7î@îêë

year

576

month

áë

day

Bëvò

%

month

è£Ýï_Ôzí¯ÔóÔzò¯Ôêêlâ

day

Áûô

We can now evaluate expressions by applying the date constructor to bindings:

date

day

áá¨êÔ

month

áá êÔ

year

áá êòòò

=

date <day == 1,month == 1,year == 1999>

The next application of the date constructor is undefined, since the passed binding is not

in its domain (the month February has no

ûô

th day):

date

day

ááHûô¯Ô

month

ááHëÔ

year

ááõê0òòò

=

ERROR[LTX:bbook.tex(186.3-186.56)]:

execution failed

reason:

application undefined:

relation: date

argument: <day == 30,month == 2,year == 1999>

backtrace:

at testing membership:

value: (30,29)

set: _\leq_

at testing membership:

value: <day == 30,month == 2,year == 1999>

set: LTX:bbook.tex(171.22-171.32)

at applying relation:

relation: date

argument: <day == 30,month == 2,year == 1999>

at evaluating command input

ZaPprints a backtrace describing the reason of the undefinedness. The ZeTa commander

– either via the Java GUI or the XEmacs GUI – allows to browse locators as contained in

the above output by clicking on them.

Next we introduce some convenience functions for describing operations on birthday

books. These functions make use of the new features of the Z ISO Standard as regards to

the unification of schema expressions and plain expressions. They yield a set of bindings

(a schema) which can be referred in schema expressions:

add

ááHÏ

name

NAME

date

DATE

name

áá

name

date

áá

date

AddBirthday

find

ááHÏ

name

NAME

name

Káá

name

FindBirthday

remind

ááHÏ

today

DATE

today

Káá

today

Remind

remove

áLáHÏ

name

NAME

name

áá

name

Remove

6.2 The ZaP Compiler 149

Note that the existential quantifiers in these definitions are schema quantifiers.

We can now execute transitions of the birthday book’s state, starting with the initial

state:

InitBirthdayBook

add

Werner

date

day

ááóÔ

month

áLá

Ô

year

áá¨ê0òëlò

=

×

{<birthday == {},

birthday’ ==

{(Werner,date <day == 7,month == 5,year == 1929>)},

known == {},known’ == {Werner}>}

InitBirthdayBook

add

Werner

date

day

ááóÔ

month

áLá

Ô

year

áá¨ê0òëlò

=

×

add

Barbara

date

day

áLá ê0ì¯Ô

month

áá êêvÔ

year

áLá ê0òû

BC=

z×

add

Martin

date

day

ááõê0ì¯Ô

month

áá¨êêÔ

year

ááõê0òû

BC=

z×

find

Werner

remind

date

day

áá¨ê0ì¯Ô

month

áá êvêÔ

year

áLáõê0òû

B=

×

remove

Martin

{<birthday == {},

birthday’ ==

{(Barbara,date <day == 18,month == 11,year == 1935>),

(Werner,date <day == 7,month == 5,year == 1929>)},

cards! == {Barbara,Martin},

date! == date <day == 7,month == 5,year == 1929>,

known == {}, known’ == {Barbara,Werner}>}

The last evaluation deserves some notes on the schema calculus operator,

.Op

binds primed names x

(

from the signature of Op

to unprimed counterparts xin the

signature of Op

. Variables not bound this way (such as cards

and date

) or kept as is. In

the evaluation result, these variables represent the result of applying the find and remind

operations, respectively.

The application of a unique schema InitBirthdaybook (representing by a singleton

binding set) to deterministic operations yields a singleton binding set. But ZaP is also

capable of handling non-unique initial states and indeterministic operations. Consider the

following definition:

InitBirthdayBookUndet

áá

InitBirthdayBook

BirthdayBook

birthday

Werner

ùÐ

date

day

áá;óÔ

month

áLá

Ô

year

áá ê0òëlò

=

yâ0Ó

Evaluation yields in:

InitBirthdayBookUndet

150 The ZeTa System

{<birthday == {},known == {}>,

<birthday ==

{(Werner,date <day == 7,month == 5,year == 1929>)},

known == {Werner}>}

InitBirthdayBookUndet

add

Barbara

date

day

áá¨ê0ì¯Ô

month

áLá êêÔ

year

áá¨ê0òû

CBC=

×

add

Martin

date

day

áá¨ê0ì¯Ô

month

áá¨êvêÔ

year

áá¨ê0òû

B=

××

{<birthday == {},

birthday’ ==

{(Barbara,date <day == 18,month == 11,year == 1935>)},

known == {},known’ == {Barbara}>,

<birthday == {},

birthday’ ==

{(Martin,date <day == 18,month == 11,year == 1935>)},

known == {},known’ == {Martin}>,

<birthday ==

{(Werner,date <day == 7,month == 5,year == 1929>)},

birthday’ ==

{(Barbara,date <day == 18,month == 11,year == 1935>),

(Werner,date <day == 7,month == 5,year == 1929>)},

known == {Werner},

known’ == {Barbara,Werner}>,

<birthday ==

{(Werner,date <day == 7,month == 5,year == 1929>)},

birthday’ ==

{(Martin,date <day == 18,month == 11,year == 1935>),

(Werner,date <day == 7,month == 5,year == 1929>)},

known == {Werner},

known’ == {Martin,Werner}>}

Of course, such evaluations might easily lead to an explosion of possible states. ZaP-

will run into efficiency problems if they are hundreds of them – ZaP-

, using the com-

putation model described in this thesis, shall easily cope with several ten-thousands of

states.

6.2.3 Refining the Birthday Book for Testing

A refinement of the section BBExec for the purpose of evaluation of test data with ZAP is

given by the Z section BBTest:



BBTest

-/.0

12

BBExec

The test-data will be specified by sequences of input and output behavior of the birthday

book. However, the original specification does not provide unique input and output in-

terfaces. But we can lift the original specification without modifying it to such unique

6.2 The ZaP Compiler 151

interfaces. First, we define free types for describing the input and output behavior:

INPUT

äåäá

addI

NAME

DATE

©Ñ

findI

NAME

§Ñ

remindI

DATE

©Ñ

removeI

NAME

OUTPUT

äWäá

okayO

dateO

DATE

§Ñ

namesO

3

NAME

Next, we lift the operations of the original specification to ones with unique in-

put/output interfaces:

UniqueAdd

áá

AddBirthday

INPUT

out

OUTPUT

name

Ô

date

l×ùÐ

addI

out

okayO

)Ö

name

Ô

date

l×

UniqueFind

áLá

FindBirthday

INPUT

out

OUTPUT

name

KùÐ

findI

date

ùÐ

out

dateO

G

)Ö

name

Ô

date

UniqueRemind

áLá

Remind

INPUT

out

OUTPUT

today

KùÐ

remindI

cards

ùÐ

out

namesO

9Ö

today

Ô

cards

UniqueRemove

áá

Remove

INPUT

out

OUTPUT

name

KùÐ

removeI

out

okayO

name

l×

ZaP has no problems with the execution of such morphed operations. Instead of illustrat-

ing this now, we directly continue by defining a framework which tests if sequences of

INPUTS and OUTPUTS do match the specification.

The possible state transitions of the birthday book are defined as the disjunction of all

the operations with unique I/O interface:

UniqueOps

áá

UniqueAdd

UniqueFind

UniqueRemind

UniqueRemove

For test data evaluation, we define a function which, given a pair of input and out-

put states, yields a relation between admissible pre- and post-states of the birthday book

w.r.t. the I/O behavior:

step

áá

INPUT

out

OUTPUT

BirthdayBook

UniqueOps

BirthdayBook

ùÐ

BirthdayBook

(

Finally, a recursive “test”-function is defined which computes the post-states resulting

152 The ZeTa System

from a sequence of input and output traces. The function takes as a parameter a set of

initial states – the condition to let it be executable is that this set is finitely enumeratable:

test

1L2MNMPO,Q

çÖ

?

BirthdayBook

SUTWV

INPUT

SNTV

OUTPUT

R 

BirthdayBook

test

áÏ

init



BirthdayBook

SNTV

INPUT

SUTWV

OUTPUT

[

áHô

\]1

init

1^_W

step

lastis

lastos

test

init

×Ö

frontis

frontos

The

L2McMdO,Q

constructor around the type of the function test tells ZaPnot to try to check

whether the set test is actually a total function of the specified type. This check would not

be executable.

To test “test”, some I/O traces representing test data are defined:

date

êáá

date

day

áá;óÔ

month

áá

Ô

year

ááõê0òëvò

=

date

ëáá

date

day

áá¨ê0ì¯Ô

month

ááõêêÔ

year

áá¨ê0òû

CBC=

êáá°ß

addI

Werner

date

êl×ià

out

êáLá¨ß

okayO

ëáá°ß

addI

Werner

date

êl×zÔ

findI

Werner

×ià

out

ëáLá¨ß

okayO

dateO

date

ê×ià

out

áá®ß

okayO

dateO

date

ë×ià

ûLáá°ß

addI

Werner

date

êl×zÔ

addI

Barbara

date

ë×zÔ

addI

Martin

date

ë×zÔ

findI

Werner

×zÔ

remindI

date

ë×ià

áá°ß

addI

Werner

date

êl×zÔ

addI

Werner

date

ë×zÔ

addI

Barbara

date

ë×zÔ

findI

Werner

×zÔ

remindI

date

ë×ià

out

ûLáLá¨ß

okayO

dateO

date

êl×zÔ

namesO

ÖiÝ

Barbara

Martin

â×ià

out

áá®ß

okayO

dateO

date

êl×zÔ

namesO

ÖÝ

Barbara

â×ià

We can now run some evaluations:

test

InitBirthdayBook

×zÖ

êvÔ

out

êl×

{<birthday ==

{(Werner,date <day == 7,month == 5,year == 1929>)},

known == {Werner}>}

test

InitBirthdayBook

×zÖ

ëÔ

out

ë×

{<birthday ==

{(Werner,date <day == 7,month == 5,year == 1929>)},

known == {Werner}>}

test

InitBirthdayBook

×zÖ

ëÔ

out

{}

6.2 The ZaP Compiler 153

test

InitBirthdayBook

×zÖ

ûÔ

out

û×

{<birthday ==

{(Barbara,date <day == 18,month == 11,year == 1935>),

(Martin,date <day == 18,month == 11,year == 1935>),

(Werner,date <day == 7,month == 5,year == 1929>)},

known == {Barbara,Martin,Werner}>}

test

InitBirthdayBook

×zÖ

out

û×

{}

test

InitBirthdayBook

×zÖ

ûÔ

out

{}

Failing tests are indicated in that they yield an empty set as the post-state of an execution.

A refined version of the function test is given as a partial function, which always yields

nonempty sets:

ptest

L2McMdO,Q

¢Ö

?

BirthdayBook

SUTWV

INPUT

SNTV

OUTPUT

 

BirthdayBook

ptest

áÏ

init



BirthdayBook

SNTV

INPUT

SUTWV

OUTPUT

[

áHô

\]1

init

1^_W

ÖÙ

post

áá

step

lastis

lastos

ptest

init

×zÖ

frontis

frontos

KÑ

post



I×

The

-form lets the executing function be undefined if the post states become empty.

Evaluating a failing test now results in a detailed backtrace which tells us the reason for

the failure (the actual backtrace has been ommited):

ptest

InitBirthdayBook

×zÖ

ëÔ

out

ERROR[LTX:bbook.tex(530.2-530.36)]:

execution failed

reason:

mu-value undefined:

set: {}

...

154 The ZeTa System

6.3 Discussion

We briefly discuss integration frameworks related to ZeTa and the execution of Z specifi-

cations.

Integration Frameworks

Few projects tackle the problem of integrating tools for formal methods and existing

commercial CASE tools like the ZeTa system. The approach of the UniForM Work-

bench [Krieg-Br¨uckner et al., 1999] comes closest to doing so. However, whereas ZeTa’s

integration language Java is object-oriented, UniForM employs a functional language

(Haskell) for this task. Also, the architecture of UniForM is more ambitious (and thus

more difficult to implement and maintain) as regards to version control. On the other

hand, UniForM is less specialized than ZeTa and does not provide conventions for inte-

grating notations and documents.

Other approaches like the Concurrency Factory [Cleaveland et al., 1994] or AutoFo-

cus [Sch¨atz and Huber, 1999] present more homogeneous environments, with most tools

written from scratch and less emphasis on integrating existing tools.

Encoding of Z

Some basic principles of a mapping from Z to

Zhave been outlined in Chapter 3, but

we have not discussed the intrinsic details. In the ZaP implementation, the mapping of

Z to

Zhas to convert axioms to definitions where possible, e.g., eliminate quantifiers

in axioms of the kind

×4á

e, map recursion to

Z’s fixed-point operator,

and so on. The mapping must also weaken the specification by turning some axioms into

assumptions: if a user, for example, declares a function to be “total” by using Z’s total

function arrow, f

B, this assertion cannot be computed if f’s definition is non-finite,

and need therefore be converted into an assumption (which can be done explicitely by the

user or implicitely by ZaP, depending on switches). All this has been implemented in

the ZaP tool and will be ported to the new design of functional logic computation in the

forthcoming ZaP-

Other Approaches Executing Z

Animation of the “imperative”part of Z is provided by the ZANS tool [Jia, 1996], impera-

tive meaning Z’s specification style for sequential systems using state transition schemas.

This approach is highly restricted, as it cannot even handle the functional sublanguage

of Z. An elaborated functional approach for executing Z has been described by Valentine

[1992, 1995], though no implementationexists today. A translation to Haskell is described

in [Goodman, 1995], where monads are used for dealing with the sequential specifica-

tion style, but no logic resolution is employed. Mappings to Prolog are described, e.g.,

in [West and Eaglestone, 1992]. Mappings of Z to Mercury are described in [Winikoff

et al., 1998]: however, in this approach the effective power of execution is restricted com-

pared with ours, because the data flow has to be determined statically. The approach

presented in this thesis goes beyond all the others, since it allows the combination of the

functional and logic aspects of Z in a higher-order setting, treating sequential behaviors

6.3 Discussion 155

as first-order citizins, comparable as to the integration of imperative computation into

functional languages using monads.

Chapter 7

Conclusion

Dreiundsechzighundert Hektometer

uberm Spiegel seiner Wohnung steht er;

sieht die Gasschiff-Flotte der Korona

und erblickt das Mondschaf in Persona.

(Christian Morgenstern:

Palmstr¨om: Angewandte Wissenschaft)

Motivated by the desire of executing a subset of the set-based specification language Z,

we have introduced the set-based calculus

Z, defined a computation model for

Zas

well as an abstract machine implementing this model, a compilation of

Zto the ma-

chine’s instruction set and an efficient implementation strategy for the machine in C++.

The application of this work in the ZeTa system has also been outlined. A discussion

of the results obtained and of related work has been given in the relevant chapters. In

this concluding chapter, we summarize the contributions of the work and discuss some

possible directions for the future.

7.1 Contributions

ZCalculus

The

Zcalculusprovidesa newwayof viewingthe

-calculus, predicatecalculus, schema/module

calculus, set algebra, etc. within a unified framework. The calculus is considered an “in-

vention of the obvious”: to the author’s knowledge, there is no comparable language

though none of the ingredients of the calculus are actually new. The most unusual individ-

ual construct of

Zis its operator of set translation, providing a means for the constructive

description of morphisms in the category of

Zsets and, in combination with the other

operators, supplying

Z’s expressive power.

Z’s semantic model of partial set algebras, a generalization of three-valued logics,

is – at least as a model for Z related languages – a new approach. The author believes

that it is actually a more suitable model for Z itself than the one currently proposed by

Z’s Draft ISO Standard. The standard leaves the treatment of undefinedness and partiality

quite vague, in an unfortunate formal method’s tradition. In this thesis, we have at least

shown that it is possible to define a model of undefinedness for a set-based formalism

that is based on simple notions of lattice theory and does not require deep insight into

mathematical set theory (which cannot be expected of computer scientists.)

Computation Model

The reference computation model of

Z, defined in the style of natural semantics, pro-

vides a new view of computation in the realm of languages that can be encoded in

158 Conclusion

leading to a realistic efficient implementation. New is the representation of set values

by unions of singleton-set values and of intensions, the latter describing “computation

receipts” for sets. Intensions are a generalization of “closures”, as used in computation

models for functional languages.

Given the minimality of the

Zcalculus, its computation model could have been kept

minimal as well, and the different embedded models – functionalreduction and concurrent

constraint resolution – could have been clearly worked out. The interfaces of these two

worlds of computation are clearly visible – via the

-operator that invokes constraint

resolution from value reduction and via the p

econstraint that calls value reduction

for efrom constraint resolution. The presence of nondeterministic computation and the

need to introduce choice points in an implementation is isolated in computing a constraint

+ã

, where we can choose between p

and p

. The higher-order nature of

the model shows up in the instantiation of an intension for the computation of p

iml

npo

qsr

outv

, where we “import” the first-order citizen, the constraint

qsr

o7t

, into a resolution

context.

Abstract Machine

The abstract machine, ZAM, and its implementation emerged as a consequence of

Z’s

computation model. It was possible to keep the machine small, being easy to understand

(relative to the inherent complexity of the computation problem in

Z). The overall ar-

chitecture of the machine is innovative in this respect. Most of the organizational details

of the machine can be regarded as “folklore”: for example, implementation of residua-

tion using wait sets at variables, or employing the “Andorra Principle” for choice point

scheduling.

The implementation of the ZAM in C++ provides an innovative architecture by its

persistent single-assignment registers. On the one hand, persistent registers allow an effi-

cient realization of thread dumping, because only the program counter needs to be trailed,

and on the other hand, they enable callees to export references to register regions out of

their lexical scope. In combination with conservative garbage collection, this leads to an

integrated stack/heap model of memory allocation, which can be utilized by high-level

optimizations that employ a region analysis.

Formalization in Z

We have given a comprehensive formal specification in strictly type-checked Z of

Z’s

syntax and semantics, equational theory, computation model and abstract machine. Thus,

this work represents a case study for the use of Z as a meta language. For this purpose, it

was essential to extend Z. The first category of extensions we made – such as inference-

rule systems and recursive partial functions – could be explained as pure syntactic sugar

and are actually implemented this way by the E

wx

macro preprocessor. The second cate-

gory – such as independence of the textual order of paragraphs – needed real extensions

in comparison to Standard Z, which are implemented by the E

wx

type checker. The ex-

periment shows that – with the extensions used – Z is not only suitable for specifying

sequential systems in the style found in the textbooks, but also for the complex axiomatic

specifications as found in this thesis.

7.2 Future Work 159

7.2 Future Work

Apart from minor enhancements of the basic model, its didactic presentation, a full-

fledged implementation and case studies applying the implementation, the basic work

presented in this thesis provides a starting point for further lines of research. The most

important of these are outlined below.

Adding Solvers

For specialized constraint problems (e.g. subset constraints), tailored solving techniques

and tools are available. The trend in constraint-solving systemsis therefore to use modular

frameworks, where components for specialized solvers can be easily integrated, on the

semantical as well as the tool level. The aim of future work on

Zand its implementation

will be to support such modularity, and integrate further resolution techniques.

On the implementation level, the thread-based architecture of the ZAM and its im-

plementation in C++ is in principel prepared for modularity. An extension would need

to convert intensions and threads into abstract classes, providing abstract methods for

spawning a thread, executing a thread’s next step, trailing, backtracking and choice point

creation. This may allow third-party constraint solvers to be integrated, thereby inheriting

the generic capabilities of the ZAM for representing constraints as first-class values, for

choice point management and for variable allocation.

Static Analyses

The

Zcalculus provides an optimal setting for investigating and realizing static analyses

using the set-based approach. In his diploma thesis, Wieland [1999] has developed initial

steps toward a subset constraint solver on

Zterms, which is applied to extended type

checking of Z. Another application of this framework might be determinism and mode

analysis for the optimized compilation of

Zitself. (As Brandes [1997] has shown in his

diploma thesis, set-based analysis in principle can be used for this purpose.)

Both the extended type checking and the determinism problem are formulated as ab-

stractions from

Zterms into (specialized) set expressions, which can be directly ex-

pressed in

Zitself. For example, for the type analysis, a constant expression

is ab-

stracted into the expression

z1y

vâ

, denoting a singleton type. Future research might, then,

focus on a reflective approach: an analysis of a

Zexpression (and hence the high-level

form it encodes) is described by a translation into a more abstract

Zexpression, which

is then interpreted by compilation and execution. To obtain the full power of set-based

analysis, a solver component for subset constraints must, of course, added to the ZAM.

Computing Interval Logics

In [B¨ussow and Grieskamp, 1997; B¨ussow et al., 1997b] we described a modest exten-

sion of Z that adds temporal interval logic to the language. The addition of traces to a

set-based model was shown to work well: a temporal formula merely represents a set of

traces. A promising line of research would be to add a solver to

Z, implementing a few

primitive operators of discrete temporal interval logics. One interesting application area

is test data evaluation for reactive and embedded systems; a further is the execution of

160 Conclusion

watchdogs monitoring the allowed modal actions of safety- or security-critical systems.

Since interval logics can be used as a natural model for the constructive description of re-

active behavior (B¨ussow and Grieskamp [1997], for example, provide a shallow encoding

of Statecharts in interval logics), the animation and prototyping of reactive systems might

also be possible using this approach.

Bibliography

H. Ait-Kaci. Warren’s Abstract Machine – A Tutorial Reconstruction. MIT Press, 1991.

reprinted version from the author’s WWW page.

H. Ait-Kaci, P. Lincoln, and R. Nasr. Le Fun: Logic, equations, and functions. In Proc.

4th IEEE Internat. Symposium on Logic Programming, San Francisco, 1987.

S. Antony. Definitional trees. In Proc. 3rd International Conference on Algebraic and

Logic Programming, volume 632 of Lecture Notes in Computer Science. Springer-

Verlag, 1992.

S. Antony, R. Echahed, and M. Hanus. A needed narrowing strategy. In Proc. 21st ACM

Symposium on Principles of Programming Languages, 1994.

A. W. Appel. Garbage collection can be faster then stack allocation. Information Pro-

cessing Letters, 25:275–279, 1987.

A. W. Appel and T. Jim. Continuation-passing, closure-passing style. In Proceedings of

POPL ’89. ACM, 1989.

P. Arenas-Sanchez and A. Dovier. A minimality study for set unification. Journal of

Functional and Logic Programming, 1997(7), 1997.

H. Barendregt, editor. The Lambda Calculus: Its Syntax and Semantics. North-Holland,

Amsterdam, 1984. (revised edition).

M. Barr and C. Wells. Category Theory for Computing Science. Prentice Hall, 1990.

H. Boehm. Space efficient conservative garbage collection. In Proceedings of the ACM

SIGPLAN ’93 Conference on Programming Language Design and Implementation,

volume 28 of SIGPLAN Notices, pages 197–206, June 1993.

P. G. Bosco, E. C. Cecchi, and C. Moiso. An extension of the WAM for K-LEAF: a wam-

based compilation of conditional narrowing. In Proceedings of the Sixth International

Conference on Logic Programming (Lisboa). MIT Press, 1989.

O. Brandes. Automatische Verfahren zur Ausf¨uhrbarkeitsanalyse und ¨

Ubersetzung der

mengenbasierten Spezifikationssprache Z. Diplomarbeit, TU Berlin, Mai 1997.

R. B¨ussow, H. D¨orr, R. Geisler, W. Grieskamp, and M. Klar.

SZ – ein Ansatz zur system-

atischen Verbindung von Z und Statecharts. Technical Report TR 96-32, Technische

Universit¨at Berlin, Jan. 1996.

162 BIBLIOGRAPHY

R. B¨ussow, R. Geisler, W. Grieskamp, and M. Klar. The

gwx

Notation Version 1.0.

Technical Report 97–26, Technische Universit¨at Berlin, Fachbereich Informatik, Dec.

1997a.

R. B¨ussow and W. Grieskamp. Combinig Z and Temporal Interval Logics for the For-

malization of Properties and Behaviors of Embedded Systems. In R. K. Shyamasundar

and K. Ueda, editors, Advances in Computing Science – Asian ’97, LNCS 1345, pages

46–56. Springer-Verlag, 1997.

R. B¨ussow and W. Grieskamp. A Modular Framework for the Integrationof Heterogenous

Notations and Tools. In K. Araki, A. Galloway, and K. Taguchi, editors, Proc. of the

1st Intl.Conference on IntegratedFormal Methods – IFM’99.Springer-Verlag, London,

June 1999.

R. B¨ussow, W. Grieskamp, W. Heicking, and S. Herrmann. An Open Environment for the

Integration of Heterogeneous Modelling Techniques and Tools. In Proc. of Intl. Work-

shop on Current Trends in Applied Formal Methods, volume 1641 of Lecture Notes in

Computer Science. Springer-Verlag, Berlin, 1998.

R. B¨ussow, W. Grieskamp, F. Lattemann, and E. Lehmann. The Defini-

tion of Dynamic Z. ESPRESS interner Projektbericht. URL: http://uebb.cs.tu-

berlin.de/˜wg/papers/dynz.ps, May 1997b.

M. M. Chakravarty and H. C. R. Lock. Towards the uniform implementation ofdeclarative

languages. Computer Languages, 23(2-4), 1997.

M. M. T. Chakravarty, Y. Guo, M. K¨ohler, and H. C. R. Lock. GOFFIN: Higher-order

functions meet concurrent constraints. Science of Computer Programming, 30(1-2),

1997.

M. M. T. Chakravarty and H. C. R. Lock. The implementation of lazy narrowing. In Pro-

gramming Language Implementation and Logic Programming, volume 528 of Lecture

Notes in Computer Science. Springer Verlag, 1991.

R. Cleaveland, J. Gada, P. Lewis, S. Smolka, O. Sokolsky, and S. Zhang. The concurrency

factory — practical tools for specification, simulation, verification, and implementation

of concurrent systems. In Proceedings of the DIMACS Workshop on Specification of

Parallel Algorithms, Princeton, NJ, May 1994.

C. Consel and O. Danvy. Tutorial notes on partial evaluation. In ACM Symposium on

Principles of Programming Language, 1993.

L. Damas and R. Milner. Principal type-schemes for functional programs. In Proceedings

of POPL ’82, pages 207–212. ACM, 1982.

B. A. Davey and H. A. Priestly. Introduction to Lattices and Order. Cambridge Mathe-

matical Text Books. Cambridge University Press, 1990.

BIBLIOGRAPHY 163

K. Didrich, A. Fett, C. Gerke, W. Grieskamp, and P. Pepper. OPAL: Design and Im-

plementation of an Algebraic Programming Language. In J. Gutknecht, editor, Pro-

gramming Languages and System Architectures, International Conference, Zurich,

Switzerland, March 1994, volume 782 of Lecture Notes in Computer Science, pages

228–244. Springer, 1994. URL http://uebb.cs.tu-berlin.de/papers/

published/DesignImplOpal.ps.gz.

K. Didrich, W. Grieskamp, C. Maeder, and P. Pepper. Programming in the large: the

algebraic-functional language opal 2

{

. In Proceedings of the 9th International Work-

shop on Implementation of Functional Languages, St Andrews, Scotland, September

1997 (IFL’97), Selected Papers, volume 1467 of Lecture Notes in Computer Science,

pages 323 – 338. Springer, 1998.

A. Dovier, E. G. Omodeo, E. Pontelli, and G. Rossi.

log

: A language for programming

in logic with finite sets. Journal of Logic Programming, 28(1), 1996.

H. Ehrig and B. Mahr. Fundamentals of Algebraic Specifications 1: Equations and Ini-

tial Semantics, volume 6 of EATCS Monographs on Theoretical Computer Science.

Springer Verlag, Berlin, Heidelberg, New York, Berlin, 1985.

A. Fett and W. Grieskamp. Berlin proposals for the Z standard. http://uebb.cs.

tu-berlin.de/˜wg/zstan/, September 1998.

Formal Systems (Europe) Ltd. Failures Divergence Refinement, FDR2 User Manual.

Formal Systems (Europe) Ltd., 1997.

T. Franzen, S. Haridi, and S. Janson. An overview of the andorra kernel language. In Pro-

ceedings of the 2nd Workshop on Extensions to Logic Programming. Springer Verlag,

1992.

T. Frauenstein, W. Grieskamp, P. Pepper, and M. S¨udholt. Communicating functional

agents and their application to graphical user interfaces. In Proceedings of the 2nd

International Conference on Perspectives of System Informatics, Novosibirsk, LNCS.

Springer Verlag, Jun 1996a. URL http://uebb.cs.tu-berlin.de/papers/

published/PSI96-EA.ps.gz.

T. Frauenstein, W. Grieskamp, and M. S¨udholt. Temporal Semantics of a Concur-

rency Monad with Choice and Services. In Proceedings of the 2nd FUJI Interna-

tional Workshop on Functional and Logic Programming, Nov 1996b. URL http:

//uebb.cs.tu-berlin.de/papers/published/FUJI96.ps.gz.

H. S. Goodman. The Z-into-Haskell tool-kit: An illustrativecase study. In J. P. Bowen and

M. G. Hinchey, editors, ZUM’95: The Z Formal Specification Notation, volume 967 of

Lecture Notes in Computer Science, pages 374–388. Springer-Verlag, 1995. ISBN

3-540-60271-2. URL http://www.comlab.ox.ac.uk/archive/z/zum95.

html.

M. Gordon and T. Melham, editors. Introduction to HOL: A theorem proving environment

for higher-order logics. Cambridge University Press, 1993.

164 BIBLIOGRAPHY

W. Grieskamp, M. Heisel, and H. D¨orr. Specifying Embedded Systems with Statecharts

and Z: An Agenda for Cyclic Software Components. In E. Astesiano, editor, Proc. of

the 1st Intl. Conf. on Fundemantal Approaches to Software Engineering – FASE’98,

volume 1382 of Lecture Notes in Computer Science, pages 88–106. Springer-Verlang,

1998.

M. Hanus. Compiling logic programs with equality. In Proceedings of the 2nd Interna-

tional Workshop on Programming Language Implementation and Logic Programming,

volume 456 of Lecture Notes in Computer Sciene. Springer-Verlag, 1990.

M. Hanus. The integration of functions into logic programming: From theory to practice.

Journal of Logic Programming, 19(20), 1994.

M. Hanus. A unified computation model for declarative programming. In Proc. of the

1997 Joint Conference on Declarative Programming, Grado (Italy), 1997.

M. Hanus. Curry – an integrated functional logic language. Technical report, Internet,

1999. Language report version 0.5.

M. Hanus and C. Prehofer. Higher-order narrowing with definitional trees. Journal of

Functional Programming, 9(1), 1999.

M. Hanus and R. Sadre. An abstract machine for Curry and its concurrent implementation

in Java. Journal of Functional and Logic Programming, 1999(6), 1999.

D. Harel, H. Lachover, A. Naamad, A. Pnueli, M. Politi, R. Sherman, A. Shtull-Trauring,

and M. Trakhtenbrot. Statemate: A working environment for the development of com-

plex reactive systems. IEEE Transactions on Software Engineering, 16 No. 4, Apr.

1990.

P. H. Hartel, M. Feeley, M. Alt, L. Augustsson, P. Baumann, M. Beemster, E. Chail-

loux, C. H. Flood, W. Grieskamp, J. H. G. van Groningen, K. Hammond, B. Hausman,

M. Y. Ivory, R. E. Jones, J. Kamperman, P. Lee, X. Leroy, R. D. Lins, S. Loosemore,

N. R¨ojemo, M. Serrano, J.-P. Talpin, J. Thackray, S. Thomas, P. Walters, P. Weis, and

P. Wentworth. Benchmarking implementations of functional languages with “pseudo-

knot”, a Float-Intensive benchmark. J. of Functional Programming, 6(4), 1996.

P. Hudak, S. P. Jones, P. Wadler, et al. Report on the programming language Haskell.

ACM SIGPLAN Notices, 27(5), March 1992.

D. Jana and B. Jayaraman. Set constructors, finite sets, and logical semantics. Journal of

Functional Programming, 38(1), 1999.

S. Janson. AKL – A Multiparadigm Programming Language. PhD thesis, Computer

Science Departement, Uppsala University, Sweden, 1994.

S. Janson and S. Haridi. Programming paradigms of the andorra kernel language. In

Saraswat and Ueda, editors, Logic Programming: Proceedings of the 1991 Interna-

tional Symposium. The MIT Press, 1991.

BIBLIOGRAPHY 165

B. Jayaraman and K. Moon. Subset logic programs and their implementation. Journal of

Logic Programming, 19(20), 1999.

X. Jia. An approach to animating Z specifications. Internet: http://saturn.cs.

depaul.edu/˜fm/zans.html,1996.

S. L. P. Jones. Implementing lazy functional languages on stock hardware: the Spineless

Tagless G-machine. Journal of Functional Programming, 2(2), 1992.

S. L. P. Jones and J. Salkild. The spineless tageless G-machine. In Workshop on Imple-

mentation of Lazy Functional Languages, Aspensas, Schweden, 1988.

G. Kahn. Natural semantics. In Symposium on Theoretical Computer Science

(STACS’97), volume 247 of Lecture Notes in Computer Science, 1987.

B. Krieg-Br¨uckner, J. Peleska, E. Olderog, and A. Baer. The UniForM workbench, a

universal development environment for formal methods. In J. Wing, J. Woodcock,

and J. Davies, editors, FM’99 – Formal Methods, volume 1708 of Lecture Notes in

Computer Science. Springer, 1999.

P. Kruchten, E. Schonberg, and J. Schwartz. Software prototyping using the SETL pro-

gramming language. IEEE Software, 1(4), 1984.

H. Kuchen, R. Loogen, J. J. Moreno-Navarro, and M. Redriguez-Artalejo. Graph-based

implementation of a functional logic language. In Proceedings of European Sympo-

sium on Programming (ESOP ’90), volume 432 of Lecture Notes in Computer Science.

Springer Verlag, 1990.

P. J. Landin. The mechanical evaluation of expressions. Computer Journal, 6, 1964.

P. Larsen and W. Pawlowski. The formal semantics of ISO VDM-SL. Computer Stan-

dards and Interfaces, 17(5-6), 1995.

H. C. R. Lock. The Implementation of Functional Logic Languages. PhD thesis, Tech-

nische Universit¨at Berlin, 1992.

R. Loogen and S. Winkler. Dynamic detection of determinism in functional logic lan-

guages. In Proceedings of the 3rd International Symposium on ProgrammingLanguage

Implementation and Logic Programming, volume 528 of Lecture Notes in Computer

Science. Springer Verlag, 1991.

K. L. McMillan. Symbolic Model Checking. Kluwer Academic Publishers, 1993.

M. Mehl. The Oz Virtual Machine - Records, Transients, and Deep Guards. PhD thesis,

Technische Fakult¨at der Universit¨at des Saarlandes, 1999. submitted.

M. Mehl, R. Scheidhauer, and C. Schulte. An Abstract Machine for Oz. Research Re-

port RR-95-08, Deutsches Forschungszentrum f¨ur K¨unstliche Intelligenz, Stuhlsatzen-

hausweg 3, D66123 Saarbr¨ucken, Germany, June 1995. Also in: Proceedings of

PLILP’95, Springer-Verlag, LNCS, Utrecht, The Netherlands.

166 BIBLIOGRAPHY

R. Milner, M. Tofke, and R. Harper. The Definition of Standard ML. Mass. Institute of

Technology Press, Cambridge, Mass., 1990.

K. Moon. Implementation of Subset Logic Languages. PhD thesis, State University of

New York at New Buffalo, 1997.

J. J. Moreno-Navarro, H. Kuchen, R. Loogen, , and M. Rodriguez-Artalejo. Lazy nar-

rowing in a graph-machine. In Proceedings of the Conference on Algebraic and Logic

Programming, volume 463 of Lecture Notes in Computer Science. Springer Verlag,

1990.

J. J. Moreno-Navarro and M. Rodriguez-Artalejo. Logic programming with functions and

predicates: The language BABEL. Journal of Logic Programming, 12, 1992.

G. Nadathur and D. Miller. An overview of

Prolog. In Proc. 5th Conference on Logic

Programming & 5th Symposium on Logic Programming (Seattle). MIT Press, 1988.

O. Owe. Partial logic reconsidered: A conservative approach. Formal Aspects of Com-

puting, 5:208–223, 1997.

L. C. Paulson. Isabelle: A Generic Theorem Prover. Springer LNCS 828, 1994.

N. Plat and P. Larsen. An overview of the ISO/VDM-SL standard. SIGPLAN Notices, 27

(8), 1992.

C. Prehofer. Higher-order narrowing. In Proc. 9th Annual IEEE Symposium on Logic in

Computer Science. IEEE Press, 1994.

J. Reppy. CML: A higher-order concurrent language. In SIGPLAN Conf. on Programming

Language Design and Implementation, pages 293–305. ACM, Jun. 1991.

G. Rossi. The SETLOG programming language. Internet, 1997.

T. Santen. A Mechanized Logical Model of Z and Object-Oriented Specification. PhD

thesis, Fachbereich Informatik, Technische Universit¨at Berlin, Ger many, 1999.

B. Sch¨atz and F. Huber. Integrating formal description techniques. In J. Wing, J. Wood-

cock, and J. Davies, editors, FM’99 – Formal Methods, volume 1708 of Lecture Notes

in Computer Science. Springer, 1999.

R. Scheidhauer. Design, Implementierung und Evaluierung einer virtuellen Maschine

fr Oz. PhD thesis, Universit¨at des Saarlandes, Fachbereich Informatik, Saarbr¨ucken,

Germany, Dec. 1998.

W. Schulte. Effiziente und korrekte ¨

Ubersetzung strikter applikativer Programmier-

sprachen. PhD thesis, Technische Universit¨at Berlin, 1992.

W. Schulte and W. Grieskamp. Generating Efficient Portable Code for a Strict Applicative

Language. In Phoenix Seminar and Workshop on Declarative Programming. Springer

Verlag, Berlin, Heidelberg, New York, 1992.

BIBLIOGRAPHY 167

J. Siekmann. Unification theory. Journal of Symbolic Computation, 7(1), 1989.

G. Smolka. A calculus for higher-order concurrent constraint programming with deep

guards. Research Report RR-94-03, Deutsches Forschungszentrum f¨ur K¨unstliche In-

telligenz, Stuhlsatzenhausweg 3, D-66123 Saarbr¨ucken, Germany, Feb. 1994a.

G. Smolka. A foundation for higher-order concurrent constraint programming. In J.-P.

Jouannaud, editor, 1st International Conference on Constraints in Computational Log-

ics, Lecture Notes in Computer Science, vol. 845, pages 50–72, M¨unchen, Germany,

7–9 Sept. 1994b. Springer-Verlag.

G. Smolka. Concurrent constraint programming based on functional programming. In

C. Hankin, editor, Programming Languages and Systems, Lecture Notes in Computer

Science, vol. 1381, pages 1–11, Lisbon, Portugal, 1998. Springer-Verlag.

W. Snyder and J. Gallier. Higher-order unification revisited: Complete sets of transfor-

mations. Journal of Symbolic Computation, 8, 1989.

J. M. Spivey. The Z Notation: A Reference Manual. Prentice Hall International Series

in Computer Science, 2nd edition, 1992. ISBN 013-978529-9. URL http://www.

blackwell.co.uk/cgi-bin/bb_item?0139785299.

L. Spratt. Seeing the Logic of Programming with Sets. PhD thesis, University of Kansas,

1996.

F. Stolzenburg. An algorithm for general set unification and its complexity. Journal of

Automated Reasoning, 22(1), 1999.

I. Toyn. Innovations in standard Z notation. In J. P. Bowen, A. Fett, and M. G. Hinchey,

editors, ZUM’98: The Z Formal Specification Notation, volume 1493 of Lecture Notes

in Computer Science, pages 193–213. Springer-Verlag, 1998. URL http://www.

fmse.cs.reading.ac.uk/zum98/.

S. Valentine. The programming language Z

|H|

.Information and Software Technology, 37

(5–6):293–301, May–June 1995.

S. H. Valentine. Z

|H|

, an executable subset of Z. Workshops in Computing, pages 157–

187. Springer-Verlag, 1992. ISBN 3-540-19780-X. URL http://www.dcs.gla.

ac.uk/springer-verlag/21.html.

R. L. Vaught. Set Theory – An Introduction. Birkh¨auser, 1995.

P. Wadler and S. Blott. How to make ad–hoc polymorphism less ad hoc. In 16th ACM

Symposium on Principles of Programming Languages, pages 60–76, 1989.

D. H. D. Warren. An abstract Prolog instruction set. Technical Report 309, SRI Interna-

tional, 1983.

D. H. D. Warren. The extended andorra model with implicit control. In ICLP’90 Parallel

Logic Programming Workshop, 1990.

168 BIBLIOGRAPHY

M. M. West and B. M. Eaglestone. Software development: Two approaches to animation

of Z specifications using Prolog. IEE/BCS Software Engineering Journal, 7(4):264–

276, July 1992.

J. Wieland. A resolution algorithm for general subtyping constraints. Master’s thesis,

Technische Universit¨at Berlin, 1999.

M. Winikoff, P. Dart, and E. Kazmierczak. Rapid prototyping using formal specifications.

In Proceedings of the Australasian Computer Science Conference, 1998.

M. Wirsing. Handbook of Theoretical Computer Science, chapter Algebraic Specification

(13), pages 675–788. North-Holland, 1990. edited by J. van Leeuven.

J. C. P. Woodcock and J. Davies. Using Z: Specification, Proof and Refinement. Prentice

Hall International Series in Computer Science, 1996. ISBN 0-13-948472-8. URL

http://www.comlab.ox.ac.uk/usingz.html.

Z-ISO. Drafts for the Z ISO standard. Ian Toyn (editor). Available via the URL

http://www.cs.york.ac.uk/˜ian/zstan, 1999.

Concept Index

-calculus, 39

Zabbreviation forms, 39, 56

Zforms, 38

AKL, 140

algebra, 41

Andorra Principle, 140

backtracking, 87, 93, 99, 140

benchmark, 136

binding, 48

boolean

algebra, 58

laws, 58

values, 47

breadth-first search, 141

C++, 132

characteristic function, 43

choice point, 99, 103, 111

closure, 102

closure passing style, 138

committed choice, 70

common subexpression elimination, 95

compilation, 127

complete lattice, 35

configuration, 104

conjunctive normal form, 58

constraint, 41, 89, 101

constructor

application form, 52

decomposition laws, 60

continuous function, 36

cpo, 35

Curry, 41, 140

cycles, 102

deep guards, 97

definitional tree, 96

De Morgan’s laws, 58, 75

depth-first search, 141

deterministic computation, 87

disjunctive normal form, 58, 75, 95

dispatching, 112

encapsulated search, 97

excluded middle, 44, 58

excluded middle laws, 59

execution step, 112

expression

assignment, 57

context, 57

substitution, 57

type, 50

expression sharing, 95

extensional

completeness, 84

confluence, 74, 81, 84

equality, 73, 109

unequality, 73

fixed-point

construction, 68

form, 55

theorem, 36

unrolling law, 66

flexible, 127

flexible variable, 76

freely generated, 46

free type, 46

free variables, 57

functional logic languages, 41

garbage collection, 136

goal, 102

Goffin, 41

graph reduction, 95

higher-order, 42

unification, 97

170 CONCEPT INDEX

HOL, 69

Horn clause, 41

indeterministic resolution, 87

infimum, 34

instantiation, 101, 106

instruction, 104, 113, 134

intension, 88, 101

interval logic, 70, 164

JUMP machine, 139

lattice, 35, 48

lazy narrowing, 85

logic languages, 41

matching context, 74

meaning function, 51

module, 41

Mozart System, 140

narrowing, 83, 95, 138

natural numbers, 47

needed narrowing, 96

normal form, 75

normalform, 127, 128

normalization, 75, 95

normalization function, 82

Opal, 138

outermost-in narrowing, 85

Oz, 70, 140

parallel and, 87

parallel or, 87

partial

logic, 44

order, 34, 48

pattern

context, 57

type, 50

persistent, 132

power set, 45

predicate calculus, 40

Prolog, 41

redex selection strategy, 96

reduction step, 89

reflective analysis, 163

residuation, 41, 87, 99

resolution step, 89

schema

calculus, 40

complement lifting law, 65

distributivity laws, 64

elimination law, 64

form, 55

membership lifting law, 65, 76

property substitution law, 66

translation elimination law, 65

SECD machine, 138

selection, 112

set intersection, 100, 109

operator forms, 53

operators, 43

ordering, 48

representation, 43

selection elimination law, 59

translation form, 53

unification, 97

union, 100, 109

value, 100

set-based analysis, 163

SETL, 69

SETLOG, 69

single-assignment, 132

singleton

set display form, 52

set selection form, 52

soundness, 80, 84

spineless tagless G-Machine, 138

strict

narrowing, 86

reduction step, 89

semantics, 87

strictness, 97

subset logic programming, 69, 97

supremum, 34

symbolic model checking, 95

temporal logic, 164

termination, 80

CONCEPT INDEX 171

thread, 102

three-valued logic, 44

top-level program, 127

translation

composition law, 60

distributivity laws, 61

elimination law, 63

identity law, 60

lifting law, 63

product law, 62

splitting law, 63

tuple, 48

typeclasses, 34

relation, 50

substitution, 45

terms, 44

unification, 45

undefinedness, 38, 39, 68

unification, 89, 109

universe, 45

value, 100

constructor, 46

form, 88

variable form, 55

variable table, 101

WAM, 138

Z extensions, 28

ad-hoc overloading, 31

Delta and Xi, 31

free type recursion, 31

if-exists notation, 28

IF in predicates, 30

inference rules, 28

locals in schema boxes, 29

paragraph order, 31

recursive partial functions, 29

redeclarations, 31

selective delta, 30

where notation, 28

Z language, 25

constants, 26

function abstraction, 26

genericity, 26

polymorphism, 26

quantifiers, 26

schemas, 25

schema text, 25

set comprehension, 26

types, 25

undefinedness, 26

universe, 25

Z toolkit, 32

Booleans, 33

order and lattices, 34

sequences, 32

set reduction, 32

Symbol Index

LOADENV,105,114, 115,129, 130

MKEMPTY,105,107,118,129,134, 135

FIXINTEN,134, 135

MKINTEN,105,107,119,130,132,134, 135

ISECT,105,107,119, 120,128–130,134, 135

LOAD,105,113–116,120,129, 130

MEMBER,105, 106,111,116, 117,120,130,

131,134,136

MKVAR,105, 106,118, 119,130

MOVE,134,136

MU,105,107,112,121–123,129, 130,134,

136

} ~

,50–55

EXPUT,50–56

EXPA,75,128,130

EXPB,75,83,128,130, 131

EXPC,75,83,128–131

EXPD,75,83,128,130, 131

EXPL,75,128, 129

EXPP,75,83,88,130

EXPV,88,90–93

PATEX,50,55,60, 61,79, 80

PATLIN,50,54, 55,65

PATUT,50, 51,54, 55

TSUBS



,44, 45,47,51,54

TYPE



,44, 45,49

 

,48

 

,18,38,50,52,60,62,64,73,75,77,86,

88,128

,56,60,77

,88–93,162

,88, 89,92

] 

,18,38, 39,41, 42,48, 49,55,66,68,

75,83,90,102,129, 130,136, 137

l o v

,88,90, 91,93,97,100, 101,120,129,

162



,54,61, 62



,51,54, 55,61,64,66



,57,60–62,64–66,78,80, 81,83

z o 

,18, 19,37–43,48,55,64–66,68,70,

75–81,83–85,88,90,95, 96,98,102,

129, 130,136, 137

z 

,18,37–39,41,52,56, 57,59, 60,62–66,

71,73,75–78,81,83–86,88,91,93,

95–97,100,129, 130,137

r 

,44–47

2

,39, 40,56,75, 76,80, 81,131



,56,75,91,129,131



,93, 94



,61

r  

,18, 19,38–40,42,53, 54,56, 57,

60–66,75–81,83,85, 86,91,95, 96,

129, 130,137





,54,61, 62

 

,47, 48,64

?

,47, 48,64



,45,52–55

 

,48,63,80, 81





,88, 89,91





,88, 89,93



,88, 89,91



,88–90,93



,51–55,66

LOADPAR,105, 106,120,130



r 

,30,111,113–122,124–126

MKSINGLE,105,107,118,129, 130,134,

135

STORE,105, 106,115, 116,130, 131

SUCCESS,105,107,120, 121,130, 131,134,

136

MKTERM,105, 106,118, 119,128,134, 135

TEST,105,107,112,121, 122,131,134,136

TESTNATIVE,105,107,112,121, 122,130,

131,134,136

UNIFY,105, 106,115–118,130, 131,134,

136

SYMBOL INDEX 173

UNION,105,107,119, 120,128, 129,134,

135

WAIT,105,113,115,127,129, 130,134–136



,47, 48,55



,27,109



,56



,18, 19,39,44,47, 48,56,60,62–64,77



,127–131



,33,43,45–49,52,55,64,109

¡¢ £c¤

,33,43,48, 49,53, 54,59,61, 62,64,109

,47, 48

¦¨§_©

,33,43,48, 49,52, 53,55,59,61, 62,64,

109

¦¨§

,54

¦ª¦

,47, 48,55

©«2¬

¤d

,45–48,52,55–57



,18,58–66,68,73,84

,34, 35,48, 49,56

,34, 35

}

,50–55



,39,42,48,55, 56,62, 63,65,75–77,93,

95,129–131,136

,19,39–42,56,65,70,74–77,80, 81,83–

85,87,91–94,99, 100,103, 104,112,

120,127,129–131,136,162

,18,38, 39,53,56,58–60,64–66,75, 76,78,

80, 81,85, 86,91–94,96,129,131

,39, 40,56,58, 59,64, 65,75, 76,78–81,

92–94,131

,47, 48

,33

,32, 33,110

,57, 58,80,83,85, 86

,57

·¸º¹p»

,74,92

,50–55

,32, 33,129

,18, 19,38–44,48,53,56,58–64,66,68,

70,75–81,84–86,95, 96,100,129,

136, 137

~À¿

,32, 33,106, 107,111,114–122,124–126

,18,37, 38,41–43,53,56,58, 59,61,64, 65,

70, 71,78–82,91,93,95, 96,98–100,

103,136, 137,162

ÂÃ

,45,52–55



,57,60–62,64–66,76,78,80, 81,83,91,93

,51,54, 55,61,64,66

,88,91,100,120

,88,93

,85, 86

,74,92

,86,90

È7ÉªÈ

,27,29

BAS,128,130, 131

CON,128–131

DIS,128–131

LIT,128–130

PRO,128,130, 131



§¡Ë ¦¡Ì

,44,46

2£N£

,88, 89,92

,46, 47



¦¡ÌÎ

,46–48

¢Ï

¦cÐ

,35, 36,49

¢Ï

,35

ÑÓÒ

©Ô

,57,66,83

«§

,82–84,90

£c¤

,46, 47,52

£c¤

,51–55,66

£c¤

,45,47–49,51–55,64,66

©Ô

,57,60–63,65, 66,76,80, 81,83,88, 89,

91,93

©Ô

,44, 45,47,51–55,66

©Ô

,51

¦ÕËÖ«

,46,52

¦¡ÌÎ

,46

¦¡ÌÎ

,50–55,58,66



§ÙØ1§Ù¬

,57,80, 81,127,130



,88, 89,91,93



,38,55,57,61–63,65, 66,76,78–81,83,

90, 91,93,130, 131



,44–46,51

ËÖÎ

,32, 33,110





,73,78,89,91



,73, 74,78,81,84,89,91, 92

,18,38,42,52, 53,56, 57,59, 60,66,75,77,

85, 86,92,94, 95,97,103,129, 130,

137,162

gÛ

,92

,45

,44, 45,47–49,52–55

,88, 89,91–93

174 SYMBOL INDEX



,18, 19,38–40,44,53,56,58, 59,61,64, 65,

70,75,78, 79,84–86,91,129, 130

,35

àÃ

,45, 46,49,52–55

¿á~

,32, 33,114–122,124–126,128–131

,34–36,55

,34–36

,89,91

,74,78,81–83,85, 86

,89,91

,74,78,85, 86

,56,60,74–77,80,91,128, 129

,56,73–75,77,88,91,97,128, 129

,88, 89,92, 93

,88, 89,92



,35, 36

,32

,35

,33, 34,54

,33, 34

Binding,103,108,110

Bound,103,105,108,110,114, 115

Choice,103,111,126

Code,101,103,105,111,119,128, 129

CONS,38,46–48,50,52,60,73,77,100,

105,108–110,119,128

CTR,88,90–93

ENV,127–131

error,103,106, 107,115, 116,119–121,124–

126

EXP,18,38,50–66,73–75,77–83,85,88,

90–93,128–131

EXPASS,57,60,62,64–66,78,80, 81,83

EXPCTX,57, 58,83

Fail,110,116

failure,103,106, 107,116, 117,122–126

Free,103,115, 116,118,124,126

Goal,103,105,113,125, 126

GoalInx,101–103,105,115

INCTX,74,78,81,83,89

Instruction,101,105,112, 113

Intension,100, 101,109,112,117, 118,120–

122,124–126

IVCTX,88, 89,91

Ok,110,116

OUTEREDEX,85

PAT,18,38,50, 51,54, 55,57,60–66,78–81,

83,90, 91,93,130, 131

PATCTX,57

Priority,103

running,103,114,118,124,126

SEMCTX,51–55,66

Set,100,106, 107,109,112,117–122

SREDEX,90

Status,103

success,103,121,123,125

TCONS,44,46,48

Term,100–102,106,108–110,118, 119

Thread,103,105,113,126

ThreadInx,102, 103,105,111,113,118

TSUBS,44, 45,47,51–55

TVAR,44–47,53

TYPASS,50–55

TYPE,44–47,49,51–55

UNCTX,74,78,89

UNIASS,51,54, 55,64,66

UniRes,110

UNIV,45–47,49,51, 52,54, 55,61, 62

UVCTX,88, 89,91

VALASS,88,90–93

Value,100, 101,103,108–112,114, 115,117,

118,120–122,124, 125

VAR,38,50, 51,55–57,60,63,77,81,88,

90–93,127,130, 131

Var,100–102,105, 106,108–110,114,118–

120

VarInx,100, 101,103,105,108–111,114–

116,119,127