Document [original]

Semantic Mediation

between Loosely Coupled Information Models

in Service-Oriented Architectures

vorgelegt von

Diplom-Informatiker

Nils Barnickel

aus Berlin

Von der Fakultät IV – Elektrotechnik und Informatik –

der Technischen Universität Berlin

zur Erlangung des akademischen Grades

Doktor der Ingenieurwissenschaften

– Dr.-Ing. –

genehmigte Dissertation

Promotionsausschuss:

Vorsitzender: Prof. Dr. Thomas Magedanz

Berichter: Prof. Dr. Dr. h.c. Radu Popescu-Zeletin

Berichter: Prof. Dr. Bernd Mahr

Berichter: Prof. Dr. Mathias Weske

Tag der wissenschaftlichen Aussprache: 11. Mai 2011

Berlin 2011

D 83

Abstract

The last two decades have shown a major shift from stand-alone to networked information

technology (IT) systems. Consequently, the effective and efficient achievement of

interoperability is a key factor to enable seamless business process chains and networks across

intra- and inter-organizational boundaries. Thereby, interoperability can be understood along

three dimensions: technical, semantic and organizational interoperability.

While the concept of service-oriented architectures (SOA) and widely accepted Web service

standards have benefited technical interoperability in recent years substantially, managing and

integrating semantic differences in heterogeneous distributed environments remains critical and

cost intensive. In order to preserve the precise meaning as data is moved from one IT system to

another, explicit formal information models in terms of ontologies have evolved as the concept

of choice from academia to first industry adoption. However, it has been recognized that the

dominant approach of developing a common, globally shared ontology as an information model

standard has turned out to be limited in real world cross-domain environments. Organizational

boundaries with regard to consensus degree and the complexity deriving from inherent domain-

specific differences in requirements force a coexistence of independently managed but however

semantic interoperable information models.

In order to address this challenge, the guiding idea of this work is to transfer the principle of

loose coupling to the semantic level. In particular, the goal of this thesis is to contribute to the

reduction of complexity in semantic system integration by developing an effective and efficient

approach for semantic interoperability in large-scale SOA landscapes based on semantic

mediation between loosely coupled information models. Moreover, this work shows how

emerging semantic technologies can contribute to the instantiation of this concept exploiting

their capabilities to explicitly express semantics. The main contributions of this work are:

 A conceptual framework for semantic interoperability in SOA, which is mapped to an

overview and evaluation of existing academic and industry-driven approaches pointing out

shortcomings and fields for further advancements.

 A concept of semantic mediation between loosely coupled information models in SOA,

which describes an information architecture design pattern that provides an optimized

balance within the identified inherent trade-off between effectiveness and efficiency in

achieving semantic interoperability in SOA. It includes a specification of loosely coupled

information models in terms of key characteristics derived from the principle of loose

coupling such as autonomy, flexible binding and encapsulation.

 An instantiating semantic mediation mechanism by means of description logic rule-based

semantic bridges and self-contained domain ontologies exploiting capabilities such as

polymorphism, facet analysis classification and declarative entity manipulation.

 A semantic mediation methodology and prototypical toolkit, which maps the developed

concept and mechanism to the SOA life-cycle ranging from business process modeling,

over service composition to runtime process execution, in order to provide a proof of

concept.

The developed approach is evaluated based on a case study of an exemplary distributed

organization. It is shown how the approach of semantic mediation between loosely coupled

information models can be applied in practice and which benefits can be generated with regard

to achieving effective and efficient semantic interoperability in large-scale SOA landscapes.

Zusammenfassung

Die Informationstechnologie (IT) der letzten zwei Jahrzehnte war durch eine zunehmende

Entwicklung weg von eigenständigen hin zu vernetzten IT-Systemen geprägt. Vor diesem

Hintergrund ergibt sich die Herausforderung, Interoperabilität möglichst effektiv und effizient

zu erreichen, um nahtlose Geschäftsprozesse innerhalb und über Organisationsgrenzen hinweg

zu ermöglichen. Interoperabilität kann dabei entlang von drei Dimensionen verstanden werden:

technische, semantische und organisatorische Interoperabilität.

Während das Konzept der Service-orientierten Architekturen (SOA) und weit etablierte Web

Service-Standards in den letzten Jahren wesentlich zum Erreichen von technischer

Interoperabilität beigetragen haben, ist die semantische Integration in heterogenen verteilten

Umgebungen weiterhin schwierig und kostenintensiv. Für den bedeutungskonsistenten

Datenaustausch zwischen IT-Systemen haben sich explizite formale Informationsmodelle in

Form von Ontologien als erfolgversprechendes Konzept in akademischen und ersten

industriellen Bereichen herausgestellt. Allerdings hat sich gezeigt, dass der dominierende

Ansatz basierend auf einer umfassenden gemeinsam zu nutzenden Ontologie als standardisiertes

Informationsmodell in organisationsübergreifenden Szenarien nur begrenzt praktikabel ist.

Organisatorische Grenzen mit Hinsicht auf Konsensfähigkeit und die Komplexität, die aus

unterschiedlichen domänenspezifischen Anforderungen hervorgeht, erfordern eine Koexistenz

von unabhängig zu verwaltenden jedoch semantisch interoperablen Informationsmodellen.

Um dieser Herausforderung zu begegnen, ist der Leitgedanke der vorliegenden Arbeit, das

Prinzip der losen Kopplung auf die semantische Ebene zu übertragen. Dabei verfolgt die Arbeit

das Ziel, einen Beitrag zur Verringerung der Komplexität bei der semantischen System-

integration zu leisten. Im Zentrum steht die Entwicklung eines effektiven und effizienten

Ansatzes für die semantische Interoperabilität in groß angelegten SOA-Landschaften mittels

semantischer Mediation zwischen lose gekoppelten Informationsmodellen. Darüber hinaus zeigt

die Arbeit, wie neuartige semantische Technologien verwendet werden können, um das

entworfene Konzept zu instanziieren. Die wichtigsten Beiträge dieser Arbeit sind:

 Ein konzeptioneller Rahmen der semantischen Interoperabilität in SOA, der abgebildet wird

auf einen Überblick existierender akademischer und industrieller Ansätze, mit dem Ziel

Handlungsfelder und Entwicklungsbedarfe aufzuzeigen.

 Ein Konzept der semantischen Mediation zwischen lose gekoppelten Informationsmodellen

in SOA als Entwurfsmuster für Informationsarchitekturen. Es beinhaltet eine Spezifikation

auf Basis von wesentlichen Merkmalen des Prinzips der losen Kopplung wie Autonomie,

flexible Bindung und Kapselung.

 Ein semantischer Mediationsmechanismus basierend auf regelbasierten semantischen

Brücken und unabhängiger Ontologien unter Nutzung von Eigenschaften wie Poly-

morphismus, Facetten-basierte Klassifizierung und deklarativer Entitätenmanipulation.

 Ein Machbarkeitsnachweis auf Basis einer Methodik und prototypischer Werkzeuge zur

semantischen Mediation, welche das entwickelte Konzept auf den SOA-Lebenszyklus

abbilden und instanziieren mit dem Fokus auf der Geschäftsprozessmodellierung, der

Servicekomposition und der laufzeitorientierten Prozessausführung.

Der entwickelte Ansatz wird anhand einer Fallstudie einer beispielhaften verteilten Organisation

evaluiert. Es wird gezeigt, wie der Ansatz in der Praxis angewendet werden kann und welche

Vorteile sich daraus für die effektive und effiziente Erreichung der semantischen

Interoperabilität in groß angelegten SOA-Landschaften ergeben.

iii

Preface

After finishing my studies, I made an internship at the United Nations Headquarters, where I

attended a conference called Web for development. A marketing vice president from a large IT

company gave a presentation on how service-oriented architectures (SOA) can accelerate

development. After the talk a question came from the audience asking to further elaborate on

how SOA can foster development in Africa. This misunderstanding has shown me that semantic

interoperability - or the absence of it – is not only an abstract concept but can be found all

around us even though often not visible and identified as such. Another example was the

organization-wide knowledge management system, which could not be adopted in the

eGovernance department I was working for, because the general terms and categories did not

match the required differentiation and perspective for this practice area.

These practical experiences and the unexploited potentials for organizational synergies through

seamless IT integration have motivated me to undertake my research on semantic mediation,

when I started to work at the eGovernment competence center of the Fraunhofer Institute for

Open Communication Systems (FOKUS).

Doing a PhD is an endeavor with many challenges. Especially in a dynamic environment driven

by client-orientation, it is sometimes hard to find the time besides all the daily project work.

However, it is just this combination at FOKUS covering theoretical research and real-world

client projects, which provides a unique opportunity to understand the multiple dimensions and

challenges of IT integration in cross-organizational contexts, for which I am very grateful.

In particular, I want to thank my two PhD supervisors: Prof. Dr. Radu Popescu-Zeletin for the

discussions, his encouragement and practical advice at key points of my PhD project and Prof.

Dr. Bernd Mahr for his conceptual advice and motivating feedback. I also express my gratitude

to Prof. Dr. Mathias Weske for serving as the external reviewer of my dissertation and for his

helpful comments. Furthermore, I want to thank my department head Gerd Schürmann for

giving me the freedom of a home office Friday in the second phase of the PhD. A big thanks

goes to my colleague and office mate Dr. Matthias Flügge for sharing his experiences and for

the discussions and feedbacks especially during paper publication. I also want to thank Prof. Dr.

Adrian Paschke for the joint work we have done for the iSemantics and European Semantic

Web Conference in 2010. Additionally, I thank my students, especially Ralf Weinand, Elena

Antonenko and Johannes Böttcher for supporting the development of the semantic mediation

toolkit.

To close the cycle in this personal preface I am referring back to my internship with the United

Nations. The second part of it brought me to Dakar in Senegal, West Africa to support the

launch of a Web community platform for eGovernance practitioners in the region. During that

time I met my wonderful wife Anta. The internship was over and I was working on semantic

mediation of IT systems. But the same time I found myself heavily involved in semantic

mediation with her, her family and relatives trying to bridge continents, languages and different

cultures. Finally, I am happy and proud that I can write about this twofold semantic mediation

success story and herewith dedicate this work to my wife Anta and our five month old son

Junus.

Berlin, May 2011

Table of Contents

Chapter 1 Introduction ............................................................................................................... 1

1.1 Background and Motivation ............................................................................................... 1

1.2 Overall Goals and Scope ..................................................................................................... 2

1.3 Methodology ....................................................................................................................... 3

1.3.1 Scientific Hypothesis and its Confirmation ................................................................... 3

1.3.2 Research Questions and Technical Challenges ............................................................. 5

1.4 Outline of the Thesis ........................................................................................................... 8

Chapter 2 Understanding the Challenge of Semantic Interoperability in SOA .................. 11

2.1 Overview ........................................................................................................................... 11

2.2 Interoperability Dimensions .............................................................................................. 11

2.2.1 The Context of Semantic Interoperability ................................................................... 12

2.3 Semantic Interoperability .................................................................................................. 13

2.3.1 Terms as Representation of Meaning .......................................................................... 14

2.3.2 Abstraction Levels for Representation of Meaning ..................................................... 14

2.3.3 Semantic Interoperability Gap..................................................................................... 15

2.4 Service-Oriented Architecture .......................................................................................... 17

2.5 Framework of Semantic Interoperability in SOA ............................................................. 21

2.6 Summary and Reflection ................................................................................................... 23

Chapter 3 State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap ........ 25

3.1 Overview ........................................................................................................................... 25

3.2 Web Services .................................................................................................................... 25

3.2.1 Definition and Concepts .............................................................................................. 26

3.2.2 Technologies and Standards ........................................................................................ 30

3.2.3 Evaluation.................................................................................................................... 32

3.3 Semantic Web ................................................................................................................... 37

3.3.1 Definition and Concepts .............................................................................................. 37

3.3.2 Technologies and Standards ........................................................................................ 39

3.3.3 Evaluation.................................................................................................................... 42

3.4 Semantic Web Services ..................................................................................................... 43

3.4.1 Definition and Concepts .............................................................................................. 44

3.4.2 Technologies and Standards ........................................................................................ 48

3.4.3 Evaluation.................................................................................................................... 53

3.5 Semantic Information Integration in Related Areas.......................................................... 56

3.5.1 Semantic Information Integration in Database Systems ............................................. 57

3.5.2 Semantic Information Integration in RM-ODP ........................................................... 58

3.6 Semantic Information Integration with Ontologies........................................................... 60

3.6.1 Single Ontology Approach .......................................................................................... 61

3.6.2 Multiple Ontology Approach with Ontology Mapping ............................................... 61

3.6.3 Hybrid Ontology Approach ......................................................................................... 66

3.7 Summary and Reflection ................................................................................................... 66

Chapter 4 Semantic Mediation between Loosely Coupled Information Models in SOA ... 69

4.1 Overview ........................................................................................................................... 69

4.2 Conceptual Goals and Requirements ................................................................................ 69

4.3 General Idea ...................................................................................................................... 70

Table of Contents

4.4 Limitations of Standardization for Semantic Interoperability in SOA ............................. 73

4.4.1 Standardization vs. Mediation ..................................................................................... 74

4.4.2 From Technical Standards to Semantic Standards ...................................................... 75

4.4.3 Semantic Standardization and Monolithic Information Models ................................. 77

4.4.4 Consensus Degree and Adequate Scope of Semantic Standards ................................. 78

4.5 Context Dependency of Information Models .................................................................... 80

4.5.1 Heterogeneity of Information Models ......................................................................... 80

4.5.2 Model of Conception and Information Models ........................................................... 80

4.5.3 Constructive Model Relations and Information Models ............................................. 82

4.5.4 Conclusions and Implications for Information Models ............................................... 84

4.6 Loose Coupling on the Semantic Level ............................................................................ 85

4.6.1 The Principle of Loose Coupling in Computer Science .............................................. 86

4.6.2 Transferrable Characteristics of Loose Coupling ........................................................ 88

4.6.3 Loosely Coupled Information Models ........................................................................ 89

4.6.4 Limitations in the Transfer of Loose Coupling and Open Issues ................................ 91

4.7 Trade-off between Effectiveness and Efficiency .............................................................. 92

4.7.1 Point-to-Point Mediation ............................................................................................. 92

4.7.2 Pivot Ontology based Standardization ........................................................................ 93

4.7.3 Semantic Mediation on Domain Level ........................................................................ 94

4.7.4 Alleviation of Trade-Off between Effectiveness and Efficiency ................................ 96

4.8 Semantic Bridges for Loose Coupling of Domain Ontologies ......................................... 97

4.8.1 Generalization and Polymorphism .............................................................................. 98

4.8.2 Facet Analysis Classification ...................................................................................... 99

4.8.3 Declarative Rule-based Entity Manipulation .............................................................. 99

4.8.4 Operation of Semantic Bridges ................................................................................. 100

4.8.5 Benefits of Developed Approach for Semantic Bridges ........................................... 101

4.9 Summary and Reflection ................................................................................................. 103

Chapter 5 Methodology and Functional Architecture for Semantic Mediation in SOA .. 107

5.1 Overview ......................................................................................................................... 107

5.2 Methodology Requirements and Domain-specific Considerations ................................. 107

5.3 Semantic Mediation Aligned to SOA Life-Cycle ........................................................... 110

5.4 Domain Ontology Development ..................................................................................... 113

5.4.1 Goals and Tasks ........................................................................................................ 113

5.4.2 Existing Work ........................................................................................................... 115

5.5 Mediated Business Process Modeling ............................................................................. 115

5.5.1 Goals and Tasks ........................................................................................................ 115

5.5.2 Functional Architecture ............................................................................................. 118

5.5.3 Related Work............................................................................................................. 120

5.6 Semantic Bridge Definition ............................................................................................ 121

5.6.1 Goals and Tasks ........................................................................................................ 121

5.6.2 Existing Work ........................................................................................................... 123

5.7 Semantic Bridge Testing ................................................................................................. 125

5.7.1 Goals and Tasks ........................................................................................................ 125

5.7.2 Functional Architecture ............................................................................................. 127

5.7.3 Related Work............................................................................................................. 129

5.8 Semantic Service Enrichment ......................................................................................... 129

5.8.1 Goals and Tasks ........................................................................................................ 130

5.8.2 Existing Work ........................................................................................................... 131

5.9 Mediated Service Composition ....................................................................................... 132

5.9.1 Goals and Tasks ........................................................................................................ 132

Table of Contents

vii

5.9.2 Functional Architecture ............................................................................................. 134

5.9.3 Related Work............................................................................................................. 136

5.10 Meditated Process Execution .......................................................................................... 138

5.10.1 Goals and Tasks ........................................................................................................ 138

5.10.2 Functional Architecture ............................................................................................. 139

5.10.3 Related Work............................................................................................................. 141

5.11 Summary and Reflection ................................................................................................. 141

Chapter 6 Realization of Semantic Mediation Toolkit ........................................................ 143

6.1 Overview ......................................................................................................................... 143

6.2 Mediated Business Process Modeling Tool .................................................................... 143

6.2.1 System Requirements ................................................................................................ 144

6.2.2 Design and Realization .............................................................................................. 144

6.2.3 Scenario, Validation and Verification ....................................................................... 149

6.3 Semantic Bridge Testing Tool ........................................................................................ 152

6.3.1 System Requirements ................................................................................................ 152

6.3.2 Design and Realization .............................................................................................. 153

6.3.3 Scenario, Validation and Verification ....................................................................... 156

6.4 Mediated Service Composition Tool .............................................................................. 159

6.4.1 System Requirements ................................................................................................ 159

6.4.2 Design and Realization .............................................................................................. 160

6.4.3 Scenario, Validation and Verification ....................................................................... 164

6.5 Meditated Process Execution Tool ................................................................................. 166

6.5.1 System Requirements and Challenges ...................................................................... 166

6.5.2 Design and Realization .............................................................................................. 167

6.5.3 Scenario, Validation and Verification ....................................................................... 172

6.6 Usage and Extension of the Semantic Mediation Toolkit ............................................... 175

6.7 Summary and Reflection ................................................................................................. 175

Chapter 7 Evaluation and Case Study of an Exemplary Distributed Organization ......... 179

7.1 Evaluation Methodology ................................................................................................. 179

7.2 The German Chambers of Commerce and its eGovernment Context ............................. 180

7.2.1 The Chambers Service Bus and Service Hub ............................................................ 181

7.2.2 The Data Conference Working Group ...................................................................... 183

7.2.3 Achievements and Ongoing Challenges .................................................................... 185

7.2.4 Potential of the Semantic Mediation Approach......................................................... 187

7.2.5 Network Effect .......................................................................................................... 189

7.3 Coverage of Goals and Confirmation of Research Hypothesis ...................................... 190

7.3.1 Coverage of Conceptual Goals .................................................................................. 190

7.3.2 Confirmation of Research Hypothesis ...................................................................... 192

7.4 Summary and Reflection ................................................................................................. 193

Chapter 8 Conclusion and Outlook ....................................................................................... 195

8.1 Summary and Main Contributions .................................................................................. 195

8.2 Evolution and Outlook .................................................................................................... 199

Bibliography ........................................................................................................................... 203

Appendix ........................................................................................................................... 217

Domain Ontology Sample “RosettaNetOntology” .............................................................. 217

Domain Ontology Sample “MoonOntology” ...................................................................... 218

Semantic Bridge Sample “RosettaNetOntology2MoonOntology”...................................... 219

Semantic Web Service Sample “MoonCRMService” ......................................................... 220

viii

List of Figures

Figure 1-1 Thesis Structure ........................................................................................................... 8

Figure 2-1 Semantic Interoperability Gap ................................................................................... 16

Figure 2-2 Service Interaction Model ......................................................................................... 19

Figure 2-3 Enterprise SOA Layers [32] ...................................................................................... 19

Figure 2-4 SOA Layer Model ..................................................................................................... 21

Figure 2-5 Service Model ............................................................................................................ 22

Figure 2-6 Framework of Semantic Interoperability in SOA ...................................................... 23

Figure 3-1 Cross-Organizational Communication using HTTP and XML [37] ......................... 28

Figure 3-2 Web Service Interaction Model ................................................................................. 29

Figure 3-3 Flow-based Web Service Composition [43] .............................................................. 30

Figure 3-4 Web Service Stack ..................................................................................................... 30

Figure 3-5 Development Environment for Process Design ......................................................... 32

Figure 3-6 WSDL-based Web Service Model ............................................................................ 33

Figure 3-7 Human Interaction in Web service technology based SOA-Life-Cycle .................... 34

Figure 3-8 Placement of XML in the Semantic Interoperability Gap ......................................... 36

Figure 3-9 Typical Knowledge Representation System based on Description Logics [70] ........ 39

Figure 3-10 Semantic Web Stack ................................................................................................ 40

Figure 3-11 XML Serialization of RDF ...................................................................................... 41

Figure 3-12 Classification of Semantic Web Service Concept [99] ............................................ 44

Figure 3-13 Exemplary Web Service Ontology [100] ................................................................ 45

Figure 3-14 Generic Semantic Web Service Grounding ............................................................. 46

Figure 3-15 Machine-based Interpretation of Web Services ....................................................... 47

Figure 3-16 Semantic Integration with Semantic Web Services ................................................. 48

Figure 3-17 Top Level of OWL-S Service Ontology [107] ........................................................ 49

Figure 3-18 WSMO Top Level Notions [103] ............................................................................ 51

Figure 3-19 SAWSDL Overview [118] ...................................................................................... 53

Figure 3-20 Shift of Abstraction Level using Semantic Web Services ....................................... 55

Figure 3-21 Global-as-View [130] .............................................................................................. 58

Figure 3-22 Local-as-View [130] ................................................................................................ 58

Figure 3-23 RM-ODP Inter-Domain Communication Architecture [134] .................................. 59

Figure 3-24 Three Ontology-based Semantic Integration Strategies [139] ................................. 61

Figure 3-25 Example Ontologies with Mappings [140] .............................................................. 62

Figure 3-26 Basic Steps in Ontology Mapping ........................................................................... 62

Figure 3-27 Step 1 of Ontology Mapping: Mapping Discovery ................................................. 63

Figure 3-28 Step 2 of Ontology Mapping: Mapping Representation .......................................... 63

Figure 3-29 Step 3 of Ontology Mapping: Mapping Deployment .............................................. 64

Figure 3-30 Step 4 of Ontology Mapping: Mapping Application ............................................... 65

Figure 4-1 From Monolithic to Loosely Coupled Information Models on Domain Level ......... 71

Figure 4-2 Shift of Semantic Integration with Loosely Coupled Ontologies .............................. 72

Figure 4-3 Semantic Standardization vs. Semantic Mediation ................................................... 74

Figure 4-4 Integration of Multiple Interface Technologies vs. Web Service Standards ............. 76

Figure 4-5 Consensus Degree and Appropriate Scope of Standards [167] ................................. 79

Figure 4-6 Model of Conception [170] ....................................................................................... 81

Figure 4-7 Model of Conception Applied to Information Models .............................................. 81

Figure 4-8 Constructive Model Relations ................................................................................... 82

List of Figures

Figure 4-9 Constructive Model Relations and Information Models ........................................... 83

Figure 4-10 Transfer of Loose Coupling to the Semantic Level ................................................. 85

Figure 4-11 Dimensions of Coupling [175] ................................................................................ 87

Figure 4-12 Degree of Coupling and Functional Distance [176] ................................................ 87

Figure 4-13 Definition of Loosely Coupled Information Models ............................................... 91

Figure 4-14 Point-to-Point Mediation ......................................................................................... 93

Figure 4-15 Pivot Ontology based Standardization .................................................................... 94

Figure 4-16 Semantic Mediation on Domain Level .................................................................... 95

Figure 4-17 Effectiveness and Efficiency Gain .......................................................................... 96

Figure 4-18 Heterogeneous Domain Information Models ........................................................ 100

Figure 4-19 Semantic Bridge Operation (Step 1) ...................................................................... 101

Figure 4-20 Semantic Bridge Operation (Step 2) ...................................................................... 101

Figure 5-1 Domain Actor Model for Semantic Mediation Methodology ................................. 109

Figure 5-2 Semantic Mediation Methodology .......................................................................... 111

Figure 5-3 Scoping of Domain Ontologies Aligned to Organizational Structures.................... 114

Figure 5-4 Protégé Ontology Editor [191] ................................................................................ 115

Figure 5-5 Semantic Mediated Business Process Modeling ..................................................... 117

Figure 5-6 Functional Architecture Mediated Business Process Modeling .............................. 118

Figure 5-7 Semantic Extension of Business Process Modeling Notation ................................. 119

Figure 5-8 Mediation between Business and IT Perspective [199] ........................................... 121

Figure 5-9 Big Picture Semantic Bridge Definition [201] ........................................................ 122

Figure 5-10 Required Entity Manipulation between Different Semantic Sub-Graphs ............. 123

Figure 5-11 Graphical representation of a mapping rule in Snoggle [209] ............................... 124

Figure 5-12 Basic Idea of Semantic Bridge Testing ................................................................. 127

Figure 5-13 Functional Architecture of Semantic Bridge Testing Tool.................................... 128

Figure 5-14 Basic Idea of Semantic Web Service Enrichment ................................................. 130

Figure 5-15 ASSAM WSDL to OWL-S Annotator GUI [222] ................................................ 131

Figure 5-16 Basic Idea of Mediated Service Composition ....................................................... 133

Figure 5-17 Functional Architecture of Mediated Service Composition .................................. 135

Figure 5-18 SATINE Composition Phases and Tools [230] ..................................................... 137

Figure 5-19 Basic Idea of Mediated Process Execution based on BPEL .................................. 139

Figure 5-20 Functional Architecture of Mediated Process Execution ...................................... 140

Figure 6-1 System Architecture of Mediated Business Process Modeling Tool ....................... 145

Figure 6-2 GUI of Mediated Business Process Modeling Tool ................................................ 147

Figure 6-4 Polymorph Information Entities embedded in BPMN ............................................ 148

Figure 6-3 Realization of Semantic Mediation Mechanism ...................................................... 148

Figure 6-5 Realization of Semantic Pool .................................................................................. 149

Figure 6-6 Purchase Order Mediation Scenario Overview [246] .............................................. 150

Figure 6-7 Scenario Performed with Mediated Business Process Modeling Prototype ............ 151

Figure 6-8 System Architecture of Semantic Bridge Testing Tool ........................................... 153

Figure 6-9 Test Project Ontology .............................................................................................. 153

Figure 6-10 GUI of Semantic Bridge Testing Tool .................................................................. 155

Figure 6-11 Heterogeneous Domain Ontologies "Blue" and "Moon" ...................................... 156

Figure 6-12 Example Mapping Rules Created with Snoggle Mapping Tool ............................ 157

Figure 6-13 Semantic Bridge and Polymorph Classification Example ..................................... 157

Figure 6-14 System Architecture of Mediated Service Composition Tool ............................... 160

Figure 6-15 GUI of Mediated Service Composition Tool ........................................................ 163

Figure 6-16 eGovernment Scenario for Mediated Service Composition .................................. 165

Figure 6-17 System Architecture of Mediated Process Execution Engine ............................... 168

Figure 6-18 Typed Container in BPEL Variable....................................................................... 169

List of Figures

Figure 6-19 Mediated Process Execution in Semantically Enhanced Process Engine.............. 171

Figure 6-20 Purchase Order Mediation Scenario and Semantic Extensions ............................. 172

Figure 7-1 Task-oriented Isolated IT Applications ................................................................... 181

Figure 7-2 Overview of Chambers Service-Oriented Architecture ........................................... 182

Figure 7-3 General Approach of Data Conference Working Group ......................................... 183

Figure 7-4 Data Conference Methodology ................................................................................ 184

Figure 7-5 Planned Mediation Services of Chambers Service Hub .......................................... 186

Chapter 1

Introduction

1.1 Background and Motivation

The last two decades have shown a major shift from stand-alone to networked information

technology (IT) systems. Today, networked IT systems based on the infrastructure of the World

Wide Web provide the technological backbone of enterprise ecosystems enabling various

business process chains and networks within and across organizational borders. Consequently,

the effectiveness and efficiency of integration of independent and distributed IT systems is of

great practical importance, which can already be seen by the estimation that up to 40% of

companies‟ IT budgets are spent on integration issues [1]. Considering historically grown and

heterogeneous IT landscapes, the ability of organizations and their IT systems to work together

– namely by ensuring interoperability – is the key factor for achieving seamless business

processes. Consequently, cross-organizational interoperation of IT systems becomes a critical

business success factor [2].

Interoperability can be understood along three dimensions: technical, semantic and

organizational interoperability [3]. Although the concept of service-oriented architectures

(SOA) [4] and widely accepted Web service standards [19] have benefited technical

interoperability in recent years substantially, managing and integrating semantic differences in

heterogeneous distributed environments remains critical and cost intensive [5]. In fact, case

studies have shown that 60-80% of the resources of integration projects are spent on reconciling

semantic heterogeneities [6].

To provide an example, a distributed organization such as the German Chamber of Industry and

Commerce with 80 decentralized sites can be considered. The sites are operated by four

different IT service providers resulting in a heterogeneous IT landscape. In order to establish

organization-wide business processes, in particular existing historically grown information

models of different providers need to be integrated. The semantic integration challenge further

increases taking into account the various external business partners to be integrated in cross-

organizational business processes.

In order to preserve the precise meaning as data is moved from one IT system to another,

ontologies have evolved as the concept of choice from academia to first industry adoption.

Ontologies provide the means for generating explicit formal information models of a domain

that can be shared between applications. The description logic-based expressiveness of

ontologies not only enables humans to develop, discuss and agree on shared conceptualizations

but also enables machines to interpret these information models in a meaningful manner across

different IT systems.

However, the dominant approach of developing one common ontology-based standard for

information exchange, which has to be globally shared by all actors in a distributed IT

Chapter 1

ecosystem, has turned out to be limited in real world cross-organizational contexts. In practice,

organizational boundaries and the complexity deriving from different requirements on

information models hinder the overall commitment to one common conceptualization. Thus,

ontology-based standards could only alleviate the problem of semantic heterogeneity and a

mapping between ontologies originating from different contexts is needed [7]. Consequently,

diverse SOA landscapes covering multiple independent organizations require a more flexible

information architecture to achieve semantic consistency while allowing for and accepting

different conceptualizations.

1.2 Overall Goals and Scope

The core concept of SOA is the decomposition of complex business processes into a

composition of loosely coupled independently managed services providing distinct business

functionalities. The guiding idea of this work is that the same principle of loosely coupled units

can be applied to information models, in order to capture the complexity of semantics in

distributed IT ecosystems.

According to this principle, the overall goal of this thesis is to contribute to the reduction of

complexity in semantic system integration by analyzing, designing, instantiating and evaluating

an information architecture pattern for large-scale SOA landscapes based on semantic mediation

between loosely coupled information models.

The concept should take into account the realistic perspective of different conceptualizations

and resulting information representations that need to evolve independently from each other to

serve best for their domain. Therefore, the concept should allow for autonomous management of

self-contained information models of independent business domains. Furthermore, the concept

should target semantic interoperability on the level of domain models rather than addressing it

recursively during process integration on the application level. In order to facilitate semantic

consistency in cross-organizational SOA scenarios, these heterogeneous domain-specific

information models should be interlinked in a loosely coupled manner by means of an effective

and efficient semantic mediation mechanism. The mechanism has to provide a high level of

expressiveness that enables to reconcile complex semantic heterogeneities between information

representations from different domain models. And at the same time the mechanism should be

easy to handle. Thus, declarative approaches should be favored in contrast to procedural ones in

order to assure efficient maintainability.

Moreover, a technology instantiating the concept of semantic mediation should be developed.

To reap the benefits of explicit semantic formalizations, it should be based on emerging

Semantic Web technologies. In particular domain ontologies and description logic rules should

be exploited to describe ontology mappings in terms of so called semantic bridges [8]. Semantic

bridges provide declarative reasoning-based means which can be integrated in SOA scenarios

for aligning heterogeneous information models and thus ensure semantic interoperability by

remaining organizational independence. However, taking into account technological path

dependency in SOA, already existing traditional XML-based Web service technology should be

respected and therefore the approach should be realized as an additional semantic layer on top

of existing technology.

Given the horizontal nature of semantic interoperability, implications of the approach of

semantic mediation to the SOA life-cycle should be derived with a focus on cross-

organizational aspects. Consequently, the approach should be applied to key steps of the SOA

Introduction

life-cycle from conceptual business process modeling, over service composition to runtime

process execution. The therefore required technologies should be bundled in a semantic

mediation toolkit, which finally should be evaluated in terms of a case study of an exemplarily

distributed organization.

To summarize, the objectives of this work are to:

 provide problem awareness in terms of a framework of semantic interoperability in

SOA used to analyze the state-of-the-art and outline open challenges;

 develop a concept for semantic mediation between loosely coupled information

models in SOA;

 design a semantic mediation methodology that applies the approach to the SOA life-

cycle;

 instantiate key steps of the methodology in terms of a semantic mediation toolkit;

 and evaluate the semantic mediation approach in terms of a case study of an

exemplary distributed organization.

The identified overall goals and objectives are further refined in the following section covering

the methodology of this work and its research hypothesis.

1.3 Methodology

1.3.1 Scientific Hypothesis and its Confirmation

Based on the above presented overall goals and objectives the scientific hypothesis of this work

can be formulated as follows:

In order to effectively and efficiently achieve semantic interoperability in large-

scale cross-organizational service-oriented architectures, the principle of loose

coupling can be applied to information models based on a flexible semantic

mediation mechanism using Semantic Web technology for autonomous

management and integration of domain-specific information models in terms of

self-contained ontologies.

To confirm the hypothesis a systematic approach is followed. The research methodology is

aligned to the approach of design research in information systems. Design research has its origin

in engineering and sciences of the artificial [9]. The approach is motivated by improving the

state-of-the-art in terms of solving practical problems, whereby the utility of the solutions is

focused. In the context of the design paradigm, understanding and knowledge of the problem

domain and its solution are achieved by construction and application of designed artifacts.

Information systems artifacts are defined as constructs (vocabularies and symbols), models

(abstractions and representations), methods (sequence of activities) and instantiations

(implemented and prototype systems) [10]. The results of design research in information

systems are useful artifacts built to address an organizational problem.

Corresponding to the basic steps in design research [11], the confirmation of the hypothesis is

structured in five consecutive parts. For each general step in design research the concrete

artifact produced in this work is further specified:

Chapter 1

1. Awareness of a problem – Framework of Semantic Interoperability in SOA and State-

of-the-Art

This work addresses the problem of achieving semantic interoperability in large-scale cross-

organizational service-oriented architectures. Therefore, based on literature review,

definitions and models of semantic interoperability are analyzed including their context to

other dimensions of interoperability such as technical and organizational interoperability.

An aggregation of conceptual models for semantic interoperability is further specified to the

focused domain of SOA. Consequently, a conceptual framework of semantic

interoperability in SOA has to be derived to deepen the understanding and providing a

consistent conceptualization of the problem area. The framework then is utilized as a

reference point for comparison in the following chapters of this work. Furthermore, an

analysis is given discussing advantages and limitations of state-of-the-art approaches and

technologies for achieving semantic interoperability in SOA, whereas it is referred to the

previously developed framework.

2. Suggestion – Concept of Semantic Mediation between Loosely Coupled Information

Models

The guiding idea of this work is to transfer the concept of loose coupling to the semantic

level. In contrast to limitations of state-of-the-art approaches based on one common

information model to be globally-shared as a lingua franca, this work develops a concept

based on multiple coexisting information models. It aims to provide a flexible information

architecture pattern that allows for autonomous management of distinct information models,

whereas a semantic mediation mechanism provides loose coupling between them to ensure

semantic interoperability. In particular, the claimed effectiveness and efficiency of the

developed approach is addressed by a comparative analysis. The concept is further

concretized by relating it to formal ontologies representing information models of specific

domains. Furthermore, the concept introduces a description logic rules-based ontology

mapping approach, in order to realize the semantic mediation between the heterogeneous

domain ontologies.

3. Development - Semantic Mediation Methodology for SOA Life-Cycle and Semantic

Mediation Toolkit

By means of a connecting step between theory and experiment, the theoretical concept is

mapped to the concrete application domain of SOA. A specific semantic mediation

methodology is developed that determines the basic steps relevant for the application of the

concept of semantic mediation to the SOA life-cycle. In order to instantiate key steps of the

methodology and to provide an experimental confirmation, a prototypical toolkit based on

Semantic Web technologies is designed and developed. The toolkit integrates existing

components and services and is extended with key tools required for semantic mediation in

the SOA life-cycle. In particular, the semantic mediation toolkit addresses the design of

semantic bridges in terms of ontology mapping rules, their systematic testing, their

integration into business process modeling, as well as into service composition and finally

into runtime execution infrastructures.

4. Evaluation - Case Study of an Exemplary Distributed Organization

The evaluation of the developed approach of semantic mediation is addressed from a

practical perspective investigating its effectiveness and efficiency in comparison to state-of-

the-art approaches. For this purpose the developed methodology and the toolkit is mapped

to an exemplary distributed organization in terms of a case study. Thus, the potential of the

Introduction

semantic mediation concept is demonstrated. Finally, the originally set goals and the

derived research hypothesis are recalled and discussed, in order to access how and to which

extent they could be covered and whether the claims of the research hypothesis could be

confirmed.

5. Conclusion

The conclusion summarizes the before described steps and points out the main conceptual

conclusions and scientific contributions in a condensed manner. Furthermore, remaining

open issues are discussed and potential extensions and future work is outlined.

1.3.2 Research Questions and Technical Challenges

In the above outlined multi-step process for the confirmation of the hypothesis various

challenges have to be overcome. In the following, the central research questions and technical

challenges are outlined.

Challenges in Step 1: Framework of Semantic Interoperability in SOA and State-of-the-

Art

 Semantic interoperability is an abstract concept, which frameworks about interoperability

often do not clearly distinguish from related aspects such a syntactical or structural

interoperability originating from a more technical perspective or with pragmatic

interoperability leading to a more organizational perspective. The framework to be

developed should differentiate between these aspects and define the scope of semantic

interoperability as it is addressed in this work.

 The framework of semantic interoperability should be expressive enough to compare

various approaches possibly following opposing concepts. The range should cover industry-

based state-of-the-art approaches to academic-driven ontology-based ones on the one hand

and as well approaches based on shared homogeneous information models following a

semantic standardization approach to approaches accepting and focusing on heterogeneous

conceptualizations on the other hand.

Challenges in Step 2: Concept of Semantic Mediation between Loosely Coupled

Information Models

 It should be investigated why on the one hand, the success of widely accepted Web service

standards for SOA has benefited technical interoperability substantially in recent years, but

on the other hand, standardization on the semantic level has turned out to be limited in the

cross-organizational context. Therefore, analogies from other fields of standardization

should be derived, in order to examine the relation between consensus degree and adequate

scope of standards and its implication for the semantic level.

 Furthermore, it should be investigated how context dependency of information models

influences heterogeneous conceptualizations and how this relates to limiting factors for their

monolithic alignment. Therefore, a model theoretic approach should be mapped to

information models.

 The transfer of the concept of loose coupling to information models implies that the central

principles of loose coupling are addressed. Therefore, the question should be addressed how

principles such as autonomy, encapsulation and flexible binding can be applied to the

Chapter 1

semantic level and how these characteristics can be interpreted to provide a specification of

loosely coupled information models and a corresponding semantic mediation mechanism.

 Having identified the practical limitations of semantic standardization across organizational

boundaries, a trade-off between effectiveness and efficiency for achieving semantic

interoperability becomes apparent. On the one hand, actors have to develop an agreement

in terms of a community process about one common standardized information model.

However, with regard to cross-organizational and heterogeneous IT landscapes with a large

number of actors with possibly divergent business requirements, a high coordination

complexity appears hindering an effective solution. On the other hand, aiming at an

approach which is based on direct mediation between each two independent information

models just requires coordination efforts for two actors, which results in lower complexity.

However, this effort could be potentially become necessary times to map between

each two information models. Thus, both general approaches do not provide a sufficient

solution regarding effectiveness on the one side and efficiency on the other side. Therefore,

an adequate solution within this trade-off needs to be addressed by the developed concept of

semantic mediation.

 As the developed concept for semantic mediation is designed to be based on Semantic Web

concepts and technologies, it needs to be pointed out which specific features of Semantic

Web languages and meta-models are the beneficial and enabling factors for the semantic

mediation approach compared to other technologies.

Challenges in Step 3: Semantic Mediation Methodology for the SOA Life-Cycle and

Semantic Mediation Toolkit

In order to develop the methodology and toolkit, the relevant phases of the SOA life-cycle

where mediation between heterogeneous information models is required need to be identified

and the afore-developed conceptual solution needs to be applied.

 The SOA life-cycle starts from the business perspective on how processes can be supported

by IT systems. Therefore, with regard to semantic mediation the modeling of cross-

organizational business processes should be covered, whereas the modeling of information

flow across heterogeneous conceptualizations is of particular concern. In order to ease the

modeling of business processes and reduce technical complexity, the heterogeneity between

different information models should be transparent for the user and its resolution should be

handled automatically based on underlying semantic bridges. This implies that required

information models and semantic bridges are already in place. Furthermore, coming from

the perspective of agile development and continuous maintenance, information models need

to evolve over time and correspondingly semantic bridges between them. According to

process-orientation, the requirements for the evolution should be derived from business

processes. Consequently, specific features for requirement engineering of information

models and semantic bridges should be supported during cross-organizational business

process modeling.

 The identified requirements provide a foundation for the development and testing of

semantic bridges. As first prototypical tools for the development of semantic mappings are

already available, they can be exploited in an adequate manner, in order to define semantic

bridges according to the requirements of the developed semantic mediation mechanism.

Furthermore, taking into account that semantic bridge developers and users such as process

experts or Web service composers are divided into different roles and may originate from

different organizational contexts, the consideration of trust in the quality of the underlying

semantic mappings is essential. Therefore, an approach and tool for testing of semantic

Introduction

bridges should be provided. The focus should be put on how to apply concepts from

software testing to testing of ontology mappings.

 Having all required assets such as business process models, information models and quality

assessed semantic bridges at hand; the consequent next step of the SOA life-cycle is the

composition of services to instantiate the business process. Therein, the explicit semantic

description of information models and formalized semantic bridges between the involved

heterogeneous information models should be exploited for seamless information flow

design between the services to be composed. One particular challenge lies in the

consideration of technological path dependency. On the one hand, the dominant

instantiation of SOA is based on Web service technology, which relies on the XML and

XML schema meta-data model. On the other hand, the meta-data model applied for the

semantic mediation approach is based on ontology concepts and description logic based

rules. Thus, a challenge is to integrate as well an appropriate mapping mechanism between

these two meta-data models and realize the solution as an additional layer on top of existing

technology.

 After design time, the runtime execution of Web service compositions takes the focus in the

SOA life-cycle. Again, well established industry standards should be considered. On this

regard especially the industry standard BPEL [12] should be addressed, which relies on the

XML meta-data model, too. Therefore, components providing Semantic Web technology

have to be incorporated into BPEL-based process integration middleware and the different

meta-data models need to be reflected on the runtime level. Another challenge thereby lies

in ensuring a reasonable performance during the rule-based inferencing process, which still

often remains a bottleneck of Semantic Web technology.

Challenges in Step 4: Case Study of an Exemplary Distributed Organization

 The evaluation needs to address how the potential of the developed methodology and toolkit

for loosely coupled domain-specific ontologies can be qualitatively analyzed and

demonstrated. Therefore, a case study is carried out in context of a research transfer project

with the German Chambers of Industry and Commerce. The Fraunhofer Institute for Open

Communication Systems (FOKUS) supports the introduction of an SOA-based IT

integration infrastructure to the German Chambers of Industry and Commerce consisting of

80 decentralized sites, which are operated by four different IT service providers. In

particular, the activities of the data conference working group targeting the development

and alignment of organization-wide information models and semantic integration with

external business process partners in the larger eGovernment context are subject to the

evaluation. In this process, shortcomings of applied state-of-the-art practices and

technologies need to be pointed out and compared to the potential provided by the

developed semantic mediation approach.

Chapter 1

1.4 Outline of the Thesis

After having discussed the objectives and the methodology of the work, this section outlines the

structure of the thesis. The thesis is organized in 8 chapters, which are derived straightforward

from the applied methodology of design research as illustrated in the following figure:

Figure 1-1 Thesis Structure

Chapter 1 gives the motivation and background of this work, its goals and scope and the

research hypothesis and methodology structuring this thesis.

Chapter 2 provides an understanding of the challenge to achieve semantic interoperability in

cross-organizational service-oriented architectures. Finally, a conceptual framework of semantic

interoperability in SOA is elaborated, in order to provide a foundation for comparison in the

following chapters.

Chapter 3 then performs a systematic state-of-the-art analysis of existing approaches.

Conceptual ideas, technologies and standards for achieving semantic interoperability in SOA

originating from different backgrounds ranging from industry to academia are presented and are

evaluated against the before developed framework.

Chapter 4 presents the core part of this work namely the concept of semantic mediation between

loosely coupled information models in SOA. The transfer of the principle of loose coupling to

the semantic level is discussed based on a conceptual argumentation ranging from the

limitations of semantic standardization to context dependency of information models and its

consequences for semantic interoperability in cross-organizational SOA. Finally, a specification

Introduction

of loosely coupled information models in SOA is provided including a developed conceptual

approach for a corresponding semantic mediation mechanism.

Chapter 5 provides an intermediate step between the developed theory and its application and

presents a derived semantic mediation methodology. It maps the concept to the SOA life-cycle

ranging from business process modeling, over service composition to runtime process

execution. It describes how the semantic mediation approach can be integrated within these

phases, in order to improve effectiveness and efficiency in achieving semantic interoperability.

Chapter 6 presents the developed semantic mediation toolkit, which instantiates key steps of the

before developed methodology. It includes prototypical tools for mediated business process

modeling, semantic bridge testing, mediated service composition and mediated process

execution. Its realization by means of a combination of state-of-the-art Web service

technologies and emerging description logic based Semantic Web technologies is described

with regard to design and development aspects.

Chapter 7 evaluates the developed approach for semantic mediation. Based on a case study of an

exemplarily distributed organization, namely the German Chambers of Industry and Commerce,

the semantic mediation methodology and the potential of the developed toolkit are assessed. On

the basis of this analysis, the coverage of the originally set conceptual goals and the

confirmation of the research hypothesis are discussed.

The final Chapter 8 provides a conclusion of the thesis and recalls the fundamental concepts and

ideas of the proposed approach for semantic mediation between loosely coupled information

models in SOA. Moreover, remaining open issues and potential advancements are discussed.

Finally, an outlook on future developments and priorities in this area is outlined.

Chapter 2

Understanding the Challenge of Semantic

Interoperability in SOA

2.1 Overview

The introduction has already outlined that interoperability is the enabling factor to achieve IT-

supported business processes across intra- and inter-organizational boundaries. The mentioned

estimation that enterprises and organizations today spend up to 40% of their IT budget on

integration projects [1] further points out the relevance and shows that interoperability has

become a crucial competitive factor. Taking into account this background, it becomes

comprehensible that the promise to advance interoperability has been a central success factor for

the adoption of SOA. However, focusing on a particular dimension of interoperability, namely

semantic interoperability, still substantive limitations have to be overcome as managing and

integrating semantic differences in heterogeneous distributed environments remains critical and

cost intensive [5]. As this work aims at advancing the way semantic interoperability in cross-

organizational SOA is achieved, firstly the problem area of this particular interoperability

dimension is analyzed.

This chapter begins by setting the scope of the addressed problem and putting semantic

interoperability into the context of related dimensions of interoperability. Then, existing

conceptual models dedicated to semantic interoperability are reviewed and interpreted from the

perspective of SOA. Furthermore, in order to develop a framework for comparison of existing

and emerging approaches, an aggregation of the analyzed conceptual models is derived. The

derived framework should deepen the understanding and provide a consistent conceptualization

of the problem area to be utilized as a reference point in the following chapters. Thereby, the

framework is limited to a descriptive scope with a clear distinction to the description of a

solution as targeted in the conceptual part of this work.

2.2 Interoperability Dimensions

In the context of the European Union's Information Society activities, interoperability is defined

as the ability of information and communication technology systems and of the business

processes they support to exchange data and to enable the sharing of information and

knowledge [3]. In order to understand the nature of interoperability, it is important to note that

interoperability is not a static property, which is provided or not, but rather a continuous degree

which can be achieved to a lower or higher extent. According to this conception, a further

definition states that interoperability is the ongoing process of ensuring that the systems,

Chapter 2

procedures and cultures of an organization are managed in such a way as to maximize

opportunities for exchange and re-use of information, whether internally or externally [13].

Given such a wide scope within the suggested definitions, it becomes useful to further subdivide

the notion of interoperability. The European interoperability framework distinguishes between

three main interoperability dimensions [3]:

 technical interoperability, which is concerned with the technical issues of linking up

computer systems, the definition of open interfaces and telecommunication protocols

 semantic interoperability, which is concerned with ensuring that the precise meaning of

exchanged information is understandable by any other application not initially

developed for this purpose

 organizational interoperability, which is concerned with modeling business processes,

aligning information architectures with organizational goals and helping business

processes to co-operate

Other frameworks about interoperability introduce further dimensions to be considered, such as

the political context including cultural aspects or legal interoperability dealing with the

alignment of heterogeneous legal environments that my hinder integration [14].

However, as this work addresses the dimension of semantic interoperability, deeper analysis

should be given with a finer granularity on its context within the three interoperability

dimensions presented above.

2.2.1 The Context of Semantic Interoperability

In [15] a changing focus on interoperability of information systems is discussed: from system,

over syntax and structure to semantics. Thereby, several types of heterogeneity are identified

with according types of interoperability:

 System - differences between hardware, operating systems, protocols etc.;

 Syntactic - incompatibilities in encodings and formats;

 Structural - differences in representation and schemata;

 Semantic - inconsistencies in terminology and meanings.

In this sense, system interoperability corresponds to the common understanding of technical

interoperability as outlined above in context of the European interoperability framework.

However, the above presented perspective identifies further aspects between technical and

semantic interoperability. On the one hand, syntactic interoperability can be referred to the

ability of different systems to interpret the syntax of data the same way, i.e. to share common

rules how parts of data can be arranged together. In particular, this deals with technical aspects

such as the alignment of common APIs, interchange formats and messaging standards. And on

the other hand, structural interoperability can be identified, which refers to the ability to align

different data representations based on differently structured schemata. Taking into account that

schemata relate to specific domains or applications, it points out that the structure of data in

terms of schemata captures partly – in terms of a limited view – the aspect of meaning. Thus, it

can be stated that structural interoperability is closer related to semantic interoperability than

syntactical interoperability, as syntax is generally more independent and generic from the

specific domain or application context of IT systems.

Understanding the Challenge of Semantic Interoperability in SOA

However, semantic interoperability covers further aspects than discussed in context of structural

interoperability. Data representation in terms of schemata cannot capture the entire meaning as

they lack the description of context of data and explicit description of relations between data,

which constitute fundamental aspects of meaning [16]. From the research area of linguistics a

further aspect comes into the play and the before discussed field is broken down into the three

branches [17]:

 Syntactics - relation of signs to each other in formal structures;

 Semantics - relation between signs and the things they refer to;

 Pragmatics - relation of signs to their impacts on those who use them.

With regard to interoperability, the aspect of pragmatics is reflected and referred to the

pragmatic interoperability problem, which arises when the sender‟s intended effect of a message

differs from the actual effect of the message performed by the receiver [18]. In SOA, this is the

case when there is insufficient insight in the interworking of services and their interdependent

behavior [283]. This problem can be overcome by means of languages that define so called

service choreographies [284]. In [285] choreographies are defined as complex interactions with

behavioral dependencies between the contained interactions. Consequently, this aspect on how

information is processed depending on its dynamic or behavioral context, points out the relation

to the dimension of organizational interoperability with its particular focus on business process

alignment between different actors as identified in the European interoperability framework.

This section has analyzed the different dimensions of interoperability and positioned semantic

interoperability within these dimensions. With regard to technical interoperability, the aspect of

structural interoperability in terms of mismatching schemata has been identified as partly

overlapping with semantic interoperability. On the other hand, the aspect of pragmatic

interoperability has been identified as bridging the gap between semantic and organizational

interoperability.

While the requirement for interoperability in all three dimensions seems obvious, it is a fact that

IT systems today are not interoperable in the way that seamless process integration can be

realized to its full potential. Only with the ubiquity of internet technologies based on open

standards and specifications namely TCP/IP, HTTP and SMTP etc., it has been possible to

achieve a high degree of technical interoperability. In this context, the recent developments of

Web service standards [19] along with the advent of the SOA paradigm, which are discussed in

more detail in the next chapter, have to be highlighted as well. However, in order to enable IT

systems to exchange and combine information and accordingly process it in a meaningful

manner, it requires agreement on more complex issues, such as the relation to the context within

information is created and used and consensus on how to represent meaning of data in principle.

As this work focuses on semantic interoperability in SOA, the following section focuses

particularly on the semantic dimension of interoperability. The scope for semantic

interoperability as targeted in this work is pointed out taking into account the overlapping

aspects identified above. Furthermore, the dimension of semantic interoperability is broken

down to the targeted domain of SOA.

2.3 Semantic Interoperability

The quest for meaning in language has a history that is almost as old as language itself [20].

Accordingly, the challenge to achieve semantic interoperability of IT systems is an ongoing

Chapter 2

effort since the advent of distributed IT environments. In this process, is has turned out that

semantic interoperability requires much more than a simple agreement concerning the isolated

meaning of a term but rather depends on the individual context [20].

2.3.1 Terms as Representation of Meaning

To further understand semantic interoperability, an analysis of the words meaning and term is

required. In linguistics, meaning is considered as a human artifact [21]. In this sense, terms as

well as things to which terms refer have no meaning per se. The meaning is assigned to them by

human beings. In order to exchange the meaning, which is subject of semantic interoperability,

it has to be encoded by utilizing terms1, whereas inherently never all facets of meaning can be

represented but restrictions have to be made according to an individual context. Thus, a first

distinction can be made between the meaning as a human artifact or conceptual idea on the one

hand and the term which represents the meaning on the other hand.

To further analyze the semantic interoperability problem, which occurs if the exchanged terms

do not refer to the intended meaning, the characteristics of terms as a representation of meaning

in context of IT systems should be elaborated. According to system design in informatics, terms

that represent meaning refer to information models which can be distinguished along different

abstraction levels. This distinction between different abstraction levels for representation of

meaning provides the starting point and foundation of the envisaged conceptual framework for

semantic interoperability in SOA, which is presented in the following section.

2.3.2 Abstraction Levels for Representation of Meaning

Following the paradigm of separation of concerns, each different abstraction level for the

representation of meaning is used for a specific purpose. The starting point in IT system design

are highly abstract modeling languages, which should be closely related to the actual meaning

or conceptual idea in the mind of human beings. Such highly abstract modeling languages are

e.g. the Unified Modeling Language (UML) [22] or the Entity-Relationship Model (ER) [23]. In

order to be used in a concrete application context, these information models need to be broken

down to a lower application specific level. Considering the common database design

methodology of different abstraction levels, it can be distinguished between:

(1) the Conceptual (2) the Logical and (3) the Physical Data Model.

The conceptual data model is used for the abstract modeling of an information space as already

mentioned. The logical data model provides a more concrete view on the information space to

be used in application development. In the context of database systems, this means to map an

ER-model to tables, columns and rows, the relational model. The physical model is private to

the actual system processing and storing the data.

In order to get a consistent picture and integrate the above described analysis regarding terms as

representation of meaning, a further abstraction level can be added on top of the presented

levels. Accordingly, in the following the conceptual idea or the meaning in the human mind is

considered as the initial model of a thing or information. Consequently, it can be distinguished

between the following abstraction levels:

(0) the Conceptual Idea (1) the Conceptual Data Model (2) the Logical Data Model and

(3) the Physical Data Model

1 derived from terminus, lat.: terminus = border, border stone, identifier, denotation

Understanding the Challenge of Semantic Interoperability in SOA

A further reference to these basic abstraction levels for information models can be found in the

context of model driven architecture (MDA) [24] for modeling software systems from the

Object Management Group (OMG) [25]. MDA focuses on functionality and dynamic behavior,

however the static data-oriented part can be related to the identified abstraction levels. The

MDA viewpoints distinguish between:

(1) Computation Independent Model (CIM) - The computation independent model focuses

on the environment of the system and the requirements for the system. A CIM does not

show details of the structure of systems. It is sometimes called a domain model,

whereas a vocabulary that is familiar to the practitioners of the domain in question is

used in its specification. Accordingly, the information model in CIM corresponds to (1)

the conceptual data model described above.

(2) Platform Independent Model (PIM) - The platform independent model focuses on the

operation of a system while hiding the details necessary for a particular platform. A

platform independent view shows that part of the complete specification that does not

change from one platform to another. Accordingly, the information model in PIM

corresponds to (2) the logical data model described above.

(3) Platform Specific Model (PSM) - The platform specific model combines the platform

independent viewpoint with an additional focus on the detail of the use of a specific

platform by a system and the underlying implementation. Accordingly, the information

model in PSM corresponds to (3) the physical data model described above.

The identified abstraction levels have further differentiated the characteristics of terms as a

representation of meaning in context of IT systems and can be taken as a reference framework

to further analyze semantic interoperability problems. In the following, the terminology is

aligned to the identified abstraction levels in context of database system design including the

initial level of the conceptual idea. However, instead of data model or information model just

the term model is used. The conception of a message as data or as information depends on the

user‟s or receiver‟s ability to interpret the data according to a certain context. In a first step, this

differentiation should be out of scope while the focus is put on the identification of the different

abstraction levels for representing meaning by utilizing terms. In a following second step

(cf. Section 2.3.3), the usage of these abstraction levels for the exchange of meaning is analyzed

and then the differentiation between data and information becomes relevant.

To summarize, it can be distinguished between the following abstraction levels for

representation of meaning, which provide the starting point to the framework to further analyze

semantic interoperability:

(0) Conceptual Idea (1) Conceptual Model (2) Logical Model (3) Physical Model

2.3.3 Semantic Interoperability Gap

The goal of semantic interoperability is to ensure that the meaning of exchanged information is

preserved in different application context in a distributed IT system. However, as the conceptual

idea cannot be directly and fully formalized in an IT system and therefore not exchanged

directly, the conceptual idea has to be represented by means of terms. As described above, this

representation can be distinguished into the four different abstraction levels. Thereby, it is

important to note that the meaning of a term is inherently dependent on the context. The

expressiveness of context description thereby differs between the different abstraction levels.

The conceptual idea of a thing captures the full domain context, as it represents the initial model

and constitutes a kind of master model or reference model for capturing the meaning per

Chapter 2

definition. Following the path down to the lesser abstract levels, certain context information gets

reduced while the same time application specific information gets concretized. The conceptual

model reduces the potentially holistic conceptual graph from the conceptual idea to the focused

application domain and refines the conceptual structure such as generalization and composition

of classes and its attributes. In a further step, the logical model reduces explicit context

description between concepts, in order to map the representation to an application specific level.

Thus, logical operations can be well defined on a sufficiently concrete level to enable machine

interpretation and processing. On the physical level, context is only encoded implicitly on a

technical level specific to the IT system performing the application.

Regarding information exchange, this explicit context description or lack of it becomes

important for semantic interoperability. The following Figure 2-1 illustrates a typical

information exchange scenario between two IT systems based on service-oriented exchange of

messages. The fundamentals for their interoperation are overlapping conceptual ideas about a

domain model which describes a certain information space. The domain context may be

different between the two IT systems. However, the designers share or refer to the same

understanding of specific concepts, which can be located in the overlapping part of the two

conceptual ideas.

Figure 2-1 Semantic Interoperability Gap

Due to the different domain context the corresponding conceptual models are different. They

may also exhibit overlapping parts which are modeled consistently but they may also differ

completely and thus only with the reference to the conceptual idea the corresponding parts can

be identified. On the abstraction level of the logical model, additional differences in information

representation can be identified as the logical model maps the representation to an application

specific level, which further differs between IT System A and B. Thus, it can be stated that the

semantic interoperability gap grows with each lower abstraction level as the differences between

the representations and the conceptual idea increase. Consequently, the largest semantic

interoperability gap is given if information is exchanged on the physical level as the concepts

including their context relation are only encoded implicitly, which may differ completely

between the two IT systems.

In order to overcome the semantic interoperability gap, the information representation needs to

be interpreted – in an automated, machine-supported or manual manner – according to the next

Understanding the Challenge of Semantic Interoperability in SOA

higher abstraction level. Thus, the representation gets linked to the shared understanding in the

two shared conceptual ideas and semantic interoperability is ensured. Finally, in order finish the

round-trip across the semantic interoperability gap and to ensure that the information gets

processed by the receiving system, the information needs to be represented according to the

system‟s technical representation on the logical or physical level.

Moreover, it is important to note that even if in a concrete scenario a direct transformation

between two different physical models or logical models is applied, the above described steps

have to be performed logically and are included within the transformation as virtual steps.

However, this logical analysis points out the complexity which these transformations contain.

Another, conclusion can be drawn from the above developed model for semantic

interoperability: If two systems are equally designed with regard to their utilized information

models on the different abstraction levels, the logical steps to bridge the semantic

interoperability gap have to be performed as well. However, the actual round-trip can be

shortened, if two models are equal on the same abstraction level. The above described round-

trip just has to be performed up to the respective abstraction level where the models are equal.

Accordingly, the semantic interoperability gap on this level is not present and therefore no

reference to the upper abstraction level is required. This explains why semantic interoperability

can be achieved by standardizing the representation models of information – whenever this is

possible. This topic is detailed later on in Chapter 4.

Referring again to the aspects identified in the context of semantic interoperability as described

in Section 2.2.1, they can be mapped to the above introduced conceptual framework: Pragmatic

heterogeneity can be located at the highest level, as this level deals with the idea of a concept in

the human mind including the context about the intension of sending or receiving such

information. The identified aspect referred to as structural heterogeneity can be located at the

logical level, as it deals with heterogeneous schemata. Syntactic heterogeneity, which addresses

incompatibilities in encodings and formats, can be mapped to the physical level. Finally, what is

referred to semantic heterogeneity in general in Section 2.1.1 as addressing the inconsistencies

in terminology and meanings can be mapped to the level of conceptual ideas with regard to

meaning and to the conceptual, logical and physical model with regard to terminology each

representing the meaning on a different abstraction level. Thus, it can be stated that the

developed model describing the problem of semantic interoperability captures consistently the

different semantic interoperability aspects identified in current state of research (cf. Section

2.2.1).

As the aim of this chapter is to develop a framework for semantic interoperability in SOA to

provide a systematic understanding of the problem addressed in this work, the above developed

model describing the semantic interoperability gap needs to be mapped to the domain of SOA.

Therefore, as an intermediate step, the concepts and approaches of SOA are introduced in the

following section.

2.4 Service-Oriented Architecture

In [26] several definitions of SOA are introduced and compared, whereas the following unified

definition is provided:

„A service-oriented architecture is a framework for integrating business processes and

supporting IT infrastructure as secure, standardized services that can be reused and combined

to address changing business priorities.“

Chapter 2

This definition outlines two major goals of SOA, namely flexibility and reusability of IT

systems. These two characteristics become particularly relevant as more and more business

processes are spanning multiple organizational and administrative domains. Changing external

business partners need to be flexibly integrated into IT-supported processes while reuse of IT

infrastructure needs to ensure that this flexibility can be realized with regard to optimized

resource spending within economic constraints.

To address these challenges, the core concept of SOA is the decomposition of complex business

processes into a composition of loosely coupled independently managed services providing

distinct business functionalities. IT systems supporting business processes are split of into a set

of loosely coupled reusable services, whereas each service realizes one modular unit of business

logic.

The architectural model of SOA is based on fundamental principles such as modularization,

encapsulation and platform-independence. These principles are incorporated from prior

approaches, mainly from component-based middleware platforms. Thus, SOA is no

revolutionary new development but rather it is based on various known concepts and methods.

However, the component-based approach implemented in platforms such as CORBA [27] or

EJB [28] require the business partners to adopt a specific object model that might not be

suitable for all collaborating parties [29]. Considering that the lifecycle of a component has to

be managed by its consumer, the coupling between provision and usage of functionality must be

still regarded as tight.

Addressing this shortcoming and extending the ability of loose coupling, the central novelty of

the architectural model of SOA relies on the strict focus on the service concept. The service

concept represents a further step up in abstraction for distributed IT system design [30].

Consequently, not the component capturing the business functionality takes the center stage but

just the service which the component provides replaces the focal point. Thereby, the conceptual

model of a service can be defined as follows [31]:

i. A service establishes an agreed relationship between partners in two distinguished roles:

a service provider and a service user.

ii. A service is meant to produce benefit of a definite type to the service user and to meet

the user‟s needs.

iii. A service is the result generated by processes at the interface between the provider and

the user and by processes internal to the provider and internal to the user.

The service model can be further refined focusing on the interaction patterns between the above

identified roles of the service provider and service user. Accordingly, the service interaction

model often is equated with the “find-bind-execute” paradigm. As illustrated in Figure 2-2,

firstly a service provider registers a service at a registry which includes a description of the

service and the business context relevant for the usage. Then, a service user interacts with the

registry for finding service descriptions which fulfill certain criteria, e.g. regarding service class

or non-functional properties. This refers to the user‟s needs and the aimed produced benefit of a

definite type (cf. ii. above). Then, the service user utilizes the service description to bind to a

service provider and to invoke the provided service. Thus, the relation between service user and

service provider gets established.

Understanding the Challenge of Semantic Interoperability in SOA

Figure 2-2 Service Interaction Model

In order to enable this so called find-bind-execute paradigm, the SOA concept needs to be

instantiated with an appropriate technology. In recent years, fostered by broad standardization

initiatives and wide industry adoption, Web services have taken the lead role as the dominant

realization approach to implement SOA. The main technologies behind Web services such as

XML, WSDL, SOAP, UDDI and BPEL are analyzed in Chapter 3 with specific focus on how

they address the problem of semantic interoperability in SOA.

In order to outline how the principles of the SOA model are reflected within a typical enterprise

IT architecture, the following Figure 2-3 illustrates the decomposition of complex business

processes into a composition of loosely coupled Web services along the so called SOA layers:

Figure 2-3 Enterprise SOA Layers [32]

 The bottom layer (layer 1) contains existing business applications, which may originate

from different organizational domains including e.g. customer relationship management

(CRM) and enterprise resource planning (ERP) systems, legacy applications or specifically

designed object-oriented systems as well as business-intelligence applications.

 The component layer (layer 2) uses typical container-based technologies and component

implementation models. It enables distribution of functional components within the

enterprise.

Chapter 2

 Layer 3 provides the mechanism to make enterprise-scale components, business unit-

specific components and in some cases project-specific components available as services.

The interfaces are exported by means of standard service descriptions. Moreover, this layer

comprises the service infrastructure (e.g. service registries).

 Layer 4 combines services and other composite services to orchestrations or choreographies

which implement enterprise-wide or even cross-enterprise business processes. Visual

process modeling and process execution engines are used for this purpose.

 The presentation layer (layer 5) is usually out of the scope of the actual SOA. It is important

to note that generally in SOA the user interfaces are decoupled from the services. However,

it is part of the figure because recent standards such as Web Services for remote portlets

(WSRP) may indeed carry service functionalities directly to the application interface or

presentation level.

 Layer 6 (orthogonal) enables the integration of services through the introduction of reliable

and intelligent routing, protocol mediation and other transformation mechanisms, often

described as the enterprise service bus.

 Layer 7 (orthogonal) ensures quality of service through sense- and respond mechanisms and

tools that monitor the state of SOA applications.

Besides the runtime perspective focused in the above SOA layers, as well the design time of

SOA needs to be taken into account. Such a holistic perspective is provided in terms of a so

called service life-cycle or SOA life-cycle focusing less on any particular service but rather on

the entire set of service from design over implementation to usage and monitoring. Even there

exists no well-established definition of the term service life-cycle or SOA life-cycle, there is a

common understanding about the main phases to be covered in it. Nevertheless, the phases are

clustered on different granularity levels and different aspects are more or less highlighted in the

various available definitions. In [33] an overview of popular definitions of the service life-cycle

model is provided. In order to stick to a consistent perspective within this work, the following

refers to a definition provided by IBM [34]. Accordingly, the service or SOA life-cycle can be

distinguished into the following phases:

 Model – This phase is about capturing the business requirements and translating them

into business process models refined by service identification and service specification.

 Assemble – This phase is about developing reusable services and composing them into

service orchestration plans which instantiate the modeled business process.

 Deploy – In this phase the developed services and service compositions are tested and

deployed to a runtime infrastructure.

 Manage – The last phase is about maintenance, measurement and optimization of

service operations from a technical and as well from a business perspective.

In order to reflect the discussed SOA concepts within the framework of semantic

interoperability in SOA described in the next section, the following Figure 2-4 presents a

condensed SOA layer model focusing on the conceptually most relevant parts. Furthermore, to

stress that the business processes may consist of services and underlying components from

different organizational domains the corresponding layers are split as well into different

domains.

Understanding the Challenge of Semantic Interoperability in SOA

Figure 2-4 SOA Layer Model

 The business process layer describes the cross-organizational business process as a

composition of services (from an abstract perspective in terms of business process

models and from a concrete perspective in terms of a service orchestration plan).

 The service layer describes the (heterogeneous) services which provide the distinct

business functionalities.

 The business components or objects layer describes the underlying components which

realize the services implementations for the middle layer.

In the following this condensed SOA layer model is incorporated as an integral part into the

framework of semantic interoperability in SOA, which is presented in the next section.

2.5 Framework of Semantic Interoperability in SOA

Before relating the concept of the semantic interoperability gap introduced in Section 2.3.3 to

the above described SOA layer model, a connecting step in terms of further analysis of the

service artifact presented in the middle layer is elaborated. As the concept of the semantic

interoperability gap has focused on information models represented on different abstraction

levels, these representations need to be related to the service descriptions.

According to the Web Ontology Language for Services [35] which aims at providing a

specification of a service in terms of a formal ontology, the following conceptual characteristics

of a service can be distinguished:

 Inputs

 Outputs

 Preconditions

 Postconditions or Effects

In this sense, the specification of a service can be related to the mathematical concept of a

function as an abstract entity that associates an input to a corresponding output according to

some specific rule [36].

Chapter 2

In the service context the input describes information which needs to be provided by the service

user necessary to invoke the service and perform the provider‟s internal processes in order to

deliver the service. The output describes information which is generated as a result of the

provider‟s internal processes and delivered to the service user. The preconditions specify the

state of the information space of the service before its execution. Moreover, preconditions can

be represented as expressions that are required to be true before an operation can be successfully

invoked. Vice versa postconditions describe the state of the information space of the service

after the execution of the service. Postconditions can be represented as expressions that must be

true after the service has been invoked and its operation completes its execution. In the

following Figure 2-5 the service model as described above is illustrated:

Figure 2-5 Service Model

Considering instantiations of this service model in concrete IT systems, these four service

characteristics have an information representation in terms of the different abstraction levels for

information models as introduced in Section 2.3.2:

Firstly, a conceptual idea about inputs, outputs, preconditions and postconditions is existent in

the mind of an IT system designer. Later in the design process, domain specific representations

of these service characteristics can be derived and captured in a conceptual model. With regard

to preconditions and postconditions most state-of-the-art approaches limit the representation to

textual description addressing the human reader. This is due to the fact that a fully-formal

specified conceptual model that represents the preconditions and postconditions is difficult to

define on a sufficient level that enables machine interpretation. Thus, further specification and

concretization of the information model on lower abstraction levels such as the logical or

physical model is limited, too. However, some approaches in the research field of Semantic

Web services also target this aspect as further analyzed in Chapter 3.

With regard to input and output parameters, which are subject to information flow in a service

composition scenario instantiating the business process model, they can be represented in a

conceptual model addressing the domain context by relating the input and output parameters to

other domain concepts relevant in the business process. The corresponding information model

can be further specified on the logical level representing the abstraction level that is utilized

when information flow is specified in a concrete application context. In order to realize the

business functionality which is provided by the services, the business components or objects get

involved. These components process the input and output parameters and thus need to represent

the information according to their specific technical environment. Accordingly, the physical

level of the information model can be located on the business components or objects layer.

Taking into account the analysis above, the service model including inputs, outputs,

preconditions and postconditions provide a further detailed description of services in the SOA

layer model. Moreover, the service characteristics can be represented on the different

abstraction levels for describing their information models. As the SOA layer model has

distinguished between heterogeneous services originating from different organizational

domains, these heterogeneous services can be directly related to the semantic interoperability

Understanding the Challenge of Semantic Interoperability in SOA

gap describing the heterogeneous information model representations on the different abstraction

levels. Consequently, a unified model illustrated in Figure 2-6 can be derived combining the

introduced SOA layer model together with the refined service model and the model describing

the semantic interoperability gap.

Figure 2-6 Framework of Semantic Interoperability in SOA

Hence, the derived perspective on the three models constitutes the framework of semantic

interoperability in SOA as a reference point and problem description. In the following, it

provides a common ground for comparison. Consequently, approaches and technologies

presented in the state-of-the-art analysis in Chapter 3 as well as the concept for semantic

mediation between loosely coupled information models in SOA developed in Chapter 4 refer

back to this framework.

2.6 Summary and Reflection

This work addresses the problem of achieving semantic interoperability in cross-organizational

service-oriented architectures. Therefore, definitions and conceptual models covering the

different aspects of semantic interoperability have been analyzed in a first step. In order to

define the problem scope, semantic interoperability has been put into context with related

interoperability dimensions, namely technical interoperability and organizational

interoperability and overlapping issues have been identified.

An aggregation of conceptual models for semantic interoperability has been developed based on

the finding that different abstraction levels for representing information models are fundamental

for the understanding of semantic interoperability. Consequently, a model describing the

semantic interoperability gap has been derived, that demonstrates how heterogeneous IT

systems differ in their information models along different abstraction levels. Starting from the

Chapter 2

conceptual idea of information in the human mind, to the conceptual model formalizing the

domain context, over the logical model to the physical model representing the concrete

information model on the technical level, the semantic interoperability gap between

heterogeneous IT systems continuously increases. Furthermore, different fundamental

approaches for achieving semantic interoperability such as alignment of terminology or

transformation between heterogeneous representations could be mapped to the derived model of

the semantic interoperability gap.

In order to address the targeted domain of semantic interoperability in SOA, the architectural

model and basic concepts of SOA have been presented. A layer model has been elaborated

capturing the central approach of SOA that lies in the decomposition of complex business

processes into a composition of loosely coupled independently managed services providing

distinct business functionalities. The service concept has been further analyzed to link the

information model used in a service interface description to the different abstraction levels

analyzed in the model of the semantic interoperability gap.

Finally, a unified model has been derived from the analysis above describing the semantic

interoperability problem in the context of SOA. Consequently, the unified model forms the

reference framework that provides a common ground for comparison between approaches and

technologies for achieving semantic interoperability in SOA. In the following, it is referred to

this framework including Chapter 3 analyzing the state-of-the-art and Chapter 4 presenting the

concept for semantic mediation between loosely coupled information models in SOA.

Chapter 3

State-of-the-Art in SOA for Bridging the

Semantic Interoperability Gap

3.1 Overview

Having set the scope for the addressed problem of achieving semantic interoperability in

heterogeneous IT systems with particular focus on SOA in the previous chapter, the next step is

to examine state-of-the-art approaches and technologies that aim at tackling the identified

challenges. Even the selected approaches and technologies are often embedded into a broader

context; the analysis tries to focus on the aspects particularly relevant for the topic of semantic

interoperability in SOA and to limit the general aspects to the basic essentials.

In a first step, Section 3.2 reviews the idea and concepts of traditional Web services along with

its existing technology stack. Furthermore, an evaluation is performed describing the

capabilities and limitations of traditional Web services in context of the previously developed

framework of semantic interoperability in SOA (cf. Section 2.5).

After outlining the need for formally defined semantics of Web service descriptions, an

intermediate step introducing the core concepts and technologies of the Semantic Web initiative

are described in Section 3.3. The following Section 3.4 then describes how these concepts can

be applied to Web services in terms of so called Semantic Web services. Again, the previously

developed reference framework describing the semantic interoperability gap is utilized, in order

to provide an evaluation outlining the advantages, limitations and open issues of this

technology.

Additionally, a survey is carried out on relevant related areas such as semantic information

integration in distributed database systems and distributed object-oriented systems.

Finally, these traditional approaches are related to a detailed analysis of ontology-based

strategies for semantic integration including approaches where multiple ontologies are involved.

In this context, as well a number of ontology mapping approaches and exemplary tools are

investigated.

3.2 Web Services

Whenever the realization of an SOA with state-of-the-art technology is discussed, the term Web

service takes an important role as Web services represent the dominant technology for the

instantiation of an SOA. In the last decade, Web services have gained considerable popularity.

Many software vendors have adopted Web service initiatives and correspondingly provide

Chapter 3

extensive product portfolios. A large consulting market for advisory services regarding Web

service technology has emerged. Furthermore, there are many organizations which are involved

in the refinement of Web service standards. However, driven by marketing campaigns and the

related ongoing SOA and Web service hype, often the problem remains that few people seem to

actually agree on what a Web Service is [37]. The introduction in Chapter 1 has already briefly

outlined the idea behind Web services and their capabilities for ensuring semantic

interoperability. This section aims to provide a further detailed analysis and starts with

clarifying what Web services are and how they are used to build an SOA. Furthermore, this

section explains the core Web service concepts and related technologies. Finally, an analysis

about the shortcomings with regard to semantic interoperability is provided.

3.2.1 Definition and Concepts

The World Wide Web consortium defines Web services as programmatic interfaces for

application to application communication over the World Wide Web [38]. This definition states

one major aspect that Web service interaction is usually machine to machine. In the same sense

another definition states that the easiest way to describe a Web service is to say that it is done on

the Internet, using Web protocols, and it does not involve a live user operating a Web browser

[39].

The World Wide Web consortium furthermore highlights the importance of XML by defining a

Web service as a software application identified by a URI [40], whose interfaces and bindings

are capable of being defined, described, and discovered as XML artifacts. Moreover, a Web

service supports direct interactions with other software agents using XML based messages

exchanged via Internet protocols [41].

Even though there exists no uniform, consistent, standardized and official terminology for Web

services, it can be stated that a common understanding about fundamental Web service

characteristics is shared among the various actors. In the following the fundamental

characteristics that are featured by Web services are listed and briefly described [42]:

 Programmable - Web services are accessible by programmable interfaces. Web services

are used for application communication and not for human information processing. Web

services do not have a user interface.

 Self-descriptive - Web services include meta-data which can be processed during

runtime, e.g. name, description, version, quality of service etc.

 Encapsulated - Web services encapsulate independent and discrete functionalities that

perform a particular task.

 Loosely coupled - Web services communicate over messages, implementation details

are hidden to Web service providers and Web service consumers.

 Location transparent - Web services are location independent and can be accessed from

anywhere at any time only dependent on access rights of applications that consume the

Web services.

 Protocol transparent – Web services are based on the Internet protocol stack. Operations

and messages can support multiple, such as Hypertext Transfer Protocol (HTTP) or

Simple Mail Transfer Protocol (SMTP)

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

 Reusable and composable - Web services can be divided into further finer grained Web

services or multiple reusable basic Web services can be composed to a new Web

service.

The here presented characteristics are not unique to Web services but rather reflect principles

that have been adopted from previous middleware approaches. Therefore, in the following the

factors and conditions that have shaped the development of Web services as well as

fundamental Web service concepts are presented and discussed.

Evolution of Integration Middleware

In a dynamically changing and more and more global economy companies and organizations are

continuously seeking for new means to cope with competitive pressure. The need to shorten

production and development cycles, to reduce costs and time-to-market, to increase customer

satisfaction, and to rapidly adapt to market changes has historically led companies to collaborate

and to distribute their business processes.

In order to automate business processes spanning multiple administrative and organizational

domains the existing stand-alone applications had to be opened and interoperability mechanisms

had to be established. Distributed object-oriented technologies and middleware platforms, such

as the Distributed Component Object Model (DCOM), Java 2 Platform Enterprise Edition

(J2EE) or the Common Object Request Broker Architecture (CORBA), are powerful means for

the integration of applications within companies or organizations.

However, as argued already briefly in section 2.4 about SOA, for integrating systems across

organizational domains these technologies are only suitable up to a limited extent. This is due to

the highly heterogeneous environments in which prescription or shared agreement of common

object models and corresponding programming languages is not appropriate and feasible. Even

the communication between organizations that are using compatible middleware technologies is

not necessarily practicable since the underlying data transport may be blocked by security

facilities such as firewalls. Moreover, not just the transport of data but also its shared

interpretation has to be taken into account.

Taking into account the conceptual similarity to component-based approaches, which become

obvious by comparing the main characteristics of Web services, it can be stated that the basic

idea behind Web Services is not new. However, reflecting the analysis above it becomes

comprehensible that with the emergence of XML as the dominant standard exchange format as

well Web services based on XML message exchange formats and XML based interface

descriptions have taken the lead role in building applications from reusable building blocks.

Web Service Scenario

The following figure illustrates the idea and a communication scenario of Web services across

organizational domains:

Chapter 3

Figure 3-1 Cross-Organizational Communication using HTTP and XML [37]

The success of the World Wide Web (WWW) as the ubiquitous infrastructure for information

exchange has brought the idea of using the WWW also as a medium for communication

between applications based on standard Web protocols, such as the Hypertext Transfer Protocol

(HTTP) or the Simple Mail Transfer Protocol (SMTP). As most organizations are using a Web

or mail server the data transport can be handled on existing infrastructure. Moreover, in many

cases this is the only communication channel which is permitted by security policies such as

firewall configurations. On top of these transport protocols messages are defined in terms of the

Extensible Mark-up Language (XML). In order to be able to process the message content,

application A and application B share common message schemas that are e.g. based on the

XML Schema Definition Language (XSD). Based on XSD the definition of customized mark-

up language for a specific business context is possible providing a representation of structured

data in a human- and as well machine-readable manner. The transformation of XML messages

to specific programming languages and object instantiations and vice versa has to be performed

by the processing applications corresponding to their underlying platform.

Web Service Interaction Model

However, Web services are not monolithic and have to be regarded in context of a distributed

architecture. Based on the general service interaction model presented in Section 2.4 further

refinements with regard to the concrete Web service technology can be made. The following

roles for the interaction of Web services can be identified [42], whereas the concrete XML

standards used in the role descriptions are further described in the following Section 3.2.2:

 User: The user consumes the Web services based on service descriptions defined in the

Web Service Description Language (WSDL).

 Provider: The provider provides services and ensures that the services are accessible

over programmatic interfaces described in declarative Web service descriptions

(WSDL).

 Registry: The registry contains declarative Web service descriptions of various Web

service providers and their access points. It provides registry services based on

standards such as UDDI or ebXML.

It is important to note, that the roles user and provider are exchangeable. A user can act as a

retailer combining several Web services according to a business process using the Business

process Execution Language (BPEL), which then is provided as an upper level Web service in

the provider role. The following figure illustrates the Web service interaction model:

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-2 Web Service Interaction Model

In an exemplary simple interaction pattern the life-cycle of a Web service can be described as

follows:

1. The service provider provides a Web service and publishes its declarative description

(WSDL) into the Web service registry using standardized registry services described in

e.g. UDDI or ebXML.

2. The potential user of a Web service sends a search query to the Web service registry

utilizing further standardized registry services described in e.g. UDDI or ebXML.

3. The Web service registry contains a categorized collection of registered, trustful Web

services which are each described in declarative Web service descriptions (WSDL).

4. After discovering the desired Web service in the Web service registry further details

about message formats and transport protocols can be gathered.

5. Based on the service description a binding to message formats and transport protocols

can be performed by the user. Then the user can communicate with the provider over

XML-based message exchange format and protocol (SOAP) and consume the desired

Web service.

Web Service Composition

As already mentioned above Web services are mostly applied as an instantiation of an SOA.

Recalling the basic idea of SOA in the context of Web services, IT systems supporting business

processes have to be split of into a set of loosely coupled reusable Web services, where each

Web service realizes one modular unit of business logic. Consequently, mechanisms are

required for the consistent and meaningful integration of Web services. This integration is called

Web service composition. In order to obtain meaningful composition results, Web services need

to be invoked in a well-defined order and they have to exchange data. The following Figure 3-3

illustrates control and data flows in the composition of services.

Chapter 3

Figure 3-3 Flow-based Web Service Composition [43]

Two different composition models can be distinguished: orchestration and choreography. There

is no well agreed common sense regarding these two definitions. Nevertheless, mostly it is

considered that in an orchestration all interactions that are part of a business process (including

the sequence of activities in particular Web service calls, conditional events such as loops etc.)

must be described like in a traditional workflow system. This description is then executed by an

adequate engine which has control of the overall Web service composition. On the other hand,

choreography is more collaborative and less centralized in nature. Only the public message

exchanges are considered relevant and moreover, each Web service only knows about its own

interactions and behavior. In contrast to orchestration, there is not an entity that has a global

view or control of the process.

3.2.2 Technologies and Standards

Accompanied by the success story of SOA in recent years a couple of standards for Web

services have emerged, which either have become an official standard or at least have the status

of a widely used de-facto standard. Aggregating the technologies into an overall picture the

following Web service stack can be derived:

Figure 3-4 Web Service Stack

As indicated by the free spaces there are still more elements that are part of the Web service

stack, such as technologies related to quality of service (Web service transactions support [44]

or security and reliability by means of encryption [45]) or service management. In the following

the most important standards are described in more detail [37]:

 Simple Object Access Protocol (SOAP) - With the Simple Object Access Protocol

(SOAP) a lightweight format and protocol for the exchange of XML messages in a

request/response-manner has been developed. SOAP holds the status of a W3C

recommendation [46]. SOAP defines a convention that can be used to represent remote

procedure calls (RPC). In the case of using HTTP as the protocol binding, an RPC call

maps naturally to an HTTP request and an RPC response maps to an HTTP response.

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Although, SOAP was intended to provide networked applications with RPC services in

XML, the interaction with a Web service is not necessarily RPC-centric but may also be

document-centric. In the former case a service is seen as a set of methods to be invoked

remotely and the messages are serializations of business objects. With document-centric

communication, however, the documents themselves are the main purpose of the

distributed computation and the services are considered as components that read, store

and produce documents.

 Web Service Description Language (WSDL) - Communication mechanisms and

message representations are not sufficient to create services. One of the most important

characteristics of a service is that it exposes a well-defined interface that describes its

functionality. This includes the description of a set of messages that the service receives

and sends, a set of named operations and, if the service is deployed, a binding to a

documented network address. The binding mechanism defines services as collections of

network endpoints or ports. A port is defined by associating a network address with a

binding. Finally, a collection of ports define a service. For describing the interface of a

Web service a specific XML language, the Web Service Description Language

(WSDL), has been developed and holds the status of a W3C recommendation [47]. A

Web service description contains definitions (data types and messages), operations and

service bindings, thus providing all necessary information for a client to interact with a

Web service.

 Universal Description, Discovery and Integration (UDDI) and Electronic Business

XML (ebXML) - There are several, mostly industrial driven, registry initiatives for

Web services, among them Electronic Business XML (ebXML) and Universal

Description, Discovery and Integration (UDDI). UDDI and ebXML provide a

mechanism for clients to find Web services. A UDDI registry contains categorized

information about services, about the businesses that offer services and about the

interfaces and communication standards that are used for conducting transactions.

Requestors can search a UDDI registry, find services based on certain matchmaking

criteria and retrieve service details, such as links to the service description (WSDL) and

the invocation address. It is important to note that UDDI does not define a specific

registry implementation but the interfaces and data structures that are used for storing

and finding services and businesses. Similarly to UDDI an ebXML registry allows

businesses to find partners, to define trading-agreements, and to exchange messages in

support of business operations.

 Business Process Execution Language (BPEL) – BPEL provides a mechanism for the

composition of Web services. The design process of such service compositions is also

called programming in the large. In order to keep the composition independent from the

underlying IT infrastructure, the exact data flow and control flow is provided in a

composition language, which can be interpreted by workflow execution engines.

Different approaches for such languages have arisen, e.g. WSFL [48] or XLANG [49].

However, BPEL, which is based on the before mentioned Web service specifications,

has been the most successful and holds the status of an OASIS standard [50]. BPEL

defines a business process as an XML-serialized description of data flow and control

flow between participating Web services and allows to run the process in a long-

running asynchronous manner. Data flow and manipulation can be expressed in XML-

related languages such as XPath [51] and XSLT [52]. In order to ease the design of

service compositions in BPEL, vendors offer a range of graphical integrated

development environments, e.g. the Oracle BPEL Process Manager as illustrated in

Figure 3-5Figure 3-5 Development Environment for Process Design:

Chapter 3

Figure 3-5 Development Environment for Process Design

However, the above technologies still exhibit fundamental limitations when it comes to

automation in the Web service life-cycle and further tool support especially with regard to Web

service composition. The composition design is still complex and time-consuming, which is due

to the shortcomings regarding semantic interoperability of conventional Web service

technology. The following section deeper elaborates on these shortcomings based on the

analysis of Web service technology above and the reference framework of semantic

interoperability in SOA developed in Chapter 2.

3.2.3 Evaluation

The fundamental characteristics of Web services have been analyzed in section 3.2.1. However,

it can be stated that the current Web service technology stack has only partially kept the promise

of enabling Web services which are truly self-descriptive, encapsulated and loosely coupled.

Limited Web Service Characteristics

Self-description of Web services is limited. Having a look at the meta-data provided by Web

services, they just allow for processing during runtime to some extent. Web service descriptions

defined in terms of the XML-based Web Service Description Language (WSDL) describe the

operations, parameters and Internet address of a Web service rather in syntactical and structured

manner. However, they are lacking any context information required for advanced automated

processing. For machines or software applications which act behind Web services the

information and description of a Web service is barely interpretable because the underlying

mark-up language XML lacks an expressive semantic background and WSDL does not define

any further semantics. Thus, the limitations of XML encoded information just allows Web

services to parse each other input and output messages and verify whether it adheres to the

expected formats, and eventually locate each piece of information within the message

parameters. But unfortunately, the cooperating Web services do not have any means to decode

the meaning of the messages on the conceptual level, in order to extract the information they

contain. Referring back to the perspective of the service model presented in the above

developed Framework of Semantic Interoperability in SOA (cf. Section 2.5), further

shortcomings adhered to the WSDL approach can be identified. Furthermore, additionally to

the limitations regarding self-description of input and output parameters WSDL-based Web

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

service descriptions lack any information about the preconditions or postconditions of the Web

service.

Figure 3-6 WSDL-based Web Service Model

Thus, cooperating Web services understand the structure of each other messages but do not

understand the content of such messages [53]. Taking this into account, the semantics of Web

service operations and data structures in corresponding messages can only be interpreted by

humans. As a consequence human interaction is necessary in order to understand what a service

does and how it can be invoked.

Therefore, encapsulation is limited, too. As the semantics cannot be exposed to the externally

available meta-data further internal information about Web service semantics is required. Thus,

the core principle of encapsulation, the hiding of internal information, cannot be assured to the

full extent as further insight into the Web service is necessary for the user in order to combine

them and create reasonable Web service compositions.

With regard to loose coupling of Web services it can also be stated that this goal has only be

achieved to a limited extent. As Web services are just enabled to parse each other‟s messages

and process their structure cooperating Web services have to rely on strictly agreed message

schemas in order to ensure sound exchange of message content. Thus, it can be stated that on

the semantic level a strong coupling is still necessary. In order to overcome this situation,

scenario specific adapters and transformations have to be integrated in the Web service

interaction by means of human intervention.

Human Intervention in Web Service based SOA-Life-Cycle

As a consequence of the analysis above it can be stated that for many tasks of the Web service

based SOA life-cycle manual efforts in terms of human interaction and collaboration is

necessary, which is time consuming, costly and error-prone. The following figure illustrates the

fields of human interaction according to the SOA layer model presented in Section 2.4:

Chapter 3

Figure 3-7 Human Interaction in Web service technology based SOA-Life-Cycle

During business process analysis and modeling process experts need to understand the business

context in which the IT-supported business process takes place. This includes a detailed

requirement analysis for the business process and specification on an abstract level of which IT

services are needed in order to fulfill the requirements. Also the information flow between the

building blocks of a business process has to be specified and modeled according to the domain

context. Based on this input created by process experts shared operation patterns and message

structures of Web services can be derived, which are fundamental requirements for their

interoperation. Due to the limited expressiveness of Web service meta-data the presented Web

service technology does not provide the means to handle the heterogeneity of Web service

properties automatically. Human intervention in terms of meetings, documents, etc. is needed to

define and agree on a common understanding of Web service properties which then can be

reflected on the technical level by cooperating service providers and service users.

On the technical level Web services need to be discovered and composed in order to realize the

desired business process. Caused by the limitations of self-description, discovery approaches

applied in UDDI and ebXML categorize Web services using external flat service classifications

in terms of so called tModels that represent taxonomies, identifier systems, etc. However, Web

service discovery is only keywords-based. As a result, this leads to low quality of the retrieved

results as keywords are often not unique and contextual information is not considered. An

analysis of different Web service discovery approaches and their limitations in quality have

been discussed in [54]. As a consequence, in practice as well for Web service discovery human

intervention is needed. Automatic discovery mechanisms just provide a first step in the process.

Additionally, human experts are involved to select or eventually further discover provided Web

services based on a shared understanding of Web service properties required by the user as well

as Web service properties of the provider.

Regarding Web service composition, BPEL as the de-facto standard allows for the design of

abstract processes. However, the activities within a process are still bound to fixed XML-based

interfaces which consequently include fixed operation patterns and message structures. In a

highly heterogeneous business environment this approach still lacks flexibility, since all

collaborating actors and corresponding systems need to adhere strictly to a previously defined

common message schema in order to ensure semantic interoperability. Composition specific

adapters and transformations have to be integrated in the Web service interaction. Data flow and

data manipulation is expressed in XML-based languages, such as XPath and XSLT. According

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

to the hierarchically tree-structured data model of XML the approach behind traditional WSDL

and BPEL-based Web service composition is mainly syntactical. Consequently, the implicit

semantics of services can only be understood by a human composer and the whole range of

composition tasks, including the selection of matching services, the control flow and the data

flow design, is a manual and recurring effort. Thus, the composition design still remains

complex, time-consuming and error-prone. The lack of explicit semantics in Web service

descriptions is an obstacle in increasing automation and further tool support in the process of

composition design [55].

Looking at Web service execution it can be stated that compared to the previous SOA life-cycle

phases the degree of automation is much higher. This can be related to the fact that during the

phases of business process modeling and Web service composition the context dependencies

and heterogeneities of the distributed business process have been already anticipated and broken

down to a concrete technical level. Hence, the execution is limited to a purely technical task

processing the instructions defined in the control flow, data flow and transformation design.

However, this also implicates that in case of even small changes in the business process the

existing execution plan becomes obsolete. As no mechanisms are incorporated on the execution

level these changes accompanied by additional heterogeneity cannot be handled on the fly and

in a preferable transparent way. Consequently, the top down approach starting from business

process modeling phase followed by Web service composition and discovery has to be iterated

again according to the SOA life-cycle.

Limitations of Underlying XML Data-Model

The above described problems leading to shortcomings regarding the automation potential in

the Web service technology based SOA life-cycle originates from the limited expressiveness of

the underlying XML languages. In Section 3.2.2 it has been pointed out that the whole Web

service technology stack is based on the markup language XML. XML provides a meta-

language to syntactically describe the structure of documents. In fact, the XML syntax is

designed for representing an encoded serialization of documents. Thus, XML has a very limited

range of expression for modeling complex information entity semantics with context awareness

in terms of constrained relationships and properties. Consequently, it can be stated that XML is

a poor language for data modeling if the goal is to represent information entities in the problem

domain such that they correspond explicitly to the user‟s or process expert‟s conceptual model

of information entities in this domain [56]. The principal constructs available in XML for

expressing relationships are limited to "containment" (hierarchy), "adjacency" (A 'followed by'

B), "co-occurrence" (if A then [also/not] B), "attribute", and "opaque reference". These

constructs are indeed useful for serialization, but are not optimal for modeling information

entities of a problem domain with sufficient expressiveness. All conceptual and relational

semantics must be mapped into syntactic structures leading to limitations in processing and

consequently the XML processor is not able to recognize their significance.

XML in the Semantic Interoperability Gap

As discussed above, XML schemas are a means to provide integrity constraints to information

sources, either documents or semi-structured data. XML schemas provide a basic vocabulary

and mechanisms for structuring information hierarchically. The tree structure of tags as

represented in e.g. the W3C Document Object Model (DOM) [57] reflects the view used for

application development. Taking the perspective of different abstraction levels for information

representation as analyzed in the model of the semantic interoperability gap in Section 2.3.3, the

abstraction level of the logical model provides a concrete view on the information space to be

used in application development. It refers to the approach of structuring information e.g. in

Chapter 3

terms of tables, columns and rows according to the relational model known from database

design. Thus, taking into account the goal of XML to structure information hierarchically and

the platform independent usage of XML in concrete application scenarios, it can be stated that

the abstraction level for the information model of XML can be located at the layer of the logical

model. The following Figure 3-8 illustrates the placement of XML and its schema definition

language in the model describing the semantic interoperability gap:

Figure 3-8 Placement of XML in the Semantic Interoperability Gap

Having identified where XML can be located in the abstraction level stack the above described

limitations of traditional Web service technology regarding semantic interoperability become

evident. The identified necessity of human intervention to handle the heterogeneities of Web

service properties can be directly linked to the required round-trip bridging the semantic

interoperability gap described in Section 2.3.3. As the underlying semantic differences are only

contained implicitly with the XML schemata describing the WSDL-based Web services,

business experts need to manually interpret the message formats to the conceptual level. Linked

to their shared understanding of conceptual ideas achieved in meetings, documents, etc.

semantic interoperability is ensured. Furthermore, in order finish the round-trip across the

semantic interoperability gap the information needs to get represented according to the XML

schemata of the receiving system, i.e. the Web service user. Finally, this round-trip needs to be

expressed in a direct transformation, e.g. in terms of XSL transformation or XPath-based BPEL

data flow manipulations, in order to be automatically executable during runtime.

With regard to transformation development and recurring manual efforts the above described

situation results in low efficiency of the SOA life-cycle and thus significantly jeopardizes the

fundamental goal of SOA to quickly and easily respond to business process changes. In order to

overcome this unsatisfying situation the conceptual expressiveness of Web service descriptions

has to be brought from an implicit to an explicit level. This means that technologies are needed

that allow for the formal definition and for the standardized exchange of conceptual descriptions

of Web services. Moreover, frameworks and tools are needed that are capable of reasoning

about the formally defined semantics.

The Semantic Web initiative defines descriptive languages for representing machine-

interpretable metadata and provides technologies with the aforementioned capabilities. As an

intermediate step, before describing how these concepts can be applied to Web services in terms

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

of so called Semantic Web services, firstly the core concepts and technologies of the Semantic

Web are described in the next section.

3.3 Semantic Web

The shortcomings regarding automatic information processing caused by the limited

expressiveness of meta-data does not only target Web services. Moreover, it is a general

problem of the World Wide Web. The World Wide Web consists of billions of Web pages and

is often described as a huge distributed knowledge base. To a large extend the information is

stored in the form of static HTML pages or in the form of HTML pages dynamically generated

from contents of databases upon request of a client. HTML, if rendered and displayed in a Web

browser, is very suitable in case the information is consumed by a human being. However, for a

computer the meaning of the data is not processable due to the lack of explicit semantics [58]. In

order to address these shortcomings, the Semantic Web initiative has evolved as a collaborative

effort led by the W3C with participation from a large number of academic institutions and

industrial research partners. The goal of the Semantic Web is to make the content of Web pages

machine-understandable and processable, in order to enable computers to perform tasks, which

require interpretation of the meaning of Web resources [59]. Thus, computers are the primary

user addressed. However, finally, by easing Web-based application development focusing on

the integration of heterogeneous information resources, the human user shall be provided with

more sophisticated means for the usage of the World Wide Web.

3.3.1 Definition and Concepts

Tim Berners-Lee, one of the main initiators of the Semantic Web vision, defines the Semantic

Web as an extension of the current Web, in which information is given well-defined meaning,

better enabling computers and people to work in cooperation [60]. This definition points out that

the Semantic Web should not be regarded as a contrary development that is detached from the

current Web but rather as complementary. Furthermore, the definition in [60] states that the

Semantic Web will bring structure to the meaningful content of Web pages, creating an

environment where software agents roaming from page to page can readily carry out

sophisticated tasks for users.

Aligned to the above definition the W3C provides a generalized definition by stating that the

Semantic Web provides a common framework that allows data to be shared and reused across

application, enterprise, and community boundaries [61]. However, the definition further

includes, that the Semantic Web is based on the Resource Description Framework (RDF) [62]

and thus refers to a concrete standardized language for expressing semantics on the World Wide

Web. That points out the strong commitment to a standards-based approach to enable the vision

of the Semantic Web.

Before further technologies and standards for the Semantic Web are presented, firstly the core

concepts are introduced in more detail in the following section.

Ontologies

As already mentioned the primary goal of the Semantic Web is to provide an extension to the

current World Wide Web by enriching its content with machine processable meaning or

semantics. Providing the means for processing the semantics of Web content to a certain extend

could enable tasks that are currently difficult to do, such as locating content, collating and cross-

Chapter 3

relating content or drawing conclusions from information found in two or more separate sources

[63].

In order to enable machines to process Web content with regard to its meaning, the content

needs to be expressed in machine understandable ontologies. In this sense, ontologies can be

defined as formal and explicit specification of a shared conceptualization of a domain [64]. This

means that an ontology defines a conceptual model of a domain that ideally represents an agreed

consensus among involved actors. On the one hand, the formal and explicit manner ensures that

the so modeled meaning can be processed by machines. On the other hand, the shared aspect

ensures a commonly accepted understanding based on consensual terminologies, so that the

modeled meaning is processed the same way anywhere. Thus, ontologies interweave human

understanding of symbols with their machine processability [65] and consequently enable to

bridge the gap between the real world and IT systems [66]. In more detail ontologies consists of

the following elements:

 individuals or instances - are the base components of ontologies and represent concrete

or abstract objects (also referred to as A-Box containing assertional knowledge)

 classes or concepts - represent sets of individuals and can be considered as types (also

referred to as T-Box containing terminological knowledge)

 properties - represent characteristics of individuals and concepts

 relations - individuals, concepts, and properties can be related to each other expressed

by properties

 rules - formulate statements about individuals, concepts, properties, and relations

dependent on other statements

Some traditional approaches regard rules separate from ontologies. But since the mathematical

formalization of ontologies in terms of description logics, this distinction becomes obsolete and

rules, as well based on description logics, become an essential part of domain

conceptualizations. The here referred description logics are a subset of predicate logic. I.e. the

ontology elements discussed before are represented as predicates and logic operators within

formulas, e.g. unary predicates for atomic concepts and binary predicates for atomic relations

[67]. Description logics are aimed at being tractable on the one hand but keeping a high degree

of semantic expressiveness on the other hand. Therefore, description logics are designed to be

decidable in contrast to their superset predicate logic, which is undecidable [68].

Thus, knowledge modeling gets a solid mathematical foundation. Subsequently, this formalism

enables machines to interpret or reason over knowledge representations. However, there is a

trade-off between semantic expressiveness and computational complexity of reasoning and thus

many different variants of description logics have emerged [69].

From Knowledge Based Systems to Semantic Web

Having modeled content in that manner, knowledge based systems using inference engines and

reasoners, as illustrated in Figure 3-9, can query and process the content as a knowledge base.

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-9 Typical Knowledge Representation System based on Description Logics [70]

The vision of the Semantic Web is about applying these concepts to the World Wide Web and

using it as a huge knowledge base enabling powerful knowledge-based applications.

Consequently, reasoning has to be realized on a partial and incomplete knowledge base. This

background has yield to the concept of open-world semantics in contrast to closed-world

semantics used in the context of knowledge-based systems, which perform reasoning on a

complete and closed knowledge base. The concept of open-world semantics assumes that the

absence of information about a fact does not indicate that this fact is false. Hence, it is possible

to reason over a dynamic knowledge base without generating contradictions.

This idea of the Semantic Web has received high interest in academia and industry, which has

led to the formation of a steadily growing, international research community. Consequently, a

wealth of work has been produced that mainly covers [71]:

 formal ontology languages (e.g. [72]) and efficient reasoning techniques (e.g. [73], [74])

 ontology management technologies (e.g. [75]), which cover methodologies and tools for

ontology engineering (e.g. [76]), scalable ontology repositories (e.g. [77]), and techniques

for ontology versioning and evolution support (e.g. [78]);

 ontology-based data integration (e.g. [79], [80]);

 several applications for Semantic Web technologies (e.g. [81])

The latter two are further discussed as they target the specific context of this work. Furthermore,

Semantic Web services as an application of Semantic Web technologies are described in Section

3.4 and ontology-based data integration is presented in Section 3.6. Common to all areas of

Semantic Web research and related industry activities is their commitment to a strong standards-

based approach led by the W3C. The corresponding technologies and standards are described in

the following section.

3.3.2 Technologies and Standards

The W3C has released several standards to realize the Semantic Web vision as illustrated in

Figure 3-10. In the following the Semantic Web stack which is also referred to as the Semantic

Web layer cake including the most important standards and basic technologies is described in

more detail:

Chapter 3

Figure 3-10 Semantic Web Stack

 Unicode and URI - The foundation of the Semantic Web stack is built by means of a

standardized encoding of data (Unicode), which joins different character sets to one

international character set together with the Unified Resource Identifier (URI) standard,

which allows the identification of any resource in the Semantic Web. A Uniform

Resource Locator (URL) is a specific type of a URI and identifies a resource that is

retrievable over a network. However, it is important to note that URIs are not only used

for identifying resources that are retrievable on the Web, but they can also reference any

other resource whether it is a concrete object such as Web page or any abstract concept.

 XML, Namespaces and XML Schema - XML is used as a universal format for

message exchanges (cf. Section 3.2.2). XML enables the structuring of data through

opening and closing tags, which eases the structured processing by parsers. Tag names

can be specified in different namespaces (NS) to avoid name collisions. The underlying

structural model of XML is hierarchical and thus an XML instance can be regarded as a

tree. XML schema allows to specify grammars to define how the different tags can be

structured.

 RDF - The Resource Description Framework (RDF) provides a mechanism to make

statements about data. These statements - also called triples - consist of subject,

predicate, and object. The subject is the resource described, the predicate is a property

of the subject, and the object is the value of the property. A set of statements spans a

unidirectional graph, which is also referred to as the RDF-Graph. Conceptually, RDF is

based on the semantic relational data model [82]. For example, it can be stated that

“Nils is the author of this thesis”. As RDF, among other notations, can be serialized in

XML, the statement above could then be represented as the following:

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-11 XML Serialization of RDF

 RDF Schema - All important resources in the Web should be identified by an URI, so

that statements can refer to it. Objects can be resources as well as empty nodes or

literals. Empty nodes represent anonymous resources that do not have a URI and are

just used to connect other parts of the graph, e.g. sub-properties of an aggregated

concept. In order to define vocabularies for these statements, it is possible to define

classes of resources and properties that the class members share. Furthermore, these

predicates defined in vocabularies can be referred to by namespaces. These vocabularies

are specified in RDF Schema, which is similar to an ontology definition language but

less expressive and less formalized. RDF Schema builds on top of RDF, i.e. it is based

on the RDF data model and a set of standard properties and resources to create simple

domain-specific vocabularies. For example, RDF Schema allows to define classes in

terms of using inheritance and properties specified with their domain and range. The

detailed specification of RDF Schema can be found in [83]. Basically, the RDF Schema

meta-data model is very similar to the meta-data model of object oriented programming

languages. Accordingly, it is possible to define classes, i.e. sets of individuals that have

shared characteristics. However, the meta-models are distinct with regard to a

significant difference. In object orientation a class is defined listing the properties that

the instances of this class share. In contrast, in RDF Schema the properties take the

central role. Properties are independent concepts as well defined outside of class

definitions, which are described in terms of the classes which they apply to.

 OWL - The ontology layer is based on top of the RDF layer. While RDF Schema

provides efficient reasoning complexity, its semantic expressiveness is relatively

limited. In order to model complex ontologies which go beyond simple classifications

or typed hierarchies, more expressive language features such as additional relations

between classes (e.g. equality or disjointness), enumerated classes and cardinality

constraints are needed. With the Web Ontology Language (OWL) the W3C has

specified a widely accepted language to define more expressive ontologies that keep the

trade-off between rich expressiveness and computational efficiency. OWL is based on

former ontology projects namely the DARPA Agent Markup Language [84] and the

Ontology Inference Layer [85] and uses the RDF syntax as well as most of the RDF

Schema constructs. However, in contrast to RDF Schema, OWL allows for defining

cardinality constraints on properties. Moreover, boolean combinations of class

expressions, such as intersections, unions and universal quantifiers as well as existential

quantifiers can be used to define restrictions on how properties are used by instances of

a class. In order to address the trade-off between expressiveness and reasoning

efficiency, OWL contains three decreasingly-expressive sublanguages: OWL Full,

OWL DL and OWL Light. With regard to the concrete application context of the

semantic mediation toolkit developed in this work (cf. Section 6.2.2) the usage and

Chapter 3

adequacy of the three sub-languages is further discussed. Together with RDF and RDF

Schema OWL belongs to the key standards of the current Semantic Web.

The layers discussed so far represent the current state of the Semantic Web research that has

reached a clear conceptualization, whether the upper layers represent the future requirements for

the Semantic Web and are much more under construction.

 Logic – The logic layer covers technologies and methods for inferring facts that are not

explicitly stated. Currently, the main debate is focused on the logic layer and how to

integrate rules to make OWL more expressive. Several candidates for rule extension of

OWL have been submitted to the W3C, whereas the Semantic Web Rule Language

(SWRL) [86] and the Rule Interchange Format (RIF) [87] has received much attention

recently [88]. In that context a debate has arisen, whether this extension by rules should

be realized by means of splitting the Semantic Web stack with rules and OWL

ontologies sitting side by side on the same level on top of an extra intermediate layer.

The purpose is to allow closed-world semantics as an alternative to open-world

semantics exposed by OWL. With respect to compatibility to the existing languages

namely RDF and OWL, [89] presents an approach that allows for forms of closed-world

assumption by remaining the stack architecture.

 Proof and Trust - Besides the logic layer, the far reaching vision is to enable heuristic

engines which can proof whether a statement is correct or wrong based on ontologies

and rules queried from the Semantic Web. The proof layer should enable tracing and

explaining the logical reasoning steps, i.e. explaining why a particular conclusion has

been reached. Furthermore, it is aspired to create a so called Semantic Web of Trust by

utilizing authentication mechanisms including signing assertions based on digital

signatures. Thus, a given statement can be referred to a specific person or author.

Moreover, trust relations including transitive relations over multiple actors can be

reasoned, in order to transfer trust characteristics and achieve network effects.

3.3.3 Evaluation

The Semantic Web is still under construction and an ongoing process. As discussed above the

Semantic Web stack still has to be fully realized. Furthermore, limitations of the core Semantic

Web languages RDF and OWL have been investigated theoretically and through study of

ontology applications as they are being created and used in practice. A wide variety of issues

have been identified ranging from required enhancements to expressiveness including e.g.

extended data types and qualified number restrictions to major additions such as the

incorporation of temporal concepts [90], semantics of geospatial data [91] or the reflection of

uncertainty in terms of fuzzy logic related aspects [92]. Some of the most common limitations

have already been addressed in implemented systems with language extensions where these

extensions are well understood, and efforts have already begun with a view to standardization of

so called low-hanging fruits [93]. In this context as well a new version of OWL namely OWL2

has been developed [94].

One particular relevant conceptual problem with regard to this work concerns the Semantic Web

inherent notion of open world semantics. Open world semantics are reflected in standardized

Semantic Web languages such as OWL and implemented in corresponding inference engines.

The underlying concept of the open-world assumption is designed for reasoning on a partial and

incomplete knowledge base. This notion is well suited for the general idea of the Semantic Web.

However, when applying it to the context of SOA, as targeted in this work, the open world

assumption might not be the optimal choice. Even, the addressed information space in SOA

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

landscapes is heterogeneous and distributed still in most scenarios the set of involved services is

self-contained and hence the inferencing process is performed on a stable and complete

knowledge base. Thus, with regard to SOA closed world semantics would be a more suitable

approach. This fundamental difference points out that the applicability of the current Semantic

Web standards as a general technology for information integration might lead to some

drawbacks in a particular context such as SOA. As presented above (cf. Section 3.3.2) the

support of closed world semantics besides open world semantics is subject to the ongoing

discussing regarding the further realization of the Semantic Web stack.

Besides these outlined conceptual shortcomings practical limitations can be identified with

regard to performance of inference engines and a general immaturity of the required technical

infrastructure. RDF and OWL stores are still slower than optimized relational databases, in

particularly with growing difficulties when the technology has to scale up. However, the

performance is improved steadily [95]. Thus, it can be stated that gaps in standards and

implementations still exist and adoption is limited by typical problems with early technologies.

This includes the requirement for a critical mass of practitioners and corresponding running

applications. Semantically annotated information sources applying RDF or OWL are still rare in

the WWW. Moreover, different domain fields have developed and are using ontologies of their

own. Ontologies are typically developed by domain specific expert groups without much

systematic collaboration with other fields. When using such isolated ontologies in cross-domain

environments still semantic interoperability problems arise. As a result, there is the danger that

the global Semantic Web will not emerge but rather a set of isolated mutually incompatible

Semantic Web islands may arise [96]. This issue targets the problem of ontology integration and

in particular the fields of ontology matching, ontology alignment and ontology mapping, which

are further discussed in Section 3.6. However, anticipating such ontology integration

opportunities points out the potential that lies in the evolving Semantic Web islands and hence

when linked together they can contribute to a step by step realization of the Semantic Web

vision.

Having outlined the basic concepts and standards of the Semantic Web initiative the following

section will describe how these concepts are applied to Web services in terms of so called

Semantic Web services.

3.4 Semantic Web Services

In consideration of the shortcomings of traditional Web services discussed in Section 3.2.3, the

idea of bringing implicit service semantics to an explicit level has arisen. By providing machine

understandable Web service descriptions with formally defined semantics powerful inference

engines and matchmaking mechanisms can be enabled, in order to automate the whole

composition process including discovery, composition, execution and interoperation of Web

services. After providing a definition of what Semantic Web services are in terms putting them

into the context of Web services and Semantic Web technologies, the basic ideas and concepts

of Semantic Web services are introduced. Furthermore, the most dominant Semantic Web

service technologies and standards are outlined. Moreover, the placement of these technologies

and standards in the framework of semantic interoperability in SOA (cf. Section 2.5) focusing

particularly on the semantic interoperability gap will be analyzed with regard to advantages and

shortcomings.

Chapter 3

3.4.1 Definition and Concepts

There exists no universally established definition for Semantic Web services. However, there

seems to be a general shared understanding of the basic concepts and characteristics of Semantic

Web services. For example in the context of the W3C Semantic Web service interest group it is

simply stated that the integration of Semantic Web technologies to Web services constitutes

what Semantic Web services are [97]. Other definitions further include the purpose of Semantic

Web services and state that they are extended Web service descriptions with rich semantic

annotations and upon this, provide inference-based techniques for automating the usage of Web

services [98]. Hence, it can be summarized that the basic idea of Semantic Web services is

about applying Semantic Web technology to Web services in order to combine flexibility,

reusability, and universal access of Web services with the power of semantic markup and

reasoning [99]. Considering Web services as the dynamic part of the World Wide Web and the

Semantic Web as an extension of the current static Web, consequently Semantic Web services

can be defined as the combined evolution of these two dimensions. The corresponding

classification of Semantic Web service concepts is illustrated in the following Figure 3-12.

Figure 3-12 Classification of Semantic Web Service Concept [99]

The long term vision behind Semantic Web services is to enable dynamic goal-based service

composition and to use powerful inference engines and matchmaking mechanisms, in order to

automate the whole Web service life-cycle including discovery, composition, execution and

interoperation of Web services. On the one hand, the research background comes from the

Semantic Web community (cf. Section 3.3). And on the other hand, there is also an influence

from the research field of dynamic planning in artificial intelligence research, in particular with

regard to goal-based discovery and composition with is further discussed in Section 3.4.3.

Generic Web Service Ontologies and Domain Ontologies

The basic idea behind Semantic Web services is to use ontology languages such as OWL in

order to create machine-understandable Web service descriptions. This can be achieved by

enriching Web service descriptions with so called upper ontologies in terms of specific

Semantic Web service ontologies. Such ontologies define generic concepts for the description of

Web services as illustrated in Figure 3-13:

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-13 Exemplary Web Service Ontology [100]

Such generic Web service concepts then can be further refined in terms of domain specific

ontologies in order to describe a concrete Web service in a particular application domain.

Building Blocks of Semantic Web Services

In this sense, the following aspects regarding the formalization of Web service descriptions can

be distinguished [101]:

 Semantic Model - A semantic model is a set of machine-interpretable representations

used to model an area of knowledge or some part of the world. As outlined above the

semantic model consists of a generic or upper semantic model on the one hand and of

an application domain specific semantic model on the other hand. Such semantic

models are represented in terms of ontologies that embody some community agreement

regarding terminology and conceptualization. They are combined with a formal

representation based on description logics that consequently allow for advanced

information processing.

 Concept – A concept is an element of a semantic model. Accordingly, the concepts are

the building-blocks for the Semantic Web service description. They cover functional as

well as non-functional service aspects including the description of message parameters

as well as references to underlying technological bindings of abstract Semantic Web

service descriptions or functional descriptions in terms of specifications of pre- and

postconditions.

 Semantic Annotation - A semantic annotation can be contained in a Web service

description or provided in an additional document that relates to or defines the concepts

of a semantic model that are used to describe the Web service. Accordingly, a semantic

annotation can be regarded as the framework for the above mentioned aspects. In

contrast to traditional Web service descriptions which rely on XML, ontologies are used

as the meta-data model for describing Semantic Web services. Furthermore, semantic

annotations provide references to the underlying technological binding, i.e. how the

Web service is realized and how to access it during runtime. Often this can be a

traditional Web service that exhibits XML entities for describing the Web service.

Chapter 3

Additional Semantic Layer on top of Traditional Web Services

Semantic Web services cannot be regarded as a technology without any path dependency. The

success story of SOA with its dominant instantiation in terms of XML-based Web services

requires the concept of Semantic Web services to reflect this layer of existing technology.

Therefore, the explicit semantic annotation of Web services has to be provided in terms of an

additional semantic layer on top of traditional XML-based Web service technology.

Accordingly, the so called grounding of Semantic Web services by traditional Web services

ensures the reuse of existing technology instead of requiring a re-implementation. The following

figure illustrates the generic grounding idea. A more detailed and technical analysis with regard

to different realizations is given in Section 3.4.2.

Figure 3-14 Generic Semantic Web Service Grounding

On the one hand, the service descriptions are lifted on the ontology level, which provides the

foundation for machine interpretation and service life-cycle automation. On the other hand, the

semantic annotations provide a bidirectional mapping that includes also the lowering between

the OWL meta-data model and the XML Schema meta-data model exhibited by the underlying

existing pure XML-based Web services. This description of the mappings between the meta-

data models allows a Semantic Web service consumer to derive the corresponding technical

representation for the Web service message parameters during runtime. Such a bidirectional

mapping is not a trivial task. As outlined above this mapping has to bridge the gap between two

different abstraction levels regarding meta-data models. Taking also into account that the

different abstraction levels furthermore feature different expressiveness, for example with

regard to polymorphism, it requires to ensure that the advantages introduced by the semantic

layer are retained. This aspect is further discussed in context of the realization of the approach

presented in this work that applies Semantic Web service technology on top of existing XML-

based technologies to address the semantic interoperability challenge in SOA (cf. Section 6.4).

Based on the conceptual approaches presented above the goal of Semantic Web services, i.e. the

provision of machine-understandable Web service descriptions can be achieved. Thus, it can be

stated that Semantic Web services fulfill the fundamental SOA promise of providing self-

descriptive services from the machine perspective. The following Figure 3-15 illustrates the

machine-based interpretation of Web services.

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-15 Machine-based Interpretation of Web Services

On the one hand, the advanced expressiveness of Web service descriptions eases the SOA life-

cycle tasks for process experts. The formal character of conceptualizations facilitates a

consistent interpretation of service specifications and thus promotes a direct transfer of

formalized requirements to application development and Web service implementation. On the

other hand, the mathematical foundation of description logic-based Web service not only

enables machines to read service description as known from traditional Web services but

moreover machines are provided with the means to interpret the content according to the

formalized conceptualization. This facilitates the automation of the SOA life-cycle and further

tool support for Web service usage especially with regard to formal business process modeling,

Web service composition, service enactment and semantic integration of Web services.

Semantic Integration with Semantic Web Services

As already discussed data or more precisely information integration is not only a structural but

mainly a semantic problem (cf. Section 2.3). Defining explicitly and coherently the semantics of

information entities is crucial for meaningful integration results. This applies either to humans

with regard to agreement processes as well as for machine based processing. Accordingly, the

central idea of utilizing the Semantic Web service approach for semantic integration in SOA lies

in the mapping of different heterogeneous Web service descriptions to one coherent shared

formal conceptualization in terms of an ontology. The following figure demonstrates how

Semantic Web services are applied in a typical semantic integration scenario based on two

different and heterogeneous traditional XML-based Web services.

Chapter 3

Figure 3-16 Semantic Integration with Semantic Web Services

Firstly, domain experts of the two organizations develop a shared formal conceptualization that

captures the requirements for information models of both organizations. The shared

conceptualization is then formalized in an ontology language such as OWL. Existing WSDL-

based Web services, which describe their parameters and message parts in terms of XML

schemas, are mapped to the shared ontology by bidirectional lifting and lowering

transformations between the meta-data models. Moreover, the shared ontology can be used to

derive message schemas for new Web services.

Before the above presented approach is put in context to the model describing the semantic

interoperability gap, it is further analyzed how different technologies and standards instantiate

the concept of Semantic Web services. Then, from a combined perspective including technical

and conceptual issues, the identified advantages and shortcomings for bridging the semantic

interoperability gap in SOA are discussed.

3.4.2 Technologies and Standards

Many approaches for semantic enrichment of service descriptions have arisen with sometimes

overlapping or opposing concepts. In the following, existing frameworks that instantiate the

outlined approach above and define comprehensive specifications for semantically describing

Web services are examined. The aim is to give an overview of the technologies and standards

that most of the research in the field of Semantic Web services is based upon. The most relevant

approaches have been submitted to standardization bodies. In the following five different

specifications that have been published by the W3C in recent years are described:

 Web Ontology Language for Services (OWL-S) [102]

 Web Service Modeling Ontology (WSMO) [103]

 Semantic Web Services Framework (SWSF) [104]

 Web Service Semantics (WSDL-S) [105]

 Semantic Annotations for WSDL and XML Schema (SAWSDL) [106]

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Web Ontology Language for Services (OWL-S)

OWL-S is an OWL-based Web service ontology, which supplies a core set of markup- language

constructs for describing the properties and capabilities of Web services in a machine-

interpretable form. OWL-S markup of Web services facilitates a higher degree of automation of

Web service tasks, such as Web service composition, execution and interoperation [107]. It can

be stated that OWL-S is the most mature and most widely deployed comprehensive Semantic

Web service technology [108], which is reflected by the availability of a large number and

variety of tools. As OWL-S is applied in the prototypical toolkit of this work, consequently

OWL-S is described in more detail.

In particular, OWL-S is an upper ontology for services. It is structured into three

complementary parts, which are further illustrated in the following figure:

Figure 3-17 Top Level of OWL-S Service Ontology [107]

OWL-S specifies that a service can have multiple service profiles that are used for advertising

and discovering services. Furthermore, a service can be described by at most one service model

that provides information about service operation and their dependencies. Finally, a grounding

has to be specified which describes how to access and invoke the service and its binding to a

traditional Web service.

More detailed the service profile describes what the service does to be used by service

requesters for discovering or directories to categorize advertised services. The service profile

consists of three pieces of information. The first is about the name of the Web service, its

provider including contact information and a natural language description. Furthermore and

most important, the service profile includes the functional description of a service. It consists of

a description of input and output parameters by means of relating them to OWL concepts from

domain ontologies. Additionally, it describes preconditions required by the service and its

expected effects according to the Web service model presented in Section 2.5. The conditions

are represented by logical formulas specified in terms of OWL-based concepts that model

expressions. These specific concepts are defined in the OWL-S ontology and further include

constructs of a rule language, namely the Semantic Web Rule Language (SWRL) [109], as

OWL itself does not completely provide the necessary language constructs to model logical

expressions. Finally, it is possible to describe various non-functional properties, e.g. quality-of-

service ratings or response time information in terms of OWL concepts.

Once a service has been discovered and selected the service profile is not used anymore.

Subsequently, the service model is processed. It specifies how to interact with the service by

presenting the possible service interactions and their dependencies to be conceived as a process.

Chapter 3

The service model can either consist of an atomic process or a composite process. An atomic

process expects one message and produces one message, whereas a composite process builds

upon several atomic processes that can expect different messages over time depending on before

received messages. Thus, by describing the service model in terms of a composite process a

stateful service is described. The different dependencies can be expressed by various control

constructs which specify the message flow. In order to make it possible for the service client to

interact properly with the service, the service model also presents input, output, precondition,

and effect descriptions (IOPE) for each atomic process as specified in the profile.

Finally, the grounding of a service specifies how to access the service in terms of protocol,

addressing and message formats. Furthermore, the service grounding needs to deal with the

mapping of abstract input and output parameters of atomic processes to concrete messages

processed by a concrete Web service realization. The default mapping is the WSDL grounding

mechanism. However, different mappings are possible. An OWL-S service can be bound to a

concrete WSDL-based Web service by means of mapping OWL-S atomic processes to WSDL

operations and OWL-S input and output parameters to WSDL messages. However, as message

parts in WSDL are specified using XML Schema by default and parameters in OWL-S are

expressed in terms of OWL classes, this mapping task becomes complex because XML Schema

cannot express the description logic based semantics of OWL classes (cf. Section 2.3.2).

Therefore, an OWL-S service grounding provides an XSLT-based mapping mechanism. The

grounding transforms OWL instances serialized in RDF/XML into corresponding XML

instances that are structured according to given XML Schema types. This transformation has to

be performed for service inputs and vice versa respectively for service outputs. But similar to

XML Schema, XSLT is based on XPath and therefore conceptualized on a completely different

abstraction level. Accordingly, it cannot capture the semantics of OWL and has to handle the

OWL individuals on a syntactical level. Thus, the successful mapping demands for complicated

XSLT scripts specific for each RDF/XML serialization as different types exist. This can be seen

as a shortcoming of the grounding mechanism or a lack of appropriate transformation languages

which are able to capture both the meta-data model of tree-based syntactical XML entities and

the meta-data model of OWL ontologies. However, to sum up, by providing the three ontology

parts for specifying a service profile, service model, and service grounding OWL-S enables

explicit semantic enrichment of traditional existing WSDL-based Web services without any

impact or necessary changes to the underlying implementation.

Web Service Modeling Ontology (WSMO)

WSMO shares the vision with OWL-S, but it differs much in the approach for achieving it.

WSMO is an alternative approach, which is not built on the W3C standard OWL. Furthermore,

in contrast to OWL-S it does not define explicit service ontologies, but it provides a conceptual

framework where ontologies can be specified in. One possible instantiation for the WSMO

framework is given by a corresponding specification language called WSML [110]. Moreover,

under the WSMO umbrella a Web Service Execution Environment named WSMX [111] has

been developed as a reference architecture and implementation. In general the WSMO

framework defines four top-level notions as illustrated in the following Figure 3-18:

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-18 WSMO Top Level Notions [103]

 Ontologies define formally specified domain knowledge and terminology, whereas

other WSMO elements are making use of these domain specific ontologies.

 Goals describe objectives that clients want to achieve by using Web services.

 Web services are defined in terms of semantic descriptions about functional capabilities

and the usage in terms of an interface.

 Mediators provide means for resolving potentially occurring heterogeneities.

In contrast to OWL-S, the WSMO approach does not purely focus on semantic annotations of

Web services in terms of providing ontology concepts for functional and non-functional service

properties but it propagates a goal-based approach for the usage of Semantic Web services. The

idea is that clients formulate requests in terms of goals, which formally describe the objective to

be achieved while abstracting from technical details. The system then automatically detects,

eventually composes and executes the suitable Web services in order to solve the goal [112].

This goal-based approach has its research background in the field of artificial intelligence and

planning algorithms in particular. The problem domain is represented as states, whereas states

can be expressed in terms of logical axioms. Then Web services can be regarded as state

transition operators. Based on conditions, effects and goal planning algorithms a path can be

derived from the initial state to the goal state based on backtracking or forward chaining

expression provers known from declarative programming [113]. The applicability of this goal-

based planning approach in SOA is discussed in the next Section 3.4.3 about the evaluation of

Semantic Web services approaches.

Additionally, WSMO includes a mediator concept to deal with the interoperation problems

between Semantic Web services. WSMO defines specific mediator services which perform

translations between ontologies that describe input and output parameters as well as goals.

Accordingly, the idea of mediators is to overcome heterogeneous resource descriptions by

resolving incompatibilities on the data level on the one hand and on the process level on the

other hand. Mediation on the data level mainly covers the integration of different terminologies.

On the process level mediation is performed by aligning heterogeneous interaction patterns

between different Web services, e.g. by splitting or grouping messages or by changing their

order. WSMO foresees four kinds of mediators [114]:

 OO Mediators for ontology mediation,

 GG Mediators for linking goals,

 WG Mediators for linking Web services to goals and

 WW Mediators for enabling interoperability between two heterogeneous Web services.

The idea of the mediator approach is as follows: Mediator services transform instances of

service parameters or (sub) goals from a source to a target ontology. Then, the mediator services

Chapter 3

are integrated as common services into the planning process, whereas the planning algorithm

automatically ensures that mediator services are selected between the interoperation of

heterogeneous Web services including mediation between (sub) goals. However, the main

purpose of mediator services is to reconcile the differences between goals of Web services and

their usage is inherently dependent on the integration into a planning process. Thus, it can be

stated that it may be difficult to map this approach to non-planning oriented problems that

purely focus on Web service interoperation, e.g. in process modeling, service composition and

invocation [115].

Semantic Web Services Framework (SWSF)

SWSF is a further alternative approach likewise not build on OWL. The major contribution of

SWSF is a rich behavioral process model based on the Process Specification Language (PSL)

[116]. It aims at being a comprehensive framework that spans the full range of Semantic Web

service related issues including orchestration and mediation. However, the design of the

orchestration concept focuses on automated planning as well as the mediation concept, which

therefore readopts the goal-based approach similar to the mediator concept in WSMO.

Web Service Semantics (WSDL-S)

In contrast, WSDL-S is a light-weight approach for Semantic Web service description. Instead

of defining a comprehensive framework for Semantic Web services, WSDL-S defines inline

extensions to WSDL in order to semantically annotate XML data types as well as messages and

operations in WSDL descriptions. It externalizes the ontology language representation in terms

of specific tags that refer to the semantic annotations and thus allows the binding to OWL. In

particular, WSDL-S defines three types of annotations [117]:

 WSDL types, i.e. WSDL entities specified in terms of XSD types, are referenced to

concepts of a domain ontology, including a mapping description between XSD types

and the corresponding semantic model concepts.

 WSDL operations can be described by preconditions and effects in terms of referencing

to respective expressions.

 Categorization information about Web services can be defined on the basis of an

ontology.

Semantic Annotations for WSDL and XML Schema (SAWSDL)

SAWSDL follows the pragmatic approach of WSDL-S and provides simple semantic

annotations within traditional WSDL-based Web service descriptions. While the above

presented approaches have been published as W3C member submissions, SAWSDL is the only

official W3C technology recommendation for Semantic Web services. The following figure

illustrates the annotation of WSDL documents with additional tags that reference to a domain

ontology:

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-19 SAWSDL Overview [118]

According to the general Semantic Web Service grounding approach as well SAWSDL consists

of two parts:

 Schema mappings between XSD typed XML instances and domain ontology

individuals.

 Model references that point to a concept in a semantic model covering interface

categorization, operation functionality, fault meaning and data type or element

correspondence in an ontology.

Thereby, it is important to note that SAWSDL limits the annotation by references to only

ontology concepts, so that the definition of preconditions and effects in terms of logical

expression is not supported. This underlines the light-weight approach of SAWSDL in contrast

to goal-based approaches such as WSMO or SWSF where specification of logical expressions

for preconditions and effects is essential. Furthermore, this limitation demonstrates that the

standardization at the W3C follows a non-planning approach to Semantic Web services

focusing on pragmatic approaches.

3.4.3 Evaluation

A comparison of the presented Semantic Web service technologies and frameworks reveals the

following commonalities and differences: The first approach, OWL-S defines a description

model for Web services that includes a formal description of interfaces regarding input and

output parameters as well as formalization of preconditions and effects. It uses OWL as the

specification language and hence it is compliant with the W3C standards for the Semantic Web.

In contrast WSMO and SWSF are not built upon OWL and apply specific ontology languages

that aim at more sophisticated service interaction descriptions. In particular, WSMO and SWSF

propagate a goal-based approach along with artificial intelligence-based planning algorithms,

which goes beyond the basic idea of annotating Web services. WSDL-S and its successor

SAWSDL take a step back and focus again just on semantic annotation of Web services and

thus can be regarded as light-weight approaches. Furthermore, WSDL-S and SAWSDL

strongly rely on existing Web service standards, namely WSDL, and rather focus on its

Chapter 3

extension than providing an alternative conceptual framework. Moreover, SAWSDL is the only

approach standardized as a recommendation by the W3C for the semantic annotation of Web

services.

Suitability of Goal-based Planning in Service-Oriented Architectures

Regarding the concept of goal-based planning applied to the Web service life-cycle in particular

in the context of discovery, selection and composition its practical limitations have already

briefly been discussed above. In general, the user of a Web service might have an overall goal

he wants to achieve by using a Web service or a composition of Web services. In order to

formalize the overall goal as required for the goal-based planning algorithms as well the

problem domain needs to be formalized. This does not only comprise an ontology

conceptualizing the domain but as well a formalization of the current state of the problem

domain and all possible states it may take. Then a decomposition of the overall goal into a

combination of formalized sub-goals referring to the corresponding states of the problem

domain is required. Based on this formalization of the goal a reasoner then can be enabled to

derive the corresponding selection and composition of Web services that can be invoked in

order to achieve the desired overall goal. However, if the user has to perform this goal

decomposition on a relatively fine-granular level to apply goal-based inferencing techniques for

planning, it remains unclear whether this approach provides any advantages compared to

manual plan creation performed directly by the user. Furthermore, the additional efforts for the

identification and formalization of possible states of the problem domain have to be considered

as well in this comparison. Moreover, classical planning problems assume complete knowledge

about the problem domain including narrow and deep formalization of possible states. In

contrast, distributed and heterogeneous SOA landscapes cover a broad problem space and do

not allow for deep formalization of all possible states including non-deterministic side-effects

[119]. Thus, it can be stated that although Web service discovery, selection and composition

problems resemble planning problems, it does not seem suitable to apply goal-based inferencing

techniques known from artificial intelligence research to them [120]. Accordingly, M. Stollberg

concludes that the employment of the goal-based Semantic Web service approach requires a

comprehensive re-design of an SOA system [121]. Moreover, J. Hendler states that even the

best works in this area assume non-realistic simplifications that generally twist Web services

into a planning framework [122].

Semantic Web Services for Bridging the Semantic Interoperability Gap

Having outlined the limitations of goal-based planning in SOA also the proposed concept of

WSMO mediators has to be analyzed in this context. Mediator services as proposed in the

WSMO framework directly address the heterogeneity of metadata formalization and thus the

problem of semantic interoperability as targeted in this work. As described above they are

integrated as common services into the planning process, whereas the planning algorithm

automatically ensures that mediator services are placed between the interoperation of

heterogeneous Web services. However, taken such mediator services out of the context of goal-

based planning they just provide transformation services between parameters described by a

source ontology to parameters described by a target ontology. Accordingly, in the absence of

automated planning they might only be usable as basic data transformation services during

runtime of Web service compositions, which furthermore have to be manually integrated and do

not provide adequate support or added value at design time. However, as described in Section

2.4 the fundamental first phases in the SOA life-cycle begin with business process modeling and

the design of service compositions. In particular, these phases are relevant for the alignment or

mediation between heterogeneous metadata formalizations as they deal with the conceptual

abstraction level of data or more precisely of information. Consequently, it can be stated that the

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

discussed mediator concept, if taken out of the goal-based planning context, does not provide a

sufficient solution to achieve semantic interoperability in SOA.

Leaving the goal-based planning approach including the notion of integrated mediators and

taking a wider perspective on how Semantic Web services can contribute on bridging the

semantic interoperability gap in SOA, nevertheless a significant added value compared to

traditional Web service technology can be identified. Formalization and representation of Web

services parameters are expressed in context of domain models in terms of ontology concepts

based on description logics. Recalling again the different abstraction levels for the

representation of meaning introduced in Section 2.3.2 ontologies as used in Semantic Web

services provide the following characteristics: On the one hand, the triple structure of the

ontology meta-data model can be regarded as well as a logical data model analogical to XML,

as it provides the view applied in application development. However, on the other hand it can

additionally be considered as a conceptual model, as concepts, properties and relations in an

ontology allow for expressive modeling similar to languages such as the Unified Modeling

Language (UML) or the Entity Relationship model (ER). Thus, it can be concluded that the

ontology meta-data model is located on a higher abstraction level compared to the XML meta-

data model used in traditional Web service technology. In the following Figure 3-20 this

comparison is illustrated in context of the model describing the semantic interoperability in

SOA:

Figure 3-20 Shift of Abstraction Level using Semantic Web Services

Semantic Web service parameters are annotated by utilizing a shared ontology. Supported by a

concrete Semantic Web service framework this conceptual model is then mapped to an XML

Schema-based logical model by means of an integrated grounding mechanism on top of

traditional XML-based Web services. Furthermore, this figure demonstrates that the general

notion of Semantic Web services, even if they are provided by different IT systems, considers

that a shared ontology is used for the annotation of Web service parameters. Consequently, even

if the conceptual ideas of the different human designers have been diverse, on the conceptual

level there are no more two different and heterogeneous conceptions but one aligned shared

conceptual model. This fact provides a major advantage of Semantic Web services for bridging

the semantic interoperability gap. As the lifting and lowering mechanism included in the

grounding of Semantic Web services provides already the mapping between the different

abstraction levels the round-trip across the semantic interoperability gap is already closed due to

Chapter 3

the shared conceptual model. Consequently, no more additional technical transformations

between representation formats are necessary, which provides a significant more efficient

solution compared to the traditional alternative (cf. Section 3.2.3) applying XML-based Web

services including extensive technical transformation code.

However, the underlying assumption of this general Semantic Web service approach is that a

shared domain ontology exists or can be developed collaboratively between the actors providing

and using Semantic Web services. Unfortunately, in real world cross-domain context as given in

the targeted large-scale SOA landscapes this approach of developing a globally shared

ontology-based standard for information models as a kind of lingua franca has turned out

limited. In particularly, organizational boundaries in community processes for the development

of shared conceptualizations covering multiple domains can be identified. While once so called

Enterprise Data Models have been a well-established approach, they have not yielded the results

expected or required including a large base of failed projects. Empirical case studies have

shown that data or information modeling cannot create new organizations or new businesses in

its own image. Conceptualizations must reflect the business [123]. Therefore, due to the

complexity deriving from inherent domain-specific differences in requirements, collective

agreement on a shared conceptualization is often only feasible under significant limitations.

These circumstances lead to the requirement for coexistence of multiple independent but

however conceptually overlapping information models. Accordingly, moving back to Semantic

Web services, it might be necessary to refer to multiple ontologies as well. Hence, it can be

stated that in the context of large-scale SOA landscapes the dominant semantic interoperability

approach of Semantic Web services is limited. In fact, it can be concluded that the ontology-

based approach for Web service description only alleviates the problem in terms of lifting the

abstraction level of semantic heterogeneity. However a mapping between ontologies originating

from different contexts is still needed [124].

To sum up the evaluation, it can be stated that the approach of Semantic Web services

contributes in achieving semantic interoperability by lifting the abstraction level of information

models and thus narrows the distance for bridging the semantic interoperability gap (cf. Figure

3-20). However, the general approach assumes the usage of a globally shared ontology with the

discussed limitations. Further analysis of the identified organizational boundaries and the

fundamental requirement for coexistence of independently managed conceptualizations deeper

elaborated in Section 4.5 which discusses context dependency of information models leading to

the proposed model of semantic mediation between loosely coupled information models in

SOA.

3.5 Semantic Information Integration in Related Areas

Having discussed how traditional Web service technology and Semantic Web service

technology can contribute to semantic interoperability and having identified the requirement for

mappings between heterogeneous conceptualizations in terms of ontologies, the next step is to

further analyze the state-of-the-art in this field. As ontology mapping is a relatively young

discipline and has been strongly influenced and based on previous work on semantic

information integration such related areas are firstly analyzed as an intermediate step before

coming back to ontology mapping.

Generally, semantic information integration, often also referred to as enterprise information

integration, is required in distributed IT systems when information from disparate sources with

different conceptualizations needs to be processed uniformly. Two prominent areas where this

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

problem is targeted in particularly are distributed database systems and distributed object-

oriented or component-based systems. As mentioned above before referring back to ontologies

this section highlights traditional approaches in these fields: Firstly the focus is put on data

integration in database systems and secondly on how semantic information integration is

targeted in the open distributed processing reference model (RM-ODP) [125].

3.5.1 Semantic Information Integration in Database Systems

In database systems the semantic interoperability thematic occurs in the context of data

integration between distributed databases. Two different approaches for data integration can be

distinguished. Firstly, data integration can be realized by so called materialized integration,

where data from different sources gets extracted, transformed and loaded (ETL) into one single

data store for uniformed processing [126]. This approach is also called data warehousing and is

used for data analysis in the context of enabling processes across different departments or

supporting business decision making tasks. The weakness of this approach is the lack of data

coherence when the original sources are updated but the single data store still contains the old

data. Consequently in the case of update, ETL processing needs to be done again. Alternatively,

data can just be integrated virtually by loosely coupling the different sources. This avoids the

repeated ETL process but increases complexity. Instead of integrating the data physically a

mediator with an integrated query interface is provided which transforms the queries to the

virtual integrated database into specific queries to each original source. Considering that data is

represented differently in the underlying database schemas of the original sources the different

source schemas need to be mapped to a so called global schema of the virtual or materialized

database. This is where semantic information integration takes place.

To define an appropriate global schema is a challenging task. The global schema needs to

express the overlapping concepts from different source schemas in a uniform manner. This task

is mainly done manually. However, various approaches have been developed for (semi-)

automatic schema matching [127]. Such a matching can be used to define the global schema by

means of matching two sources to extract the overlapping part. The matching does only cover

the design time task of semantic information integration. During runtime the global queries need

to be translated into queries for the local sources. For this process a mapping between the

schemas needs to be defined. Ideally, a mapping is the output of an automatic schema matching.

Global-as-View vs. Local-as-View

Such mappings can be expressed by making use of so called views. Views are read only virtual

tables of a data base schema composed of the result set of a query[128]. The main approaches

for schema mapping are the following [129]:

 Global-as-View (GAV) requires that the global schema is expressed in terms of local

data sources. The local data is stored physically according to different local schemas,

whereby mappings to the global schema are provided. More precisely, to every element

of the global schema, a view over the data sources is associated, so that its meaning is

specified in terms of the data residing at the sources. The following figure illustrates the

global-as-view approach:

Chapter 3

Figure 3-21 Global-as-View [130]

 Local-as-View (LAV) requires the global schema to be specified independently from

the sources. In turn, the sources are defined as views over the global schema. The

relationships between the global schema and the sources are thus established by

specifying the data of every source in terms of a view over the global schema. The

following figure illustrates the local-as-view approach:

Figure 3-22 Local-as-View [130]

In the GAV approach the views need to be updated whenever a source changes or a new one is

added, which is inflexible in a dynamic environment. In this regard LAV is more appropriate as

the global schema remains unchanged even when sources are changed or added. However, in

GAV the query reformulation task for the mediator can be performed straight forward as queries

for the sources are already defined in the views. In contrast query reformulation in LAV is more

complicated. Queries need to be constructed in terms of analyzing the views over the global

schema, whereas the relation between entities in the global schema and entities in the local

schema is only given inverse. Section 3.6 refers back to these presented approaches in database

schema mapping and outlines the relation to integration approaches with ontologies.

3.5.2 Semantic Information Integration in RM-ODP

The reference model for open distributed processing (RM-ODP) [125] is a joint standard of the

International Standards Organization (ISO) and the International Telecommunications Union

(ITU). RM-ODP offers a conceptual framework and an architecture that integrates aspects

related to distribution, interoperation and portability of software systems, in such way that

hardware heterogeneity, operating systems, networks, programming languages, databases and

management systems are transparent to the user. In this sense, RM-ODP manages complexity

through a separation of concerns, addressing specific problems from different points of view

[131]. It targets a comprehensive approach and aims at being a coordinating framework for any

current and future standards in the field of open distributed systems. As stated in Section 2.4 as

well various fundamental concepts of SOA are already mentioned in RM-ODP.

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

However, one of ODP's fundamental concepts is the use of a common object model, thus

following the object-oriented paradigm [132]. Software components are modeled as objects that

interact via interfaces with other objects. These objects can be remote objects and run each on

different machines. Therefore, interactions are realized in terms of remote procedure calls. This

context might explain why RM-ODP has received much attention in the context of object-

oriented distributed systems and especially the various standards related to the Common Object

Request Broker Architecture (CORBA). In contrast to the context of recent Web service

developments, in which RM-ODP is rarely mentioned, although most fundamental conceptual

approaches of Web services have already been identified in RM-ODP [133].

Information Viewpoint and Trading Function

Another fundamental concept of RM-ODP is the specification of a distributed system in terms

of viewpoints. Besides the enterprise, the computational, the engineering and the technology

viewpoint, RM-ODP defines the information viewpoint, which focuses on the semantics of

information and their processing. It describes the information managed by the system and the

structure and content type of the supporting data [134]. One of the common functions on which

RM-ODP gives outline definitions is the trading function, which targets in particular the

information viewpoint. In general, the trading function provides a centralized service for

discovery, binding and interaction between different objects by making use of attribute-based

descriptions, e.g. security policies or service advertisements. The foreseen usage of the trading

function in the ODP framework is to support inter-object communication via interrogations and

announcements. The communication methods require, besides the trading function, also the

support of a type repository, channel creation (binding) and service invocation functions. The

type repository function stores information about abstract types and their concrete

representation forms, in order to support translations between the service representations. The

binding function sets up a channel between the objects. It requires information from the trading

function and the type repository. When the binding has been established, the invocation

controller can activate a service request on behalf of the client. The communication scheme is

illustrated in the following figure:

Figure 3-23 RM-ODP Inter-Domain Communication Architecture [134]

Hence, during discovery and interaction, the trading function transparently integrates the

information from type repositories for the mediation between concrete representation forms in

different domains. Besides others the type repository therefore offers operations for [135]:

Chapter 3

 Publishing realizations of abstract types,

 Checking whether two type realizations are conformant and interchangeable,

 Retrieving subtypes or supertypes of a type realization and

 Translating one type realization to another.

The RM-ODP does not provide a concrete specification on how the actual translation of type

realizations should be performed. However, it defines the architecture how the trading function

and the type repository interact for mediation. Similar to the mediator approach in data

integration (cf. Section 3.5.1) a specialized service between source and requestor, here the

trading function together with the type repositories, performs the mediation. RM-ODP also

describes architectures where each domain uses a distinct trader also referred to as multiple

trading domains. In such architectures designed for heterogeneous platform integration, traders

interact with one another by instantiating proxies to services of another trader. The proxy needs

to be supervised by a so called interceptor [136]. The interceptor interprets the routed data and

performs the translation of representation formats by interaction with type repositories as

discussed before.

In contrast to data integration in context of database systems there is no distinction between a

global and local perspective of metadata. Rather type repositories which store metadata are

considered as equal each dedicated to its corresponding domain. However, it is possible to

regard one specific type repository as global compared to other local type repositories. Hence,

the integration from many local to one global perspective can be modeled within RM-ODP.

Furthermore, it can be stated that the concept of semantic mapping between heterogeneous type

systems is already identified within RM-ODP. However, the approach of the trading function

focuses on the runtime level and does not explicitly include mediation support during design

time such as required for modeling service compositions based on heterogeneous

conceptualizations.

3.6 Semantic Information Integration with Ontologies

In recent years the use of ontologies for semantic information integration has received much

attention. Prominent Semantic Web scientists such as Uschold and Grüninger regard semantic

interoperability as a key application of ontologies [137]. In this sense, ontologies have evolved

as a promising approach to preserve the precise meaning as data is moved from one IT system

to another. However, as discussed in Section 3.4.3 about the limitations of Semantic Web

services the dominant approach of developing globally shared ontology-based standards for

information models has turned out limited in real world cross-domain contexts. Accordingly,

the discipline has evolved from studying single ontology approaches to multiple ontology

approaches including mappings between them. Taking this into account, ontology mapping has

emerged as a central research field and is often regarded as the Achilles Heel of the Semantic

Web [138].

Basically, the application of ontologies for semantic information integration can be

distinguished into three different strategies as illustrated in the following Figure 3-24:

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

Figure 3-24 Three Ontology-based Semantic Integration Strategies [139]

In the following sections the different strategies are presented and further discussed.

3.6.1 Single Ontology Approach

The most traditional approach is using an ontology representing a global view on various

different data sources. This approach follows the core idea of an ontology as a shared

specification of a domain conceptualization. Information entities in each source need to be

related to one concept in the global ontology. This relation can be preprocessed and the

information can be stored centrally in an ontology representation. Another possibility is to

express the relation in terms of query reformulation similar to the global-as-view approach in

data integration outlined in Section 3.5.1, so that queries to the global ontology are delegated to

the different sources during run time.

However, it is not always sufficient to target semantic information integration by mapping

different information representations to a single global ontology. As discussed before, from a

realistic point of view there is no single ontology describing one domain, rather there exist

many different. Different purposes materialized in different applications which follow their own

conceptualizations force e.g. different granularity requirements on ontologies. Thus, different

domain ontologies may overlap and partly model the same information space, however from

different perspectives.

3.6.2 Multiple Ontology Approach with Ontology Mapping

These constraints have led to the multiple ontology approach. Hence, ontologies can be

developed independently and provide the accurate level of granularity required for their specific

purpose. Semantic information integration is then realized by mappings between different

Chapter 3

ontologies. These mappings define relations between concepts in different ontologies. In [140]

ontology mapping is defined the following way:

Given two ontologies O1 and O2, mapping one ontology onto another means that for each

entity (concept, relation or instance) in ontology O1, we try to find a corresponding entity,

which has the same intended meaning, in ontology O2.

The example shown in Figure 3-25 illustrates a mapping between two ontologies. The mapping

between the two ontologies is marked with dashed lines:

Figure 3-25 Example Ontologies with Mappings [140]

The process of mapping two different ontologies is generally divided into four basic tasks [140]

as illustrated in the following figure:

Figure 3-26 Basic Steps in Ontology Mapping

Mapping discovery or also named ontology matching targets the question on how to identify the

correspondences between semantically related entities within two given ontologies. Matchings

can be found between two ontology objects; then it is called a one-to-one matching. If one

object represented in ontology A is represented as two or more objects in another ontology B,

then it is called a one-to-many matching. Consequently, there are also many-to-many

matchings, if there is any correspondence of aggregated objects in different ontologies. The

process of mapping discovery is a complicated task and has been an active research field in

recent years. The discipline is closely related to schema matching as known from database

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

integration. In this context various approaches have been analyzed in [127]. Such work has

provided substantial foundations for current research in ontology matching. More recently, in

[141] various ontology matching techniques have been surveyed, classified and contextualized

to application scenarios. Basically, such ontology matching techniques are based on particular

characteristics of ontologies. For example this includes structural measures based on graph-

theory or lexical as well as statistical similarities which are evaluated to find correspondences

between concepts. Moreover, machine-learning methods can be applied that learn how to sort

alignments through the presentation of many correct alignments in a learning phase in terms of

positive examples [141].

Figure 3-27 Step 1 of Ontology Mapping: Mapping Discovery

Although encouraging results are obtained, this problem is by no means solved and

automatically obtained results are not yet good enough in terms of recall and precision [142].

Hence, the analysis turns out that most approaches for automatic ontology matching still require

human intervention to generate sufficient results and thus can be considered as semi-automatic

matching and heuristic based. A semi-automated mapping process means that a tool proposes a

possible mapping to a user and the user has to validate and complete it [143]. Indeed, for finding

the correspondences between concepts, it is necessary to understand their meaning. Besides the

represented meaning described by model-theoretic semantics, the ultimate meaning of concepts

is only in the head of the people who developed those concepts and accordingly the final

matching decision can only be performed by them [144]. Thus, it can be stated that the need for

user involvement seems to be a natural limitation to automatic ontology matching approaches.

Figure 3-28 Step 2 of Ontology Mapping: Mapping Representation

After having found matching concepts and properties the matching results, i.e. the identified

mappings need to be formally defined. On the one hand, the alignment can be utilized to merge

the data or conceptual models into a newly created so called merged ontology. This approach

basically refers back to the single ontology approach taking into account update of distributed

information sources and ontology evolution. On the other hand keeping the approach of

multiple ontologies the translation of instances according to the identified matching results need

to be considered. In order to express the translation, generally three approaches can be

distinguished:

 Views and Queries

 Mapping Ontology

 Bridging Axioms

Using views and queries for describing mappings is similar to approaches discussed in the

context of data integration outlined in subsection 3.5.1. D. Calvanese et al. demonstrate in [145]

how view-based query answering known from data integration can be applied to ontology

Chapter 3

mapping. A global schema or ontology is used and queries to the global conceptualization are

rewritten in terms of queries to local ontologies. The results are aggregated according to the

view mechanism. This approach can also be applied in the context of two or more ontologies

that are used on the same level in a peer-to-peer manner.

Another approach for describing mappings is to define a specific mapping ontology which

expresses the different relationships between corresponding concepts and properties. A.

Maedche et. al. present in [146] a mapping framework for distributed ontologies (MAFRA), for

which they have developed a dedicated so called semantic bridge ontology. In order to process

such a mapping, a specific inference engine needs to be provided which performs the mapping

and enables further reasoning with mapping results.

The third way is to describe the mapping by a set of bridging axioms which refer to concepts or

properties of a source ontology and specify how to express them in a target ontology. Thus,

bridging axioms can be realized as description logic-based rules which describe the

transformation. The advantage of this rule-based approach is that reasoning over the described

mappings can be applied straight forward as the mapping rules can be integrated into the regular

ontology inference process, such as classifying etc. For example Deijing Dou et. al. have

developed an ontology mapping tool OntoMerge [147] based on bridging axioms. It mixes a

source and target ontology with bridging axioms to form a so called merged ontology.

Individuals expressed in the source ontology are then inferred by bridging axioms to individuals

which are additionally expressed in the target ontology. Finally, all source ontology constructs

are removed, which is called projection and individuals purely expressed in the target ontology

are the result of the transformation. The ontology mapping approach instantiating the concept of

loosely coupled information models developed in this work also applies rules to transform

concepts from one ontology to a representation in another. Thus, Chapter 4 refers back to

OntoMerge and outlines similarities and differences with the presented approach.

Figure 3-29 Step 3 of Ontology Mapping: Mapping Deployment

Mapping discovery and mapping representation mainly concerns activities during design time of

the information systems life-cycle. Moreover, these activities are addressed from a domain

perspective by focusing not only on a particular application but rather on a more comprehensive

scope targeting an entire application landscape. Consequently, domain experts model their

respective domain in terms of ontologies and collaborate, in order to define the mappings

between them according to the multiple ontology approach. However, this perspective does not

necessarily match the perspective of a concrete application that may only cover partly any

particular domain but however addresses multiple ones in terms of cross-organizational

processes. In fact, different stakeholders are involved and have different responsibilities. On the

one hand, domain experts define ontologies and respective mappings between them as

producers and on the other side, application or process experts utilize such ontologies and

ontology mappings as users in concrete application scenarios.

Consequently, the deployment of ontology mappings needs to support multi-stakeholder

processes taking into account mechanisms for publishing facilities, access and feedback

channels as identified in [148]. Despite the increasing interest shown from various communities

in which ontologies have been used, there is still a lack of such tools to facilitate deployment

and maintenance of ontologies [149]. Some approaches have been identified in [150] and

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

aspects and requirements of a collaborative, community-oriented ontology server have been

described. In particular, trust in the developed ontology mappings needs to be ensured when

mappings are passed from domain experts to stakeholders focusing on usage in concrete

application scenarios. Therefore, measures for validation and correctness as addressed in testing

of ontology mappings have to be considered in the deployment process.

Figure 3-30 Step 4 of Ontology Mapping: Mapping Application

The ultimate goal of ontology mapping is to use the identified and formally defined mappings in

multiple ontology-based application contexts. The resulting mappings are used for various

integration tasks: data transformation, query answering, or Web service composition, to name a

few [124]. For instance, it should be enabled to ask queries using the vocabulary of one

ontology and receive answers that do not only consist of instances of this ontology but also of

ontologies connected through ontology mappings [151]. Accordingly, in order to process the

ontologies the reasoning on them has not only to cover the ontologies themselves but as well the

mappings between the ontologies. That means that besides the relations between concepts and

properties within an ontology additionally inter-ontology relations between concepts and

properties defined in the mappings need to be integrated into the inferencing mechanism.

Thereby, the inferencing mechanism strongly depends on the way the mappings are formally

expressed as outlined above.

Regarding ontology mappings expressed in terms of database-like views the reasoning is

performed in a lazy-evaluation style. I.e. the reasoner does not process the whole knowledge

base including source and target ontology as well as ontology mappings but only gets active

when queries need to be processed. Then the reasoner interprets the defined views in order to

derive rewritten queries according to the underlying information source. The result is then

presented homogeneously in terms of the reference ontology. In case of a dedicated mapping

ontology that represents the mapping, a specific inference engine has to be provided in order to

interpret the declarative mapping. Such a mapping interpreter translates instances that conform

to a source ontology to instances conforming to a target ontology. Further reasoning such as

subsumption, classification or constraint checking can then be performed on the resulting

unified instances. Regarding bridging axioms-based ontology mapping the reasoning process is

quite similar. However, taking into account that bridging axioms can be expressed in terms of

description logic-based rules no specific mapping interpreter is necessary. Reasoning over the

described mappings can be applied straight forward as the mapping rules can be integrated into

the regular reasoning process. I.e. besides reasoning according to the underlying rules defining

the ontology language semantics as well the rules that define the ontology mapping are handled

by the same inference engine.

According to these four basic steps ontology mapping enables an information architecture for

semantic integration that is not dependent on one globally shared ontology. The multiple

ontologies are linked together by means of ontology mappings and thus enabling information

exchange across heterogeneous sources without any semantic centralization.

Chapter 3

3.6.3 Hybrid Ontology Approach

Besides the single ontology and multiple ontology approach H. Wache et al. [139] also describe

a hybrid approach which combines the before mentioned. Basic common concepts which can be

equally used in different local information sources without any restrictions are merged and

expressed in a shared vocabulary. Then concepts of local ontologies can be derived from the

shared vocabulary, which can be as well considered as a top level ontology. However, the

globally shared vocabulary is limited strictly to basic concepts that are not affected by specific

local requirements. Context-related concepts capturing different perspectives, coverage or

different granularity of local information sources are defined independently in local ontologies.

Consequently, such local ontologies are then treated according to the multiple ontology

approach with mutual mappings between them.

Having identified ontologies as a suitable means for semantic information integration, three

different ontology-based integration strategies can be distinguished. The single ontology

approach features a straight forward implementation effort but does not support heterogeneous

viewpoints on different information sources. This is the major benefit of the multiple ontology

approach that enables different perspectives on heterogeneous information sources combined

with independent management of separate ontologies. The hybrid ontology-based integration

strategy compared to the two other approaches combines the advantage of supporting

heterogeneous perspectives with providing a common base of generalized basic concepts that

are shared among the different ontologies. This enables a separation of concerns where common

aspects can be shared and at the same it ensures the independency of conceptualizations where

different perspectives are required [139].

3.7 Summary and Reflection

This chapter has analyzed the conceptual and technological state-of-the-art in achieving

semantic interoperability of heterogeneous systems with particular focus on SOA.

The integration of information is at the heart of semantic interoperability. Hence, an analysis of

different approaches in this area has been carried out covering the main strategies in context of

traditional purely XML-based Web services, Semantic Web services and related areas such as

information integration in distributed database systems and distributed object-oriented systems.

It has turned out that integration on the conceptual level is more efficient and effective than

integrating information models which are represented on the more technical logical data model.

The formal definition of concepts and their relationships allows for increasing the efficiency of

the integration process by lifting it to a higher abstraction level and thus reducing manual

semantic interpretation and technical transformations between differences in information

representation.

Consequently, a thorough survey of ontology-based Semantic Web services technologies and

concepts and corresponding semantic integration approaches has been performed and their

advantages have been outlined in contrast to the analyzed capabilities of traditional purely

XML-based Web services.

In order to further substantiate the analysis, the shift of abstraction level has been pointed out

within the framework of semantic interoperability in SOA developed in Chapter 2.

Consequently, a reduction of the semantic interoperability gap (cf. Section 3.4.3) could be

State-of-the-Art in SOA for Bridging the Semantic Interoperability Gap

shown for semantic integration on the conceptual level compared to semantic integration on the

on the lower abstraction levels for information representation.

Furthermore, it has been identified that due to the nature of large-scale SOA landscapes a single

globally accepted conceptualization covering all requirements on information models of

involved actors in cross-organizational processes has limited practicability. Rather, there exist

several conceptualizations which fully or partly cover a given domain. It has been outlined that

even in case of relatively distinct identified domains; organizational reasons lead to the

development of competing ontologies which thus have to be considered in realistic application

scenarios. This is where the multiple ontology approach for semantic integration based on

ontology mapping comes into play. Hence, a number of ontology mapping approaches and

exemplary tools have been investigated.

In the following Chapter 4, this analysis of heterogeneous conceptualizations is refined and the

central artifact of this work, the concept for semantic mediation between loosely coupled

information models in SOA is presented.

Chapter 4

Semantic Mediation between Loosely

Coupled Information Models in SOA

4.1 Overview

This chapter presents the core conceptual approach of this work. Based on the problem

identification and the analysis of the state-of-the-art for achieving semantic interoperability in

SOA, a concept for semantic mediation between loosely coupled information models is

presented.

The chapter starts by concretizing the requirements already briefly introduced in Chapter 1.

Subsequently, the general idea of the developed concept is outlined to provide an overview of

the central aspects. These include mainly the shift from monolithic to loosely coupled

information models combined with the approach to address semantic integration on a higher

abstraction level. The following sections then refine the general idea and provide detailed

descriptions and argumentations of the respective conceptual parts.

Firstly, the limitations of standardization with regard to semantic interoperability are pointed

out. Then, the underlying reason for context-dependency of information models is deeper

analyzed by referring to a model theoretic approach. Based on these findings, the transfer of the

principle of loose coupling to the semantic level is discussed and specified.

As the main goal of this work is to contribute to a solution for the semantic interoperability

problem in SOA with particular regard to effectiveness and efficiency, the subsequent section

reflects the proposed conceptual solution on this regard including a proposed balance between

these as competing identified sub-goals.

Furthermore, in order to fulfill the requirements for loose coupling of information models in

SOA and enable the proposed concept, a semantic mediation mechanism based on Semantic

Web concepts in terms of ontologies and description logic rules is described and specified.

Finally, a conclusion and reflection of the chapter is provided summarizing the main outcomes

and relating them to the overall thesis goals.

4.2 Conceptual Goals and Requirements

Starting from the demand to achieve interoperability in IT-supported business processes across

intra- and inter-organizational boundaries, the semantic dimension has been identified as

critical. The research hypothesis already outlines the direction of the proposed solution. In order

Chapter 4

to refine what is meant by an effective and efficient approach for achieving semantic

interoperability in SOA, the following conceptual requirements or goals are listed:

 The complexity in semantic integration should be decreased by separating technical

issues from business issues. This should result in a reduced demand for overlapping

skills of involved stakeholders. These skills include technical expertise, expertise in

different business domains that are affected by cross-organizational processes and as

well the respective business process expertise itself.

 Heterogeneity and differences in business requirements and organizational boundaries

should be respected. In particular, it should be anticipated that conceptualization and

information modeling is strongly bound to organizational structures including the scope

of influence and the feasibility of community processes to ensure agreement and

standardization of information models. Therefore, organizational independence should

be reflected in an information architecture for large-scale SOA landscapes.

 In a concrete semantic integration scenario, i.e. the realization of a cross-organizational

business process, the status quo of complicated and highly technical transformation

coding for bridging heterogeneous information representations should be overcome.

Furthermore, recurring manual efforts for integrating these transformations in the

specific application or process context should be consolidated and automation for these

tasks should be improved.

 As differences in information model representations can be complex, the mediation

mechanism should be able to cover that complexity in terms of completeness and should

as well remain easy to maintain by domain experts.

 The developed conceptual approach should remain consistent to best practice SOA

methodologies. According to this, it should be based on process orientation starting

with business analysis leading to process models, followed by deconstruction into

building blocks in terms of services, which can be independently realized and

maintained based on standardized interfaces and service integration infrastructures. In

particular, this implies that no opposing approaches are followed such as artificial

intelligence motivated attempts that try to substitute the human business analyst and

process modeler in terms of automating the design of processes with planning

algorithms (cf. Section 3.4.3).

 The approach should rely on existing concepts and standards of the World Wide Web,

as it constitutes the dominant IT infrastructure for cross-organizational interaction.

Accordingly, technological path dependency should be considered and thus it has to be

ensured that any proposed solution can be based upon existing technology.

As described in the introduction, the methodology of this work is aligned to the approach of

design research (cf. Section 1.3), which is motivated by improving the state-of-the-art in terms

of solving practical problems. Accordingly, the above determined requirements share the focus

on practical relevance and business suitability.

4.3 General Idea

Considering typical cross-organizational processes in large-scale SOA landscapes, not only two

actors as in a classical integration scenario need to be integrated but rather multiple actors have

Semantic Mediation between Loosely Coupled Information Models in SOA

to be connected within process chains and process networks. To tackle the complexity of such

integration scenarios the principle of loose coupling has been evolved as the concept of choice

originating from component-based architectures and it has been well reflected in SOA as a

fundamental paradigm. However, when it comes to the semantic level as presented in the

analysis of the state-of-the-art, the dominant approach aims at developing commonly shared

conceptualizations in terms of standards for information models.

From an architectural point of view focusing on the semantic level this constitutes a monolithic

approach that stands out contrary to the general strategy tackling complexity in terms of loose

coupling. The advent of semantic technologies and particularly ontologies as means for

generating explicit formal vocabularies to preserve the precise meaning in information exchange

can be seen as the same attempt considering the shared nature of ontologies. But taking into

account organizational boundaries to achieve shared agreement among stakeholders, the

feasibility of this approach has turned out limited in real-world cross-domain context.

Based on this analysis, the first part of the general idea is to transfer the concept of loose

coupling to the semantic level and move from monolithic to loosely coupled information models

on domain level as illustrated in the following figure:

Figure 4-1 From Monolithic to Loosely Coupled Information Models on Domain Level

The approach preserves the basic idea of an ontology as a shared conceptualization but limits

the scope of its application to an organizational feasible extent. Accordingly, common

information models are established on the so called domain level, where community processes

aiming to reach shared agreement about conceptualizations are bound to stakeholders belonging

to the same domain. In this context, domain refers to a group of organizations or organizational

units represented as organigrams in Figure 4-1 that share common requirements and exhibit the

capability for effective collaborative decision making.

Chapter 4

In order to achieve semantic interoperability across domain borders as required in cross-

organizational processes, the independent and heterogeneous information models need to be

loosely coupled in terms of so called semantic bridges. Such semantic bridges enable the

mediation between domain-specific differences and ensure a coexistence of independently

managed even possibly conceptually overlapping information models.

The scope of loosely coupled information models on domain level takes into account the aim for

an efficient solution, as not every single organization develops and maintains their own

information model but separates this task to specific domain experts. Consequently, also the

mediation between different information models, i.e. different representations of overlapping

conceptualizations, is only done once on domain level instead of performing it repeatedly on the

concrete application level for each single cross-organizational process.

Moreover, efficiency is addressed by shifting the semantic integration task onto a higher

abstraction level and thus reducing technical efforts for the mediation between heterogeneous

information models. The following Figure 4-2 illustrates the shift of semantic integration with

loosely coupled ontologies based on the framework of semantic interoperability developed in

Section 2.5:

Figure 4-2 Shift of Semantic Integration with Loosely Coupled Ontologies

As described in the state-of-the-art analysis, traditional Web service descriptions are based on

XML and XML Schema to specify information models including meta-data of services

(cf. Section 3.2). Inherently, due to the limited semantic expressiveness of XML and XML

Schema, technically complex transformation procedures are necessary to perform the round-trip

across the semantic interoperability gap.

Aiming at overcoming this drawback, the second part of the general idea is to shift the

abstraction level of semantic integration and thus reducing the semantic interoperability gap.

Thereby, the gap is not bridged directly by using the same conceptual model in terms of a

shared ontology as applied in classical Semantic Web service approaches. However, the gap is

narrowed as heterogeneities can be overcome less technically based on the conceptual

abstraction level of information models. The round-trip across the semantic interoperability gap

then can be performed by utilizing rules-based declarative description logic, which enables a

Semantic Mediation between Loosely Coupled Information Models in SOA

mechanism for loose coupling between the heterogeneous and independent domain ontologies

describing the Web services.

To summarize the general idea, it can be stated that the particular aim for effectiveness and

efficiency regarding a solution of the semantic interoperability problem in SOA is targeted the

following way:

 Effectiveness is targeted by anticipating that reaching agreement about shared

conceptualizations between multiple heterogeneous stakeholders involved in cross-

organizational processes is often beyond de facto organizational capabilities. In order to

address the complexity resulting from heterogeneous requirements and perspectives, the

central idea is to transfer the concept of loose coupling to the semantic level and

consequently move from monolithic to loosely coupled information models.

 Efficiency is targeted by addressing semantic integration of different information

models resulting from the loosely coupled approach on domain level rather than

repeatedly on the concrete application level for each single cross-organizational process.

And secondly, efficiency is targeted through the shift of abstraction level for semantic

integration. Instead of overcoming the heterogeneities of different information models

on a rather technical level, efforts and complexity are reduced by performing the

mediation between heterogeneities on the more business-oriented conceptual level.

The following sections further refine the general idea and provide detailed analysis and

argumentations of the respective conceptual parts outlined above. The requirement for loose

coupling of information models in SOA landscapes is examined in Section 4.4, which further

discusses the limitations of standardization for semantic interoperability in SOA and Section

4.5, which relates this analysis to the context dependency of information models. In Section 4.6,

the core concept of loose coupling on the semantic level is deeper analyzed and proposed as a

design principle for the architecture of information models in SOA landscapes. Then, Section

4.7 reflects on an identified trade-off between efficiency and effectiveness and explains why

positioning loose coupling on the domain level provides an appropriate balance between these

as competing identified sub-goals. Finally, Section 4.8 presents and specifies a concrete

mediation mechanism based on semantic bridges for loose coupling of domain ontologies that

builds upon declarative rules-based description logic.

4.4 Limitations of Standardization for Semantic

Interoperability in SOA

Basis of all IT systems that support business processes are information models which describe a

certain domain and try to capture the relevant information that needs to be processed in a certain

organizational context. Information models are the central artifact when it comes to semantic

interoperability. Different systems and actors need to understand and interpret information

models in the same manner to ensure that the information is processed consistently between the

communication partners. The dominant top-down approach is to address this issue with

semantic standardization so that conceptualizations are consistently shared among all

participating stakeholders. This section discusses the possibilities and limitations which come

with this approach and tries to derive the adequate scope and extent of standardization regarding

information models.

Chapter 4

4.4.1 Standardization vs. Mediation

As already outlined in Section 2.3.3 which has introduced the framework describing the

semantic interoperability gap, there are generally two options for achieving semantic

interoperability between distributed IT systems in SOA landscapes:

1. Standardization of information models, i.e. solving semantic heterogeneity by forcing

homogeneity

2. Translation or transformation between different information models, i.e. solving the

semantic heterogeneity by mediation

The following figure illustrates the two general strategies:

Figure 4-3 Semantic Standardization vs. Semantic Mediation

Standardization of information models (as illustrated on the left hand side in Figure 4-3) can be

carried out either on the conceptual level (1a) or on the logical level (1b). On the conceptual

level (1a) standardization can be established e.g. in terms of UML described information models

or in terms of ontologies as applied in the context of so called Semantic Web services based on

a shared domain ontology (cf. Section 3.4). For example the international standard “United

Nations Electronic Data Interchange For Administration, Commerce and Transport”, builds

upon the UML-based UN/CEFACT's Modeling Methodology (UMM) for capturing

collaboration patterns and messages between enterprises [152]. Alternatively or in addition,

standardization can be addressed as well on the logical level (1b) for instance in terms of XML-

Schema-based standards. For example the RosettaNet industry standards [153] or the OASIS

standardized Universal Business Language (UBL) [154] instantiation of the ebXML Core

Components Technical Specification [155] are XML-based. Both cases (1a) and (1b) bridge the

semantic interoperability gap by virtually narrowing it to zero as the representation of

information models used by the different communication partners is the same. This can be either

achieved directly in terms of harmonizing internal information models of involved IT systems

or indirectly by means of a so called lingua franca used as a single reference standard for

exchanged service messages.

The other strategy for achieving semantic interoperability abstains from a shared standardized

information model and employs mediation between the heterogeneously modeled IT systems as

illustrated on the right hand side in Figure 4-3. Consequently, a kind of intermediate translation

has to be performed and it needs to be assured that the meaning is preserved adequately.

Accordingly, on the logical level of information models (2 b) this mediation can be addressed in

terms of transformation procedures between different schemas for instance by using the industry

Semantic Mediation between Loosely Coupled Information Models in SOA

standard XSL transformations as part of XML-based Web service compositions (cf. Section

3.2.3). Furthermore, symmetrically to the before discussed cases, it is possible to bridge the

semantic interoperability gap already on the conceptual level (2 a). This approach can be

achieved by applying ontology mappings (so called semantic bridges) which are discussed in

more detail later in Section 4.8.

Moreover, a combination of the two general strategies is possible. Such a hybrid approach

combines semantic standardization with semantic mediation (cf. Section 3.6.3). On the one

hand, basic common concepts which can be equally used in different contexts without any

restrictions are merged and expressed in a shared standardized information model. However, on

the other hand, the shared and standardized information model is strictly limited to basic

concepts that are not affected by specific local business requirements. Context-related concepts

capturing different perspectives, coverage or different granularity of local information models

are defined independently and consequently semantic mediation between them is required.

This hybrid strategy is also targeted in this work. Nevertheless, the focus is put on the mediation

part as the non-context dependent part of information models can be observed as rather low as

analyzed later on in Section 4.5 about context dependency of information models. Moreover,

later on this chapter further describes how to perform such semantic mediations between

different heterogeneous conceptual information models exploiting ontology mappings as

introduced in Section 3.6.2. However, before presenting deeper the concrete developed

approach, firstly the argumentation line is provided that leads from the limitations of semantic

standardization-based strategies to the pillars of the aimed solution.

4.4.2 From Technical Standards to Semantic Standards

The advent of SOA and widely accepted Web service standards has benefited technical

interoperability in recent years substantially. Taking this into account standardization has also

been regarded as the key towards semantic interoperability within SOA. However,

standardization brings both advantages and disadvantages. In the following this section analysis

some factors in this context in order to discuss whether or to which extent the success regarding

technical standardization in SOA can be transferred to the semantic level.

On the one hand, Web services are still an emerging technology and are subject of ongoing

innovation resulting in redevelopment and technological adjustment. In such a context

standardization is often seen as hampering advancement or trailing behind the market. On the

other hand, standardization remains the absolute condition for progress in virtually all fields of

technology and especially in information technology [156]. Thereby, a multitude of dimensions

can be considered. These include economics of standards, standardization policies and

intellectual property rights, an aligned overall development process including actors and their

rights, roles and responsibilities and a political dimension about what is socially desirable and

where consensus might be reached. The last two aspects also needs to address the limiting factor

of standardization in terms of optimizing that the output of standardization is promptly and

manages to address the actual market‟s needs [156].

With regard to SOA and Web services such factors of standardization has led to platform and

vendor independent interface and interaction specifications for distributed IT systems.

Economies of scales in terms of increased productivity by rationalization could be realized on

vendor side regarding investment protection as well as on user side taking into account that by

adopting Web service standards just one type of technical interface specification needs to be

supported instead of multiple ones following different paradigms. On the micro-level for the

end user this results in lower integration cost and faster integration cycles by minimized risk.

Chapter 4

The following figure illustrates the reduced complexity resulting from broad adoption of Web

services standards:

Figure 4-4 Integration of Multiple Interface Technologies vs. Web Service Standards

On the one hand, integration complexity of actors potentially increases by as all

combinations between different interface technologies have to be taken into account and a

classical point-to-point integration is necessary (as illustrated on the left hand side of Figure

4-4). On the other hand, if all actors consistently adopt Web services standards, integration

complexity remains linear as each actor just has to consider one interface technology for all

cooperation partners (as illustrated on the right hand side of Figure 4-4).

Regarding the overall standardization process of Web services it can be noticed that as

characteristically in information technology traditional government-driven standardization

organizations did not play a central role. In contrast, private standardization organizations

around industry consortia such as the W3C or OASIS have been mainly responsible as the

standards development organization for Web services. This can be seen as a reaction to alleviate

the limiting factor of standardization often regarded as a heavy vehicle [157]. Due to the

dynamics of innovation with non-linear and much shorter life-cycles in information technology

such standardization processes driven by industry consortia provide much more agility and

ensure market relevance, however to the cost of broad consensus [158]. In the field of the Web

service standardization this has been the case as only corresponding major industry players have

been involved. However, the standards are provided on a royalty free basis and in the last

decade Web services have gained considerable popularity and broad market uptake.

One major success factor in the standardization of Web services and its market adoption can be

seen in the political or business dimension regarding the consensus level. In the context of

standardization around distributed object-oriented systems the industry uptake has been limited

as not all major industry players adopted the developed standards in their product portfolio. No

industry-wide consensus could be reached regarding a shared agreement of common object

models and corresponding programming language models (cf. Section 3.2). Whereas, Web

services built on more light-weight concepts focusing on message-based and document-based

interaction, they consequently implied less dependency to existing technology and thus could

reach the consensus level for broad market uptake. Thus, a good trade-off between the enabling

and limiting factors of standardization could be found and made standardization of interface

descriptions and interaction patterns by means of Web services the dominant state-of-art method

of enabling technical interoperability in SOA.

This success story has led to the conception that developing industry-wide accepted standards

for common information models could be also the key towards semantic interoperability in

SOA. However, this has often lead to an oversimplification of the problem not reflecting the

business driven fundamental need for flexibility in the specification of service granularity and

their information models reflecting dynamic business requirements. Consequently, it can be

Semantic Mediation between Loosely Coupled Information Models in SOA

noticed that a unified adoption of business-oriented semantic standards such as the before

mentioned UN/CEFACT or RosettaNet (cf. Section 4.4.1) is limited. Rather, there are various

standardization initiatives for so called B2B standards and selecting an appropriate B2B

standard is depending on the specific business situation [159]. Some prominent B2B standards

besides RosettaNet or UN/EDIFACT further include but are not limited to ebXML [160],

SPEC2000 [161], CIDX [162], cXML [163], xCBL [164], etc. Moreover, the concrete standards

are additionally customized and thus deviate from the original standardized specification to

meet the needs of the concrete business processes context.

Accordingly, it can be stated that regarding semantic standards the required positive trade-off

between the advantages and disadvantages of standardization could not be found. The

successful shift in the context of technical standards from slower government standardization

bodies to more agile industry-based consortia could apparently not compensate and meet the

high requirements for business adaptability taking also into account business diversity.

Moreover, a light-weight consensus limiting the dependencies as reached in terms of Web

services on the technical level could not be established in terms of standardization on the

semantic level, which can be explained with the fact that information models are heavily

depending on the application context which may differ from business to business. Thus, any

standardization measures lead to the consequence that internal information models are more or

less tightly-coupled to a fixed conceptualization and therefore imply limited adoption of such

standards.

4.4.3 Semantic Standardization and Monolithic Information Models

On the technical level Web service standards have enabled modular architectures, as they allow

to decouple functional components and service users from service providers in context of Web

service development, Web service operation and Web service management. In contrast,

applying standards on the semantic level leads to opposite effects regarding architectural

principles. In fact, it can be argued that they establish in analogy to common architecture

patterns so called monolithic structures even not on the functional level but on the semantic

level.

Standards for information models can be built in modular units. Accordingly, the concept of

building blocks is well adopted in prominent B2B standards. As mentioned above, e.g. the

ebXML Core Components approach [154] catalogues business-related information components

that are of horizontal nature, which then can be reused in different business-contexts to build

more specific information components. However, the more specific information components

need to be derived from the core components. This enables semantic interoperability for the

basic parts with overall relevance but has the consequence that business-context specific

information components are tightly-coupled to the general core components as they cannot exist

independently. Therefore, this tight-coupling establishes a monolithic architecture on the

semantic level because any part relevant for information interchange is realized according to

one single conceptualization.

Such a monolithic architecture on the semantic level caused by rigorous semantic

standardization brings analog to the field of software architecture the problem of poor

maintainability. This disadvantage is serious because with growth of coverage of the semantic

standard maintainability efforts rise disproportional. The various stakeholders in the

standardization process which represent the diversity of business requirements need to

coordinate each other in order to achieve consensus. But due to the monolithic structure

including tight dependencies no efficient and effective separation of concerns can be

Chapter 4

established. Thus, the coordination of actors and business requirements results in

complexity [165] as they need to be mutually synchronized.

Consequently, based on the analysis above it can be concluded that a rigorous standardization of

information models attempting to reach maximal coverage does not provide an optimal solution

to the problem of semantic interoperability especially in large-scale SOA landscapes involving

multiple organizations. However, in order to overcome a similar situation on the semantic level

of necessary point-to-point integration as described in Figure 4-4, the approach of

standardization cannot be completely dismissed. Rather, an appropriate degree and coverage of

semantic standardization with reflection of the consensus dimension needs to be found. The

following subsection continues in particular with these aspects.

4.4.4 Consensus Degree and Adequate Scope of Semantic Standards

As already noticed in the context of Web services and in general in standardization of

information technology the trade-off between the limiting factors of standardization have led to

a shift from government-driven standardization organizations to industry consortia. This can be

seen as a reaction to the wide believe that especially in the Internet era, the consensual

standardization process is too slow to produce anything relevant for business especially with

regard to maintenance of standards and adoption of new requirements [158]. Typically,

government-driven standards organizations have well defined and agreed procedures for

development and approval of standards, dispute resolution and fair representation for affected

parties. In contrast, industry consortia often appear for specific purposes on a temporary basis

perhaps set in motion by one or more companies with common interest. Such standardization

initiatives may have less complex procedures with the advantage of higher speed. However,

they cannot insure participation from all entities with a material interest [166]. Due to the high

degree of innovation with requirements such as rapid change and flexibility of standardization

procedures the consensus degree is limited and leads to a smaller scope of standardization.

Another type of standardization that addresses this trade-off are public specifications. They are

published by a standards organization but are controlled by the sponsor or a group of sponsors,

which respectively does not reflect the consensus of all possible stakeholders but allows prompt

publication and usage. And finally there are so called in-house standards, which are usually

non-public and just used within an enterprise or an enterprise network. The following Figure 4-5

illustrates the different types of standardization along the trade-off between consensus-degree

and the adequate scope of standardization.

Semantic Mediation between Loosely Coupled Information Models in SOA

Figure 4-5 Consensus Degree and Appropriate Scope of Standards [167]

Figure 4-5 demonstrates that products or processes with a high innovation rate requiring for

frequent change and flexibility in standardization processes are related to a lower consensus

degree of stakeholders and thus result in smaller groups and therefore smaller scope of

standardization.

Reflecting this in the context of standardization of information models it can stated that the

subject of standardization undergoes rapid innovation. Defining final standards for business

semantics, a standard that is suitable for any current and future business processes, is an ideal

that cannot be reached [168]. Business processes and scenarios are highly dynamic and object of

continuous adaption and advancements. Furthermore, there are conflicting goals of

stakeholders. On the one hand, business processes targeted by semantic standardization are

internal or within closely related organizational units and their requirements need to be

reflected. On the other hand, they are in competition with the requirements for cross-

organizational processes and both need to be aligned within each organization participating in

the standardization process.

Consequently, the following conclusion regarding the limitations of standardization on the

semantic level can be made: Even the conceptual question where to position semantic

standardization within the discussed Figure 4-5 cannot be finally answered, it can be clearly

derived that the scope is not on the level of universal semantic standards. Information models,

which are subject to frequent business changes, can be classified as highly innovative with a

lower consensus degree and thus their standardization should be located further in the direction

of the lower left-hand side corner of Figure 4-5.

This analysis explains why attempts for universal semantic business standards such as the

aforementioned UN/CEFACT or RosettaNet (cf. Section 4.4.2) have limited market success.

Consequently, the more adequate scope of semantic standardization can be seen on covering

smaller groups of interest allowing also multiple coexisting standards for information models

covering in principle similar issues.

This section has provided an argumentation of the inadequacy of common standards for

information models in highly distributed and dynamic environments targeted in cross-

organizational SOA landscapes and the need for multiple coexisting information models in this

context. The main argumentation line therein has been the need for continuous adaption and

advancements of information models which leads to limited consensus degree. The following

Chapter 4

section aims at further strengthening the argumentation for multiple coexisting standards of

information models. However, it does not focus on the dynamics of business requirements that

causes the limited consensus degree but focuses on the nature of information models and its

inherent context dependency.

4.5 Context Dependency of Information Models

In general, a model can be used to describe, explain or predict certain aspects of the reality.

Models are abstract artifacts in terms of an abstraction on parts of the real world driven by a

specific purpose. Information models, which are the central artifact when it comes to semantic

interoperability, are specific models. Information itself is an abstract artifact and information

models add another abstraction layer to model relevant information in a certain context or

domain.

4.5.1 Heterogeneity of Information Models

As already stated in the analysis above (cf. Section 4.4), information models in the business

context tend to be diverse and changing over time due to the frequent changes in business

processes and different context-specific business requirements in general. In particular,

information models therefore are characterized by the following differences such as [124]:

 Different terminology for information entities

 Different modeling conventions

 Different (possibly overlapping) coverage

 Different granularity

Whereas the first two points may constitute differences that could be overcome in terms of

adequate organizational measures; however, the last two points represent more fundamental

differences. The different coverage of information models which are possibly overlapping or the

different granularity or deepness of the modeled information entities can be linked to the

different purposes and contexts, which information models need to reflect.

In order to further understand the nature of the differences in information models and their

relation to multiple and heterogeneous representations, the following section applies a general

model theory to information models. A deductive approach is followed to derive specific

characteristics of information models from a general model theoretic perspective. Firstly, the

central concepts of a selected model theory are outlined and then related to aspects of

information models in terms of analogies. Finally, conclusions are derived in order to provide

further argumentation for the requirement of loosely coupled information models in large-scale

SOA landscapes.

4.5.2 Model of Conception and Information Models

The model of conception [169] provides a model theory which puts special emphasis on the

context dependency of models and its conception by an inherently subjective actor. The main

ideas of the model of conception point out that the constituting parts of a model are entities of

the real world conceived as a model by a subjective actor. That means that such entities only

exist as a model, in so far as they are being conceived as a model by a subject. Furthermore,

Semantic Mediation between Loosely Coupled Information Models in SOA

entities are related to other entities. The relation between one entity and the others is also part of

the conception of a subject and constitutes the context of the entity [170]. The following figure

illustrates the basic concepts of the model of conception:

Figure 4-6 Model of Conception [170]

The subject may be a human being, but also a group of people or any other actor which may be

expected to make a judgment. Furthermore, the content of the conception is determined by the

contextual relations of the entity [169].

This general model of conception can be applied to information models. With regard to

information models, the basic concepts of the model of conception such as subject, entity and

context can be interpreted as illustrated in the following figure:

Figure 4-7 Model of Conception Applied to Information Models

The subjective actor can be interpreted as a group of domain experts, who develop a specific

information model of a certain as relevant conceived domain. Moreover, the subjective actor

can be an application developer, who reuses the developed information model for a certain IT

application or a user in terms of a human being which reads and writes information within this

IT application. However, the user perspective is discussed later on in the next Section 4.5.3.

Furthermore, the entity can be viewed as an information entity e.g. in terms of a customer

address expressed in a schema representation or UML diagram on a sheet of paper. However,

the actual information model of the customer address only exists in the mind of the domain-

experts and multiple views on the in principle same information entity are possible. This can be

linked with the observed differences in information models as outlined above. Differences in

terminology for information entities and different modeling conventions can be the result, as

different domain-experts or groups of them have different conceptions. Moreover, the as

relevant conceived domain differs between different actors and consequently results in different

Chapter 4

possibly overlapping coverage of information models. Finally, the different granularity of

information models can be related to the model of conception. The extent to which a larger

information entity is subdivided into further information entities depends on what is conceived

as relevant to be expressed on the minimal granularity level, which consequently can differ

again between different domain experts.

However, so far mainly domain-experts as creators of information models have been discussed

and information models have been described as a conception of a domain. But information

models are not just created; they are also used or applied to a certain situation and therefore

have a specific purpose.

4.5.3 Constructive Model Relations and Information Models

This notion or differentiation of a model as being a model of something and at the same time

being a model for something is discussed in the context of the model of conception and has led

to the identification of so called constructive model relations [170] as illustrated in the

following Figure 4-8. Again, the idea is to briefly outline the general model theoretic concepts

and then interpret them in the context of information models.

Figure 4-8 Constructive Model Relations

As already stated in context of the model of conception, it is distinguished between the model

object or entity M and the purely mental model μ. Furthermore, in [170] it is distinguished

between the creation of a model and the application of it. These two actions constitute

constructive model relationships as they are established by a constructive act, which is either

thought or actually performed. Thus, on the one hand, the model creation is a constructive

model relationship starting from an initial object A which may be something observed or a set

of requirements and resulting in the actual model object M via some kind of abstraction or

selection. On the other hand, as well the model application is a constructive model relationship,

starting from the created model object M which may be a prototype or a reference object and

realizing the application in terms of an object B via a kind of production, role assignment, or

mapping.

Consequently, it can be stated that a model is related not only to something (A) of which it is a

model but also to something (B) for which it is a model [170].

Moreover, in order to be useful for the application to a specific situation or problem, the model

object must ensure to exhibit certain qualities which it transfers from something observed or a

set of requirements A to the application B and which is problem relevant. What is transferred in

this way via a model object is called the cargo of the model [170] as illustrated with the blue

Semantic Mediation between Loosely Coupled Information Models in SOA

arrows in Figure 4-9. The object M, which is conceived of as a model, cannot get its role as a

model and its function as the carrier of a cargo for a resulting object B unplanned or in an

accidental manner. The model object M has to be well chosen, produced, or evaluated as a

model in the conception of the judging subject, in such a way that it actually carries the

described cargo to serve as a model. Accordingly, the practice of modeling has to aim at

„working the cargo into the model object‟ during the model creating process in a way that the

cargo may also be transferred to the resulting object B in the course of model application [170].

Analog to the model of conception, which has been applied to information models, also the

model theoretic concepts of constructive model relations can be applied to the context of

information models. The following Figure 4-9 shows this conceptual instantiation. It further

integrates the distinction between different abstraction levels of information models (cf. Section

2.3.3) from the conceptual idea in the mind to the physical model as a representation of the

actual information.

Figure 4-9 Constructive Model Relations and Information Models

The two constructive model relationships constitute the information model design phase one the

hand and the information model application phase on the other hand. The analogy that a model

M is a model for something B explains why an information model is always for some specific

application or application domain according to a subjective conception.

Furthermore, the differentiation between the actual information model µ in the mind and its

representation through the model object M can explain, that in a semantic integration scenario

of two applications B and B‟ even two different mental information models may match, but the

corresponding representations e.g. in terms of different message schemas M and M‟ may not

enable seamless information exchange. This can be also linked back to the distinction between

different abstraction levels of information models (cf. Section 2.3.3) from the conceptual idea in

the mind, over the conceptual model, to the logical model and to the physical model as a

representation of the actual information. With each lower abstraction level further technical and

application specific conditions and constraints need to be reflected, which leads to the

heterogeneous occurrences of information models on the concrete representation level.

Coming back to the constructive model relationships, this explanation can be further

substantiated by the concept of a cargo which is transferred through the model object or here the

information representation format. In the context of information models that means, that an

Chapter 4

information model representation M might carry the relevant characteristics X of required

information for the purpose of application B, but when it should be applied to the purpose or

application of B´ this representation and corresponding cargo X might not be sufficient. In this

scenario the concrete information representation M carries X to be adequate for application B

but it does not carry X´ required for application B´.

4.5.4 Conclusions and Implications for Information Models

In summary, the application of the model of conception and the transfer of the constructive

model relationships to the realm of information models can be concluded as follows:

 The general argumentation that context dependency of models is an inherent

characteristic of models and a determinant factor to cope with complexity [171] is

particularly valid for information models as any statement taken out of context cannot

be understood. Only by putting it into a context provides it with meaning.

 Furthermore, the model theoretic perspective shows that context cannot be regarded as

an objective issue; rather it is determined by the subjective conception of an actor.

Applied to the realm of information models, this leads to the understanding that

information models of a certain domain are naturally not unique and uniform but

different depending on the different actors developing and using them.

 The last point is further stressed by the central statement, that a model is related not

only to something (A) of which it is a model but also to something (B) for which it is a

model [170]. Transferred to information models, this shows that information models are

not only dependent on the actual business domain on which they provide an abstraction,

but moreover this abstraction cannot be undirected but must be related to a specific

purpose or respectively to concrete IT applications. Consequently, due to their very

nature, information models cannot be general, unique and uniform but rather they are

diverse, heterogeneous and consequently are likely to occur as multiple possibly

conceptually overlapping variations.

This section has analyzed the context dependency of information models. It has argued that in a

multi-stakeholder environment such as given in the targeted large-scale SOA context, the

subjective conception provides the determent factor of information models. Therefore, from a

theoretical initial point, no general objective or in mathematical terms no canonical form of an

information model of a business domain exists per se but rather multiple different coexist,

according to the different conceptions of the involved stakeholders. This understanding

identifies the challenge of integrating the different conceptions to achieve semantic

interoperability.

As already outlined in Section 4.4, there are generally two strategies for the integration of

different conceptions or for bridging the semantic interoperability gap. On the one hand,

semantic integration can be targeted by aiming for a common and shared information model.

This requires the alignment of the different subjective conceptions into a uniform one which

suits all involved stakeholders. On the other hand, the divergent conceptions and resulting

information models can be accepted in their heterogeneity by keeping their independence and

integrating them in a flexible manner in terms of mediation between them. Such an approach

can be regarded as following the basic principle of loose coupling, which is known as an

effective instrument to cope with complexity in distributed systems. The following section

presents this approach that applies the concept of loose coupling to the semantic level. The

subsequent section then provides further analysis with regard to the research hypothesis of this

work and provides a comparison regarding effectiveness and efficiency between the here

Semantic Mediation between Loosely Coupled Information Models in SOA

presented approach of semantic mediation between loosely coupled information models and the

traditional alternative approach of semantic standardization.

4.6 Loose Coupling on the Semantic Level

The principle of loose coupling can be regarded as one of the fundamental concepts in the

evolution of computer system engineering. Notably, loose coupling is not a feature that is

necessary for a system to operate but rather it is essential to manage the complexity in a system

under change. However, it was originally introduced into organization science by Karl Weick.

In [172] loose coupling is described as a situation in which a series of stable subassemblies are

responsive but retain evidence of separateness, independency and identity. Furthermore, loose

coupling is evident when elements affect each other suddenly rather than constantly, negligibly

rather than significantly, indirectly rather than directly and eventually rather than immediately.

In organizational science the notion of loose coupling suggests that some classical principles of

administration like centralized control and rational planning may not be useful as often believed.

It emphasizes the limitations to administrator‟s abilities to shape the instructional process.

Taking into account the organizational aspects and challenges identified in the context of

semantic harmonization as analyzed in the previous two sections about limitations of

standardization for semantic interoperability in SOA (cf. Section 4.4) and about context

dependency of information models (cf. Section 4.5), the notion of loose coupling promises to be

an adequate mechanism to manage the complexity with regard to change of business semantics.

Especially in distributed systems and particularly in the field of SOA, the principle of loose

coupling represents a core mechanism to cope with complexity. The following figure illustrates

the basic idea of transferring the principle of loose coupling to the semantic level, namely from

loosely coupled services to loosely coupled information models:

Figure 4-10 Transfer of Loose Coupling to the Semantic Level

In SOA, loose coupling is applied to tackle the complexity of business IT alignment. Services

are loosely coupled to chains in order to realize business processes. Moreover, service users and

service providers are loosely coupled based on the abstract concept of a service. And finally,

services are decoupled from the actual IT components that provide the services to provide

further flexibility. In this sense loose coupling is mainly focused on control and maintenance

issues ensuring independent management of services under different ownership.

Chapter 4

However, when looking at information models used for message exchange between services a

rather tight degree of coupling becomes evident. For the sake of semantic interoperability

service users and providers usually are committed to a shared information model in order to

ensure that the messages exchanged can be understood and processed as indented. Aggregation,

association or inheritance relationships between information entities are dependent on an overall

conceptualization. Practically this means that information model representations of service

providers are imposed on service requesters, which can be regarded as tight coupling on the

semantic level. However, in distributed environments with limited centralized (i.e. top-down)

management authority the commitment of independent stakeholders to a shared information

model leads to the challenges discussed before.

In order to transfer the principle of loose coupling to the semantic level this section analysis the

principle of loose coupling in computer science in general and more detailed with a focus on

SOA and identifies the core characteristics that constitute this kind of architecture style. Then it

is discussed how and to which extent loose coupling can be applied to information models and

which conclusions can be derived from this.

4.6.1 The Principle of Loose Coupling in Computer Science

Firstly, selected definitions of loose coupling from the perspective of computer science are

discussed. Then the main motivations for loose coupling are outlined before the central

characteristics are extracted, in order to provide a basis for the transfer of the concept to

information models.

Definitions

Although the term loose coupling is widely used in discussions of computing and software

architectures the term has a relatively diffuse definition. However, there seems to be a general

convergence towards the following shared understanding [173]:

 In computer science, coupling or dependency is the degree to which each program

module relies on each one of the other modules.

 Loosely coupled systems are considered useful when either the source or the destination

module is subject to frequent changes.

A more analytical definition is provided in [174], where the degree of loose coupling is defined

along the three dimensions: Knowledge, Availability and Trust:

Semantic Mediation between Loosely Coupled Information Models in SOA

Figure 4-11 Dimensions of Coupling [175]

A module A is related or has a dependency to component or module B. Then module A is

considered loosely coupled with module B if [174]:

(1) Knowledge - A has minimal knowledge of B.

(2) Dependency on availability - A can work even when B (or the communication link to

B) is temporarily not available.

(3) Trust - B does not need to trust that A satisfies pre-conditions and A does not need to

trust that B satisfies post-conditions.

The extent or degree to which the modules in a system are coupled is regarded as a relative or

qualitative notion. Furthermore, these dimensions can be related to the functional distance

between modules, which can be used to define the adequate degree of coupling in a certain

situation. The following Figure 4-12 illustrates the relation between the degree of coupling and

the functional distance between different modules:

Figure 4-12 Degree of Coupling and Functional Distance [176]

The notion of functional distance can be also found in the concept of cohesion between or

within modules or components. In the context of structured analysis and more particularly in

context of structured design [177], cohesion is strongly related to loose coupling as usually

coupling is contrasted with cohesion. In structured design the concept of cohesion is defined as

the degree to which the internal contents of a module are related. And coupling is defined as the

degree to which a module depends upon other modules. Furthermore, cohesion is concerned

with the grouping of functionality into a set or a particular module. Thereby, functional closely

Chapter 4

related elements should be grouped into the same module whereas the boundary to other

modules should be clearly defined with a precise scope. In conclusion modules should be

designed according to strong cohesion regarding internal contents and loosely coupled with

regard to dependencies between different modules.

Motivation

As indicated by the definitions above, loose coupling is an attribute of IT systems with the aim

of reducing the interdependencies across modules or components. In particular, loose coupling

is motivated by reducing the risk that changes within one module will create unanticipated

changes within other modules which requires change in their implementation. This approach

specifically aims at increased flexibility while deploying, modifying, or replacing modules

without necessarily affecting other modules that communicate or share information with it.

Furthermore, loose coupling eases testing and maintenance because problems are easy to isolate

and unlikely to propagate. Moreover, the isolation and clear distinction between modules

simplifies to understand the semantic of modules. This fosters reuse as modules can be handled

separately and related or dependent modules do not need to be integrated in parallel.

Particularly, in SOA loose coupling provides the foundation that enables IT supported business

processes across technological heterogeneous IT landscapes. Loosely coupled services provided

on different platforms, which may be based on incompatible system technologies, can be joined

together in order to create composite services and thus realize business processes.

However, loose coupling is not universally positive. It is necessary to balance the trade-off

between utility and costs. On the one hand, loose coupling promotes more flexibility but on the

other hand these benefits cause drawbacks. For example if systems are de-coupled in time

using e.g. a message-oriented middleware as usual in SOA, it is difficult to also provide

transactional integrity [173]. Moreover, loose coupling increases costs in terms of additional

efforts for the integration. At least this applies initially due to the distributed approach and

finally regarding performance as additional communication efforts are required. To conclude, it

can be stated that it remains the art of the architect to find the adequate degree of loose coupling

in a concrete integration scenario [174].

4.6.2 Transferrable Characteristics of Loose Coupling

In the previous section the principle of loose coupling in computer science has been discussed.

The main focus has been put on decoupling IT functionality in order to cope with the

complexity of large and distributed IT systems. With regard to the goal of transferring the

concept of loose coupling from a well-known functional dimension to the semantic dimension

this section pinpoints the central aspects of loose coupling which need to be covered as well

when discussed in the context of loose coupling of information models. Based on the analysis

above, the following three central characteristics of loose coupling can be distinguished and

should be utilized as reference points for the transfer of the concept:

 Autonomy – Loosely coupled systems are based on decomposition into multiple

components or modules that are as autonomous as possible. The modules are self-

contained and discrete. The aimed independence between the modules thereby focuses

mainly on management, i.e. leaving the control of module‟s internal aspects to the

respective module owner.

 Flexibly binding – Loosely coupled modules are related to each other by means of

flexible binding mechanisms that allow to integrate the modules. In contrast to tightly

Semantic Mediation between Loosely Coupled Information Models in SOA

coupled systems, where different modules or components are statically bound, loosely

coupled systems exhibit dynamic or late binding, in order to increase independence and

the above described autonomy between different modules.

 Encapsulation – Loosely coupled modules are clearly defined and exhibit a precise

scope. Hence, loose coupling follows the principles of separation of concerns and

information hiding, i.e. keeping design and maintenance decisions at the module‟s

owner. Accordingly, module interaction is based on stable interfaces which ensure that

functionality can be used without being concerned about the other module's internal

implementation. Thus, the implemented application logic behind the interface can be

changed without affecting other modules that communicate or share data with it. In this

sense interfaces can be regarded as intermediary points between interacting loosely

coupled modules.

Taking furthermore into account the aim of developing a methodology on how to apply loosely

coupled information models in concrete SOA scenarios (cf. Chapter 5) here also the practical

perspective on how to apply the general concept of loose coupling should be given. Hence, the

general method of applying loose coupling focusing on the functional dimension can be

summarized as follows:

1) Decompose a single unit into multiple independent components or modules (autonomy).

2) Place an intermediary point by means of an interface between two interacting end points

(encapsulation).

3) Define concrete relations or interactions between different modules as late as possible in

terms of dynamic mechanisms (flexible binding).

Consequently, the following section continues with discussing how the concept of loose

coupling can be applied to information models.

4.6.3 Loosely Coupled Information Models

The transfer of the concept of loose coupling to the semantic level implies that the core

principles of loose coupling can be applied to information models. Therefore, this section

targets the question how the above identified central characteristics of loose coupling, namely

autonomy, flexible binding and encapsulation can be applied to the semantic level. In the

following a specification covering these three characteristics is provided.

Autonomy of Loosely Coupled Information Models

With regard to autonomy, it can be stated that loose coupling on the semantic level can be

identified when the potential information space is decomposed into multiple coexisting, self-

contained and independent information models. Analogical to loosely coupled functional

components the feature of independence is mainly related to the management of information

models which is under the control of different ownership. I.e. issues related to the life-cycle,

maintenance or evolution of information models are kept under the authority of the respective

information model owner. In particular, this implies that no cross-references exist between

loosely coupled information models such as generalization, specialization or aggregation of

entities of different origin. That would cause that the existence of one information model is

dependent on another which consequently contradicts the aim for autonomy.

Chapter 4

The approach of multiple independent information models takes into account the realistic point

of view that the existence of an overall generalized so called canonical information model

cannot be assumed in heterogeneous scenarios including autonomous actors and organizations.

Organizational boundaries in community processes for the development and maintenance of

information models and the complexity deriving from inherent domain-specific differences in

requirements force a coexistence of independently managed information models.

Consequently, it can be stated that autonomy can be reflected on the semantic level and

moreover forms an integral characteristic of loosely coupled information models.

Flexibly Binding of Loosely Coupled Information Models

Certainly, mutually independent and autonomous information models need to be interlinked and

related to each other in order to achieve semantic interoperability in a concrete integration

scenario. However, such integration similar to loosely coupled functional components does not

need to be permanent or static. But rather information models can be interlinked to each other in

a dynamic manner by means of mediation between them. Hence, semantic interoperability can

be achieved as differences in e.g. information representation or heterogeneous granularity levels

in cross-organizational scenarios are aligned in the moment when information needs to be

exchanged.

In order to achieve a high degree of flexibility the mediation mechanism has to cover the whole

spectrum of potential differences in information modeling. Furthermore, the mediation should

be kept as transparent as possible to the user by hiding the complexity and providing lightweight

maintainability. The original information models can remain unchanged and independent of

each other and thus ensure the autonomy between the different information models.

Accordingly, it can be concluded that the aspect of flexible binding can be transferred to the

semantic level and that a mediation mechanism is the constituting part that enables the flexible

binding between loosely coupled information models.

Encapsulation of Loosely Coupled Information Models

Analogically to loosely coupled modules in a functional architecture loosely coupled

information models as well should be clearly defined and exhibit a precise scope. The principles

of separation of concerns and information hiding can be reflected by loosely coupled

information models taking into account the different abstraction levels of information

representation (cf. Section 2.3.2). Consequently, loosely coupled information models should be

encapsulated on the most abstract (the conceptual abstraction level) to hide the concrete internal

representation on lower abstraction levels such as the logical or the physical abstraction level.

Accordingly, the conceptual abstraction level acts similar to an interface encapsulating the

internals of information models.

Decoupling in terms of mapping between the conceptual representation and the internal

realization ensures that internal changes do not affect the external representation of the

information model. Thus, the mediation mechanism can be based on the conceptual abstraction

level and internal changes of the information model do not affect other information models and

consequently independence is fostered. In this sense the conceptual abstraction together with the

mediation mechanism acts as the intermediary between interacting loosely coupled information

models.

To conclude it can be stated that as well encapsulation can be reflected on the semantic level.

The main idea is that the conceptual abstraction level of information models provides an

encapsulation of information model internals and thus enables their loose coupling.

Semantic Mediation between Loosely Coupled Information Models in SOA

Definition of Loosely Coupled Information Models

Based on the above discussion it can be concluded that the major characteristics of loose

coupling can be transferred to the semantic level. Reflecting the above derived analogies the

following definition of loosely coupled information models can be given:

Loosely coupled information models:

a) are mutually independent and self-contained information models that exist and are

managed autonomously.

b) exhibit flexible binding as they can be dynamically interlinked to each other to ensure

semantic interoperability in a concrete integration situation based on a mediation

mechanism.

c) are encapsulated by an external representation on the conceptual abstraction level that

hides the internal representation on the logical or physical abstraction level to prevent

unwanted change propagation of internals.

The following Figure 4-13 illustrates the definition and highlights the main conceptual parts of

loosely coupled information models:

Figure 4-13 Definition of Loosely Coupled Information Models

4.6.4 Limitations in the Transfer of Loose Coupling and Open Issues

As examined above the concept of loose coupling as used in functional architectures of IT

systems can be transferred to the semantic level in terms of loosely coupled information models.

However, besides the analogies regarding the major aspects of loose coupling as well

limitations of the transferability have to be noticed.

Usually, in loosely coupled functional architectures the functionality of the IT system is

distributed among the independent functional components or modules in a disjunctive manner.

Each module is defined by a clear scope in order to avoid overlapping responsibilities for

certain functionality. However, regarding loosely coupled information models such a distinct

distribution of scope is not reflected. In contrast, loosely coupled information models tend to be

conceptually overlapping on purpose in order to allow the modeling of the same domain or parts

of related domains from different perspectives (cf. Section 4.5). This coexistence ensures that

Chapter 4

information models can be independently designed according to the information model owner

specific needs. Then, the overlapping parts of the loosely coupled information models need to

be addressed in a concrete semantic integration situation, which should be considered by an

appropriate mediation mechanism.

Furthermore, it has to be noticed that in the above discussion and definition of loosely coupled

information models the mediation mechanism enabling the loose coupling plays a central role,

although it has not yet been particularly specified. In order to specify the mediation mechanism

its core characteristics need to be defined. Taking into account the research hypothesis of this

work (cf. Section 1.3) the main targeted characteristics of the semantic mediation mechanism

are effectiveness and efficiency. Consequently, the next section analyzes the role of

effectiveness and efficiency and the relation between them, in order to derive a more concrete

idea on how the semantic mediation mechanism should be designed.

4.7 Trade-off between Effectiveness and Efficiency

Effectiveness and efficiency in achieving semantic interoperability in large-scale SOA

landscapes are two fundamental factors that need to be distinguished. On the one hand,

effectiveness plays a central role in the way semantic interoperability is achieved. Each

proposed solution is subject to certain explicit or implicit assumptions such as a high or low

consensus degree enabling or limiting standardized canonical information models across

organizational boundaries. Taking into account the fact that achieving semantic interoperability

not only involves technical aspects but as well exhibits a challenging organizational dimension,

the question regarding feasibility of a proposed solution in practice needs to be addressed. On

the other hand, the efficiency of the mechanism for achieving semantic interoperability strongly

influences the degree to which semantic interoperability is achieved. The question is how the

complexity of semantic heterogeneity is tackled and how much manual integration efforts need

to be invested in order to bridge divergent information representations.

Often effectiveness and efficiency are competing goals. However, a sufficient solution for a

mediation mechanism resolving the semantic interoperability problem needs to provide both

qualities. In the following, three different mechanisms for semantic mediation are presented.

They are examined with regard to the question on how they address effectiveness and efficiency

and how they relate to the trade-off.

4.7.1 Point-to-Point Mediation

The first approach is based on a direct application of the concept of independent information

models and mutual mediation between them (cf. (2) in Section 4.4.1). Each organization

manages their own internal information model, which may be implemented by state-of-the-art

technologies such as XML Schema or SQL database schemata to be located on the logical or

physical level. This internal information model is encapsulated by a conceptual model, which

can be realized in terms of an ontology (cf. Section 3.6). Accordingly, the main semantic

integration takes place in the mappings between the different external information models based

on ontology mappings. Consequently, a point-to-point mediation between the heterogeneous

information models is required. The following Figure 4-14 illustrates this approach:

Semantic Mediation between Loosely Coupled Information Models in SOA

Figure 4-14 Point-to-Point Mediation

The approach bears the advantage of loosely coupled information models as semantic

integration is performed on the conceptual level instead of the technical level. In the context of

SOA this can be enabled by Semantic Web service technologies (cf. Section 3.4). Addressing

the semantic integration problem on a higher abstraction level responds to the high complexity

of integration by separating semantic integration from technical issues. Thus, the integration

efforts can be reduced.

Furthermore, it can be stated that the presented approach provides an effective solution.

Meaningful information exchange between the different systems and applications using the

heterogeneous information models can be ensured in terms of mappings between the ontologies

and thus the goal of semantic interoperability can be achieved.

However, point-to-point mediation features an essential drawback regarding efficiency.

Although a single semantic integration between two interacting organizations can be addressed

with reasonable efforts as described above the overall semantic integration costs for the whole

ecosystem are much higher. The point-to-point mediation implies potentially mappings

to ensure semantic interoperability between all organizations. This means that the approach of

point-to-point mediation has a quadratic complexity. Therefore, the approach does not scale as

integration cost are increasing disproportionally when further organizations need to be

integrated into the ecosystem. However, the flexible integration of new business partners into

cross-organizational business processes constitutes a key promise of SOA.

The point-to-point approach represents an extreme variant with regard to the granularity of

mediated partners. The following approach presents the opposite variant in order to set the range

of possible options in this trade-off between effectiveness and efficiency.

4.7.2 Pivot Ontology based Standardization

The second approach follows the idea of a standardized canonical information model (cf. (1) in

Section 4.4.1). One global information model is shared. This can be specified on the conceptual

level e.g. in terms of an ontology, in order to be independent of the different heterogeneous

internal information models of the involved organizations. Consequently, the main semantic

integration takes place between the internal e.g. schema-based information models and the

global canonical information models in terms of a so called pivot ontology. As the mediation

needs to cover both directions: from the internal lower abstract information model to the

external higher level and vice versa, the mapping can be also referred to as lifting and lowering.

Thus, a message sent from organization A is firstly lifted to the shared pivot ontology and then

lowered to the appropriate internal information model used in organization B. This approach

represents the dominant semantic integration approach in conjunction with ontologies and can

be found in the majority of research projects applying Semantic Web technologies for

information integration (cf. Section 3.6 and Section 5.9.3). The following Figure 4-15 illustrates

the semantic standardization approach based on a pivot ontology.

Chapter 4

Figure 4-15 Pivot Ontology based Standardization

In contrast to the previous approach semantic standardization based on a pivot ontology just

requires mappings (or liftings/lowerings). Each organization‟s internal information model has

to be mapped only once bidirectional to the pivot ontology to ensure semantic interoperability

during information exchange. Thus, from the perspective of required mappings it can be stated

that the approach offers an efficient solution.

However, with regard to effectiveness the already discussed limited feasibility of standardized

information models in cross-organizational SOA landscapes has to be taken into account (cf.

Section 4.4). As new services are added to the architecture, extensions to the common message

model are needed to meet their specific data requirements. As more services are added the

model grows, complexity rises and it becomes difficult to understand. Moreover, conflicting

requirements lead to inconsistencies and required compensation mechanisms. From an

analytical point of view consensus for such a canonical information model in terms of a pivot

ontology, requires actors to align their different requirements for information exchange with

external partners. The mutual coordination of actors potentially implies coordination

tasks, which also lead to quadratic complexity. Due to the disproportionate coordination

complexity some actors may leave the consensus finding process. Thus, the goal of achieving

semantic interoperability within the ecosystem covering all actors is missed. This drawback is

also described in [178], which states that it is increasingly unlikely that a single ontology will

both adequately capture the domain in question and also be consensual among all interested

parties. Consequently, this non-scaling coordination complexity hinders an effective solution.

Hence, the approach based on a pivot ontology provides a sufficient solution with regard to

efficiency, however it has shortcomings regarding effectiveness. Basically, it provides a shift

from integration efforts for e.g. mapping development to organizational coordination efforts

required for alignment of the different perspectives of the involved actors.

4.7.3 Semantic Mediation on Domain Level

Having presented the two opposing approaches for achieving semantic interoperability within

the trade-off between effectiveness and efficiency, the following approach represents a

combination of the two approaches, which tries to exploit the respective advantages and to

minimize the discussed drawbacks.

The limiting factor of the first approach has been inefficiency with regard to high integration

cost caused by too many required mappings. The second approach has exhibited shortcomings

in effectiveness with regard to high organizational complexity as too many actors need to be

coordinated. Taken this into account, the approach of semantic mediation on domain level tries

to minimize the required mappings as well as the involved actors to be coordinated. The main

idea of this twofold reduction lies in the aggregation of distributed information models to a set

Semantic Mediation between Loosely Coupled Information Models in SOA

of multiple domain information models. Consequently, the semantic mapping can be targeted at

the domain level. The following figure illustrates this approach:

Figure 4-16 Semantic Mediation on Domain Level

The domain models do not necessarily have to be disjoint as in practice domains inherently

overlap and community- and application-specific requirements have produced and will always

produce different conceptualizations for one and the same problem (cf. Section 4.5). However,

in order to achieve a smaller number of actors the scope of a pivot ontology is limited to the

domain level. Instead of aiming for one unique shared pivot ontology, the coexistence of

multiple domain-specific pivot ontologies is intended. Consequently, the reduction of actors can

limit the complexity in the consensus finding process to a feasible extent.

Thereby, the notion of a domain can be understood relatively flexible e.g. in terms of a selected

area of activity. The concrete specification of a domain depends on the actual application

context. Thus, on the general level required for this approach the concept of a domain is used in

context of domain information models that cover a limited scope but however contain or span

across multiple individual organizational information models.

Accordingly, semantic interoperability within a domain is achieved by means of a domain

ontology. The domain ontology acts as a pivot information model to which the heterogeneous

internal information models of the different organizations are mapped in terms of liftings and

lowerings. Aggregated over the ecosystem of organizations this results in mappings analog to

the efficient but however not effective approach of overall semantic standardization based on

one single pivot ontology.

In order to achieve semantic interoperability not only within each domain but across the

ecosystem of organizations a further step is necessary. The required semantic mediation

between the domain information models can be enabled by means of ontology mappings as

represented by the dashed arrows in Figure 4-16. The semantic integration efforts for these

mappings also have quadratic complexity. However, the problem size has been reduced

substantially. Considering domains, where is much smaller than the counting the

number of organizations, the semantic mediation between the domain information models

requires mappings. The aggregation of total mapping efforts for this approach results in

a complexity of .

Consequently, the approach of semantic mediation on domain level reduces the complexity as

follows:

 from mappings (point-to-point mediation)

 or from mappings combined with coordination efforts (single pivot-ontology

based standardization)

 to mappings combined with

coordination efforts

Chapter 4

Thereby represents the number of domains which is much smaller than the number of

organizations or actors . Furthermore, it has to be taken into account that the coordination

efforts for semantic mediation on domain level can be performed in parallel in each domain. To

summarize it can be stated that the approach combines the advantages of the before discussed

approaches and minimizes their drawbacks. On the one hand, the amount of required semantic

mappings is minimized and on the other hand, the number of involved actors to be coordinated

is reduced.

4.7.4 Alleviation of Trade-Off between Effectiveness and Efficiency

However, the approach of semantic mediation on domain level just provides a shift within the

trade-off between effectiveness and efficiency and cannot resolve it in principle. Nevertheless,

the adequate application of Semantic Web technologies can enable an alleviation within the

trade-off. The following qualitative analysis illustrates this argumentation:

Figure 4-17 Effectiveness and Efficiency Gain

The two opposing approaches of point-to-point mediation and pivot ontology-based

standardization lack either sufficient effectiveness or efficiency. Considering the two options in

a two dimensional projection they span a line of possible combinations within the trade-off

between the two competing goals. Starting from the pivot-ontology based approach the number

of actors which need to be coordinated can be reduced. This leads to higher effectiveness as the

feasibility of consensus is increased but to the cost of less efficiency as additional efforts for

mappings between multiple pivot ontologies are required. Hence, a shift on the line of possible

combinations from the lower-right corner up to the center is performed. On the other hand

starting from the point-to-point mediation approach the conceptual information models of

multiple actors can be united. Thus, the number of point-to-point mappings between the aligned

conceptual information models becomes obsolete and thus the total number gets reduced;

however to the cost of coordination complexity which is now required for the consensus finding

process. Accordingly, this can be represented as a shift on the line of possible combinations

from the higher-left corner down to the center.

Besides providing such a combination on the line between the two opposing approaches,

semantic mediation on domain level provides an additional alleviation within the trade-off.

Exploiting Semantic Web technologies in terms of ontologies for representing the external

Semantic Mediation between Loosely Coupled Information Models in SOA

conceptual information models and ontology mapping to enable the loosely coupled semantic

mediation mechanism brings the following advantages:

 Firstly, semantic mediation is targeted on the conceptual level. Consequently, the higher

abstraction level reduces mapping efforts substantially as technical issues can be

handled with enhanced transparency.

 Furthermore, efficiency is improved as the mediation between different heterogeneous

representations in overlapping conceptualizations is only done once on the domain level

instead of performing it repeatedly on the technical level in concrete application or

process scenarios. Hence, ontology mappings once developed can be reused for various

integration scenarios between the involved domains. Additionally, the process expert

can focus on process specific concerns and can leave the task of semantic mediation to a

domain experts thus taking further advantage of separation of concerns.

 Regarding effectiveness, the formal nature of conceptual models instantiated by the

exploitation of ontologies eases the coordination and consensus finding process.

Semantics can be discussed on an unambiguous and explicitly defined level

irrespective of technical issues. For this purpose ontologies provide an effective

instrument to domain experts to specify clear conceptualizations and communicate them

in the development process of domain information models.

In this sense the focus on the domain level for addressing the semantic mediation challenge can

be understood from two perspectives. Firstly, the domain level refers to the scope covered by

the loosely coupled information models. And secondly the domain level further refers to the

conceptual nature of a domain model supporting the shift of abstraction level for semantic

mediation as described above.

Having discussed the general advantages of Semantic Web technologies for the approach of

semantic mediation on domain level the following section discusses how particular features of

this technology can be exploited to enable the mediation mechanism between domain

ontologies.

4.8 Semantic Bridges for Loose Coupling of Domain

Ontologies

The semantic mediation mechanism can be based on the general idea of semantic bridges as

introduced in [146]. Semantic bridges provide a framework for ontology mappings as described

in Section 3.6.2. They are utilized to describe the relations between concepts defined in different

ontologies which are not shared but which intuitionally have an equal or similar meaning.

Furthermore, semantic bridges define mappings, i.e. a translation between these concepts to be

used in concrete information flows across heterogeneous ontologies. In [127] a matching

function is formally defined. The matching function takes two different schemas and provides a

mapping between them. A semantic bridge represents the output of such a matching function

and thus describes the mapping.

The ontology mappings of semantic bridges can be either manually developed or (semi)

automatically by utilizing ontology mapping tools such as presented in [179], [140]. Such

mapping tools reuse the approaches developed in the context of schema mapping, on which

[127] provides an overview. However, the extent of automation is limited as the identification

Chapter 4

of similar meaning of separately defined concepts is often context-based and human

intervention in the matching process is necessary (cf. Section 3.6.2).

Considering these context dependencies and different granularity levels in representations,

defining these mappings to create semantic bridges is a complex task. In particular, semantic

bridges should cover general semantic correspondences between domain ontologies rather than

covering specific application logics. This means that concepts from different domain ontologies

might be regarded as matching in a specific application context; however this relation does not

hold in other cases taken into account the overall domain perspective. Therefore, only those

concepts should be mapped that semantics generally match, i.e. in all application contexts where

those concepts are involved.

After having found matching concepts and properties the matching results i.e. the identified

alignments need to be formally defined (cf. Section 3.6.2.). In the context of database schemas

these mappings can be formulated e.g. as queries or views (cf. Section 3.5.1). In the context of

XML-based approaches these mappings can be implemented in terms of structure-based

transformation languages such as XSLT (cf. Section 3.2.2).

However, in the context of ontologies based on formal description logic it is beneficial to

express these mappings as well with description logics. In order to serve the given

circumstances of heterogeneous and independent domain ontologies the added value resulting

from the application of Semantic Web technology lies in exploiting specific features of this

technology. Three such features are the following which are described in more detail below:

 Generalization and Polymorphism

 Facet Analysis Classification

 Declarative Rule-based Entity Manipulation

4.8.1 Generalization and Polymorphism

Ontology languages strongly support the concept of generalization and inheritance. A concept

and in particular a class can be defined as a subconcept of another and thus inherits the whole

specification of its superconcept and can further specialize it. Consequently, ontology languages

support the concept of polymorphism as instances of a subclass are as well instances of their

superclass (cf. Section 3.3.2). This is in contrast to purely XML based approaches, where

language concepts such as real inheritance support is missing (cf. Section 3.2.3). As it has been

argued in [181], the static type bindings do not allow for effective polymorphism in XML. A

given XML element is marked by only one specific tag, hence, in fact it has to remain

monomorph. I.e. instances cannot be processed in a polymorph manner, because the processing

of XPath or XSLT, etc. can only match a unique tree path of tags or respective attributes. For

example an XPath expression /root/vehicle cannot match an instance tagged by <car>, even if

it provides all features of a vehicle. This fact can be ascribed to the structured-based approach of

XML processing where a consequent polymorph type system is missing.

However, polymorphism provides an essential advantage taken into account the requirements

posed by semantic mediation between overlapping domain ontologies. During the mediation

process inheritance relations can be utilized to describe the relations between corresponding

concepts defined in different ontologies. Furthermore, as information flow has to be realized

across domain borders the mediation approach has to ensure that an information entity

expressed as an instance of a concept from one domain ontology can be as well processed as an

instance of the corresponding concept from the other overlapping ontology. Thereby,

polymorphism can be exploited during the mediation process as instances of corresponding

Semantic Mediation between Loosely Coupled Information Models in SOA

concepts from different ontologies can be handled as the same entity providing a polymorphic

representation that conforms to both concept definitions. Hence, the different perspectives on

the information space reflected in the different domain ontologies can be technically

materialized by the exploitation of a polymorph type system for information entities.

4.8.2 Facet Analysis Classification

A further essential feature of Semantic Web technology, which can be exploited for the

mediation mechanism, is the ability of facet analysis classification. Facet analysis classification

can be regarded as one of the core concepts of ontology design. A concept or class is defined in

terms of so called facets each member of the set of instances it describes needs to provide. Here,

facets stand for clearly defined, mutually exclusive, and collectively exhaustive aspects,

properties, or characteristics of a class or specific subject [182]. Any individual featuring such a

specific set of facets is then classified as an instance of the class. This flexible classification

approach based on the theory of facet analysis can be dated back to the work of the Indian

librarian Ranganathan, who developed a faceted classification scheme in the 1930's [183]. This

property-focused classification can be found in OWL in terms of class descriptions through

property restrictions [184]. Such classes describe a set of instances or individuals that satisfy the

restrictions. Thus, an OWL reasoner is able to analyze individuals and if they provide the

properties declared in the restriction, they are classified as a member of the class. Consequently,

applying facet analysis classification to information entities in a semantic integration scenario

can support polymorph entity handling as described above. Whenever an instance of a concept

from one domain ontology satisfies as well the property restrictions of the corresponding

concept from the other overlapping ontology, this instance can be classified additionally to the

type of the corresponding concept and thus becomes polymorph.

4.8.3 Declarative Rule-based Entity Manipulation

Another feature of Semantic Web technology, which is exploited in the developed mediation

mechanism of this work, are description logic-based rules to express semantic bridges.

Basically, semantic bridge rules are specified in such a way that they infer in terms of

declarative entity manipulation a polymorph representation. The polymorph representation then

provides the basis to allow the seamless information processing across different domains as

described above. This ability of declarative rule-based entity manipulation fits well to the

targeted approach of exploiting polymorphism for the mediation mechanism. Consequently, in

the context of semantic mediation on domain level rule-based bridging axioms provide the

option of choice besides other approaches discussed in Section 3.6.2 such as views and queries

or dedicated mapping ontologies. Moreover, the advantage of the bridging axiom approach is

that the use of a rule language does not only allow for describing the semantic relationships:

Given a suitable inference engine also it allows for performing the transformations between

related concepts. Then the reasoning of the described mappings can be applied straight forward

as the transformation rules can be integrated into the regular ontology inference process such as

classifying etc. Thus, no additional specific inference engine is needed for the mapping process

such as required in the approach based on a particularly designed mapping ontology.

To summarize it can be stated that the semantic mediation mechanism can be based on the

ability of Semantic Web technologies to express generalization and polymorphism and to infer

this polymorphism by a combination of facet classification and declarative rule-based entity

manipulation. In the following section these three features are combined to describe the concrete

operation of a rule-based semantic bridge, which materializes the mediation mechanism

between loosely coupled information models.

Chapter 4

100

4.8.4 Operation of Semantic Bridges

The operation of semantic bridges is illustrated based on a simple example of two concept

definitions: Address and PostalAddress. It is assumed that the two concept definitions although

representing the same conceptual idea have been defined independently and by different domain

experts in separate domain ontologies. The following Figure 4-18 shows an outline of these

heterogeneous information models, which can be formalized in the OWL ontology language

using defined classes (cf. Section 3.3.2). I.e. the concept definitions follow the principle of facet

analysis classification and are formulated as a set of required properties that characterize this

particular concept. Thereby, the important aspect is that the concept definitions differ in their

semantic sub-graph as illustrated in the following Figure 4-18:

Figure 4-18 Heterogeneous Domain Information Models

Obviously, information entities instantiating either the Address or PostalAddress concept

presented above cannot be exchanged between communicating partners by default, although

they represent the same conceptual idea. For instance, in domain information model A the

concept Address among others is defined in terms of four properties, whereas the property

hasName refers to a further concept Name which is defined in terms of two properties. In

contrast in domain information model B the semantically equal concept PostalAddress just

features three properties containing the same information, however defined at a lower level of

granularity. For example only one property hasStreetAddress is representing the street name and

street number together and hasRecipient is representing the full name. Moreover, the two

representations do not only differ in granularity, they also show a structural difference in the

semantic sub-graph as the first ontology encapsulates Name as an extra concept, whereas the

second ontology does not.

Consequently, the task of the semantic bridge is to provide a mechanism that mediates between

these different concepts, so that instances of these concepts e.g. input parameters or output

parameters of Web services can be seamlessly processed despite of their different

representations. As mentioned before semantic bridges can be expressed as rules which are

based on description logic implications so called forward-chaining rules. The following Figure

4-19 illustrates the first step of the semantic bridge operation where such rules are applied to

infer additional properties.

Semantic Mediation between Loosely Coupled Information Models in SOA

101

Figure 4-19 Semantic Bridge Operation (Step 1)

By applying the semantic bridge rules an instance of type Address is furnished with additional

properties e.g. with hasRecipient combining the values of the hasGivenName and the

hasSurname properties from the entity Name as illustrated in Figure 4-19. Analogically

hasStreetAddress is inferred, which combines the values of the Address properties hasStreet and

hasStreetNumber.

In a second step having the class definitions on hand, a reasoner is now able to classify the

instance as a member of the defined class PostalAddress, since all required properties

hasRecipient and hasStreetAddress are present. The following Figure 4-20 illustrates the second

step of the semantic bridge operation:

Figure 4-20 Semantic Bridge Operation (Step 2)

Thus, within the scope of a business process any service, independently to which domain it

belongs, can now make use of this transformed and reclassified instance as it is polymorph of

type Address and PostalAddress, i.e. semantic interoperability has been established.

4.8.5 Benefits of Developed Approach for Semantic Bridges

Firstly, the declarative approach enables the targeted separation of concerns. Accordingly, the

absence of technical transformation code increases maintainability of the semantic bridges and

thus improves efficiency.

Regarding effectiveness it can be noticed that using rules for describing semantic bridges

enables expressive mappings including as description logic-based rules are computationally

complete. Hence, any kind of mapping relation (one-to-one, one-to-many, many-to-many) can

Chapter 4

102

be described. A many-to-one mapping describes the case where two or more concepts from the

first ontology need to be taken to represent one concept in the second ontology.

Correspondingly, a many-to-many mapping can be understood. For example the presented

mapping between the concept Address and the concept PostalAddress is actually a many-to-one

mapping because not only the concept Address but also the concept Name, which it refers to are

taken both to be mapped to the concept PostalAddress.

Moreover, the developed mediation mechanism features following advantage over other

approaches applying bridging axioms for ontology mapping such as presented in [147]. By

combining description logic based rules with facet analysis classification semantic bridges can

be directly applied to instances of e.g. Web service outputs and do not need to be transformed

into new instances of a so called merged ontology. Hence, during the mediation, the object

identity can be kept and further processing of the original parameter is enabled. Accordingly,

different actors of different domains, e.g. in terms of interacting services in a cross-organization

business process, can have different views on the same service parameter realized through

polymorphism.

However, in this context it should be noticed that the additional inferred properties create

redundancy in the description of service parameters. Thus, in case of interacting services just

read-only operations on the service parameters can be performed without the risk of

inconsistency between the redundant descriptions. Nevertheless, parameter manipulation is not

hindered by the approach. Before a service changes property values, it should be ensured that

only the properties defined in the service‟s domain ontology are present. Then in case of further

service interactions again the application of semantic bridges should start from the beginning.

This matches well with the characteristic of flexible binding in the definition of loosely coupled

information models (cf. Section 4.6.3), which states that semantic mediation should be

dynamically performed just right in the moment when information needs to be exchanged.

Furthermore, due to the nature of rule-based inference engines an additional benefit of the

presented approach for realizing semantic bridges lies in its ability to easily derive transitive

mappings. That means, that e.g. if domain ontology A is mapped to domain ontology B and

domain ontology B is mapped to domain ontology C, then also A is automatically mapped to C.

This benefit can further improve the efficiency of the semantic mediation mechanism on domain

level as the number of required semantic bridges is further reduced.

The presented approach for realizing semantic bridges enables to express and perform mappings

between concepts from different domain ontologies that cannot be expressed by common

ontology language constructs. However, this does not imply that semantic interoperability is

only ensured by the integration of semantic bridges. Common ontology language constructs

such as equality or inheritance relations between concepts of e.g. top level ontologies and

domain ontologies can also contribute to semantic interoperability. Such concepts from top level

ontologies are equally used by different organization even originating from different domains.

Thus, the standardization of these concepts which are not business context dependent ensures

semantic interoperability for commonly used information entities. This is for example the case

when using top level ontologies describing general aspects of Semantic Web services (cf.

Section 3.4.2). This combination refers back to the hybrid ontology approach discussed in

Section 3.6 about semantic information integration with ontologies as illustrated Figure 3-24.

From this perspective semantic bridges provide the horizontal connections between the different

local ontologies, whereas the vertical connections to the shared ontologies can be expressed

with common ontology language constructs.

Finally, it should be noted that the here developed approach for semantic bridges as an

instantiation of the semantic mediation concept has been presented and published at the

Semantic Mediation between Loosely Coupled Information Models in SOA

103

European Semantic Web Conference 2008 [185] describing as well the opportunities for its

application within the SOA lifecycle as discussed in the following chapter.

4.9 Summary and Reflection

This chapter has presented the developed concept of semantic mediation between loosely

coupled information models in SOA. The concept has been derived from several approaches

analyzed in the state-of-the-art in Chapter 3 focusing on information integration and Semantic

Web approaches in combination with concepts from service-oriented architectures. The

principle idea has been to transfer the concept of loose coupling to the semantic level.

According to the hypothesis of this work, the approach aims at providing not only an efficient

but moreover an effective solution to the problem of semantic interoperability in large-scale

SOA, which goes beyond dominant current Semantic Web service-based approaches.

In order to set the requirements for the solution targeted by the concept of semantic mediation

between loosely coupled information models in SOA, the chapter has firstly listed required

qualities which need to be covered by the developed concept. The main requirements which

have been identified are: separation of concerns regarding business process expertise and

between semantic and technical issues, anticipation of organizational boundaries to agree on and

share information models across multiple domains, to address semantic integration on the higher

abstract conceptual level, to remain consistent with best practice SOA methodologies, i.e.

starting with business analysis leading to process models and keeping the process planning to a

human process expert, and finally anticipating technological path dependency and thus to build

upon existing technologies and standards of the World Wide Web.

Subsequently, the general idea of the developed concept has been outlined in order to provide an

overview of the central aspects. These include mainly the shift from monolithic to loosely

coupled information models combined with the approach to address semantic integration on a

higher abstraction level. Thereby, the notion of a shift to a higher abstraction level has been

elaborated along two dimensions: On the one hand, semantic integration is shifted from the

schema or structure-based logical abstraction level to the conceptual abstraction level. And on

the other hand, semantic mediation is performed on domain level instead of recurrently on

application or process level. The following sections then have refined the general idea and

provided detailed descriptions and argumentations of the respective conceptual parts.

Firstly, the limitations of standardization with regard to semantic interoperability have been

discussed to point out the demand for loosely coupled information models. In order to clarify

the understanding of semantic standardization, this general approach has been put in contrast to

the targeted approach of semantic mediation for bridging the semantic interoperability gap.

Furthermore, the differences between the advantages of technical standards in contrast to

implications of semantic standards have been shown. On the one hand, technical standards

principally increase productivity by rationalization as integration and adaptation costs for

different interfaces, protocols and coding formats can be reduced. On the other hand, standards

on the semantic level targeting a universal scope lead to inflexible monolithic information

models, which cannot adequately respond to changing business requirements. Taking this into

account, analogies between general standardization processes and semantic standardization

could be derived. It has turned out that similar to technologies which face high innovation and

frequent changes the consensus degree for agreeing on semantic standards is limited.

Consequently, it could be concluded that the adequate scope of semantic standards has to be

Chapter 4

104

restricted to an organizational feasible extend and should not target the level of universal

adoption.

Having derived a first rational for multiple coexisting information models, the subsequent

section has further substantiated the argumentation by analyzing the underlying reason for

context-dependency of information models. Therefore, a general model theoretic perspective

has been presented that applies a model of conception to information models. The main

conclusions have been that context dependency of models is an inherent characteristic of models

and a determinant factor to cope with complexity, which is particularly valid for information

models. Furthermore, the model theoretic approach has shown that the context cannot be

regarded as an objective issue but is determined by a subjective conception. Consequently,

information models of a certain domain are not unique and uniform but may differ dependent on

the different actors developing and using it. Finally, the analysis has shown that information

models are not only related to their domain but also to a specific purpose or respectively to

concrete IT applications. Thus, due to their very nature, information models cannot be general,

unique and uniform but rather they are diverse, heterogeneous and occur as multiple possibly

overlapping variations.

Taken into account this requirement for the coexistence of multiple overlapping information

models, the need for a mechanism to overcome the heterogeneous conceptualization in a

concrete integration scenario has been pointed out. Based on these findings, the transfer of the

principle of loose coupling to the semantic level has been discussed. Three fundamental and

transferrable characteristics of loose coupling have been extracted from an analysis of different

definitions of the functional dimension of loose coupling, namely: autonomy, flexible binding

and encapsulation. Transferred to the semantic level, autonomy of loosely coupled information

models can be understood as the independence of their management which is under the control

of different ownership. Furthermore, flexible binding of information models describes the non-

permanent but dynamic nature of the coupling mechanism between two information models.

Finally, as well encapsulation is reflected on the semantic level. The main idea for this point is

that the conceptual abstraction level of information models provides an encapsulation of

information model internals and thus enables their loose coupling independently from internal

changes or representation formats. Based on this discussion, a definition of loosely coupled

information models has been provided that reflects these three central aspects.

As the main goal of this work is to contribute to a solution for the semantic interoperability

problem in SOA with particular focus on effectiveness and efficiency, the subsequent section

has reflected the proposed conceptual solution on this regard. A trade-off between effectiveness

and efficiency has been pointed out. On the one hand, an extreme application of loose coupling

given by means of a point-to-point mediation of organization‟s information models provides an

effective solution but does not scale efficiently as in principle integration costs increase with

quadratic complexity. On the other hand, the opposite approach of semantic standardization via

a single pivot ontology provides an efficient but no effective solution. Efficiency is ensured as

only linear costs for mapping each organization‟s information model to the pivot ontology are

required. However, the required coordination efforts for consensus finding between multiple

actors lead as well to quadratic complexity. Thus, it has been shown that the pivot ontology

approach just provides a shift from operational integration efforts for seamless information flow

to organizational coordination efforts required for the alignment of the different perspectives of

the involved actors. In order to overcome the drawbacks of the analyzed approaches as well as

exploiting their respective advantages, a combination of the two semantic integration strategies

has been presented. Basic idea of this derived balanced approach is to target semantic mediation

on the domain level. Thus, firstly the required number of semantic mappings is reduced as only

mediation between different domains instead of between each single organization has to be

Semantic Mediation between Loosely Coupled Information Models in SOA

105

covered. And secondly, the involved actors are reduced, as with the limited scope of the

standardized domain information models as well the number of involved actors can be limited

to a feasible extent.

However, even if providing a more balanced approach, it has been argued that the approach of

semantic mediation on domain level just provides a shift within the trade-off between

effectiveness and efficiency and that the approach cannot resolve it in principle. Nevertheless, it

has been shown that the dedicated application of Semantic Web technologies as exploited in the

concept of loosely coupled information models enables an alleviation of the trade-off. As

semantic mediation is targeted on the conceptual level, consequently mapping efforts are

reduced substantially as technical issues can be handled with enhanced transparency. Moreover,

efficiency is improved, as the mediation between different heterogeneous representations in

overlapping conceptualizations is only done once on the domain level instead of performing it

repeatedly in concrete application or process scenarios. Regarding effectiveness, it has been

argued that the formal nature of conceptual models instantiated by the utilization of ontologies

eases the coordination and consensus finding process.

Finally, the chapter has presented the developed semantic mediation mechanism which enables

the proposed concept of loose coupling of information models in SOA. The semantic mediation

mechanism instantiates the general idea of semantic bridges based on selected Semantic Web

technologies, in particular ontologies and description logic rules. The requirements for the

semantic bridges to target loose coupling of domain ontologies have been addressed by

exploiting features such as generalization and polymorphism, facet analysis classification and

declarative rule-based entity manipulation. In order to show how these features can be

combined together to serve the desired goals, the operation of semantic bridges has been

explained and the benefits of the developed approach could be pointed out.

By means of a connecting step between theory and experiment, the following chapter maps the

here presented theoretical concept to the concrete application domain of SOA. Therefore, a

dedicated semantic mediation methodology is developed that determines the basic steps relevant

for the application of the concept of semantic mediation to the SOA life-cycle.

107

Chapter 5

Methodology and Functional Architecture

for Semantic Mediation in SOA

5.1 Overview

This chapter presents the developed semantic mediation methodology which maps the concept

of loosely coupled information models to the basic phases of the SOA life-cycle. The relevant

phases of the SOA life-cycle where mediation between heterogeneous information models is

required are identified and it is discussed how the before developed conceptual solution can be

applied.

The chapter starts by refining the general conceptual requirements identified in Section 4.2 with

regard to concrete conditions and implications for the semantic mediation methodology.

Furthermore, domain specific considerations are derived in terms of an actors model describing

the involved stakeholders relevant for semantic mediation in SOA. Subsequently, an overview

of the individual steps of the semantic mediation methodology is given, which is aligned to the

basic phases of the SOA life-cycle. The following sections then discuss the particular

methodology steps in more detail with regard to performed tasks and required functionalities. In

order to prepare an experimental confirmation of key steps of the methodology in terms of a

prototypical semantic mediation toolkit, as well a high-level view on the functional architecture

for each methodological step is derived. Therein, it is distinguished between methodological

steps which can be addressed sufficiently with existing work including respective tools and

steps which cannot be mapped to available functionality. These missing functionalities are then

addressed in the subsequent Chapter 6 presenting the developed prototypical instantiation of the

semantic mediation toolkit based on the in this chapter derived requirements and functional

architectures. Finally, a conclusion and reflection of the chapter is provided.

5.2 Methodology Requirements and Domain-specific

Considerations

The general conceptual requirements for semantic mediation between loosely coupled

information models have been already discussed in Chapter 4 (cf. Section 4.2). They need to be

refined, in order to derive concrete implications for their practical application to the SOA life-

cycle (cf. Section 2.4).

The first conceptual requirement has addressed the reduction of complexity in semantic

integration in terms of separation of concerns. Consequently, the demand for overlapping skills

of involved stakeholders should be reduced. Thus, the semantic mediation methodology should

Chapter 5

108

be based on a dedicated actor model that reflects the different expertise requirements of the

assigned roles. In particular, the responsibilities of the defined roles should be highly disjoint, in

order to minimize the demand for overlapping skills that are a major factor of complexity in

semantic integration.

The second conceptual requirement has dealt with organizational boundaries and limitations in

the scalability of community processes to ensure agreement and standardization of cross-

organizational information models. As the concept of loosely coupled information models is

applied in the methodology, this requirement is already reflected per se and no further

implications for the semantic mediation methodology can be derived. However, the

methodology should address this cross-organizational environment by reflecting that the

involved stakeholders identified in the actor model may origin and operate in different

organizational contexts.

A further conceptual requirement has been the shift of abstraction level in semantic integration.

On the one hand, this requirement is well reflected by the concept of loosely coupled

information models. The semantic mediation mechanism exploits ontology-based conceptual

information models, which are independent from internal more technical representations of

information models. On the other hand, the shift of abstraction level has also been described as

the aim for addressing semantic integration just once on domain level instead of recursively on

application or process level. This is also reflected by the first methodology requirement taken

into account the targeted actor model which should address the different responsibilities

between the roles of domain experts and business process experts. Therefore, besides the

consequent reflection of the shift of abstraction level no further explicit implications for the

semantic mediation methodology can be derived from this conceptual requirement.

Another central conceptual requirement has been the consistency with best practice SOA

methodologies. In particular, this means that the concrete service composition is derived from a

before undertaken business process analysis and business process modeling phase. Accordingly,

the semantic mediation methodology has to be strictly aligned to the SOA life-cycle phases,

whereas these phases where semantic heterogeneity is a challenge need to be covered.

The final conceptual requirement has been the constraint to respect technological path

dependency. This means that the approach should be based on the well-established concepts and

standards of the World Wide Web as it constitutes the dominant IT infrastructure for cross-

organizational interaction. Furthermore, as the semantic mediation concept should be applied to

the realm of SOA, this means more specifically that the approach should rely on the existing

XML-based Web service technologies as standardized by the W3C or OASIS (cf. Section

3.2.2). The on top of this applied Semantic Web technologies therefore should be as well based

on the W3C recommendations (cf. Section 3.4.2).

Domain-specific Considerations

Besides the refined methodology requirements further domain specific considerations from the

realm of SOA can be derived. Therefore, the following domain actor model is defined, in order

to specify the involved stakeholders relevant for the semantic mediation methodology with their

roles, responsibilities and general skill profiles. Often a domain actor model is also referred to

by the term domain model [186]. However, with the focus on domain information models the

term domain model might be misleading and thus in the following just the term domain actor

model is used. Figure 5-1 below illustrates the five identified actors and their interrelations: the

domain information model expert, the business process expert, the service developer, the service

composer and finally the service or process consumer:

Methodology and Functional Architecture for Semantic Mediation in SOA

109

Figure 5-1 Domain Actor Model for Semantic Mediation Methodology

A domain information model expert is responsible for developing an information model

covering a particular domain. Therefore, he needs to have comprehensive general business

knowledge of the information requirements for the interactions of organizations in this domain.

He works on the conceptual level and specifies the domain information model by using

ontology languages. Moreover, domain information model experts from different domains need

to collaborate, in order to define semantic bridges between different domains. The defined

domain information models and corresponding semantic bridges are provided to the business

process experts.

A business process expert is in charge of analyzing and modeling concrete business processes of

an organization. From the here focused perspective of information models, he is particularly

responsible for modeling the information flow within business processes. Accordingly, he needs

to have comprehensive knowledge about the concrete information requirements for the

particular business activity of the organization. Furthermore, the business process expert

contributes to the evolution of domain information models by deriving requirements for missing

concepts necessary for the description of business process information flow. Moreover, in order

to enable cross-organizational business processes, business process experts from different

organizations need to collaborate together. To define the cross-organizational information flow

they make use of the provided semantic bridges to overcome semantic heterogeneities. Finally,

the business process expert also sets the general requirements for the service development by

specifying the individual activities within a business process model.

Chapter 5

110

Accordingly, a service developer is responsible to develop the functionality required for a

particular business process activity and to provide it in terms of a Web service. Therefore, he

needs to have in-depth technical capabilities for Web service implementation and provision.

Furthermore, in order to enable the shift of abstraction level in semantic integration the service

developer enriches the services parameters with concepts defined in the domain information

model and thus builds the required Semantic Web services.

A service composer then selects from the developed and provided Semantic Web services these

candidates which are required to instantiate the business process in terms of a Semantic Web

service composition. On the one hand, the service composer needs to have solid understanding

of the business process to be implemented by the services composition. On the other hand, his

skill profile requires general technical competencies to develop the concrete Semantic Web

service composition with specific tool support. Such tool support particularly exploits the

semantic layer given in terms of semantically enriched service descriptions and semantic

bridges to ensure highly transparent semantic integration for the service composer.

Finally, the actor model contains a service or process consumer. His skill profile also requires

general technical competencies as the concrete business process instantiation needs to be called

in terms of a composite Semantic Web service request. Therefore, the concepts of the domain

information model are used to specify the service parameters.

5.3 Semantic Mediation Aligned to SOA Life-Cycle

Based on the domain actor model the semantic mediation methodology can be defined. The

concept of semantic mediation is mapped to relevant phases of the SOA life-cycle. These

phases of the SOA life-cycle where mediation between heterogeneous information models is

required are identified as steps for the semantic mediation methodology and it is discussed how

the before developed conceptual solution can be applied to provide the aimed additional value.

The following Figure 5-2 presents an overview on the semantic mediation methodology. The

seven steps of the methodology are briefly outlined, whereas the subsequent sections cover the

individual steps in detail.

Methodology and Functional Architecture for Semantic Mediation in SOA

111

Figure 5-2 Semantic Mediation Methodology

The first step of the semantic mediation methodology starts with enabling the shift onto a higher

abstraction level by creating the conceptual information models, which are the central artifact

throughout the whole methodology. Therefore, domain information model experts from each

domain develop autonomously so called domain ontologies. Thereby, the focus is on the

requirements and usage within the respective domain, yet leaving aside any cross-domain

aspects.

In the following the methodology steps are described in a sequential order for the purpose of

providing a better understanding. However, this order should not be regarded too strictly.

Rather, the methodology contains steps which can be also performed in parallel or which feature

mutual feedback relations to enable an iterative approach. Especially, the first four steps can be

performed in an iterative manner. Accordingly, this is highlighted with double arrows and the

possible iterations are mentioned in the descriptions of the individual steps.

The developed domain ontologies provide an input to the next step of the methodology, namely

the mediated business process modeling. Aligned to the central SOA phase of business process

analyses and business process modeling, this step targets the required high-level modeling of

information flow within business processes. Therefore, the business process expert defines a

business-oriented information flow on the conceptual level using the concepts from the before

developed domain ontologies. Taking into account cross-organizational business processes,

heterogeneity between different information models has to be addressed. Therefore, semantic

bridges focused in the subsequent methodology step are required to identify and describe

Chapter 5

112

information flow across heterogeneous information representations. Moreover, coming from the

perspective of agile development and continuous maintenance, information models and

correspondingly semantic bridges between them need to evolve over time. According to

process-orientation this evolution should be driven by requirements derived from business

processes. Consequently, mediated business process modeling not only includes the exploitation

of domain ontologies and semantic bridges but also provides specific features for their

requirement engineering during business process modeling.

Having already identified certain requirements for semantic bridges during the modeling phase

of cross-organizational business processes the dedicated methodology step referred to as

semantic bridge definition covers their comprehensive identification and specification. For this

step domain information model experts from different domains need to collaboratively develop

semantic bridges based on (tool-supported) ontology mappings (cf. 3.6.2).

The roles of semantic bridge developers in terms of domain information model experts on the

one hand and semantic bridge users in terms of business process experts and service composers

on the other hand are carried out by different actors. They might be even mutually unknown in

cross-organizational SOA landscapes; which makes the consideration of trust in the quality of

the developed semantic bridges an essential factor. Therefore, the third step of the semantic

mediation methodology focuses on the evaluation and particularly the testing of semantic

bridges. As it can be shown that in principle no formal verification of the correctness of

semantic bridges can be provided, the focus is put on how to apply concepts from software

testing to the systematic testing of semantic bridges and the underlying ontology mappings.

After having provided the foundation for addressing semantic interoperability on a higher

abstraction level by means of process-oriented domain ontologies and quality assessed semantic

bridges between them, the next methodology step is to lift the service descriptions as well on the

higher conceptual abstraction level. Therefore, the service developer, responsible for developing

Web services that provide the corresponding functionality required by the individual business

process activities, also has to provide the semantic service enrichment. This means that the

service developer enriches the service description with the concepts defined in the respective

domain information model, in order to lift the traditional XML-based Web service description

to a so called Semantic Web service description (cf. Section 3.4).

In the following methodology step the semantic service composer combines the before provided

Semantic Web services to a mediated service composition. The composition instantiates the

previously developed cross-organizational business process model by defining a concrete so

called Semantic Web service orchestration plan that can be executed in a process engine. In

order to achieve a high level of automation, the semantically enriched service descriptions can

be exploited for service selection based on semantic matchmaking. Furthermore, the additional

semantic layer is utilized to define the concrete information flow between the involved service

input and output parameters. Taking into account that the services are described with concepts

from heterogeneous information models, the before provided semantic bridges are used to

enable a seamless design of this concrete information flow, whereas semantic integration is

handled transparently.

The final methodology step deals with the runtime execution of the Semantic Web service

orchestration. Thereby, well established standards for workflow execution are considered. On

this regard especially the de-facto industry standard BPEL is addressed, which as according to

traditional Web service technology relies on the XML meta-data model. Hence, the challenge to

work consistently on the different meta-data models needs to be reflected during runtime and

consequently Semantic Web technology has to be incorporated into BPEL-based process

runtime middleware. Another challenge thereby lies in assuring a reasonable performance

Methodology and Functional Architecture for Semantic Mediation in SOA

113

during the rule-based inferencing process, which still often remains a bottleneck of Semantic

Web technology.

Usually, after process execution a subsequent phase dealing with process monitoring and

process optimization follows, in order to iteratively close the SOA life-cycle and derive new

requirements for process evolution. However, this phase has not been covered with a particular

semantic mediation methodology step. In fact, the analysis in terms of monitoring and

optimization of process data flow combined with heterogeneous information from other data

sources within or across organizations requires semantic interoperability, too. However, in order

to operationalize such analytical processes as well a service-oriented approach would be

followed and consequently a re-instantiation of the seven described methodology steps is

required.

Moreover, other possible phases of the SOA life-cycle have not been addressed specifically in

the methodology. This includes e.g. dynamic service discovery or dynamic service negotiation,

ranking or contracting. Even though such approaches have shown first results in academia (cf.

Section 3.4.3) they do not match adequately with current SOA best-practices. Thus, they have

not been focused in this work according to the requirements set for the general approach as well

as for the semantic mediation methodology.

5.4 Domain Ontology Development

This section describes the semantic mediation methodology step of domain ontology

development and the existing work in terms of methods and tools which can be applied to

perform this methodology step.

5.4.1 Goals and Tasks

Ontology development is an active field of research. It ranges from the usage of ontologies to

ensure a common understanding among human actors to information processing, whereas

ontologies provide the means upon which semantic reasoning techniques can be applied for

higher degree of automation (cf. Section 3.3.1). Accordingly, the major merit of ontologies is

that they can bridge the gap between the real world and IT systems [187].

An important question regarding the concept of loosely coupled information models and

semantic mediation on domain level targets the adequate scoping of domains. The question may

arise from two perspectives. On the one hand, stakeholders who intend to cooperate in cross-

organizational business processes may face defragmented and non-integrated information

models. Therefore, they need to define which parts can be aligned to canonical information

models based on an ontology or where mediation between independent domain ontologies is the

more feasible approach. On the other hand, stakeholders who are already engaged in cross-

organizational business processes and share an aligned complex information model may face the

lack of ability for efficient and effective maintenance and evolution. In this situation they need

to define how the overall information model can be broken down into loosely coupled domain

ontologies to ensure more mutual independence and adequate reaction to local requirements.

From both perspectives the way to determine the adequate scope is inherently dependent on the

specific scenario. Thus, this question cannot be covered exclusively within the scope of this

work, which targets a general approach for achieving semantic interoperability in SOA

independent from any particular application scenario. However, it can be stated that the scope of

Chapter 5

114

the domains should correlate with existing organizational structures as the consensus finding

process requires a certain degree of organizational binding (cf. Section 4.4.4). The following

Figure 5-3 Scoping of Domain Ontologies Aligned to Organizational Structures illustrates this

correlation:

Figure 5-3 Scoping of Domain Ontologies Aligned to Organizational Structures

In one case different organizations may exhibit organizational ties in terms of an umbrella

organization or a well-integrated business association. Such an environment can be an enabler

for a consensus finding process leading to a shared canonical information model. In this case

one domain may cover several organizations that develop collaboratively their domain

ontology. In another case one domain may just comprise a single organization. Even one large

organization may be divided into different domains to address the possibly divergent

requirements from different departments or business units on information models.

At the end, in all cases the domain information model experts define a conceptual model of the

identified domain that represents an agreed consensus among the involved parties within this

domain. However, the domain ontologies should not be developed in terms of a greenfield

approach. Usually, at least parts of domain information models are already available in terms of

conceptual e.g. UML or ER models or as well in terms of database schemas or XML-schema

representations. Therefore, the main focus of domain ontology development is not to reinvent

existing information models but to lift existing parts to an ontology level and ensure a consistent

and high quality conceptualization backed by formal logics. Moreover, taking into account

iterative development and continuous maintenance, domain information models and their

representation in terms of domain ontologies need to evolve over time. According to process-

orientation this evolution should be driven by requirements derived from business processes.

Hence, domain information models can be developed in a so called top-down and bottom-up

mixed approach, which has been presented in [188]. This leads already to the next methodology

step about mediated business process modeling, which is presented in detail in the following

section covering as well the process-oriented requirement (re-)engineering of domain

information models.

Methodology and Functional Architecture for Semantic Mediation in SOA

115

5.4.2 Existing Work

As having stated above, ontology development is an active research field and consequently an

impressive amount of work is available ranging from concepts and methodologies on how to

structure the development process to concrete tools and ontology editors. A general introduction

to ontology development is given in [189]. A specific approach and methodology addressing the

cross-organizational nature in terms of collaborative ontology development is presented in

[190]. Concrete tools are available in terms of the mainly academic-driven Protégé ontology

editor, which provides comprehensive functionality and visualization as illustrated in Figure

5-4. Within the last years as well commercial tools such as SemanticWorks [192] have become

available, which demonstrates the maturity and first industry adoption within the field of

ontology development and usage. Taking into account this stage of existing work and available

tools the here presented step of the semantic mediation methodology about domain ontology

development can be instantiated with existing approaches and tools. Therefore, no further

specification of required functionality for this step needs to be provided.

Figure 5-4 Protégé Ontology Editor [191]

5.5 Mediated Business Process Modeling

This section describes the semantic mediation methodology step of mediated business process

modeling and the required functional architecture of a tool to perform this methodology step.

The concrete realization of this functionality is then further described in context of the

developed semantic mediation toolkit in Section 6.3.

5.5.1 Goals and Tasks

As discussed in Section 5.3, the SOA life-cycle starts from the business perspective on how

processes can be supported by IT systems. In this context, taking into account cross-

organizational business processes, the challenge of heterogeneous information models also

affects the design phase of business process modeling and in particular the definition of

information flow. Usually, business process experts use business-oriented high-level

descriptions of information entities which are non-formal and non-technical to define the

Chapter 5

116

general information flow in business processes. However, in cross-organizational business

processes e.g. the usage of mismatching terms for semantically equal information entities can

hinder the sound design of information flow across organizational borders. Moreover, the non-

formal nature increases the so called business-IT gap as the used terms are not explicitly linked

to already existing information or data models of the organizations. This causes additional

efforts and iterations for aligning the top-down requirements-driven business perspective with

the bottom-up IT perspective focusing on reuse of existing assets.

Whereas the previous methodology step has provided the foundation for semantic mediation on

a higher abstraction level, the goal of this step is to exploit the ontology-based information

models for the sound design of information flow in business process models. The idea is that

already during business process modeling the before developed ontology concepts are used to

define the information flow on a non-technical conceptual level suitable for business process

experts. However, due to the formal nature of ontology-based information models a consistent

link between the business or conceptual level and the underlying technical information or data

models can be derived.

Furthermore, having the formal domain information models at hand facilitates the mediation

between heterogeneous conceptualizations by different organizations by integrating the

developed mechanism of semantic bridges. Thus, the business process expert is enabled to

seamlessly design the cross-organizational information flow, whereas semantic heterogeneities

can be handled transparently based on semantic technology-based tool support.

Moreover, taking into account as well the perspective of agile development and continuous

maintenance, domain information models and correspondingly semantic bridges between them

need to evolve over time. According to process-orientation this evolution should be driven by

requirements derived from business processes. Consequently, mediated business process

modeling not only includes the exploitation of domain ontologies and semantic bridges as

described above but also should provide specific features for their requirement engineering

during business process modeling. This demand-driven evolution includes for example the

possibility for the process expert to specify information entities which are not already reflected

within the ontology-based domain information model as well to identify missing semantic

mapping rules between information entities of different domain information models not

reflected in the available set of semantic bridges.

The following Figure 5-5 illustrates the three major tasks for the business process expert during

mediated business process modeling:

Methodology and Functional Architecture for Semantic Mediation in SOA

117

Figure 5-5 Semantic Mediated Business Process Modeling

The first task deals with the semantic annotation of information flow in business processes

within one organizational domain. The business process expert defines a business-oriented

information flow on the conceptual level using concepts from the before developed domain

ontology to annotate the non-formalized information entities and thus shifts them onto the

higher semantically explicit level. This task requires a generic extension of the business process

modeling notation enabling to visualize the higher expression level in terms of semantic sub-

graph of information entities in contrast to flat representations provided in current modeling

notations. However, it is important to note that during this task each organization just has to use

their specific domain ontology without any dependencies to other domains.

Then, in the second task semantic bridges are applied to the ontology-based information entities,

in order to obtain polymorph information entities and thus overcome semantic heterogeneities

between the different domain ontologies. In particular, semantic mediation based on semantic

bridges can be exploited to suggest matching information entities in process parts of different

organizations. This enables seamless information flow design keeping information

representation differences transparent for the process expert.

The third and last task focuses on the process-oriented evolution of information models and

semantic bridges. Missing information entities and semantic bridges required for the

information flow in the concrete business process but not already reflected in the existing

domain information models can be specified by the process expert. Then, in a further external

step they can be defined by domain information model experts in terms of iterative and demand-

driven development. Consequently, these evolutionary developed ontologies and semantic

bridges can be further utilized for semantic annotation and mediation of the information flow

and thus close the so to say micro-life-cycle of mediated business process modeling.

Even though several approaches for integrating semantic technologies into business process

modeling exist (cf. Section 5.5.3), the presented approach for mediated business process

Chapter 5

118

modeling requires a dedicated solution that supports semantic mediation based on semantic

bridges. Taking into account that the field of business process modeling is well covered by

mature industry tools and products, the required tool-based realization of the methodology step

of mediated business process modeling should reflect such existing work. Therefore, the

following section discusses a possible functional architecture of a tool as an extension of a state-

of-the-art business process modeling tool with integrated features for semantic mediation during

information flow design. The concrete realization of this functionality is then further described

in context of the developed semantic mediation toolkit in Section 6.3.

5.5.2 Functional Architecture

In this section the functional architecture for the mediated business process modeling tool is

presented and explained. As the tool should be realized as an extension of a state-of-the-art

business process modeling tool, the architecture has to incorporate an abstraction of it, in order

to remain independent of any concrete tool or product. Accordingly, based on the goals and

tasks described in the previous section a systematic use case analysis has been performed, which

is described in detail in [193]. Based on this requirements engineering the following functional

architecture has been developed which provides an overview of the main functional components

and how they interact with each other:

Figure 5-6 Functional Architecture Mediated Business Process Modeling

The resulting architecture is based on state-of-the-art functionality for business process

modeling. On top of this bottom layer concerned with business process modeling two more

layers are added, which are enabled by means of Semantic Web technologies provided by the

vertically shown Semantic Web framework. The first additional layer provides the semantic

annotation of information entities within the business process information flow. The second

additional layer then contains the functionality for semantic mediation based on polymorph

information entities to facilitate sound design of cross-organizational information flow. A

further vertical layer named semantic pool provides complementary functionality to the before

presented layers in terms of management functionality for utilized ontologies and semantic

bridges. In the following these five basic components are described in more detail:

Business Process Modeling: Provides state-of-the-art business process modeling functionality

based on a visual editor for modeling domains, actors, events, activities and information flow,

Methodology and Functional Architecture for Semantic Mediation in SOA

119

whereas the functionality should be abstract from any concrete modeling notation such as

BPMN [194] or EPC [195] as multiple notation languages should be supported.

Semantic Annotation: Provides the functionality to annotate basic information entities in

cross-organizational process models with concepts of domain ontologies. Further functionality

is provided to advance the semantic annotation in terms of process-oriented requirement

engineering for extensions of domain ontologies. In particular, missing information entities or

properties of them as well as semantic bridges or missing individual mappings within them

required for the process information flow can be specified by the process expert. Finally, the

higher semantically explicit level of information entities should be visualized in the business

process modeling notation. Therefore, a generic extension is provided that highlights the

semantic sub-graph of information entities in contrast to flat representations provided in current

modeling notations. For example a data object Partner modeled in BPMN can be annotated

with a domain ontology concept describing a Partner containing three properties. Accordingly,

the three properties can be visualized in the extended notation. The following Figure 5-7

illustrates this generic semantic extension of business process modeling notations:

Figure 5-7 Semantic Extension of Business Process Modeling Notation

Semantic Mediation: This layer features the functionality to apply semantic bridges in order to

get polymorph information entities that enable seamless information flow across differently

conceptualized domains. Furthermore, by exploiting the before described components a

matching functionality is provided that matches corresponding information entities that are

related to the same meaning but are expressed in different representations. Moreover,

visualization of semantic bridges is provided in order to integrate the mediated information flow

in the business process modeling notation. Finally, according to the above described approach

of process-oriented evolution of domain information models and semantic bridges (cf. Section

5.5.1), functionality is provided to specify requirements for the advancement of semantic

bridges such as a further mapping between as corresponding identified concepts or properties

within different organizational domains.

Semantic Pool: The Semantic pool lies vertically to the above described layers and provides

support functionality for semantic annotation and mediation. On the one hand, it provides a

repository to handle and manage domain ontologies to be used during annotation of information

flow. On the other hand, the analog functionality is provided to manage the used semantic

bridges including import, export, create and versioning operations.

Semantic Web Framework: Finally, the vertically located Semantic Web framework provides

the required generic semantic technologies. These includes firstly an ontology reasoner to

Chapter 5

120

process the domain ontologies and secondly an integrated rule-engine to perform the semantic

bridges, which are based on ontology mapping rules.

5.5.3 Related Work

Several approaches for integrating semantic technologies into business process modeling exist.

The basic idea is to utilize formal semantic languages to alleviate the so called Business-IT-gap

originating from the different perspectives of business requirements on the one hand side and

existing IT systems and resources on the other hand side. In [196] a comprehensive introduction

and framework for semantic business process modeling is provided. However, due to the

relative novelty of the research field, the available approaches are mostly academic-driven.

Nevertheless, some early industry adoptions exist such as SemTalk [197], which is based on the

Microsoft Visio [198] modeling and visualization tool. However, this approach does not address

the problem of semantic interoperability and works in a single or homogeneous ontology

environment.

The most prominent academic example is the European integrated research project SUPER

(Semantic Utilized for Process Management within and between Enterprises) [199]. The

SUPER project defines a semantic business process management cycle [200] including semantic

business process (SBP) analysis, SBP modeling, SBP implementation and SBP execution.

Thereby, each phase is based on a so called ontological foundation, i.e. artifacts (e.g. events,

activities or data objects) which are created or processed during business process management

are consequently enriched with ontology concepts.

However, in contrast to the presented approach of semantic mediation in this work, the SUPER

project relies on a single central ontology called business process modeling ontology (BPMO).

Thereby, the approach and framework of the Web Service modeling Ontology (WSMO) is

integrated (cf. Section 3.4.2), whereas e.g. business process activities are specified via goals,

which later can be used to infer matching Web services that provide the designated

functionality. Although these approaches are related to this work their focus is on ontology-

based annotation of process steps, in order to improve process management and search in

process repositories in a homogenous ontology environment. In contrast the research presented

in this methodology step aims at mediating between different information models in cross-

organizational design of BPM information flow and thus focuses on heterogeneous ontology

environments. The SUPER project follows a more general direction towards semantic

technology driven alignment between business and IT perspectives focusing on a mediation

between these two spheres, whereas the in this work addressed issue of semantic

interoperability in cross-organizational scenarios is not focused. The scope of the SUPER

project is illustrated in the following Figure 5-8:

Methodology and Functional Architecture for Semantic Mediation in SOA

121

Figure 5-8 Mediation between Business and IT Perspective [199]

5.6 Semantic Bridge Definition

This section describes the semantic mediation methodology step of semantic bridge definition

and the existing work in terms of methods and tools which can be applied to perform this

methodology step.

5.6.1 Goals and Tasks

Having already identified process-oriented requirements for semantic bridges during mediated

business process modeling, this methodology step covers their comprehensive identification and

specification. Semantic bridges are the basis for semantic mediation between the different

representations of semantically corresponding concepts in different domains. A detailed

elaboration of the basic idea and concepts for ontology mapping as the underlying technology

for semantic bridges has already been provided in Section 3.6.2. Taken into account the analysis

there and the dedicated actor model for this methodology, domain information model experts

from need to collaborate to ensure the required cross-domain knowledge and target the two

identified tasks (cf. Section 3.6.2):

 Mapping Discovery – Identify the individual mappings between corresponding

concepts, which are the continuing parts of a semantic bridge.

 Mapping Representation – Express the semantic bridge as a set of description logic

based rules as defined in Section 4.8.

Similar to the development of domain ontologies, a hybrid so called top-down/bottom-up

approach is followed for identifying the individual mappings. Therefore, in order to discover

potential mappings one way is to start from what is already there in terms of analyzing the two

ontologies to be semantically mapped and to detect correspondences between similar concepts.

On the other hand a starting point can be the top-down identified requirements for semantic

bridges derived during mediated business process modeling as described in the previous Section

5.5. The following Figure 5-9 illustrates these two complementary ways:

Chapter 5

122

Figure 5-9 Big Picture Semantic Bridge Definition [201]

The (semi-)automatic mapping discovery (cf. 1a) in Figure 5-9) can be based on multiple e.g.

lexical or structural matching algorithms calculating a similarity measure between each two

concepts from different ontologies. Such a similarity measure-based mapping discovery is

exemplarily described in [201]. As manual mapping discovery can be a very complex and a

time-consuming task, the application of tool support for automatic or semi-automatic mapping

discovery is very useful. As already discussed in Section 3.6.2 encouraging results are obtained,

however this problem is by no means solved and automatically obtained results are not yet good

enough in terms of recall and precision [202]. It has turned out that most approaches still require

human intervention to generate sufficient results and thus need to be considered as semi-

automatic. Indeed, for finding the correspondences between concepts, it is necessary to

understand their meaning. Besides the represented meaning described by model-theoretic

semantics, the ultimate meaning of concepts is only in the head of the people who developed

those concepts and accordingly the final matching decision can only be performed by them

[203].

On the other hand corresponding concepts from different domain ontologies are identified

during cross-organizational business process modeling in a requirements-driven manner (cf. 1b)

in Figure 5-9). This approach ensures as well that semantic bridges do not need to be

exhaustively engineered but just to the extent of actual needs derived from mediated business

process modeling as described in Section 5.5.

Finally, the identified mappings need to be expressed according to the developed approach for

realizing semantic bridges by means of description logic rules to infer polymorph

representations of concept instances (cf. 2) in Figure 5-9). In particular, the transformation of

the different formalized semantic sub-graphs need to be defined as semi-automatic (1a) or

requirements-driven mapping discovery approaches just determine which concepts are

corresponding but not how they can be transformed into each other by means of declarative

Methodology and Functional Architecture for Semantic Mediation in SOA

123

rule-based entity manipulation. The following demonstrates this task to be fulfilled by rule-

based semantic bridge definition:

Figure 5-10 Required Entity Manipulation between Different Semantic Sub-Graphs

According to the chosen example already presented in Section 4.8.4 for example an object

property Name of a concept Address is represented in terms of two sub-properties Given Name

and Surname. In contrast in a semantically equal concept PostalAddress these two values are

aggregated in the property Recipient. As the developed semantic bridge approach utilizes

forward-chaining rules to express the transformation of the different semantic sub-graphs (cf.

Section 4.8) semantic bridges from the first ontology to the second and vice versa have to be

defined in order to obtain a bidirectional mapping as illustrated at 2) in Figure 5-9. Since the

definition of such declarative mapping rules is a complex and time-consuming process, the

application of graphical tool support to assists the domain ontology experts with this task is

reasonable as presented in the next section.

5.6.2 Existing Work

As having outlined above and discussed in Section 3.6.2 ontology mapping discovery is an

active research field and several heuristic-based and machine-learning approaches for (semi-)

automatic mapping discovery have been proposed. To some extent they are similar to

approaches for mapping of database schemas and XML structures, since they also use lexical

and structural measures to determine correspondences [124]. But since ontology languages

allow for more expressiveness, these approaches furthermore exploit features in ontology

definitions in order to determine ontology mappings [124]:

 concept names and additional natural-language descriptions

 class hierarchy (subclass–superclass relationships)

 class definitions (equality relationships, defined classes, unions)

 property definitions (domains, ranges, restrictions)

 instances of classes

An introduction and overview of existing approaches is given in [204]. Furthermore, a

comprehensive survey of available tools for semi-automatic mapping discovery from a user

perspective is provided in [205].

As described above the second task in semantic bridge definition is the expression of ontology

mappings. Several general approaches namely utilizing views and queries, mapping ontologies

and bridging axioms have been discussed in Section 3.6.2. As the requirements set by the

Chapter 5

124

concept of semantic mediation between domain ontologies can be most effectively addressed by

rule-based bridging axioms (cf. Section 4.8.5), the instantiation of this mechanism requires a

concrete declarative rule language. In particular, the developed semantic bridge approach can be

instantiated by means of forward-chaining rules expressed in the Semantic Web Rule Language

(SWRL).

SWRL can be used not only to reason about OWL individuals and to infer new knowledge (cf.

Section 3.3.2), but also to define entity manipulations describing mappings between ontologies.

If an SWRL rule contains entities of a source ontology in its antecedent (so called body of the

rule) and entities of a target ontology in its consequent (so called head of the rule), then a rule

can be interpreted as a mapping rule. Consequently, upon its execution instances of a source

ontology will be transformed into polymorph instances, which belong to two classes and have

properties of both source and target ontologies (cf. Section 4.8.4).

A major benefit of applying SWRL rules is the opportunity to directly integrate the rule

execution into the ontology inference process. As the goal of SWRL is to be a rule language for

the Semantic Web [206], it is well integrated with the major ontology language standard OWL

(cf. Section 3.3.2). Hence, if ontology mappings are expressed with rules, then the execution of

mappings, i.e. transformation of instances from source ontology into target ontology can be

performed with inference engines already in use for OWL-based ontology reasoning.

Furthermore, SWRL constructs contain a set of predefined so called built-in functions, which

enable highly expressive transformations and mappings e.g. performing mathematical

operations, operations with strings, lists, etc. [207].

An additional benefit of using SWRL rules lies in the fact that it is relatively easy and intuitive

for users to create rules for example by using a graphical editor. While rule users can express

declaratively and visually what the result should be, it is left to the rule inferencing engine to

process the desired result. One example to model SWRL rule-based ontology mappings is the

graphical editor Snoggle [208]. Snoggle provides means to graphically create rule-based

mappings in terms of a flexible set of functionality that allows to support various kinds of rule-

based mapping approaches. As well the in this work developed approach for semantic bridges -

exploiting in particularly the ontology feature of polymorphism - can be addressed with the

Snoggle editor. Snoggle provides a canvas that displays entities from a source ontology and

their relating entities from the target ontology. Correspondences between the entities of the two

ontologies are visualised as arrows from the left region of the canvas to the right one as

illustrated in following Figure 5-11:

Figure 5-11 Graphical representation of a mapping rule in Snoggle [209]

Methodology and Functional Architecture for Semantic Mediation in SOA

125

Consequently, it has to be taken into account that a remarkable amount of existing work is

available in terms of methods and tools for both fields: namely similarity-based mapping

discovery and rule-based expression of semantic bridges. Therefore, it can be stated the no

further functionality besides the already described requirements-driven mapping discovery (cf.

Section 5.5) is required to enable the step of semantic bridge definition of the semantic

mediation methodology.

5.7 Semantic Bridge Testing

This section describes the semantic mediation methodology step of semantic bridge testing and

the required functional architecture of a tool to perform this methodology step. The concrete

realization of this functionality is then further described in context of the developed semantic

mediation toolkit in Section 6.3.

5.7.1 Goals and Tasks

As discussed in the previous section the development of semantic bridges is a complex and

error-prone process. Moreover, the roles of semantic bridge developers in terms of domain

information model experts on the one hand and semantic bridge users in terms of business

process experts and service composers on the other hand are carried out by different actors.

They even might be mutually unknown in cross-organizational SOA landscapes. Therefore, the

consideration of trust in the quality of the developed semantic bridges is an essential factor. It

can be shown that in principle no formal verification of the correctness of semantic bridges can

be provided. Finally, semantic mappings between information entities representing the same

meaning can only be determined by a human interpreter (cf. Section 4.5.2).

Therefore, the third step of the semantic mediation methodology focuses on the evaluation and

particularly on the testing of semantic bridges. Focus should be put on how to adapt well

established concepts from software testing as well as concepts for testing of ontologies, rule

bases and XSL transformations to the systematic testing of semantic bridges and the underlying

ontology mappings.

In this sense the evaluation of correctness can be understood in the way it is used in software

testing: as the degree to which the system performs its intended function [210]. Applied to

semantic bridges, evaluation of correctness can be interpreted as the degree to which the

underlying ontology mapping performs transformations of ontology instances into polymorph

representations as intended. Consequently, in order to quantify the evaluation, a measure for this

degree should be provided as well as a measure for the coverage of a particular test with regard

to the overall scope of a semantic bridge. The latter point is particularly relevant as in principle

it is impossible to test a program exhaustively. This is due to the fact that testing cannot take

into account all potential and partly unknown test inputs, execution conditions and their

combinations [211]. That is why during testing consequently only a part of all possible test

cases can be taken into account. Furthermore, in order to not only support black-box testing2 but

2 In the black box approach, a tester treats software-under-test as a black box, which means that he has no

information about the structure of the program and no program code [210].

Chapter 5

126

also certain aspects of so called white-box testing3, the concrete identification and localization

of defects in semantic bridges should be supported.

One fundamental concept, which can be derived from related research fields for testing as

discussed above, is the method of testing with a predefined set of test cases. The basic idea is

that during test execution a functional component is executed under certain conditions with a

particular input. In order to determine if the test was successful or failed, a tester has to define

the expected output of the test and compare it with the actual output produced during test

execution. All this information is included in so called test cases. In particular a test case

consists of the following parts [210]:

 Test inputs are data, which is passed functional components under test as input and

which comes from external sources.

 Execution conditions are conditions, under which tested functional components should

be executed.

 Expected outputs are the specified outputs that are supposed to be returned by the

functional component under test.

Transferred to testing of semantic bridges this requires the following tasks: On the one hand for

domain information model experts, who develop semantic bridges and on the other hand for

semantic bridge users, i.e. business process experts and service composers, who have to trust in

the quality and correctness of ontology mappings they use.

 Define semantic bridge test cases – Define exemplary instances of concepts from the

source ontology as test inputs and expected polymorph instances as expected test

outputs expressed as well according to concepts from the target ontology.

 Perform semantic bridge test cases – Provide the execution conditions in terms of

source ontology, target ontology and an execution framework for the developed

semantic bridge between them and run the test cases.

 Evaluate semantic bridge test cases – Provide an evaluation of the correctness regarding

the performed test case based on a gap analysis between intended results defined in the

test cases and actual results received after semantic bridge execution.

Accordingly, the basic idea of semantic bridge testing can be illustrated in the following

diagram:

3 In the white box testing approach, the inner structure of software-under-test is considered during the

development of test cases. Hence, program code should be available to a tester [210].

Methodology and Functional Architecture for Semantic Mediation in SOA

127

Figure 5-12 Basic Idea of Semantic Bridge Testing

The main inputs for semantic bridge testing are source and target ontologies and the semantic

bridge between them. They are referenced in a test project which further consists of several test

cases, in order to achieve a high coverage of the ontology mapping rules within the semantic

bridge. After test execution the test report evaluates the overall test coverage and provides the

gap analysis between the expected test output instances from the test cases with the actual

individual mapping results.

Even though first tools e.g. for modeling rule-based ontology mappings such as Snoggle (cf.

Section 5.6) are available, the research field is relatively new and thus existing tool support is

very limited. Several approaches for testing in related areas have been investigated including

ontology testing, testing of rule bases, testing and debugging of XSLT stylesheets and finally

testing of ontology mappings (cf. Section 5.7.3). However, until now no dedicated testing

methods and tools suitable for rule-based ontology mappings as exploited in the developed

semantic bridge approach have been developed. Accordingly, the following section discusses a

possible functional architecture of a tool for testing semantic bridges, whereas the concrete

realization of the functionality is then further described in context of the developed semantic

mediation toolkit in Section 6.3.

5.7.2 Functional Architecture

The main architecture style required for providing the functionality discussed above should

follow the Model-View-Controller (MVC) [212] architecture style, in order to support the

interactive process of semantic bridge testing. According to the MVC architecture style three

types of functional components can be mapped to the requirements of semantic bridge testing:

 Models, which maintain the core information assets including internal representations

of source and target ontology, semantic bridges, test projects and contained test cases

as well as the test report to be generated.

 Views, providing the GUI enabling the domain information model expert and

modeller of business processes or service compositions to define the test cases and

investigate the rule-based ontology mappings under test. Thereby, different

perspectives can be provided containing an interactive GUI for creating use cases,

Chapter 5

128

testing and debugging semantic bridges and a static report based on the maintained

models.

 Controller, which controls the interactions of a user and provides the business logic,

i.e. the specific functionality for semantic bridge testing based on the maintained

models.

Accordingly, the following Figure 5-13 illustrates the basic logical components of the functional

architecture for semantic bridge testing including the functionalities required for the developed

semantic bridge approach based on ontology languages and exploitation of declarative rules:

Figure 5-13 Functional Architecture of Semantic Bridge Testing Tool

The main functional components deal with the lifecycle of test cases and particularly their

execution and evaluation in terms of a test report. Upon execution of semantic bridge test cases,

ontology mapping rules are executed, as it has been described in Section 4.8.4 about the

operation of semantic bridges. After semantic bridge test cases have been executed and mapping

results have been created, they have to be compared with the expected test output instances.

Thereby, the comparison should be performed according to the following qualities:

 If all statements4 describing expected test output instances are also contained in the

ontology mapping result or can be inferred in the ontology mapping result, the test case

should be considered as succeeded.

 If some of the statements describing expected test output instances are not contained in

the ontology mapping result and cannot be inferred as well, the test case should be

considered as failed.

Furthermore, consistency checks should be performed. The semantic bridges are tested not only

with respect to the expected test outputs, but also with respect to the involved ontology

definitions as ontology mapping rules can also produce instances that can be inconsistent with

the definition of source and target ontologies. Hence, it is reasonable to use consistency checks

which are based e.g. on cardinality constraints, disjointness constraints, etc. in order to evaluate

the overall correctness of semantic bridge execution or to detect defects in their underlying

4 triple relations between subject, property and object in instance descriptions (cf. Section 3.3.1)

Methodology and Functional Architecture for Semantic Mediation in SOA

129

ontology mappings. However, consistency check results should not affect whether a particular

test case succeeds or fails, since success or failure is determined only on the basis of comparison

of expected test output instances with actual ontology mapping results. Rather, if a test case has

succeeded, but the mapping result is not consistent with the source or target ontology

definitions, then the test report should contain a warning about inconsistency. Accordingly, in

the test report the evaluation can be quantified in terms of listing how many test cases have been

succeeded or failed as well as informing about the number of consistency issues.

Moreover, in order to quantify the overall quality or expressiveness of the test project, the

coverage of all test cases regarding the complete rule set of a semantic bridge should be

calculated. This approach can be transferred from the context of testing rule bases as described

in [213]. Consequently, the overall test case coverage of a semantic bridge can be calculated as

follows:

Hence a test report can be generated that contains the above discussed information. If any test

case has failed, then it can serve as an indication that the semantic bridge is not completely

correct and most probably contains defects. However, it is important to note that one the other

hand if all the test cases in test project succeed, it neither proves that the semantic bridge is

completely correct nor that is contains no defects. Nevertheless, a set of test cases, which have

been well chosen and defined and which have succeeded during test case execution including a

high degree of overall test case coverage, can demonstrate that a semantic bridge performs

ontology instance transformations the way it is expected and hence it can increase trust in the

tested semantic bridges.

5.7.3 Related Work

Even though first tools e.g. for modeling rule-based ontology mappings such as Snoggle (cf.

Section 5.6) are available the research field is relatively new. Several approaches for testing in

related areas have been investigated including testing ontology testing (e.g. [214], [215], [216],

[217]), testing of rule bases (e.g. [218], [215]), testing and debugging of XSL transformations

(e.g. [219], [220]) and finally testing of ontology mappings (e.g. [221]). Key concepts such as

the test case-driven approach, white-box and black box testing as well as test coverage

quantification could be transferred to semantic bridge testing. However, the analysis has turned

out that until now no dedicated testing methods and tools for rule-based ontology mappings as

required in the developed semantic bridge approach are available. Accordingly, the above

section has discussed a possible functional architecture, in order to enable the methodology step

of semantic bridge testing. The concrete realization of this functionality is then further described

in context of the developed semantic mediation toolkit in Section 6.3.

5.8 Semantic Service Enrichment

This section describes the semantic mediation methodology step of semantic service enrichment

and the existing work in terms of methods and tools which can be applied to perform this

methodology step.

Chapter 5

130

5.8.1 Goals and Tasks

After having provided the foundation for addressing semantic interoperability on a higher

abstraction level by means of ontology-based domain ontologies and quality assessed semantic

bridges between them the next step is to lift the service descriptions as well on the higher

conceptual abstraction level. In order to leverage the developed semantic bridges, the existing

Web services of an SOA landscape need to be enriched with the additional semantic layer. As

already discussed in Section 3.4, the basic idea of Semantic Web services is to wrap existing

WSDL-based Web services and XML Schema based message formats with explicit semantics in

terms of concepts from domain ontologies. Combined with specific upper ontologies for Web

services (cf. Section 3.4.2), which facilitate the machine interpretation of generic service

capabilities, so called Semantic Web services can be realized. The following Figure 5-14

illustrates this basic idea of Semantic Web service enrichment:

Figure 5-14 Basic Idea of Semantic Web Service Enrichment

Accordingly, the Semantic Web Services are an extension of existing Web Services

technologies and standards. Therefore, the service developer, responsible for developing Web

services that provide the functionality required by the individual business process activities, also

provides the semantic service enrichment. In particular, this includes the following tasks:

 Annotate service with domain concepts – Select concepts from the domain ontology to

describe service input and output message parameters.

 Define lowering – Provide a translation for message parameter instances expressed

according to domain ontology concepts to input parameter instances expressed according to

XML Schema used by the underlying traditional Web service.

 Define lifting – Provide the reverse translation, i.e. from XML Schema-based Web service

outputs to instances expressed according to domain ontology concepts used to describe the

output of the Semantic Web service.

 Aggregate artifacts to formal Semantic Web service description – Provide annotations,

lifting and lowering definitions in a formal service ontology language defined to express

Semantic Web service, e.g. OWL-S (cf. Section 3.4.2)

 Ensure Semantic Web service interpretation and execution – Ensure that appropriate

service execution engines are available in the targeted SOA landscape that enable the

interpretation and execution of the developed Semantic Web services

Methodology and Functional Architecture for Semantic Mediation in SOA

131

It can be noticed that in the above outlined description of necessary tasks for the methodology

step of Semantic service enrichment no point is dedicated to the specification of formal

descriptions of service goals, preconditions and postconditions as discussed in context of

Semantic Web service descriptions in Section 3.4.2. However, as already stated in the

evaluation of different Semantic Web service approaches (cf. Section 3.4.3) the approach

followed in this work does not cover goal-based plan creation. The approach developed in this

work intentionally leaves the planning task (i.e. which services to include at which part into the

composition) to the business domain process expert, in order to remain compliant to the well-

established SOA life cycle. A further reason is to not overload and mix the approach with

concepts such as goal-based planning which do not purely focus on the challenge of semantic

interoperability in SOA addressed in this work but rather aim for higher levels of automation

and dynamics particularly in service composition leaving aside major real world constraints

regarding the heterogeneity of information models (cf. Section 3.4.3).

5.8.2 Existing Work

As having outlined above and discussed in Section 3.4.2 semantic service enrichment is an

active research field and several concepts, W3C standards and corresponding tools already exist.

This existing work covers both fields discussed above, tool-support for semantic annotation of

Web service descriptions during design time and corresponding Semantic Web service engines

which are able to interpret the semantic description and execute the underlying services during

runtime. This includes for example the tool Assam WSDL Annotator [222]. The basic idea of

the tool is to generate an OWL-S service description from an existing WSDL file. The

following Figure 5-15 shows the graphical user interface of the Assam WSDL Annotator:

Figure 5-15 ASSAM WSDL to OWL-S Annotator GUI [222]

Another tool called WSDL2OWLS converts WSDL-based Web service descriptions to OWL-S

descriptions, whereas analog to the Assam tool the concrete lifting and lowering XSL

transformations have to be provided externally. The tool is integrated into a Semantic Web

service framework developed in the MINDSWAP project at the University of Maryland called

OWL-S API [223]. The framework also supports the interpretation and execution of OWL-S

described Semantic Web services. A good overview of existing approaches and tools has been

provided in context the semantic interoperability work package of the European research project

QualiPSo in [224].

Taking into account the amount of existing work, it can be stated that no further general

functionality is required to enable the methodology step of semantic service enrichment.

However, as turned out during the usage of these tools, one technical enhancement for Semantic

Web service execution is required that addresses the challenge to deal with the different meta-

data models. Therefore, an extension of the applied OWL-S API for OWL-S execution

Chapter 5

132

including lifting and grounding has to be made, which is discussed in context of the realization

of the semantic mediation toolkit in particular with regard to mediated process execution in

Section 5.10.

5.9 Mediated Service Composition

This section describes the semantic mediation methodology step of mediated service

composition and the required functional architecture of a tool to perform this methodology step.

The concrete realization of this functionality is then further described in context of the

developed semantic mediation toolkit in Section 6.3.

5.9.1 Goals and Tasks

The composition of services is an integral part of the SOA life-cycle. From the perspective of

semantic mediation the main focus in this methodology step lies in the information flow design

by a semantic service composer who combines the before provided Semantic Web services to a

mediated service composition. The goal of the composition task is to instantiate the previously

developed cross-organizational business process model by defining a concrete so called

Semantic Web service orchestration plan that can be executed in a process engine. In order to

achieve a high level of automation in this design process, the semantically enriched service

descriptions can be exploited for service selection based on semantic match making between

service input and output parameters and seamless information flow design across organizational

borders. This can be achieved by performing a reasoning over semantically described

relationships (such as inheritance or equality between concepts), thus enabling a composition

tool to support the design of interaction patterns by issuing recommendations for suitable

assignments between output and input parameters of different Web services.

Taking into account that the input and output parameters of different services are described with

ontology concepts from heterogeneous information models the before provided semantic

bridges are used to mediate between the different domain conceptualizations. Thereby, the

semantic integration can be handled transparently for the service composer and combined with

the match making functionality less manual integration efforts are required to achieve semantic

interoperability. Consequently, the main benefit of this approach is to overcome technical data

flow and transformation coding and thus integration efforts can be reduced significantly. The

following Figure 5-16 illustrates the main conceptual artifacts of information flow design in

mediated service composition and how they interrelate:

Methodology and Functional Architecture for Semantic Mediation in SOA

133

Figure 5-16 Basic Idea of Mediated Service Composition

In order to design the information flow in terms of a mediated service composition, the service

composer has to accomplish the following tasks supported by dedicated tool support:

 Firstly, joint semantically enriched input parameters of the whole composition have to

be declared. This can be done by means of domain ontology concepts describing them.

Furthermore, their assignment to corresponding Web service inputs within the

composition needs to be defined. Thus, the whole service composition itself can be

regarded as a Semantic Web service with a composite service input.

 In order to design the information flow between the involved Semantic Web services of

the composition, the individual service output parameters or certain parts of them need

to be assigned to input parameters of services that are succeeding within the

composition. Therefore, an automatic matching mechanism can be performed that

retrieves a semantically sound information flow based on a so called pull-mode. This

means that for each service input (or input part) the potential sources of the information

flow targeting this input are analyzed with regard to their semantic suitability.

Transparently, semantic bridges are integrated into this matching process, in order to

identify matching service parameters across domain borders. The matching candidates

are presented to the service composer as an information flow recommendation.

 Based on the semantically sound information flow recommendations the service

composer selects appropriate parameter assignments that comply with the desired

business process logic. According to the selection the actual information flow needs to

be expressed in order to be integrated in the targeted executable orchestration plan.

Therefore, the description logic based meta-model for information description has to be

reflected which in turn implies that the information flow should be as well expressed in

terms of description logic.

 After the information flow within the service composition is defined, analogically to the

composite input joint output parameters of the composition have to be declared.

Furthermore, their origins from corresponding Semantic Web service outputs within the

composition need to be defined.

 Finally, the defined composition needs to be exported to an executable Web service

orchestration plan. This part is covered in more detail in the previous section about

mediated process execution.

Chapter 5

134

Thereby, the main challenge of dealing with the description logic based meta-model for

information description in contrast to the XML meta-data model used in traditional Web service

composition can be tackled by consistently remaining on the additional semantic layer during

service composition. This includes the usage of Semantic Web services provided from the

semantic service enrichment methodology step. Also the semantic bridges exploited for the

matching mechanism and as well the information flow definitions remain completely on the

level of description logic based meta-models. Thus, no interference with the underlying

structural-based XML meta-data model occurs and a highly automated information flow design

can be ensured.

Even though various approaches for integrating semantic technologies into service composition

exists (cf. Section 5.9.3) the presented approach for mediated service composition features new

aspects such as remaining consistently on the added semantic layer and thus requires a dedicated

solution as a proof-of-concept. Therefore, the following section discusses a possible functional

architecture of a tool for mediated service composition as outlined above. A systematic

requirement analysis in terms of use case diagrams describing the above discussed analysis in

more detail has been performed in [225]. The concrete realization of this functionality is then

further described in context of the developed semantic mediation toolkit.

5.9.2 Functional Architecture

In this section a functional architecture enabling the methodology step of mediated service

composition is presented and explained. The central component is the Semantic Web service

composition tool, which supports a service composer in modeling the control- and information

flow by exploiting the mediation mechanism as described above. To highlight the feature that

the composition is consistently performed on the added semantic layer, additionally the in this

context involved components are integrated into the functional architecture. The following

Figure 5-17 illustrates the main conceptual components of the functional architecture and how

they relate to each other:

Methodology and Functional Architecture for Semantic Mediation in SOA

135

Figure 5-17 Functional Architecture of Mediated Service Composition

The Semantic Web services are based upon traditional WSDL-based Web services, which use

XML Schema (XSD)-based domain standards to describe service input and output parameters

as explained in the methodology step of semantic service enrichment (cf. Section 5.8). On top of

these traditional Web services a Semantic Web service ontology combined with selected

concepts from corresponding domain ontologies is utilized to semantically describe the input

and output parameters of thus established Semantic Web service descriptions. Furthermore,

semantic bridges between overlapping domain ontologies need to be provided, in order to

enable semantic mediation between them. Finally, the Semantic Web service descriptions

together with the semantic bridges build the input for the Semantic Web service composer.

The central component of the Semantic Web service composer is a visual modeling tool that

enables the service composer to orchestrate services. It is based on a Semantic Web framework

for handling the Semantic Web service descriptions and domain ontologies. Additionally, an

ontology reasoner and a rule engine have to be integrated to support the following components:

Firstly, the mediation between the domain ontologies has to be performed by a semantic

bridging engine that applies the semantic bridges to the Semantic Web service parameters.

Thus, the foundation for seamless design of information flow across heterogeneous Web

services from different domains is established. In order to increase automation in this process,

the matching engine reasons over available service parameters and recommends matching

parameter parts that could be assigned to a certain Web service. According to the actual

business process logic the service composer selects the corresponding assignments and

additionally defines joint input and output parameters of the composed process by using the

visual modeling tool. Based on the service composer‟s input, the deployment engine then

generates an execution plan of the composed process and deploys it into an execution engine.

Finally, the created process is then accessible as a new Semantic Web service. The process

execution is further discussed in the next step of the semantic mediation methodology.

Moreover, the above presented functional architecture is concretized in Section 6.3 describing

Chapter 5

136

the developed semantic mediation toolkit by relating the conceptual components to concrete

technologies and their implementation.

5.9.3 Related Work

In recent years the field of semantics-based service composition has been an active research

field as discussed in the state-of-the-art analysis about Semantic Web services (cf. Section 3.4).

However, the focus is put on services composed in homogenous ontology environments or on

the integration of dynamic behavior during the execution of service compositions (cf. Section

3.4.3). In this context semantic mediation is only addressed during runtime, whereas during

design time a homogeneous ontology environment is assumed. In the following an exemplary

European research project is outlined, which shows the relation to this work in the field of

Semantic Web technologies applied to SOA.

The SATINE project [226] has been located within European research activities for networked

businesses and governments. It has develop a semantics-based interoperability infrastructure for

the tourism industry, which provides tools and mechanisms for publishing, discovering,

composing and invoking Web services through their semantics in peer-to-peer networks [227].

A detailed description of the SATINE infrastructure is given in [228]. To focus on the relevant

part for comparison with this work, on the one hand, the mediation approach of the SATINE

project between semantically heterogeneous service requests during runtime is outlined and on

the other hand, the approach to Semantic Web service composition and the execution

framework is presented.

Within SATINE ontologies describing the travel domain are applied for Web service

annotation. With respect to an open environment SATINE supports mapping between different

ontologies. It provides support for the basic steps involved in ontology mapping (cf. Section

3.6.2) by integrating the OWL mapping tool (OWLmt) [229] into the peer-to-peer infrastructure

used for Web service discovery and interaction. OWLmt follows the approach of describing the

mapping between two ontologies in terms of a dedicated ontology, which is called mapping

schema. OWLmt includes a graphical mapping tool which generates instances of the mapping

schema to express the mapping between a source and a target ontology. The tool does not

provide any automatic mechanism for discovering similarities between corresponding constructs

in the source and target ontology. The user has to identify corresponding concepts or properties

and relate them graphically supported with specific mapping schema constructs. These

constructs include similarity and subclass relations as well as transformation relations to

overcome structural differences including one-to-many relations. Transformation relations are

defined by specifying instances of concepts or properties in source queries and a plan for target

instance construction. In order to enable transformation of values, XPath built-in functions are

applied besides an integrated mechanism of user defined JavaScript transformation code to

realize more expressive transformations. Consequently, the resulting mapping can then be

executed by a specific engine to transform instances defined in the source ontology into new

instances defined in the target ontology. Furthermore, mediator nodes within the Web service

peer-to-peer infrastructure, similar to interceptors in RM-ODP (cf. Section 3.5.2), detect

mismatching service requests and invocation messages and transform them into the appropriate

ontology representation needed in a specific context. That enables service discovery and

invocation across peers using different ontologies for describing Web services. In comparison

with the approach applied in this work, which targets semantic mediation on the conceptual

level, it can be stated that the utilized approach of OWLmt in the SATINE project can be

located on the logical level as the core of the mapping mechanism are XPath built-in functions

and JavaScript transformation codes.

Methodology and Functional Architecture for Semantic Mediation in SOA

137

With regard to Semantic Web service composition and execution SATINE supports the

orchestration of Web services published in the peer-to-peer network to provide so called added-

value services. The following Figure 5-18 SATINE Composition Phases and Tools illustrates

the SATINE composition framework, its phases and corresponding tools:

Figure 5-18 SATINE Composition Phases and Tools [230]

The composition framework is based on the concept of activity components [227] which can be

described as abstract functional building blocks representing a set of Web services that provide

the same functionality. Choosing appropriate concepts from a domain ontology, an OWL-S

profile can be defined (step 1). In particular, input and output parameters of the Web service are

thus specified. The OWL-S profile is then used to generate an activity component (step 2),

which also includes a WSDL view on the service. The WSDL description can be exploited for

integration of the Web service into a process description in terms of the Business Process

Execution Language (BPEL). Generated activity components are stored in a repository (step 3)

to make them available for the composition modeling tool. The modeling tool supports the

orchestration of various activity components by assisting in data flow design by means of

assignment recommendations between service outputs and inputs of following services. The

support for data flow design is based on semantic matching of service parameters. In contrast to

the mediation approach of this work, which takes into account heterogeneous domain

ontologies, SATINE requires that conceptualizations are homogeneous and shared on the design

level. However, during the invocation of services the usage of heterogeneous ontologies is

supported as mentioned before by utilizing the OWLmt mapping mechanism. Thus, during

design time the single ontology approach is followed, whereas during runtime the multiple

ontology approach is applied. In particular, the composition tool supports the matching of

concepts that are related by inheritance. However, each ontology class is strictly mapped to one

XML Schema type to enable the data flow expression in BPEL, which is based on the data

model of XML Schema. Consequently, due to the shortcomings of XML Schema regarding the

expression of inheritance, the matching mechanism cannot directly exploit such relations but a

technical work-around need to be provided operating on the structural-based XML level. In

exchange the composed process can be directly stored as an XML-based process template

specified in BPEL in the process template repository (step 5). Furthermore, a configuration tool

is used to specify non-functional parameters of activity components, which influence the

runtime-selection of concrete services from the SATINE peer-to-peer network.

Chapter 5

138

5.10 Meditated Process Execution

This section describes the goals and tasks of the semantic mediation methodology step of

mediated process execution. Furthermore, a functional architecture for a dedicated semantic-

based process execution engine including the presented mediation mechanism is presented and

discussed in context of related work. The concrete realization of this functionality is then further

described in Section 6.3 presenting the developed semantic mediation toolkit.

5.10.1 Goals and Tasks

It has been recognized that the success of Semantic Web technologies relies on the reuse and

integration of existing Web standards. The most widely-used standard for the composition of

Web services is BPEL (cf. Section 3.2.2). A considerable number of mature BPEL compliant

process execution engines testify the broad industrial support for this standard, which provides a

rich set of control and data flow constructs for defining and aligning the interactions between

Web services in a business process. Consequently, the goal of the mediated process execution

approach is to integrate the before composed Semantic Web service composition including the

semantic mediation mechanism into an execution plan formalism in terms of the BPEL

standard. Thus, the execution in available process engines can be ensured.

The main challenge on this regard is to find a suitable mapping between different abstraction

levels: While at design time ontologies and description logic based rules are used for

information representation, information flow and semantic mediation, BPEL execution engines

make use of hierarchically structured XML Schema types, XPath and XSLT transformations

(cf. Section 3.2.3). In order to face this challenge, the starting point is to exploit the RDF/XML

serialization of ontologies (cf. Section 3.3.2) for data representation on the BPEL level.

Furthermore, BPEL enhancements have to be developed to integrate semantic bridges and to

support information flow specifications in terms of description logic based rules. These

enhancements should be incorporated as external functions that can be plugged into BPEL

engines using standardized extension mechanisms. Finally, as well the invocation of Semantic

Web services within the mediated business process has to be covered. Thereby, the mapping of

the additional semantic layer (as discussed in context of the methodology step of semantic

service enrichment in Section 5.8) to the underlying structural-oriented XML layer, which is

natively supported by BPEL, has to be considered. This requires on the one hand, to ensure a

BPEL-conform Web service invocation mechanism while at the same time preserving the

semantic soundness of the service interaction. The following Figure 5-19 illustrates the above

discussed basic idea of mediated business process execution based on BPEL:

Methodology and Functional Architecture for Semantic Mediation in SOA

139

Figure 5-19 Basic Idea of Mediated Process Execution based on BPEL

In order to achieve the above presented approach, firstly a BPEL-conform execution plan

including the extensions for rule-based information flow, semantic mediation and Semantic

Web service invocation has to be generated. This generation has to take place in the design

phase of mediated service composition (cf. Section 5.9). Secondly, components providing the

Semantic Web technology-based functionalities need to be incorporated into a BPEL-based

process integration middleware (BPEL process engine). This incorporation has requires to

reflect the different meta-data models for representing the processed information models on the

runtime level.

A systematic requirement analysis in terms of use case diagrams describing the above discussed

basic idea in more detail has been performed in [225]. The following section discusses a

possible functional architecture of such an extension of a BPEL process engine, whereas the

concrete realization of this functionality is then again further described in context of the

developed semantic mediation toolkit in Section 6.3.

5.10.2 Functional Architecture

This section presents and explains the functional architecture for an engine enabling the

methodology step of mediated process execution. A BPEL-conform process execution engine is

responsible for interpreting the BPEL execution plan. This functionality is bundled in the

engines core component. Furthermore, the process engine is enhanced with additional

functionality to perform the semantic extensions. The relation between the core BPEL

functionality and the extensions for semantic mediation and corresponding components are

illustrated in the following Figure 5-20:

Chapter 5

140

Figure 5-20 Functional Architecture of Mediated Process Execution

The extension evaluator is responsible for evaluating the invocation of additional functionality

embedded in the BPEL execution plan. As the composed process is published as a Semantic

Web service (SWS) the first extension handles the overall process input and outputs and the

translation between the different data meta-models. This includes the lifting and lowering

between XML instances of XML Schema types according to the WSDL interface of the BPEL

process on the one hand and ontology concept instances as used for semantic mediation on the

other hand.

Having lifted the process input data on the semantic level the information flow engine can

interpret the description logic based rules that define the information flow. This includes the

information flow between process input and service input parameters within the composition as

well as between service output and other service input parameters or parts of them. Therefore,

again the functionality of a Semantic Web framework has to be utilized, which provides

ontology reasoning and rule inferencing as basic support services.

Having all inputs on hand, a Semantic Web service can be called by means of the SWS

invocation engine. According to the lifting and grounding mechanism (cf. Section 5.8) the

parameters are transformed from ontology-based individuals (ontology concept instances) to

instances of XML Schema types and vice versa for the reply after the grounded WSDL-based

Web service has been called.

To perform the semantic mediation expressed in the embedded semantic bridges between

heterogeneous services, the semantic bridge engine performs the rule-based ontology mappings.

As described in Section 4.8.4, the heterogeneous representations are mediated from ontology

concepts describing the outputs resulting from the called Semantic Web service to the required

input representations for the next Semantic Web service to be called. Again a Semantic Web

framework is required to support the handling of domain ontologies and the rule-based

mappings between them.

After the last Semantic Web service has been called and the information flow to the joint

outputs has been forwarded, the execution engine passes these outputs as results to the caller of

the Semantic Web service and the execution of the composition is completed.

Methodology and Functional Architecture for Semantic Mediation in SOA

141

5.10.3 Related Work

Integrating semantic extensions into service composition and its execution by means of process

engines has been a vital research field in the last years. Accordingly, several approaches exist

which target to integrate semantics-based enhancements into BPEL. Related work in this field

has been already touched in the related work section of the previous methodology steps of

meditated business process modeling (cf. Section 5.5.3) and mediated service composition (cf.

Section 5.9.3).

In the related work section of the mediated process modeling step the SUPER research project

has been exemplarily highlighted. Besides the modeling aspects including the development of

upper ontologies for modeling semantic business processes (SBPMN), the SUPER project has

defined semantic extensions of the BPEL specification (sBPEL and BPEL4SWS) and has

developed a prototypical semantic BPEL Execution Engine (SBPELEE). The focus of these

activities is put on the dynamic discovery and dynamic composition of services, which are

based on the goal-oriented approach discussed and evaluated in Section 3.4.3. As already stated

there, this approach is difficult to map to non-planning based composition and execution

approaches as applied in the work of this thesis.

In context of the service composition step the SATINE research project has been exemplarily

presented as related work. Besides the composition framework the SATINE toolkit also

includes the export of an execution plan to a BPEL process engine. The designed orchestration

plan of Semantic Web services including control and information flow is expressed in the

XML-based BPEL language, in order to ensure compatibility to state-of-the-art BPEL execution

engines. However the underlying assumption of the SATINE composition and execution

approach has a different focus (cf. Section 5.9.3). While in SATINE a homogenous ontology

environment during the composition and its BPEL-process representation is assumed, the

anticipation of heterogeneous conceptualization of service descriptions in a heterogeneous SOA

landscape is at the core of this work.

The concrete realization of this functionality enabling the execution of enhanced BPEL

processes including the semantic mediation mechanism between heterogeneous services is then

described in context of the developed semantic mediation toolkit in Section 6.3.

5.11 Summary and Reflection

This chapter has presented the developed methodology and functional architecture for semantic

mediation in SOA. The principle idea has been to provide a connecting step between the

developed theoretical concepts and its experimental confirmation by mapping the concept of

semantic mediation to the concrete SOA life-cycle. Accordingly, the semantic mediation

methodology has been aligned to the basic steps of the SOA life-cycle where mediation between

heterogeneous information models is required.

The chapter has started by discussing relevant requirements for the semantic mediation

methodology based on a refinement of the general conceptual requirements identified in Section

4.2. This has included the reflection of organizational boundaries in domain conceptualization

or as well the shift of abstraction level for semantic integration. Moreover, the requirement

regarding the reduction of complexity in semantic integration in terms of separation of concerns

has been addressed in a dedicated domain actor model describing roles and responsibilities. A

further refined requirement has been consistency with best practice SOA methodologies, which

Chapter 5

142

implies dedicated methodology steps for business process modeling and service composition

and respective roles and responsibilities in the actor model. Furthermore, technological path

dependency in SOA has been reflected by outlining the restrictions to certain technologies and

standards of the W3C and OASIS standards body.

Further domain specific considerations with regard to the application field of SOA have been

derived and integrated into the already mentioned domain actor model that specifies

stakeholders relevant for the semantic mediation methodology with their roles, responsibilities

and general skill profiles. The following actors could be identified: domain information model

expert, business process expert, service developer, service composer and service or process

consumer.

Subsequently, an overview of the individual steps of the semantic mediation methodology has

been given to provide a better understanding in terms of a big picture. The semantic mediation

methodology consists of the following seven steps:

 Domain Ontology Development

 Mediated Business Process Modeling

 Semantic Bridge Definition

 Semantic Bridge Testing

 Semantic Service Enrichment

 Mediated Service Composition

 Mediated Business Process Execution

The following sections then have presented the particular methodological steps in more detail

with regard to their goals, the performed tasks within each step and the required functionalities.

In order to prepare an experimental confirmation of key steps of the methodology by means of a

prototypical semantic mediation toolkit, additionally a high-level view on the functional

architecture for each methodology step has been derived. Therein, it has been distinguished

between methodology steps which can be addressed sufficiently with existing work and

respective tools and steps which cannot be mapped to available functionality. Accordingly, for

the steps: domain ontology development, semantic bridge definition and semantic service

enrichment existing work has been presented, whereas for the methodology steps: mediated

business process modeling, semantic bridge testing, mediated service composition and mediated

business process execution a dedicated functional architecture has been elaborated.

Based on the in this chapter developed methodology for semantic mediation and the respective

functional architectures, the following Chapter 6 now presents the developed prototypical

instantiation in terms of the semantic mediation toolkit.

143

Chapter 6

Realization of Semantic Mediation Toolkit

6.1 Overview

This chapter presents the realization of the developed semantic mediation toolkit for the purpose

of providing an experimental proof-of-concept. The toolkit comprises instantiations of the

functionality specified in the previous Chapter 5, which has presented the semantic mediation

methodology and functional architectures of missing steps for its application. For the steps:

domain ontology development, semantic bridge definition and semantic service enrichment

existing work and corresponding tools can be directly reused. The here presented toolkit covers

the newly developed prototypical tools for the methodology steps: mediated business process

modeling, semantic bridge testing, mediated service composition and mediated business process

execution.

Each tool is focused in a separate section following a consistent structure for the description of

its prototypical realization: Firstly, the respective system requirements are briefly refined based

on the goals of the particular methodology step and its enabling functional architecture. The

following section covers the design and actual realization of the respective tool. The functional

architectures presented in the previous chapter are refined to system architectures and the

technical realization of its components and their interrelation are discussed. Then, a further

section is devoted to the validation and verification of each developed prototype based on a

cross-organizational business scenario, which is highlighted from different perspectives

according to the respective focus of each prototype of the toolkit. The validation part analyses

how the developed prototype meets the objectives as refined in the section about system

requirements and the verification part deals with the question, whether the prototype performs

the functionality correctly. An evaluation of the semantic mediation approach itself is given in

an extra Chapter 7.

Finally, a brief overview of references to information about the usage and extension of the

toolkit is provided, which has been published and made available as four separate open source

projects. The chapter finishes with a conclusion and reflection on the realization of the semantic

mediation toolkit.

6.2 Mediated Business Process Modeling Tool

This section discusses the realization of the mediated business process modeling tool as part of

the semantic mediation toolkit. The section is structured according to the outline presented

above (cf. Section 6.1) starting with a summary of the system requirements, over the description

of the design and realization of the system architecture and its components to its application in

terms of a scenario for the tool‟s validation and verification.

Chapter 6

144

6.2.1 System Requirements

The main goals of the methodology step of mediated business process modeling together with

its functional architecture discussed in Section 5.5 express the system requirements of the

envisioned tool. In order to recall them and provide a consistent understanding for the following

sections they are summarized as follows:

 Provide functionality for the design of information flow in business process models on a

non-technical conceptual level in terms of ontology-based information models suitable

for business process experts.

 Ensure a consistent link between the business or conceptual level (ontology-based

information models) and the underlying technical information or data models and thus

provide improved business IT alignment.

 Integrate the developed semantic mediation mechanism to enable seamless design of

cross-organizational information flow, whereas semantic heterogeneities are handled

transparently for business process experts.

 Provide functionality for business process-driven evolution and extension of existing

information models and semantic bridges with corresponding collaboration support

between business process and domain information model experts.

In the following the functional architecture consisting of the three horizontal layers: business

process modeling, semantic annotation and semantic mediation combined with the vertical

support layers: semantic pool and Semantic Web framework (cf. Section 5.5.2) is refined to the

tool‟s system architecture. Additionally, each component is described in terms of its concrete

technical instantiation.

6.2.2 Design and Realization

The prototypical implementation of the tool for mediated business process modeling has been

performed according to the Agile Unified Process (AUP) [231], which is a methodology based

on the Rational Unified Process (RUP) [232] framework. I.e. the development process is

composed of several iterations, where each one has few distinguished phases. After each

iteration, a set of artifacts is collected to systematically present the progress of the development.

Critical (in terms of most difficult to implement and core components) have been designed first

and then prototypically realized before further components have been addressed. The general

architecture style is client-server based as this suits best the goal for distributed collaboration

between multiple business process experts and domain information model experts in the

targeted cross-organizational context. The following Figure 6-1 presents the designed system

architecture:

Realization of Semantic Mediation Toolkit

145

Figure 6-1 System Architecture of Mediated Business Process Modeling Tool

Business Process Modeling: The underlying layer is instantiated by the open-source based

Oryx editor and repository for the basic business process modeling functionality. The Oryx

editor has been chosen among a set of candidates in a systematic criteria-based evaluation

carried out in [233]. The main criteria have been extensibility, support of standardized business

process modeling notations such as BPMN (cf. Section 5.5.2), usability and an active developer

community with support provision. The Oryx editor is based on an Ajax-JavaScript Web

frontend combined with a Java-Servlet-based backend including a Hibernate persistence layer.

The open approach combined with its clearly structured and defined plug-in mechanism

provides a solid foundation for the realization of the upper layers. A comprehensive description

about the Oryx editor can be found in [234].

According to the client-server architecture and the extension mechanism of the Oryx editor and

repository the upper layers have been realized as plug-ins. Its functional components contain

each a client-side Web frontend for the GUI including as well lightweight application logic and

a server-side backend for sophisticated processing (cf. Figure 6-1). The backend includes in

particular the functionality for the additional semantic layers using the API of the Protégé

Semantic Web framework and the persistence handling of extended business process models

with semantic artifacts such as domain ontologies for information entity annotation and

semantic bridges for semantic mediation.

Semantic Annotation: The semantic annotation plug-in extends the Oryx BPMN modeling

functionality by means of semantically enriched expression of information entities and

information flow. The Oryx editor supports multiple notations for business process models.

However, BPMN was chosen due to its standardization and wide industry adoption. The

developed extension allows to link BPMN information entities to concrete concepts of a domain

ontology described in OWL-DL. Among the three available OWL languages levels (cf. Section

3.3.2) OWL-DL has been chosen for the following reasons: OWL-DL imposes some restrictions

on the underlying RDF graphs in comparison to OWL-Full, which does not. Therefore, based on

limitations in expressiveness OWL-DL is decidable and reasonably computable compared to

OWL-Full. Moreover, OWL-DL enables arbitrary values for cardinality restrictions of

properties, whereas OWL-Lite only allows to distinguish between 0 and 1 for minCardinality,

etc. In this context it has to be kept in mind that in the following tools of the semantic mediation

Chapter 6

146

toolkit (e.g. in the mediated business process execution tool) OWL concepts need to be mapped

to XML Schema types due the aimed anticipation of path dependency of Web service

technology. The OWL-DL features for cardinality restrictions allow to cover the XML Schema

feature of defining how often an element may occur within a complex type definition by

minOccurs and maxOccurs, which are not restricted to 0 or 1. Therefore, the language level

OWL-DL has been regarded as most useful for the developed prototypes including the fact that

most OWL reasoners focus as well on OWL-DL.

By selecting a concept from a tree-based representation of the OWL-DL domain ontology graph

the adequate semantic annotation for the business process information entity can be identified.

This representation also incorporates the domain ontology context and the properties describing

the particular concept. In particular, the concept-tree representation focuses on the following

OWL-DL ontology features:

 the concept and its sub-concepts

 the concept and its properties

 the properties and their domain and range (cf. Section 3.3.2)

Further more sophisticated ontology features (e.g. the qualification of a property as functional

etc.) are not represented, in order to focus on a high-level non-technical visualization for

business experts without too much complexity. Therefore, the OWL-DL ontology is processed

on the server-side using the Protégé-API. In order to display the ontology structure, an XML

object representation is utilized to serialize ontology representations to the client-side. The

information flow is represented in BPMN by means of linking annotated information entities to

multiple activities between information entities are exchanged. Besides this visualization of the

annotation which focuses on the whole domain information model, additionally a directly into

BPMN integrated semantic extension of the information entity representation is provided.

According to the approach presented in Section 5.5.1 each annotated information entity is

visualized in the business process model with the corresponding concept properties as cascading

sub-shapes using SVG [235]. Figure 6-2 below illustrates the two perspectives of the semantic

annotation in a screenshot of the Web-based GUI of the extended business process modeling

tool. The BPMN extension of the annotated information entity is shown on the left hand side

and the domain ontology browser including the concept tree on the right hand side. In the center

a semantic bridge between the two semantically corresponding information entities

PurchaseOrder and Order in different domains is highlighted, which will be further explained

in the next paragraph.

Realization of Semantic Mediation Toolkit

147

Figure 6-2 GUI of Mediated Business Process Modeling Tool

The functionality to advance the domain ontology or particular concepts is integrated in the tree-

based representation of the OWL-DL domain ontology graph. Basic extension features such as

adding, changing or deleting a concept or a property can be directly performed in the provided

ontology browser. This can be done through specific buttons and context menus highlighted as

the toolbar in Figure 6-2. More sophisticated advancements of the ontology can be specified in

terms of textual comments by process experts and then have to be externally incorporated into

the domain ontology by domain information model experts. They can utilize a separate tool

dedicated to ontology editing as presented in context of existing work for the methodology step

of domain ontology development (cf. Section 5.4).

Semantic Mediation: The semantic mediation plug-in applies preloaded semantic bridges, in

order to match semantically corresponding information entities in the cross-organizational

business process model. As the application of semantic bridges represents the core part of the

actual semantic mediation mechanism (cf. Section 4.8), its realization is discussed in more

detail in this section and is then just referenced when applied in further tools of the toolkit. The

semantic bridges are realized in terms of SWRL forward-chaining rules combined with the facet

classification semantics of OWL (cf. Section 3.3.2). Therefore, the OWL feature of class

definitions through property restrictions [236] is exploited. This way classes are defined in

terms of at least one necessary and sufficient condition describing exactly the properties which

an instance of this class should exhibit. Consequently, instances which fulfill the conditions can

be classified by an OWL reasoner as members of such a defined class. Different third-party

reasoners and rule engines have been examined in order to interpret and execute the SWRL

rules and perform the required classification.

At first, KAON2 [237] was investigated, since it supports reasoning over OWL and SWRL.

However, KAON2 implements a pure backward-chaining algorithm, which is designed for

query answering; i.e. only facts necessary for answering one specific question are generated. It

does not support the calculation of all facts based on a given knowledge base. That means it is

not possible to trigger a forward-chaining reasoning, determining all facts that can be inferred

from the given knowledge base [237]. However, this is necessary to generate polymorph

instances containing all properties of each concept definition from the domain ontologies

between the semantic bridge is applied. Furthermore, the SWRL support of the Pellet reasoner

[238] has been evaluated, which however at time of investigation did not include required built-

Chapter 6

148

ins [239] such as the makeOWLThing-built-in [240]. However,

this built-in is necessary e.g. if a semantic bridge defines a

mapping between concepts where the target concept is defined

on a finer granularity level then the source concept. In this case

new OWL individuals have to be created to accommodate the

additional level of structure. Due to the lack of support of such

specific built-ins, the Pellet SWRL support is not sufficient.

A third and finally adopted approach is the use of the Protégé-

OWL Framework [241] in combination with Pellet and the Jess

Rule Engine [242]. Protégé is an open source tool for managing

and manipulating OWL. It provides a direct connection to Pellet

for performing OWL-DL reasoning applied for facet

classification. Since Pellet does not support all the SWRL-built-

in-libraries as discussed above, the Jess rule engine is used for

this purpose. The right hand Figure 6-3 illustrates this

realization of the semantic mediation mechanism. Protégé is

utilized as the top-layer framework that coordinates the

communication between the other frameworks. It is responsible

for reading, importing and managing all ontological facts. While Pellet is directly integrated into

the Protégé framework, Jess is an independent component by itself. Therefore, all the facts that

are necessary for executing the SWRL-rules have to be transferred to the rule engine via the

SWRL-JessBridge [243]. The available methods and the syntax for handling the rules are

explained in [244]. Since rules operate on individuals exclusively, proxy OWL individuals have

to be created for all ontology concepts involved in the semantic bridge. The proxy individuals

simulate the actual instances of information entities that will be provided during process

execution. After the semantic bride is executed, the now polymorph proxy individuals can be

visualized in the concept tree of the ontology browser and as well directly in the BPMN model

as illustrated in Figure 6-4 below:

Figure 6-4 Polymorph Information Entities embedded in BPMN

Moreover, based on the polymorph proxy individuals the matching of semantically

corresponding information entities across business domains can be performed. Iterating over the

involved information entities in the process model by taking into account recursively sub-

elements, the concept types can be directly compared. Matching information entities are

highlighted and presented to the process expert. Furthermore, in an analog manner to the

evolution of information models the tool provides advancement functionalities for the evolution

of rule-based semantic bridges. Requirements for missing mapping rules between two concepts

or as well for semantic bridges missing at all can be specified and are stored as textual

Figure 6-3 Realization of

Semantic Mediation Mechanism

Realization of Semantic Mediation Toolkit

149

comments to be addressed by domain experts. The technical realization of the visualization and

advancement functionalities for semantic bridges are similarly realized as for the semantic

annotations as described above.

Semantic Pool: On the one hand, the semantic pool plug-in provides a repository to handle and

manage domain ontologies to be used during annotation of information flow. On the other hand,

the analog functionality is provided to manage the used semantic bridges including dialogs for

import, export, create and versioning operations required for the advancement functionality

discussed above. After importing a domain ontology its URL is parsed on server-side and the

tree-based representation for the client-side ontology browser is generated and stored. Thereby,

the realization takes into account that the used ontologies and semantic bridges are persistently

integrated into the data set of the business process model to restore them consistently when the

business process is reloaded. The following Figure 6-5 shows the visualization of the semantic

pool plug-in.

Figure 6-5 Realization of Semantic Pool

Semantic Web Framework: The Semantic Web framework has been chosen and utilized as

discussed above in context of the realization of the semantic mediation mechanism. It includes

the Protégé API framework combined with the Pellet OWL-DL reasoner and the Jess SWRL

rule engine.

This section has focused on the most important aspects of the tool‟s implementation. A more

detailed description of the tools technical realization including class diagrams, sample

ontologies, test cases, lessons learned are provided in [193] and are discussed in the paper [245].

6.2.3 Scenario, Validation and Verification

This section covers the validation and verification of the developed prototype for mediated

business process modeling. Based on a briefly outlined scenario a subsection about validation

analyses how the developed prototype meets the objectives defined in the system requirements

(cf. Section 6.2.1). A further subsection about verification deals with the question whether the

prototype performs the mediated business modeling tasks correctly. An evaluation of the

semantic mediation approach in general is given in Chapter 7.

The performed scenario is based on the “Purchase Order Mediation Scenario”, which has been

issued by the international Semantic Web Service Challenge (SWSC) [246]. It is highlighted

from different perspectives according to the respective focus of each prototype in the toolkit. In

this section the focus is put on the modeling of a cross-organizational business process

including the design of information flow across heterogeneously defined information

representations. The basic idea of the scenario is that a customer “Blue“ wants to purchase

goods from the manufacturer “Moon“. However, the systems responsible for issuing a purchase

order on the Blue side and for processing the order on the Moon side differ in terms of

information representation and in terms of interaction patterns. I.e. the granularity and

Chapter 6

150

denotation of the data elements used on both sides varies, as does the order and granularity of

operations, necessary to complete the processing of an order. The following Figure 6-6 Purchase

Order Mediation Scenario Overview illustrates the scenario:

Figure 6-6 Purchase Order Mediation Scenario Overview [246]

The purchase order sent by the Blue system is based on an information model specified in the

RosettaNet XML Schema standard, while the Moon system defines its own information model

with a proprietary XML Schema format. Consequently, the challenge is to implement a

mediator that should bridge the heterogeneities regarding the different information models and

interaction patterns of the two systems. The business process between the two companies has

been modeled with the developed prototype. Furthermore, the conceptual information flow has

been designed based on developed domain ontologies “Blue” and “Moon”, which capture the

different conceptualizations of the information models on an ontology level. Moreover,

additionally developed semantic bridges between the Blue and the Moon domain ontology have

been applied, in order to provide a transparent semantic mediation between the heterogeneous

information models to the business process expert. The initial development of the domain

ontologies corresponding to the provided XML-Schemas and the semantic bridges is further

described in the context of the next prototype covering the systematic testing of semantic

bridges. Furthermore, the advancement functionality for information models and semantic

bridges of the mediated business process modeling tool has been used to complete the Blue and

Moon domain ontologies and corresponding semantic bridges. Figure 6-7 below shows how the

purchase order mediation scenario has been mapped to a business process model developed with

the realized prototype:

Realization of Semantic Mediation Toolkit

151

Figure 6-7 Scenario Performed with Mediated Business Process Modeling Prototype

Validation

The required functionality for the design of information flow on a non-technical conceptual

level could be successfully provided: On the one hand by means of the ontology browser

presenting the information model as a concept tree and on the other hand by extending the plain

BPMN representation of information entities with concept annotations from a domain ontology.

Furthermore, based on the underlying OWL-DL domain ontologies utilized for the expression

of domain information models the link to their processing in the further SOA life-cycle could be

ensured. I.e. the OWL-DL domain ontologies can be reused, in order to build the additional

semantic layer for Web service enrichment and Web service composition, which can be mapped

to existing XML-based infrastructures and hence contribute to improved business IT alignment.

The semantic mediation mechanism has been realized by means of SWRL-based forward-

chaining rules and the facet analysis classification of an OWL-DL reasoner. Once loaded to the

semantic pool and accordingly applied with the matching functionality, the semantic bridges

enable a seamless design of cross-organizational information flow, whereas semantic

heterogeneities are kept transparently in the background. Finally, the functionality for business

process-driven evolution of existing information models has been realized within the ontology

browser and dialogs for the extension or completion of semantic bridges have been integrated.

Therefore, it can be stated that the developed prototype for mediated business process modeling

meets the objectives as defined in its system requirements (cf. Section 6.2.1).

Verification

Guided by the scenario several tests were run to ensure that the prototype mainly performs

correctly and stable in the expected behavior. In particular, a set of defined use cases covering

the main functionalities have been addressed. The tested use cases include: manage ontologies

and semantic bridges in the semantic pool, create new ontology, define requirements for a new

semantic bridge, annotate information entity with a concept, display annotated information

entity, edit annotated information entity, link corresponding information entities with a semantic

bridge, suggest semantically matching information entities, display semantic bridge and finally

edit semantic bridge. Additionally, during the development of the prototype several unit tests

Chapter 6

152

were run to check the correct implementation of the various methods. Further details of the

performed scenario are provided in [193].

To summarize, it can be stated that the prototype generally performs correctly. And taken into

account as well the validation discussion, it can be concluded that the implemented prototype

demonstrates successfully how the mediated business process modeling step of the semantic

mediation methodology can be instantiated as a proof-of-concept.

6.3 Semantic Bridge Testing Tool

The testing of semantic bridges constitutes the second step of the developed semantic mediation

methodology. While tools for the creation of SWRL-based ontology mappings exist, the

systematic testing of such mappings has not yet been addressed. Hence, a prototypical

instantiation within the semantic mediation toolkit has been developed that focuses on the visual

design and execution of tests for rule-based semantic bridges between domain ontologies. This

section is structured analog to the previous one starting with a summary of the system

requirements, over the description of the design and realization of the system architecture and

its components to its application in terms of a scenario for the tool‟s validation and verification.

6.3.1 System Requirements

The goals for the methodology step of semantic bridge testing together with its functional

architecture have been already discussed in Section 5.6. They can be referred to as the system

requirements of the envisioned prototypical instantiation. They are recalled and summarized as

follows:

 Provide functionality to define test cases for semantic bridges, which include

exemplarily instances of concepts from the source ontology as test inputs and expected

polymorph instances as expected test outputs (expressed according to source and target

ontology).

 Integrate the functionality to execute the defined semantic bridge test cases under the

required execution conditions (in terms of availability of source and target ontology).

 Provide test reporting functionality which evaluates the semantic bridge test cases based

on a gap analysis between intended results defined in the test case and actual results

received after semantic bridge execution. Furthermore, a calculation should be

performed that quantifies the correctness degree of a semantic bridge. On the one hand,

this evaluation should include a comparison of how many test cases have succeeded vs.

how many have failed. And on the other hand, the coverage of the whole test with

regard to the overall scope of a semantic bridge should be measured.

The functional architecture as discussed in Section 5.7.2 is based on the Model-View-Controller

(MVC) approach including: model components representing the ontologies, semantic bridges

and test projects, the GUI for domain information model experts and modelers of business

processes or service compositions and finally the controller components handling the

interactions as well as providing the actual business logic for the semantic bridge test execution.

Based on this functional perspective the following section refines this functional architecture to

the tool‟s system architecture, whereas each component is described in terms of its concrete

technical instantiation.

Realization of Semantic Mediation Toolkit

153

6.3.2 Design and Realization

Analog to the other prototypes of the semantic mediation toolkit the semantic bridge testing tool

has been iteratively implemented according to the customized Agile Unified Process (AUP).

This process has ensured high quality and enough flexibility to choose between different

realization options. The MVC architecture style is reflected in structuring the tool into

subsystems organized in the three main packages: model, controller and GUI. The following

Figure 6-8 presents the designed system architecture:

Figure 6-8 System Architecture of Semantic Bridge Testing Tool

Model Subsystem: The model subsystem manages the assets on which the testing tool operates.

It is realized in Java following an object-oriented approach. Firstly, it contains a component that

maintains the source and target domain ontologies of the semantic bridge under test. As the

testing tool is part of the semantic mediation toolkit, the supported language for specifying

domain ontologies is OWL-DL according to the other prototypes.

A further component encapsulates a representation of the

semantic bridge to be tested, which is based on SWRL forward-

chaining rules. Moreover, the internal representation of the test

project and its containing test cases are realized as OWL

individuals, whereas a dedicated ontology describing the

required information has been developed. The right hand Figure

6-9 shows the required properties of a test project and test cases.

The test project representation by means of an ontology has been

chosen for two reasons: Firstly, the required technology for

handling and storing the OWL files in a persistent form is

already in use within the prototype and thus convenient to reuse.

And secondly and more important, it provides a suitable form for

exchange of test projects among domain information model

creators and users.

Figure 6-9 Test Project

Ontology

Chapter 6

154

Analogically, the test report is represented as an OWL individual. The test report contains

information whether a respective test case has succeeded or failed. If a test case has failed, the

test report also lists statements in the expected test output which could not be found or inferred

in the mapping result and hence caused a test case to fail. Furthermore, the report for each test

case contains information regarding rules that have been fired. Finally, information about test

case coverage of a particular test case and the overall test coverage is provided (cf. Section

5.7.1).

Semantic Web Framework: The Semantic Web framework utilized in the prototype for test

case execution is the same as it has been exploited for the semantic mediation mechanism as

described in context of the prototype of mediated business process modeling (cf. Section 6.2.2).

This includes the object-oriented representations of ontologies and semantic bridges as utilized

in the model subsystem.

Controller Subsystem: The controller subsystem contains the actual business logic of the

testing tool, which is as well implemented in Java. Firstly, it provides the interaction logic in

terms of dialogs for creating test projects and included test cases. The core component is the test

case execution engine, which reads the test project description and performs the individual test

cases based on the already discussed semantic mediation mechanism applying the semantic

bridge‟s mapping rules (cf. Section 6.2.2). During test case execution the rule execution monitor

determines which mapping rules have been fired. Therefore, specific axioms are temporarily

added to each rule head and thus can be identified after rule execution. This information is

required to prepare the test report. To compare a mapping result with expected test output

instances, an inferred OWL model of the mapping result is derived with the objective to

consider not only asserted but also inferred knowledge during comparison. Then, the test case

execution engine checks whether each statement contained in the expected test output instances

is also contained in the inferred mapping result model. If all the statements are present, the test

case can be considered as succeeded.

The purpose of the rule debugger is to debug the mapping rules by means of executing only a

selected set of rules. In this process, a concept individual of the source ontology, which has to

be provided by the user, is utilized as the input. Similar to the test case execution engine the rule

debugger uses the implemented semantic mediation mechanism to execute a set of mapping

rules and the rule execution monitor to determine which rules have been fired. Finally, a

component manages the test project storage and loading based on serialization of the OWL

representation of test projects and included test cases according to the description in the context

of the model subsystem.

GUI Subsystem: The GUI subsystem realizes the visual presentation relevant for editing and

execution of test cases and rule debugging. The entry point for the functionality is organized in

the testing tool menu. In the center, the ontologies and rules panels are located, which provide

an overview of the rules constituting the semantic bridge and its source and target ontology. The

following Figure 6-10 shows a screenshot of the GUI of the developed testing tool for semantic

bridges, which is based on Java Swing:

Realization of Semantic Mediation Toolkit

155

Figure 6-10 GUI of Semantic Bridge Testing Tool

The lower left hand side shows the source ontology view, the lower center the mapping rules

and the lower right hand side the target ontology. The message panel at the bottom informs

about the tool‟s actions performed during test case creation, execution and rules debugging.

Furthermore, the functionalities for the visual design of test cases and their combination to test

projects have been implemented in a test case editor tab. The input instances are either defined

by the domain expert or generated automatically in terms of dummy instances by the testing

tool. The upper side of Figure 6-10 shows the test case view after test execution as a

comparison between input instances, expected output instances and actual mapping results.

Furthermore, the test case coverage and consistency check results are presented. Similarly, the

debugging view is realized, whereas the domain expert can select the semantic mapping rules to

be executed while observing the mapping results.

Finally, the tool presents a generated test report that summarizes the test results, consistency

checks and the test case coverage. The latter is an indicator used to quantify the expressiveness

of the performed test case as it has been mentioned above (cf. Section 5.7.1). The test report

serves as a basis for conclusions regarding correctness of the semantic bridge or identification of

defects within its underlying mapping rules.

More details on the design, system architecture and the implementation of this proof-of-concept

including class diagrams, sample ontologies, test cases and lessons learned are provided in

[225].

Chapter 6

156

6.3.3 Scenario, Validation and Verification

In order to demonstrate the prototype of the mapping testing tool and provide a means for its

validation and verification, it has been applied to the “Purchase Order Scenario” already

presented in context of the prototype for mediated process modeling. This time the scenario is

highlighted from the perspective of the two heterogeneous domain ontologies and the testing of

the semantic bridge between them. Again, first a validation part recalls the originally defined

system requirements (cf. Section 6.3.1) and applies them as evaluation criteria. Secondly, a

verification part deals with the question, whether the prototype performs the tasks for semantic

bridge testing correctly.

As already mentioned the two domain ontologies “Blue” and “Moon” have been developed, in

order to shift the information models provided as XML Schemas onto the higher semantically

expressive conceptual level. For the purpose of providing a better understanding for the

performed testing of the semantic bridge, the following Figure 6-11 illustrates an excerpt of the

two heterogeneous ontologies representing the different information models of the Blue system

(RosettaNet) and the Moon system (legacy):

Figure 6-11 Heterogeneous Domain Ontologies "Blue" and "Moon"

The information models differ in their semantic sub-graph. As the concept Partner in the

RosettaNet ontology is defined in terms of three object properties, a semantically corresponding

concept Customer in the Moon ontology just features two object properties. The two concepts

contain the same information, however defined at a different level of granularity. The domain

ontologies have been designed according to the methodology step of domain information model

development based on the tools outlined in the existing work part (cf. Section 5.4.2). In

particular, the Protégé ontology editor has been applied. Furthermore, also the functionality for

information model evolution provided by the developed mediated business process modeling

prototype (cf. Section 6.2) has been taken into account to complete the information model

aligned to the business process requirements.

Obviously, the Partner and Customer concepts presented above cannot be exchanged between

communicating partners by default, although they represent the same conceptual idea. In order

to mediate between these differently represented concepts, a semantic bridge between the two

heterogeneous ontologies has been developed according to the methodology step of semantic

bridge definition (cf. Section 5.6). The following screenshot Figure 6-12 shows an excerpt of

mapping rules created with the Snoggle mapping tool presented in Section 5.6.2.

Realization of Semantic Mediation Toolkit

157

Figure 6-12 Example Mapping Rules Created with Snoggle Mapping Tool

By applying the semantic bridge rules, an instance of type Partner is furnished with additional

properties e.g. with hasCustomerInfo combining the values of the BusinessDescription and the

ContactInformation properties hasBusinessName and hasEmailaddress. Having the class

definitions on hand, a reasoner is now able to classify the instance as a member of the defined

class Customer, since all required properties (including hasCustomerInfo) are present. The

following Figure 6-13 illustrates the underlying logic of the mapping rules in a so called human-

readable syntax realizing this part of the semantic bridge:

Figure 6-13 Semantic Bridge and Polymorph Classification Example

In order to test this semantic bridge, a test project with the following three test cases has been

defined and executed:

 Test Case 1 is used for testing the mapping rules between the concepts Partner,

ContactInformation, PhysicalAddress and BusinessDescription of the Blue ontology

and their corresponding concepts in the Moon ontology.

 Test Case 2 is used for testing the mapping between the same concepts; however

provided input instances of the concept ContactInformation are not featured with the

property hasEmailAddress.

Chapter 6

158

 Test Case 3 also tests the mapping between the above mentioned concepts, however an

instance of concept PhysicalAddress is not featured with the property hasHouseNr.

During test case execution Test Case 1 and Test Case 3 are supposed to succeed and Test Case 2

is supposed to fail, due to the missing but required property hasEmailAddress in the provided

test input instance. However, the failing of Test Case 2 does not point to a defect in the defined

semantic bridge but rather has been purposeful designed, in order to provoke a non-succeeding

test case in an otherwise completely correct set of mapping rules. The already presented Figure

6-10 above shows a screenshot after test case execution including the three test cases presented

above. More details on the performed scenario including complete ontology descriptions,

semantic bridges, test case definitions and test reports are provided in [247].

Validation

The overall goal has been to increase trust in the quality of semantic bridges by supporting the

evaluation of correctness of their underlying ontology mapping. Accordingly, three system

requirements have been derived, which coverage is discussed in the following: The developed

prototype provides the required functionality to define test cases for semantic bridges. The

exemplary test input instances and expected output instances can be either provided by the user

or are automatically generated in terms of so called dummy instances. Furthermore, the

developed semantic mediation mechanism could be successfully reused, in order to execute the

defined test cases. Finally, the envisioned test reporting functionality could be provided as a

prototypical instantiation. The quantification of the quality of a semantic bridge in terms of

comparison of succeeding test cases vs. overall test cases is calculated. Furthermore, test

coverage with regard to the overall scope of a semantic bridge is derived by the tool and

successfully tested in the presented scenario. It is important to recall that the absolute

determination of correctness of a semantic bridge cannot be provided by the tool and has also

not been targeted due to the limitations of testing in general as discussed in Section 5.7.1.

However, a successful test case execution in combination with a high indicator for test case

coverage contributes to increase trust in the correctness of the created mapping. Moreover, the

debugging mode of the tool helps to eliminate errors in the semantic bridge development phase.

Consequently, it can be stated that the developed prototype for semantic bridge testing meets the

objectives as defined in its system requirements (cf. Section 6.3.1).

Verification

According to the described scenario several test were run to ensure that the prototype mainly

performs correctly and stable in the expected behavior. In particular, the prototype was

continuously applied during the definition of the semantic bridge between the “Blue” and

“Moon” domain ontologies. Additionally, analog to the previous prototype, several unit tests

were run during the development iterations to check the correct implementation of the various

methods.

Thus, it can be stated that the prototype for testing of semantic bridges generally performs

correctly. Taken as well into account the validation results, it can be concluded that the

implemented prototype demonstrates successfully how the semantic bridge testing step of the

semantic mediation methodology can be instantiated as a proof-of-concept.

Realization of Semantic Mediation Toolkit

159

6.4 Mediated Service Composition Tool

This section is dedicated to the prototypical instantiation of the methodology step of mediated

service composition (cf. Section 5.9). It has been argued that although several approaches

targeting the integration of Semantic Web technologies into Web service composition exist,

their focus is put mainly on the automation of the composition process, however leaving aside

the challenge of heterogeneous information models describing the involved Web services (cf.

Section 3.4.3 and Section 5.9.3). The concept of mediated service composition has been

explained in detail along with its required semantic components and how they should be used.

In order to proof this methodology step, a prototypical composition tool has been developed,

which is described in this section. The section is structured according to the ones before, starting

with a summary of the system requirements, over the description of the design and realization of

the system architecture and its components to its application in terms of a scenario for the tool‟s

validation and verification.

6.4.1 System Requirements

The main goals of the methodology step of service composition together with its functional

architecture discussed in Section 5.7 can be taken as the system requirements of the envisioned

tool. In order to recall them and provide a consistent understanding for the following sections

they are summarized as follows:

 Provide functionality for the visual design of compositions of semantically enriched

Web services, whereas the focus should be put on the design of information flow, i.e.

the assignment of input and output parameters between the services.

 During the design process the tool should exploit the semantically enriched Web service

descriptions for service selection based on semantic match making. I.e. the visual tool

should recommend services, which input and output parameters can be assigned

semantically sound to each other.

 Furthermore, provided semantic bridges should be utilized to mediate between the

different domain conceptualizations describing the Semantic Web services. The

semantic mediation should be handled transparently for the service composer to

overcome technical transformation coding and reduce integration efforts.

 Finally, the tool should serialize the designed composition to a Semantic Web service

orchestration plan based on the industry standard BPEL. The generated orchestration

plan should be executable in available BPEL process engines, while preserving the

capabilities of the exploited ontology-based technology such as polymorphism etc. (cf.

Section 4.8).

The functional architecture as discussed in Section 5.9.2 is based on the following components:

semantic bridging engine, matching engine, visual modeling tool and deployment engine and

supported by a Semantic Web framework. They process the artifacts including domain

ontologies, Semantic Web service description and semantic bridges, in order to produce an

executable orchestration plan. In the following this functional architecture is refined to the tool‟s

system architecture, whereas each component is described in terms of its concrete technical

instantiation.

Chapter 6

160

6.4.2 Design and Realization

Again the customized Agile Unified Process (AUP) has been applied for the development

phase. Furthermore, the architecture reflects as well the MCV style. However, for this prototype

this approach is not highlighted furthermore but still remains as the underlying functional

architecture style. More central for this prototype are the various artifacts processed by the tool,

which are therefore focused in the system architecture following the information viewpoint (cf.

Section 3.5.2). Accordingly, the system architecture for the mediated composition tool is

illustrated in the following Figure 6-14. It provides an overview on the applied technologies for

the different components of the prototype.

Figure 6-14 System Architecture of Mediated Service Composition Tool

The subsequent paragraphs describe the components in detail and explain how they are related

to each other:

Semantic Web Services: The tool requires the input of Semantic Web services, in order to

compose them to an orchestration. Therefore, a set of exemplary Semantic Web service has

been developed, which are further described in context of the performed scenario (cf. Section

6.4.3). The realization of the Semantic Web services has been achieved by defining OWL-S

service descriptions and ground them to underlying traditional WSDL-based Web services as

described in the existing work section of the methodology step of semantic service enrichment

(cf. Section 5.8.2). The development of therein referenced domain ontologies and corresponding

Realization of Semantic Mediation Toolkit

161

domain XML Schemas standards, which specify the input and output parameters in the WSDL

descriptions, is explained as well in context of the scenario (cf. Section 6.4.3). Especially,

XSLT transformations need to be developed, in order to realize the lifting and lowering between

the different meta-data models of OWL-S and WSDL. For the development and provision of the

underlying WSDL-based Web services, the Apache AXIS framework was used to map the Web

service interfaces to the actual implementation of the service in terms of a Java components.

As outlined in Section 3.4.2, there exist several approaches for realizing Semantic Web services.

In this work OWL-S Version 1.1 has been chosen, because OWL-S provides several advantages

for the purpose of the presented approach. OWL-S has been the first of the submissions to the

W3C in the context of Semantic Web services and many researchers regard OWL-S as the most

mature Semantic Web service technology [108]. Compared to other approaches such as WSMO

or SWSF, OWL-S is a light-weight approach. WSMO and SWSF concentrate on goal-based

dynamic plan creation known from artificial intelligence research, which has turned out to be

less suitable for the approach targeted in this work (cf. Section 3.4.3). Also it has been

considered to move from OWL-S-based Semantic Web services to the newer light-weight

approach of SAWSDL (cf. Section 3.4.2), which recently was adopted as a W3C

recommendation. There is growing support in terms of available tools to create and parse

WSDL files annotated with the extension attributes defined by SAWSDL. Moreover, it is

already used in several research projects. However, at the time of this prototypical realization no

available framework or API exists, that supports the complete process from parsing of an

SAWSDL file to the execution of the Web services including the lowering and lifting of the

inputs and outputs of the service. Therefore, SAWSDL could not be considered to be used in

this work.

Semantic Bridges: The semantic bridges have been realized by means of SWRL forward-

chaining rules as already elaborated in context of the previous prototypes (cf. Section 6.2.2).

The serialization of the SWRL-based semantic bridges are OWL files, whereas a dedicated

OWL ontology has been defined in the SWRL standardization process, which describes the

content of the rules including body and head in terms of ontology concepts (cf. Section 3.3.2).

Semantic Web Framework: The Semantic Web framework utilized in the prototype is the

same as it has been exploited for the semantic mediation mechanism as described in context of

the prototype for mediated business process modeling (cf. Section 6.2.2). In the mediated

service composition tool the framework has been extended with the Java-based OWL-S API

[223]. It enables to parse and generate OWL-S-compliant Semantic Web service descriptions.

Moreover, the OWL-S API also supports the execution of Semantic Web services including

lifting and grounding, which is exploited for the mediated business process tool (cf. Section

6.5). Together, the framework is utilized for the handling of the Semantic Web services, the

semantic bridges, the application of the semantic mediation mechanism and the matching

mechanism discussed below.

Semantic Mediation Engine: The semantic mediation engine performs the SWRL rule-based

mappings described in the semantic bridges. The engine is realized by reusing and integrating

the already described instantiation of the semantic mediation mechanism (cf. Section 6.2.2).

Applied to the composition task, this means that output parameters of Semantic Web services

expressed in the source ontology are additionally expressed in the target ontology as polymorph

representations. Thus, the service composer can assign them to inputs of subsequent Semantic

Web services within the composition regardless of heterogeneous representations of their

service parameters. Again, it has been taken into account that rules operate on OWL concept

instances called individuals by binding variables to them. However, the service parameters are

described by OWL concepts, which hinders that the semantic bridges can be directly work on

Chapter 6

162

them. Therefore, analog to the annotated information entities in the mediated process modeling

tool (cf. Section 6.2.2), firstly proxy individuals are generated for the Semantic Web service

outputs based on the concept definitions describing them.

Matching Engine: The matching engine performs reasoning over Semantic Web service

parameters, which enables the composition tool to provide recommendations for suitable

information flow assignments between the involved services. It exploits the semantically

described relationships between concepts described in the utilized domain ontologies, such as

inheritance or equality. As well the matching engine considers the polymorphism based on facet

analysis classification and the transparently applied semantic bridges. In particular, the

matching mechanism checks, if any concept COUTPUT describing the part of an output parameter

of a service A fits to the concept CINPUT describing the part of an input parameter of a service B.

If they fit, then this matching represents an assignment candidate for semantically sound

information flow between service A and B. The question whether two concepts fit, depends on

their inferred relation to each other. If concept COUTPUT is a subconcept or defined as equal to the

required concept CINPUT, then these concepts fit. Furthermore, if a semantic bridge has been

defined between two concepts they also fit, because due to inferred rules and facet classification

they share the same type (cf. Section 4.8.4). The subclass relation ensures that all properties are

inherited from a superclass. The equal relation ensures that the same or equal properties are

available, so that the required properties can be inferred from the equal properties. And for

mediated concepts the semantic bridges infer the required properties through semantic bridge

rules. Thus, it is ensured, that the output described by concept COUTPUT provides all the

properties required by concept CINPUT. This can be considered as the core requirement for a

proper assignment as the guarantied properties ensure that the forwarded data can be processed

correctly.

Visual Modeling Tool: The visual modeling tool provides a Java Swing-based graphical user

interface to model the information flow in Semantic Web service sequences. Due to the nature

of a prototype, it has been focused on the information flow design and more sophisticated

control flow functionalities going beyond simple service sequences have not been realized. The

top panel of the visual modeling tool presents an overview of the current state of the designed

service composition and serves as a navigation bar. Selecting a service, its input and output

parameters are displayed in a parameter split pane in the middle. The following Figure 6-15

shows a screenshot of the prototypical tool captured during the matching process:

Realization of Semantic Mediation Toolkit

163

Figure 6-15 GUI of Mediated Service Composition Tool

Based on the automatically provided assignment recommendations, which are highlighted in

green color, the service composer defines the appropriate information flow between the services

according to the overall process logic. The same view is used to define the overall inputs and

outputs of the designed composite service. Based on the assignments an information flow

description can be formulated, which is integrated into the execution plan of the service

composition. In order to remain consistently on the conceptual level followed in the Semantic

Web service descriptions and the rule-based semantic bridges, the information flow is as well

expressed by applying description logics. This means that SWRL-based forward-chaining rules

specify how output instances of a service A are asserted to the input of a service B.

Accordingly, on the one hand, semantic bridges and as well the information flows within a

composition are implemented as forward-chaining rules. This way, information flow can be

expressed as a rule, where the body of the rule describes the involved parameters and the head,

which is inferred, describes the parameters after the information flow has been assigned.

Finally, the bottom of the visual composition tool provides a message panel, which informs the

user about the performed tasks.

Deployment Engine: Apart from the composition support, the tool is capable of exporting the

defined service composition as an BPEL orchestration plan, which can be executed as a

composite service in a process engine. The deployment engine is responsible for serializing the

composition including the mapping rules of the semantic bridges, the rule-based information

flow definitions and the Semantic Web service calls. However, due to the higher abstraction

level during the composition design (based on the ontology meta-data model), the composed

process cannot be directly expressed in a BPEL orchestration plan (based on the XML Schema

meta-data model). Especially, the semantic bridges and the information flow descriptions

expressed in description logic rules cannot be directly mapped to BPEL constructs. Therefore,

an extension mechanism of BPEL has been exploited to incorporate these semantics-based

features. The details of this extension mechanism and its reflection in an appropriate BPEL

Chapter 6

164

engine are discussed in the following prototype targeting the mediated business process

execution (cf. Section 6.5). Furthermore, the tool generates additional necessary artifacts to

construct a BPEL deployment bundle including an engine-specific deployment descriptor and a

WSDL descriptor. This WSDL file describes the structure of the input and output messages of

the designed process, in order to access it with conventional Web service technologies as

required for a BPEL process. Furthermore, a Semantic Web service description is generated,

which is grounded to the previously constructed WSDL file. Thus, the composition is finalized

and available in an SOA landscape as a newly created composite Semantic Web service.

More details on the design, system architecture and the implementation of this proof-of-concept

including class diagrams, sample Semantic Web services, test cases and lessons learned are

provided in [225] and are discussed in the paper [8].

6.4.3 Scenario, Validation and Verification

Analog to the previous prototypes a scenario has been performed, in order to assess whether the

aimed system requirements of the mediated service composition tool could be met and if they

have been implemented correctly. This time a different scenario has been performed. Due to the

nature of the tool as a proof-of-concept, it supports only the most relevant functionality for

mediated service composition. Therefore, the focus is put on the design of information flow and

only basic control flow specifications, namely sequences are supported. In order to cover the

previously addressed “Purchase Order Scenario” more control flow functionality would be

required such as the definition of loops or the evaluation of conditions. Nevertheless, a more

suitable scenario for the assessment of the developed tool has been designed demonstrating a

sequential cross-organizational process from the eGovernment domain.

In this eGovernment scenario the process of applying online for a birth certificate has to be

implemented in terms of a Web service composition. In the application process various

authorities from different domains are involved. The process includes a service for handling the

payment of the birth certificate fee, a resident registry service for checking the citizen input for

consistence, a vital records office responsible for issuing the birth certificate and a statistical

office, to which the vital records office reports its activities.

These services are supposed to provide standard XML-based Web service interfaces including

well-defined message sets. In fact, in various countries national interoperability frameworks

impose XML Schema and Web service interfaces for exchanging data between administrations

to increase interoperability. An example for such a national effort is the Danish eGovernment

initiative, which focuses on the definition and promotion of XML domain data structures [248].

A key achievement of the initiative is the “InfoStructureBase” [249], a shared repository for

XML-based schemas of eGovernment services. In other countries such as Germany, its federal

structure causes a less centralized approach. Nevertheless, there are domain and state-specific

initiatives, which work under a shared organizational framework named OSCI-XÖV [250].

However, interoperability is just ensured within domain boundaries, e.g. through OSCI/XMeld

(XML Schema exchange standard between registration offices) or OSCI/XJustiz (XML Schema

exchange standard for legal authorities). In cross-organizational eGovernment processes (such

as addressed in the scenario below) services of various public agencies from different domains

are involved. In such scenarios the lack of semantic interoperability across-domains results in

enormous integration efforts.

Realization of Semantic Mediation Toolkit

165

Accordingly, in the developed

scenario different XML Schemas

have been applied to describe the

WSDL-based Web services. The

domain information models have

been expressed in terms of ontologies

to benefit from the developed

semantic mediation mechanism.

Therefore, the Web services have

been semantically annotated with

concepts from the OWL-S upper

ontology and the designed domain

ontologies. Furthermore, SWRL-

based semantic bridges have been

developed according to the previous

methodology steps. The left hand

Figure 6-16 illustrates the applied

semantic bridges and the designed

information flow of the eGovernment

scenario. For instance the domain

standard employed by the resident

registry uses a different information

representation for names and

addresses than that used by the vital

records office. While in the one

domain an address might be a

complex type consisting of different

attributes for given name, surname,

street, street number, etc.; in the other

domain standard the address concept

might be modeled as a complex type

that contains just one single attribute

for street and street number all

together. According to this process

description a composite eGovernment service has been composed as presented in the screenshot

Figure 6-15 above. Finally, a BPEL-based orchestration plan has been generated and published

in an extended open source-based process execution engine, which is further discussed in the

following Section 6.5. More details on the performed scenario including ontology descriptions

of the eGovernment domain standards, semantic bridges and test case definitions are provided

in [247].

Validation

The overall goal has been to ease the information flow design in Web service compositions

spanning heterogeneous conceptualized domains. In particular, five system requirements have

been derived, which extent of coverage is discussed in the following.

The basic functionality for the visual design of information flow in Web service compositions

has met the expectations, whereas this does not include the provision of complex control flow

support. In particular, the expression of information flow, i.e. the assignment of input and output

parameters, could be achieved by utilizing description logic based rules. Thus, the whole

Figure 6-16 eGovernment Scenario for Mediated Service

Composition

Chapter 6

166

composition design could remain purely on the conceptual level. Furthermore, the developed

semantic mediation mechanism could be successfully integration in the service composition

process, while keeping its application transparent to the service composer as described in the

scenario. Based on the semantic mediation as well the required matching mechanism for service

selection and composition support has been realized. The realization of the final requirement of

exporting the designed composite service to a semantically extended BPEL engine has been

fulfilled. However, the successful execution of the generated execution plan and its validation

remains to be shown in the next section discussing the realization of the mediated process

execution tool.

Verification

Analog to the previous verification part several tests have been run to ensure that the prototype

mainly performs correctly and stable in the expected behavior. In particular, the WSDL-based

Web services representing the eGovernment services have been tested by specifically developed

test clients. Similar, Semantic Web service test clients have been developed that call each

eGovernment Web service using its OWL-S description and the OWL-S API for direct

invocation. Moreover, during the development of the composition tool and execution engine

several unit tests have been run to check the correct implementation of the various methods. In

order to ensure that the prototype does not only work with the specifically for the scenario

designed Semantic Web services, it has also been tested with test-oriented Semantic Web

services provided in the Mindswap project [223].

Thus, it can be stated that the prototype for mediated service composition generally performs

correctly. Taken as well into account the validation results, it can be concluded that the

implemented prototype demonstrates successfully how the mediated service composition step of

the semantic mediation methodology can be instantiated as a proof-of-concept.

6.5 Meditated Process Execution Tool

This section is dedicated to the final tool of the semantic mediation toolkit, which instantiates

the methodology step of mediated process execution (cf. Section 5.10). In order to close the

cycle of the presented methodology, it remains to demonstrate how modeled business processes,

which then have been instantiated by a composition of semantically enriched Web services, can

be finally executed. The section is structured analog to the previous, starting with a summary of

the system requirements, over the design and realization description of the system architecture

and its components to its application in terms of a scenario for the tool‟s validation and

verification.

6.5.1 System Requirements and Challenges

Analog to the previous prototype descriptions, the goals of the methodology step of mediated

business process execution together with its functional architecture (cf. Section 5.10) are

recalled, in order to list the system requirements of the aimed prototype. The general goal has

been to incorporate components providing the Semantic Web technology-based functionalities

into BPEL process integration middleware (BPEL process engine). Thereby, in particular the

reflection of the different meta-data models (ontology vs. XML) on the runtime level is

required. The Semantic Web technology-based components to be incorporated into BPEL

include the following four aspects:

Realization of Semantic Mediation Toolkit

167

 Semantics-based information flow – Enable the execution of information flow

definitions, which are specified in terms of description logic based rules that work on

serializations of OWL ontology concept instances.

 Semantic mediation mechanism – Integrate the developed semantic mediation

mechanism to enable the execution of information flow across services with

heterogeneous ontological conceptualizations of service parameters.

 Semantic Web service invocation – Remain on the higher conceptual abstraction level

to the latest possible point, i.e. breaking down the abstraction level just before service

invocation in terms of mapping Semantic Web service calls to traditional WSDL-based

Web service calls.

 Semantic Web service interface to the semantically-extended BPEL orchestration –

Handle instances of ontology concepts as process inputs and outputs in terms of a

Semantic Web service interface on top of the WSDL interface of the BPEL process.

In order to provide a practice-oriented and usable solution, also time consumption for the

reasoning during execution of semantic bridges and information flow rules has to be considered,

which is often a weakness of Semantic Web technology.

In order to achieve these goals, the functional architecture is based on an extension mechanism

for BPEL process engines to incorporate the above summarized semantics-based functionalities.

In the following the functional architecture is refined to the system architecture of the mediated

process execution engine, whereas each component is described in terms of its concrete

technical instantiation.

6.5.2 Design and Realization

Analog to the previous prototype realizations again the customized Agile Unified Process

(AUP) has been applied during development. The principle idea of the realization has been to

exploit the XML-based RDF-serialization of OWL concept instances (cf. Section 3.3.2). As the

information flow needs to be embedded in the BPEL process description, such serializations are

stored in BPEL variables, which are designed to hold XML instances. Whenever one of the

above outlined semantic enhancements is required to provide processing on the higher

abstraction level, a call for external functionality is delegated by the extension evaluator to

dedicated external engines performing this task. Such an external engine parses the dedicated

BPEL variable and loads the stored information into an OWL model, in order to process the

information on the higher semantically expressive abstraction level. Thus, the system

architecture follows a straight-forward approach by providing for each semantic extension one

dedicated external engine, which are incorporated though a common BPEL extension

mechanism based on XPath function calls. Accordingly, the system architecture for the

mediated process execution engine is illustrated in the following Figure 6-17:

Chapter 6

168

Figure 6-17 System Architecture of Mediated Process Execution Engine

The subsequent paragraphs describe the individual components and explain how they are related

to each other:

BPEL Process Engine: The realization of the prototype is based on the existing ActiveBPEL

[251] process execution engine. The BPEL-engine requires a Web container to run in. The

recommended container for ActiveBPEL is the Apache Tomcat Web Server. In order to enable

the aimed mediated process execution, mainly the handling of ontology concept instances in

BPEL and a solution for an extension mechanism for the semantic enhancements have to be

targeted. The ActiveBPEL engine has been chosen since it is open source and supports defined

extension points by implementing so called “custom functions”. However, the concept and

developments can be applied to any BPEL engine supporting such defined extension points. As

an alternative, the first investigated approach for the extension mechanism has been the

delegation to external components via Web services using the standard <invoke> activity of the

BPEL language. The advantage of this approach is that no modifications or extensions have to

be made to a BPEL engine and that all BPEL compliant execution engines could execute the

resulting process. But a drawback is the mixing of “functional Web services“ that contribute to

the business logic with „technical Web services“ that would perform supporting tasks such as

mediation and information flow. Hence, the integration of semantic bridges, semantic

information flow etc. via standard Web service calls has not been chosen as the preferred

solution. The more suitable approach is to use the extendibility of XPath to integrate semantic

components into BPEL. XPath is supported in BPEL engines to support XML processing,

whereas XPath custom functions are designed to externalize certain complex processing tasks to

dedicated components. The advantage of this approach is that no modifications to the BPEL

engine have to be made since the most popular BPEL engines such as ActiveBPEL, Apache

ODE [252] and Oracle BPEL Process Manager [253] support the integration of custom XPath

functions ([254], [255], [256]). Additionally, portability issues are minimized, since custom

XPath functions can be installed into all above mentioned BPEL engines using standardized

installation procedures.

Realization of Semantic Mediation Toolkit

169

As it has been already outlined above, the handling of ontology concept instances in BPEL is

based on their serialization format. Accordingly, the most straight-forward solution is to type all

BPEL variables as xsd:anyType and to assign OWL individuals in terms of RDF/XML-

serializations to them. Moreover, in order to be able to define

SWRL information flow rules, that move parameters between

service inputs and outputs, it is necessary to prepare the

variables. For this purpose so called typed containers define a

graph-structure of statements as an anchor to which the actual

parameters can be attached. This provides defined access points

on which the SWRL rules can be performed. The right hand

Figure 6-18 illustrates an empty typed container for a service

input.

In future it can be assumed, that a semantically enhanced version of BPEL might natively

support ontology concepts to be used for typing of variables. Nevertheless, the implementation

of a respective semantic BPEL engine is out of scope of this thesis. Rather the aim is to use

standard extensions of the BPEL specification to integrate the additional semantic functionality.

In fact, storing OWL individuals in terms of RDF/XML-serializations in BPEL variables goes

well along with the decision to use custom XPath functions to incorporate semantic concepts

into BPEL. Thus, BPEL variables can be used in BPEL expressions, which are used to call the

custom XPath functions. Hence, each semantic component can gain easy access to the required

OWL individuals and each semantic component can be realized as a separate Java component

integrated via the universal Java XPath engine [257].

Input and Output Handler: A deployed BPEL process is as a whole handled like a traditional

Web service, whereas its interface is provided by an XML Schema-based WSDL definition

created during deployment. Thus, the XSD instances of the process input have to be lifted onto

the ontology level. As the lifting and lowering mechanism based on XSL transformations is well

addressed in the OWL-S API providing the grounding (lifting and lowering) for Semantic Web

service calls, the same functionality is reused for the process input lifting and process output

lowering. The process inputs and analog the process outputs are passed to an external Java

component based on the OWL-S API via the XPath custom function mechanism.

Information Flow Engine: Having lifted the abstraction level of the service parameter

representation, the information flow rules can be applied, which assign the process input parts to

the respective service inputs and between service outputs to following service inputs within the

orchestration. It is important to note that the information flow rules have already been integrated

into the generated BPEL process description during mediated service composition based on the

information flow assignments. Consequently, the information flow rules are embedded within

the BPEL process storing them in terms of their RDF/XML serialization in variables. The

execution of a SWRL rule-based information flow is again performed as an XPath custom

function call in a BPEL assignment expression. The engine‟s internal expression evaluator

determines the responsible external Java component and delegates the function call together

with the variables holding the SWRL rules, the source parameters and the target parameter.

Within the external component the RDF/XML-serialized service parameters including the

anchor structure (cf. Figure 6-18) are loaded into a new Protégé-OWL model allowing further

semantic processing. Internally, all service inputs and outputs are handled in different OWL

models but are merged temporarily to apply the SWRL rules and thus perform the information

flow between services. The application of SWRL rules for the information flow is realized with

a similar mechanism as already explained for the semantic mediation engine (cf. Section 6.2.2).

Finally, the resulting OWL model is reduced to contain only the statements belonging to the

Figure 6-18 Typed Container

in BPEL Variable

Chapter 6

170

respective service input. Finally, the OWL model can be serialized to its RDF/XML

representation before writing it back to the BPEL process into a variable holding the inputs for

the targeted service.

Semantic Mediation Engine: In order to execute the mediation between the heterogeneous

conceptualizations used for the description of service parameters, the application of semantic

bridges is delegated to the external semantic mediation engine. Therefore, the SWRL rules for

the semantic mediation are embedded into the BPEL process in an analog manner as described

for the information flow rules above. Again, the XPath custom functions are exploited to

delegate to an external Java component implementing the semantic mediation engine. It is

realized by means of a combination of selected Semantic Web frameworks based on the same

semantic mediation mechanism and its implementation as described in the previous prototypes

(cf. Section 6.2.2).

In this context of process execution, it should be noted that the current OWL-DL semantics

bring one particular shortcoming for the application to Semantic Web services. The OWL-DL

semantics are based on the open-world assumption (cf. Section 3.3.3) as the language is

designed for the World Wide Web. However, in the context of Semantic Web service

composition all information required for reasoning are available during processing time.

Accordingly, closed-world semantics would be more suitable. For example if a class C is

defined as the set of individuals having exactly one certain property p in terms of

minCardinality = 1 and maxCardinality = 1, then due to open-world semantics an individual

featuring this certain property p cannot be classified to class C . The open-world semantics-

based reasoner does not know if there maybe exists a second occurrence of p for this individual,

so that the restriction would become unsatisfied. This behavior was not expected but has been

detected during the implementation phase of the prototypes. However, it is important to express

exact cardinality constraints on service parameter properties to correspond to the constraints of

type definitions in XML Schema domain standards describing WSDL-based Web services. In

order to overcome this weakness, the feature of functional properties within OWL-DL is

exploited. By declaring the property p as functional the reasoner knows that p exists exactly

once. Thus, even with open-world semantics the individual (service parameter) can be classified

to class C. Hence, basic cardinality restrictions are supported and the facet analysis

classification applied in the semantic mediation mechanism reflects the requirements which

originate from the targeted mapping to underlying traditional XML Schema-based Web

services.

Semantic Web Service Invocation Engine: After the representations of service parameters

have been mediated by semantic bridges and all required input parameters have been assigned

according to the defined information flow, a Semantic Web service can be invoked. Analog to

the previous components, the Semantic Web service call is delegated via the XPath extension

mechanism to an external Java component which instantiates the Semantic Web service

invocation engine. It applies the OWL-S API to get programmatic access to read and execute

OWL-S service descriptions. Internally, service specific XSL transformations have to be

performed, in order to ground the Semantic Web service to an XML Schema-based Web service

(cf. Section 5.8.2). In this process, it has to be taken into account that XSLT does not work on

the same meta-data model as OWL and thus can only operate on the OWL serializations.

However, the natively provided RDF/XML-ABBREV serialization by the OWL-S API

implementation does not allow to exploit the full potential of polymorphism as necessary for

further processing of service parameter with semantic bridges. When a polymorph individual is

serialized using the RDF/XML-ABBREV format, one of the types it holds is non-

deterministically selected and the last fragment of the types URI is taken for the opening tag for

this individual's XML serialization. The other types are expressed by separate <rdf:type.../> sub

Realization of Semantic Mediation Toolkit

171

elements. This varying structure complicates the development of the service specific XSLT

code dramatically. To overcome this weakness, the OWL-S API, which is provided as on open

source project, was adjusted. Now internally the basic RDF/XML serialization is applied. This

means that all types are represented equally as sub-elements, which allows to define straighter

XSL transformations. Hence, the mapping from polymorph OWL serializations to single typed

XML Schema instances can be achieved in terms of XSLT rules that match exactly the type

which has been defined in the OWL-S input description. The processing of the Web service

results is less complicated, as the received message parts correspond to the data model XSLT

was designed for. XSLT rules can easily match the XML Schema instances and fill predefined

skeletons of serializations of OWL individuals.

It has turned out, that the application of XSL transformations for Semantic Web service

groundings as intended in OWL-S provides only a sub-optimal solution for bridging the gap

between the abstraction levels of the meta-data models of Semantic Web services and WSDL-

based Web services. To overcome the complicated XSLT processing on OWL serializations, a

more powerful language should be applied. This language should be similar to XSLT in terms

of providing means to produce XML Schema based instances but it should as well be able to

understand and work on the triple model of OWL. However, as long as such a language is not

established and well supported (cf. Section 3.4.3) the presented mechanism provides a solid

solution based on widely accepted and standardized languages.

The four above discussed semantic extensions can be found in the following Figure 6-19

showing a process execution report of the extended ActiveBPEL engine. It illustrates the

execution of a semantically enriched BPEL process, which has been generated by the mediated

service composition tool according to the eGovernment scenario discussed in Section 6.4.3. The

semantic extensions for process input handling, semantic information flow and invocation of

Semantic Web services can be seen in the process graph shown in the center:

Figure 6-19 Mediated Process Execution in Semantically Enhanced Process Engine

Chapter 6

172

More details on the design, system architecture and the implementation of this proof-of-concept

including class diagrams, further samples of semantically enriched BPEL processes, test cases

and lessons learned are provided in [225] and are discussed in the paper [185].

6.5.3 Scenario, Validation and Verification

Analog to the previous prototypes, a scenario has been performed to assess whether the aimed

system requirements of the mediated process execution engine could be met and if they have

been implemented correctly. This time again the “Purchase Order Mediation Scenario” already

introduced and modeled in context of the mediated business process modeling prototype (cf.

Section 6.2.3) and the semantic bridge testing prototype (cf. Section 6.3.3) has been exploited.

Having the capabilities of the mediated process execution engine at hand, the full mediation

scenario has been as well performed on the runtime level.

Besides applying semantic bridges to mediate between different information models used by the

interacting systems Blue and Moon, the control flow capabilities of BPEL are used to align as

well their heterogeneous interaction patterns. I.e. the granularity and order of operations of the

two systems in the scenario differs. For example the Blue system issues a RosettaNet-based

purchase order message containing all relevant information such as contact information for the

shipping as well as requested items and their requested quantities in one block. On the other

side, the Moon system expects this information to be sent separately: First the customer

information, then a general order request and - upon acceptance of the order in general - each

ordered item one by one. The following Figure 6-20 Purchase Order Mediation Scenario shows

the full mediation scenario including the interaction patterns. Furthermore, it highlights where

the developed semantic extensions address the key challenges of the scenario:

Figure 6-20 Purchase Order Mediation Scenario and Semantic Extensions

Realization of Semantic Mediation Toolkit

173

The following paragraphs discuss the semantic extensions (1) – (5). This includes as well

references to the previously presented prototypes, as finally they all together contribute to the

solution of the scenario.

(1) Information Model Shift to Domain Ontologies – In order to shift the different information

models to a higher abstraction level and address semantic interoperability on the more

adequate conceptual level for this challenge, the two XML Schema-based information

models (RosettaNet and Moon) have been lifted to domain ontologies. This part of the

scenario solution has been already discussed in detail in context of the semantic bridges

testing prototype and its application to the scenario (cf. Section 6.3.2).

(2) Semantically enriched Web Services – By using the technologies described in Section 5.8.2

the WSDL-based Web services provided in the scenario are annotated with concepts from

the developed domain ontologies, in order to get OWL-S Semantic Web services. For

instance, in the scenario a Semantic Web service for the Moon CRM has been developed,

which expects the defined class Customer which – among others – defines the property

hasBusinessName. This property is used as the search criteria for the customer lookup. If a

customer with the given name is found, an IdentifiedCustomer OWL individual is returned

containing all customer attributes supplied by the CRM system. The corresponding domain

ontology containing these concepts has been already discussed in Section 6.3.3.

Furthermore, OWL-S-based services for creating an order and filling it stepwise with items

have been deployed on top of the existing Web services. Finally, the lifting and lowering

definitions for converting the incoming and outgoing XSD instances to OWL instances have

been defined.

(3) Semantic Mediation – Dedicated semantic bridges for the scenario have been defined and

tested (cf. Section 6.3.3). Based on the semantic bridges and the externalized semantic

mediation engine called from the BPEL process, the heterogeneous conceptualizations

could be aligned and prepared for sound information flow.

(4) Information Flow on Conceptual Level – In order to express the information flow on the

ontology-based conceptual level within the BPEL process, as well SWRL rules are used

similarly to the approach applied for semantic bridges (cf. Section 6.5.2). For this scenario

the information flow rules have been defined manually. They move the necessary input

parameters to the respective variables reserved for each service in the BPEL process. For

instance, the overall process input message contains a Partner instance as part of a

PurchaseOrder instance. After applying a semantic bridge this Partner instance is

polymorph and as well classified as a Customer according to the concept definition in the

Moon ontology. In order to prepare the service call which registers the Customer in the

Moon CRM system, the Customer instance is separated from the variable storing the overall

PurchaseOrder instance by applying an information flow rule that transfers the Customer

instance into the designated input variable for the CRM service. In an analog manner the

other service input variables are prepared by applying accordingly defined information flow

rules.

(5) BPEL Process for Mediating Interaction Patterns – The BPEL process can be seen as the

glue which holds the before discussed components together. Furthermore, as already

outlined above, the BPEL process is utilized to harmonize the interaction patterns, i.e. the

different granularity levels in service calls of both systems. In the scenario the OrderItems

provided by the Blue system are aggregated in a single incoming call. However, the Moon

system requires fine granular calls of the AddLineItem Semantic Web service, i.e. one

service call for each single LineItem. Therefore, a for-each loop available as a BPEL

language construct is used to split of the aggregated OrderItems and invoke the

Chapter 6

174

AddLineItem Semantic Web service one by one. For this scenario the BPEL process has

been designed using the ActiveBPEL process designer. However, the consistent usage of

the ontology level provides the foundation for further semi-automatic tool support in the

design phase by means of matching functionalities etc. as presented with the mediated

service composer prototype in Section 6.4.3. However, for the composer prototype it has

been out of scope to develop comprehensive support for control flow including loops and

conditions which would be necessary to completely design the purchase-order scenario.

Validation

The overall goal has been to incorporate the semantic mediation mechanism into the industry

standard BPEL and corresponding execution engines, in order to demonstrate the practicability

of the developed approach. The four aimed semantic enhancements as outlined in the

requirements Section 6.5.1 could be integrated in a BPEL process engine making use of the

XPath custom function mechanism and the RDF/XML serialization format of OWL instances.

Semantics-based information flow could be realized by exploiting SWRL-based rules that work

consistently on the higher conceptual abstraction level. Furthermore, the semantic mediation

mechanism as well utilized in the previous prototypes has been successfully integrated. The

BPEL process engine could be enabled to invoke OWL-S based Semantic Web services by

customizing the OWL-S API and grounding WSDL-based Web services with XSL

transformations. Finally, by exploiting the same technologies as used for the grounding in a

reverse manner, as well a Semantic Web service interface to the semantically-extended BPEL

process could be provided.

Regarding time consumption the following runtime measurement has been performed during

scenario execution: On a test platform based on a VMware image running on a Intel Core Duo

1.33 GHz CPU with 2 GB RAM the execution of the whole scenario including repeated

classification, semantic bridging, information flow inference and Semantic Web service

invocations takes less than 5 seconds on the server side, where the execution engine runs, and

around 11 seconds including the time consumption for the simulation of the Blue system calling

the deployed process. This good performance could be achieved by avoiding reasoning

processes on big data sets including all involved statements of service parameters, semantic

bridges and information flow rules all at once. Rather, reasoning has been applied in a very

focused manner by means of individual reasoning tasks concentrating just on particular

variables within the BPEL process that contain only service parameters of one service or just

one semantic bridge etc. Thus, the overall time consumption can be regarded as reasonable.

Verification

Consistently to the verification measures applied to the previous prototypes, several tests guided

by the scenario have been run to ensure that the prototype mainly performs correctly and stable

in the expected behavior. Besides the above presented “Purchase Order Scenario” as well the

service composition from the eGovernment scenario presented in Section 6.4.3 has been

deployed and successfully executed in the extended ActiveBPEL engine as highlighted in

Figure 6-19. Additionally, during the development of the composition tool and execution engine

several unit tests have been run to check the correct implementation of the various methods.

Thus, it can be stated that the prototype for mediated process execution generally performs

correctly. Taken as well into account the validation results, it can be concluded that the

implemented prototype demonstrates successfully how the mediated process execution step of

the semantic mediation methodology can be instantiated as a proof-of-concept.

Realization of Semantic Mediation Toolkit

175

6.6 Usage and Extension of the Semantic Mediation Toolkit

For further extension of the prototypes the source code of the toolkit is provided in four open

source projects under the GNU General Public License (GPL). The projects are hosted on the

BerliOS platform [258] of the Fraunhofer Institute for Open Communication Systems:

 the Mediated Business Process Modeling Tool is available at [259];

 the Semantic Bridge Testing Tool is available at [260];

 the Mediated Service Composition Tool is available at [261];

 and the Mediated Process Execution Tool is available at [262].

The source code is documented using Javadoc. Furthermore, the above listed project locations

provide documentations for each prototype including detailed installation guides and usage

descriptions with stepwise walkthroughs for the above presented scenarios.

6.7 Summary and Reflection

The chapter has presented the developed semantic mediation toolkit. The principle idea has been

to provide prototypical instantiations of certain key steps of the semantic mediation

methodology discussed in the previous Chapter 5. Some steps of the methodology could be well

covered with existing work and respective tools. Therefore, it has been abstained from

implementing such functionalities redundantly. Rather, it has been focused on these

methodology steps which cannot be performed adequately with existing work. Therefore, the

methodology steps of mediated business process modeling, semantic bridge testing, mediated

service composition and process execution have been addressed with newly develop prototypes.

They complete the semantic mediation toolkit along with the already existing tools for domain

ontology development, semantic bridge definition and semantic service enrichment.

The chapter has started by discussing the realization of the mediated business process modeling

prototype. Firstly, the goals for mediated business process modeling from the semantic

mediation methodology have been briefly recalled and refined to four system requirements.

Subsequently, the design and realization has been described, whereas the tool‟s system

architecture has been presented and its building blocks have been explained. The tool has been

described as an extension of a client-server based business process modeling tool. Therein the

developed semantic mediation mechanism has been integrated by applying Semantic Web

technology frameworks including an ontology reasoner and a rule engine working on W3C

standardized description logic languages. In particular, the implementation of the semantic

mediation mechanism, i.e. the realization and processing of semantic bridges, has been

discussed in detail. Finally, in order to validate and verify the implemented prototype a scenario

has been performed, namely the “Purchase Order Mediation Scenario” defined in context of the

international Semantic Web Service Challenge. Therefore, the business process between two

heterogeneous systems - Blue and Moon - and corresponding information flow across

heterogeneous conceptualizations has been modeled with the realized prototype. Consequently,

the system requirements have been mapped to the results of the performed scenario and it has

been concluded that the envisioned goals could be demonstrated within the scope of a proof-of-

concept. The essence of the concept and the prototypical implementation of the mediated

Chapter 6

176

business process modeler applied to the “Purchase Order Scenario” have been published in

[245].

The second section has been dedicated to the realization of the semantic bridge testing

prototype. As the underlying ontology mapping of a semantic bridge is a complex and error-

prone process, users and developers need to be enabled to systematically test and determine the

quality of semantic bridges. Therefore, the core system requirements have been derived and

listed including: functionality for semantic bridge test case definition, functionality for test case

execution and reporting functionality. The functional architecture discussed in Section 5.7.2 has

been refined to a concrete system architecture following the model-view-control (MVC) pattern.

The realization of the individual components has been explained, especially the calculation of

test case coverage in order to quantify the test result. The validation and verification of the

prototype has been examined by applying the testing tool to the semantic bridges developed for

the “Purchase Order Scenario” discussed above. Thereby, as well the development of the

domain ontologies “Blue” and “Moon” have been described. As the targeted goals for semantic

bridge testing could be met, it can be stated that the purpose of a proof-of-concept

implementation is fulfilled.

The next described prototypical instantiation of the semantic mediation toolkit has targeted the

methodology step of mediated service composition. The purpose of the composition tool is to

demonstrate how the task of integrating Web services described by heterogeneous ontologies

can be facilitated by semantic matchmaking and incorporation of semantic bridges, which are

expressed in the Semantic Web Rule Language (SWRL). The tool is capable of loading and

composing sequences of Web services that are semantically described by means of the OWL-S

standard. The output is a semantically extended BPEL process orchestration plan, which

contains the sequence-based control flow and rule-based information flow between the

Semantic Web services. The section has started with a recall and refinement of system

requirements followed by a description of the design and realization of the tool in terms of a

stepwise elaborated system architecture. The evaluation of the prototype was again performed in

terms of a scenario, however this time a cross-organizational eGovernment scenario has been

applied. According to the validation and verification results, it can be concluded that the

implemented prototype provides an adequate proof-of-concept for mediated service

composition. The essence of the concept and prototypical implementation of the Semantic Web

service composer applied to the eGovernment scenario has been published in [8].

Finally, the realization of the mediated process execution engine has been discussed. The

developed prototype is based on the existing ActiveBPEL process execution engine. The

ActiveBPEL engine was chosen, since it is open source and supports defined extension points

by implementing so called “custom functions”. However, the concept and developments can be

applied to any BPEL engine supporting this standardized extension mechanism. Firstly, the

system requirements have been specified followed by a detailed discussion of the derived

system architecture. This includes the semantic extensions for the BPEL engine and particularly

the development of components for executing semantic information flows and Semantic Web

services. With the capabilities for mediated process execution on hand, the complete “Purchase

Order Mediation Scenario” could be performed. Thereby, the whole process execution remains

on the ontology-based conceptual level and is just broken down at the latest possible point in

terms of grounding OWL-S Semantic Web service calls to traditional WSDL-based Web

services. According to the successful scenario execution, the validation and verification of the

implemented prototype can be considered as a valid proof-of-concept for the mediated process

execution approach. The essence of the concept and prototypical implementation of the

mediated process execution engine applied to the “Purchase Order Scenario” has been published

in [185].

Realization of Semantic Mediation Toolkit

177

At the end of the chapter, a brief section has presented information about the extension and

usage of the four developed prototypes, including references to a Web-based development

platform where the corresponding open source projects are hosted and from where the realized

semantic mediation toolkit can be downloaded.

179

Chapter 7

Evaluation and Case Study of an Exemplary

Distributed Organization

This chapter targets the evaluation of the presented work. Having discussed the validation and

verification of the semantic mediation toolkit with regard to the system requirements of the

respective prototypes in the previous chapter, this chapter aims at evaluating the overall

approach of the developed concept for semantic mediation between loosely coupled information

models in SOA. The chapter starts by presenting the evaluation methodology, which focuses on

a qualitative approach based on a case study of an exemplary distributed organization.

Accordingly, the subsequent section discusses this case study, which outlines how the German

Chambers of Industry and Commerce have addressed the challenge of internal and external

semantic interoperability. In particular, it is analyzed how the in this work proposed solution for

semantic mediation and its corresponding artifacts (models, methodologies, prototypes) can

optimize effectiveness and efficiency in this process. Based on this analysis, the coverage of

goals and the confirmation of the research hypothesis are discussed, whereas the main idea is to

map the outcome of the case study to the originally set conceptual goals and the claims of the

research hypothesis. Finally, a summary and reflection of the evaluation chapter is provided.

7.1 Evaluation Methodology

The evaluation is performed in two steps: Firstly, the semantic interoperability activities of the

German Chambers of Industry and Commerce are compared in terms of a gap analysis with the

developed approach for semantic mediation. Thus, the comparison is used to outline the

potential of the developed concept, methodology and toolkit. Secondly, the outcome of this

analysis is mapped to the originally set conceptual goals from Section 4.2 and to the claims of

the research hypothesis in Section 1.3.1.

In order to fit for the evaluation, the case study has to target a large-scale SOA landscape which

covers multiple organizations with independent IT management. Taken this into account, the

exemplary character of the German Chambers of Industry and Commerce is given, because

firstly: it is a decentralized organization; and secondly: it has rolled out an organization-wide

SOA covering 80 institutionally-independent chambers, which are served by four different IT

service providers. Moreover, the Chambers are core actors in cross-organizational eGovernment

processes. For instance, the administrative process for the registration of a new business

involves besides the Chambers various further public administrations including the trading

supervision department, statistical offices and fiscal authorities etc. depending on the business

case. As the IT-based communication and interaction between these various involved

institutions is iteratively migrated to service-oriented approaches, the case study of the

Chambers provides as well the required attribute of a large-scale SOA landscape.

Chapter 7

180

However, due to the prototypical nature of the semantic mediation toolkit, it is out of scope of

this evaluation to apply the toolkit directly to operational processes of the Chambers and its

business partners. Rather, a qualitative analysis is performed comparing the Chambers

achievements and challenges with the currently applied practices and technologies for achieving

semantic interoperability with the in this research presented approach. The case study has been

carried out in context of a research transfer project from 2007 to 2010 of the Fraunhofer

Institute FOKUS supporting the introduction of a service-oriented architecture to the German

Chamber of Industry and Commerce.

Based on the outcomes of the case study the fulfillment of the conceptual goals and claims in

the research hypothesis are assessed. According to the applied research methodology of design

research, the main artifacts developed in this work are recalled to structure the evaluation. They

address the core steps of design research: awareness of a problem, conceptual suggestion and

development (cf. Section 1.3.1). The corresponding main artifacts are:

 Framework for Semantic Interoperability in SOA (cf. Section 2.5)

 Concept for Semantic Mediation between Loosely Coupled Information Models (cf.

Section 4.8)

 Semantic Mediation Methodology and Toolkit (cf. Section 5.3 and 6.1)

Furthermore, general implications and derived conditions are discussed, which relate to the

fulfillment of the conceptual goals and the research hypothesis.

7.2 The German Chambers of Commerce and its

eGovernment Context

In Germany the Chambers of Industry and Commerce [263] are a public institution with self-

administration under the inspectorate of the state ministry of economy. All German companies

registered in Germany, with the exception of handicraft businesses, the free professions and

farms, are required by law to join a chamber. Accordingly, the Chambers represent approx. 3.6

million companies. The main duties of the Chambers of Industry and Commerce are [263]:

 Sovereign functions entrusted to business by government to perform upon its own

responsibility

 Services to member companies

 Representation of companies in political decision making

The German Chambers of Industry and Commerce are a decentralized organization with 80

chambers across Germany. The Chambers are legally independent entities with one central

umbrella association DIHK [263]. The decentralized character of the Chambers organization has

direct implications on their IT landscape. The 80 chambers have four different general IT

service providers leading to redundant IT applications for major Chamber business processes

with partially decentralized hosted instantiations and data storage. Besides this geographical

fragmentation along chamber districts as well fragmentation along the business departments has

been part of the “as-is” state of the Chambers IT landscape. Below Figure 7-1 illustrates the

historically grown isolation of IT applications across the major business departments. Most IT

projects and applications have been isolated across business departments, whereas only basic IT

services such as intra- and internet, email, etc. have been provided as shared services. These

Evaluation and Case Study of an Exemplary Distributed Organization

181

redundancies in IT project management, development,

operation and maintenance have caused low cost-

effectiveness and significant integration problems with

external partners due to the internal heterogeneity.

As this is a typical situation for a large distributed

organization, where local entities mainly operate within

their own scope of responsibility, the heterogeneous and

isolated IT landscape can be explained historically.

However, the Chambers organization as a whole had to

meet new requirements posed by European legislation

requiring certain organization-wide IT applications. For

example, according to its main duty to provide sovereign

functions, the Chambers organization was entrusted to

provide a central Web-based registry for insurance

broker, i.e. agents who want to sell insurance contracts to

private clients. The background of this European

consumer protection legislation is that it is sometimes

difficult for customers to know if an insurance broker is

serious and to know if there have been any legal complaints regarding him in the past. Besides

this registry for insurance broker of the legal affairs and fair play department a set of further

organization-wide registries have to be provided. This includes for instance a registry for

companies to register their amount of produced packaging material from the innovation and

environment department. Another example is a Germany-wide apprenticeships registry, in order

to face demographic change and provide companies a focused channel to qualified trainees. This

registry is not motivated by any legislation but corresponds to the second duty of the Chambers

organization to provide their member companies with attractive services.

It turned out that the status-quo situation of geographical and departmental IT fragmentation

could not meet the new requirements for cross-organizational eGovernment projects and

consequently an organization-wide IT integration approach was targeted. In order to anticipate

the legally-bound decentralized structure of the Chambers organization, an SOA-based

integration infrastructure has been favored to enable eGovernment processes across Chamber

districts and departments.

7.2.1 The Chambers Service Bus and Service Hub

The introduced service-oriented architecture consists of two main infrastructures:

 The Chambers Service Bus for internal process-oriented IT integration within the

chambers domain

 The Chambers Service Hub acts as a single point of contact for secured IT-based

communication with external partners

The following Figure 7-2 provides an overview of the Chambers SOA-based integration

infrastructures:

Figure 7-1 Task-oriented Isolated IT

Applications

Chapter 7

182

Figure 7-2 Overview of Chambers Service-Oriented Architecture

The Chambers Service Bus connects 80 local chambers (IHK), which are served by four main

IT service providers (AGU, GFI, IHK Köln and TMG). The IT applications are partly hosted in

external data centers (GFI data center and IHK Köln data center) or the IT service providers

host the IT applications locally at the individual chamber. Despite this decentralization on the

application level the Chambers share a common intranet, which ensures a secure and trusted

communication layer within the Chambers domain. Of major interest are the so called master

data systems, which provide customer relationship management such as functionality to process

the information of member companies of the local chambers. These master data systems provide

the foundation for most additional IT applications of each chamber. Accordingly, in order to

enable cross-organizational integrated processes, the various decentralized master data systems

had to be enriched with Web service adapters to be connected to the Chambers Service Bus. The

Chambers Service Bus has been realized by means of an industry product-based enterprise

service bus [264]. Furthermore, the Chamber Service Bus contains a BPEL-process engine (cf.

Section 3.3.2) that enables the processing of Web service orchestrations between decentralized

IT applications that materialize chamber-wide process chains.

The Chambers Service Hub provides a single point of contact for the integration with external

eGovernment partners such as administrations, associations, justice or companies. As this

communication takes place over the public internet, the Chambers Service Hub is based on a

secured OSCI intermediary [265], which provides a German eGovernment specific transport

protocol stack for secure communication. Incoming service messages are received over the

Chambers Service Hub and are then further routed over the Chambers Service Bus to the

particular chamber and vice versa for outgoing service communications.

In order to establish the SOA of the Chambers, many technical and organizational challenges

had and still have to be addressed. For instance, the IT service providers have to adopt the still

relatively new Web service technology. Furthermore, their different business models have to be

aligned to ensure sustainable competition while keeping the required cooperation. Besides the

technical and organizational dimension, as well semantic challenges have to be addressed. As

this work focuses on semantic interoperability, the following section presents how the

Chambers organization has targeted the aim for seamless cross-organizational process

Evaluation and Case Study of an Exemplary Distributed Organization

183

integration despite the fact that the different IT service providers and external partners use

heterogeneous information models for their applications and the communication between them.

7.2.2 The Data Conference Working Group

One major challenge for achieving semantic interoperability across the Chambers organization

is the fact that the communication partners are autonomous with no legally binding management

authority. Therefore, any commitment depends on consensus and collaboration. However, a

properly functioning SOA requires well defined service interfaces not only on the technical,

protocol and syntactic level but also on the semantic level.

In order to achieve semantic interoperability between the different IT applications connected via

the Chambers Service Bus, the chambers central umbrella association DIHK has established the

so called data conference working group as part of an established eGovernment project office.

The aim of the data conference has been to develop a process-oriented chambers-wide XML-

based data exchange standard, whereby all four IT service providers have been involved

supported by consultation and moderation of the Fraunhofer Institute FOKUS. The idea has

been to focus on a Chambers internal XML standard for message exchange instead of enforcing

one consistent information model to be followed by all applications. The advantage of this

approach is that the existing internal data structures of the different IT applications of the four

IT service providers can remain unchanged. They just have to provide Web service adapters

providing interfaces according to the new established message exchange standard to

communicate across chamber borders.

Consequently, the data conference working group has been defined as an organizational entity

with a methodology and toolbox for the development and evolution of cross-chamber data

exchange standards named XIHK. The name XIHK is aligned to the name of German

eGovernment exchange standards XÖV. The relation between XIHK and the XÖV standards

will be discussed later on in Section 7.2.3. The major task of the data conference is to support

demand-driven cross-chamber IT projects with expertise in data modeling on the one hand and

ensuring the incorporation of existing XIHK data models within new IT projects on the other

hand. The following Figure 7-3 illustrates the basic idea of the data conference working group:

Figure 7-3 General Approach of Data Conference Working Group

The data conference works demand-driven, i.e. its expertise is called from a concrete

eGovernment project of the chambers organization (e.g. the provision of a new cross-chamber

analysis and process modeling, the data conference works on the required information models in

Chapter 7

184

parallel to the conceptualization phase of the initiating project. According to the identified

requirements and derived from the process models, required information or data entities are

identified. In order to foster reuse of existing XIHK data models, which are stored and managed

centrally in the so called XIHK repository, a designated XIHK overall responsible owner has

been established who has been selected among the IT service providers. In order to work in a

consistent way, the data conference applies a defined set of guidelines, data modeling tools and

XML tools. The general methodology describing how these assets come into play is presented

in the following Figure 7-4 and briefly explained in the subsequent paragraphs. Furthermore, the

essence of the methodology of the chambers data conference has been published in [188].

Figure 7-4 Data Conference Methodology

The data conference methodology follows a combined top-down and bottom-up approach. The

idea is to derive the requirements of information models for new individual IT applications and

services from their process models and align them to existing information models. In the first

step of the top-down stream, an information model entity list is derived from a process model to

be used by business experts responsible for the concrete IT project. In a second step, the

information entities are consolidated and then are formalized in UML entity class diagrams. In

order to ensure that the formalized information entities are suitable and complete for the desired

IT application, a so called business test of the information flow is performed. I.e. the

corresponding information flows in the process models are traversed by using the defined UML

information entities. In the third and final step of the top-down stream the concrete XML

schemas are generated based on the UML models.

Reuse and overall consistence of the XIHK information models is ensured by means of the

bottom-up stream of the data conference methodology. In a first bottom-up step existing

schemas of major IT applications are formulated as XML Schema entities, in order to be

available for the alignment with the new requirements. For this purpose additionally a UML

view and entity list view on the existing information entities is provided. Thus, as well the

business expert with a non-technical background is enabled to identify existing information

entities for reuse.

All three views or abstractions levels (entity list, UML and XML) of the information models for

message exchange are stored in the chamber-wide XIHK repository. Furthermore, a lifecycle-

model for the information models along with a versioning framework has been developed, in

Evaluation and Case Study of an Exemplary Distributed Organization

185

order to ensure sound evolution of the internal XIHK message exchange standard. This includes

as well service models and process models and is part of an overall SOA Governance activity of

the Chambers organization. However, due to the focus on semantic interoperability within this

case study, these aspects are not further elaborated in this work.

7.2.3 Achievements and Ongoing Challenges

Based on the data conference methodology and its application to a first set of cross-chamber IT

projects such as the above discussed eGovernment register, a chamber-wide XML-based data

exchange standard could be established. Similar to the in this work developed framework for

semantic interoperability (cf. Section 2.5) the XIHK information model reflects different

abstraction levels for its representation. Even though the Chambers approach does not utilize

ontologies for the conceptual level, the combination of entity lists and UML models allow non-

technical business experts to understand the information models and use them in the process

analysis and conceptualization of new IT applications.

Furthermore, a general goal targeted with the SOA introduction to follow a more process-

oriented approach in the design and realization of IT applications has been anticipated in the

data conference. The top-down and bottom-up approach starts with process models to identify

required information entities and thus provides a starting point for improved alignment between

business and IT perspectives.

However, it turned out that in practice only incomplete representations on the three abstraction

levels of the information models have been maintained, due to shortcomings of the applied

technologies which require significant manual development efforts. In consequence, mostly

only the XML Schema representation exists and is continuously maintained. This leads to

inconsistencies between the different abstraction levels and thus limits the aimed advantages for

business IT alignment.

The overall concept of loose coupling on the semantic level (cf. Section 4.6) has been reflected

insofar, that the developed Chamber-wide message exchange standard XIHK remains

independent from the internal information models within the decentralized IT applications. Each

IT service provider provides XIHK-conform service adapters and performs the semantic

mapping to its internal representation internally. Accordingly, it can also be stated that with

regard to the abstraction levels the semantic integration remains on the logical level or even

physical level as the mappings are performed between the XML level and the internal

application specific representation.

On the other hand, compared to the main strategies for achieving semantic interoperability

discussed in Section 4.4.1, the Chambers approach follows rather the standardization strategy

than the mediation strategy between loosely coupled coexisting domain information models.

Within the chambers organization only one shared XIHK standard for message exchange exists.

The feasibility and effectiveness of this approach can be linked to the existence of the central

umbrella association and the data conference working group of its eGovernment project office.

These organizational conditions could enable the required degree of consensus among the

independent stakeholders and thus establish the XIHK standard within the chambers.

Nevertheless, the mediation strategy between loosely coupled coexisting domain information

models is well reflected when it comes to semantic integration with external partners. The

chambers XIHK standard has been developed and defined completely independent of other

information model standards in the eGovernment domain. In Germany, the XÖV [266]

standards define XML Schemas for electronic message exchange in the public sector as already

presented in the eGovernment scenario for the evaluation of the mediated service composition

Chapter 7

186

prototype (cf. Section 6.4.3). As mentioned there, XÖV consist of several standards for

particular domains in eGovernment such as XGewerbe for business registration, XMeld for

resident registration or XPersonenstand for vital records, e.g. birth certificates, death records,

marriage licenses, etc. Furthermore, in order to achieve semantic interoperability in cross-

domain processes, i.e. between the different XÖV standards, the XÖV AG Datenkonferenz

[267] has suggested the so called XÖV core components [268]. The XÖV core components

specifiy information entities such as Name and Address, which should be consistently reused in

the various XÖV standards. However, it turned out that due to the complexity in the consensus

finding process and due to heterogeneous requirements in the different domains, the feasibility

for reuse of core components is limited (cf. Section 4.4). For instance the core component

Name cannot be consistently defined and reused for XMeld and XPersonenstand because of

contradicting legislative regulations [269]. In the legislation for resident registration, the

surname has to be split obligatory into his subparts for electronic transmission. However, in the

respective legislation for vital records, the whole surname has to be represented in a single

string and cannot be represented in its subparts because for that purpose the existing IT

applications of this domain would require costly reengineering [270].

Based on these experiences, the chambers data conference has decided to stay independent from

any XÖV standards and not derive the Chambers XIHK standard from e.g. XGewerbe or the

XÖV core components. Furthermore, this strategy has allowed the chambers to establish an

efficient standardization process (cf. Section 7.2.2) without interfering into the complex

organizational structures of the XÖV committees. Consequently, the data conference has

anticipated the limited feasibility of cross-domain standardization and has adopted the

mediation strategy based on loosely coupled domain information models, whereas XIHK

represents such a domain information model.

According to the develop concept of semantic mediation, central mediation services have been

planned to provide semantic integration with external business process partners in the larger

eGovernment context. This mediation service should extend the Chambers service hub as the

single point of contact for external electronic communication (cf. Section 7.2.1). It should

provide a message translation from the XIHK standard to the relevant XÖV standards such as

XGewerbe for business registration in which process the Chambers organization plays an active

role. The following Figure 7-5 illustrates the idea of the mediation services:

Figure 7-5 Planned Mediation Services of Chambers Service Hub

The mediation services are based on mappings between the different XML standards, e.g.

realized by means of XSL transformations (cf. Section 3.2.3). They are stored centrally in the

XIHK repository together with the XIHK information models.

Evaluation and Case Study of an Exemplary Distributed Organization

187

However, the mediation services are not yet realized due to high costs for their realization and

investment protection of existing adapters within decentralized IT applications. Thus, it can be

stated that the semantic integration with external partners is successfully performed according to

the approach of loose coupling between information models but not on domain level. The

mapping between e.g. XGewerbe and XIHK is not targeted once by means of organization-wide

mediation services but redundantly within existing decentralized IT applications. I.e. receiving

XGewerbe messages are forwarded through the Chambers service hub and service bus to the

particular Chamber and mapped to the internal information model and then are translated to

XIHK messages when further communication with other Chambers is required. This can be

understood for historical reasons as existing application-internal adapters have been in place and

its consolidation requires a migration roadmap, which is a second step after the general SOA

introduction.

Having discussed above the achievements of the Chambers but also their ongoing challenges

with regard to effective and efficient semantic interoperability, the following section points out

how a more complete adoption of the semantic mediation approach developed in this work

could contribute to overcome the identified shortcomings.

7.2.4 Potential of the Semantic Mediation Approach

The potential of the semantic mediation approach is exemplary demonstrated by traversing

through the semantic mediation methodology steps (cf. Chapter 5) and the respective semantic

mediation toolkit (cf. Chapter 6) to discuss their impact and potential advantages for the

Chambers organization and the semantic integration with its external partners.

The first step of the semantic mediation methodology is the step of domain ontology

development (cf. Section 5.4). The Chambers organization has already developed their own

chambers-wide information model XIHK, which is distinct from information models of other

eGovernment domains (XÖV). However, as argued in the previous Section 7.2.3, the XML-

based XIHK information model remains on the logical abstraction level instead of the

conceptual level as targeted by domain ontologies. The Chambers data conference effort to

establish in parallel a UML representation and textual information entity lists to be used on the

conceptual level could not be achieved completely and consistently (cf. Section 7.2.3). Taken

this into account, the development of domain ontologies for the Chambers domain and the

discussed eGovernment domains provides a consistent conceptual level of the domain

information models. The explicit conceptualization improves the consensus finding and

standardization process within the domains and eases the expression of semantic bridges

between the different domains (cf. Section 4.8.5). Moreover, due to the explicit and formal

nature of description logic based ontologies – such as OWL ontologies as utilized in the

semantic mediation toolkit – the domain ontologies can be consistently linked to underlying

XML-based information representations such as the existing XIHK and XÖV standards. All in

all, the development of domain ontologies establishes the additional semantic layer that

provides the foundation for the further steps of the semantic mediation methodology.

The second step of the methodology is concerned with mediated business process modeling (cf.

Section 5.5). The data conference of the Chambers organization already adopted a process-

oriented approach to requirements engineering and design of new IT applications and

corresponding information models (cf. Section 7.2.2). However, no consistent incorporation of

existing conceptual information models during process modeling could be ensured. Therefore,

applying the mediated business process modeling approach provides advantages regarding

consistent reuse and evolution of existing domain information models. The chambers domain

ontology could be directly used by non-technical business experts during process modeling for

Chapter 7

188

the design of information flows and thus eases the alignment between business and IT

perspectives. Furthermore, the ontology-based approach allows to address semantic integration

with external partners already on the conceptual level. By means of semantic bridges

heterogeneous information representations in e.g. XGewerbe and XIHK could be kept

transparent, which reduces complexity and efforts for process modeling in cross-organizational

scenarios.

The methodology steps of semantic bridge definition (cf. Section 5.6) and testing (cf. Section

5.7) provide the means to realize the planned mediation services of the Chambers service hub.

The declarative rule-based approach to express the mappings could reduce the development

efforts in contrast to the planned but however not adopted technical complex XSL

transformations. Furthermore, maintenance efforts which occur with changing versions of

XIHK and XÖV could be reduced. Current redundantly maintained mappings in the

decentralized IT applications could be replaced by semantic bridges between corresponding

domain ontologies, which bears the already discussed advantage of addressing semantic

integration declaratively on the domain level rather than redundantly and technically more

complex on the application level.

In order to reap the benefits of the additional semantic layer and actually apply the semantic

mediation mechanism, the so called grounding to the existing underlying XML-based Web

services need to be ensured (cf. Section 5.8). This step of the semantic mediation approach does

not directly provide any advantages for the Chambers case. In contrast it requires further efforts

to define the liftings and lowerings between the ontology concepts and the corresponding XIHK

elements for each service parameters. However, the step is required in order to enable the

overall semantic mediation approach.

With regard to the mediated service composition step (cf. Section 5.9), the good starting point

of the Chambers SOA initiative towards process-orientation for improved business-IT

alignment could be further fostered. The during business process modeling utilized and further

derived XIHK ontology concepts could be directly reused for service composition, in order to

design the BPEL processes which instantiate the originally modeled business processes. Here

the full benefits of the semantic mediation approach become an important factor. Firstly, the

mediated service composition tool provides semantics-based matching functionality between

corresponding service parameters and thus eases the design process in general. And secondly, it

transparently integrates the semantic mediation mechanism based on the Chamber-wide

deployed semantic bridges. Consequently, the whole service composition can be performed on

the conceptual level, whereas heterogeneities between XIHK and XÖV information models are

kept transparent to the service composer. Thus, no more technical transformation code has to be

implemented within cross-organizational BPEL processes and high level Chamber business

processes can be mapped to concrete IT infrastructures with less manual efforts.

Finally, the designed service compositions need to be interpreted during runtime in order to

execute the actual business processes. Thereby, the additional semantic layer can be processed

by the mediated business process execution engine (cf. Section 6.5) that enables the information

processing on the conceptual level along with the semantic mediation mechanism during

runtime.

The analysis of the Chambers case study has shown that the developed semantic mediation

approach provides the potential for improving semantic integration within the distributed

Chambers organization and particularly with external business partners. As the highest potential

could be identified in cross-domain integration between XIHK and the various XÖV domain

information standards, the next section provides an additional perspective with a larger scope.

Evaluation and Case Study of an Exemplary Distributed Organization

189

7.2.5 Network Effect

As the overall goal of this work is to provide an effective and efficient approach to reduce

complexity in semantic integration, it has to be analyzed if the subsumption of the proposed

methodology steps, which are complex themselves, altogether indeed conduce the overall goal.

Considering the various proposed activities, it has to be stated that firstly they produce

additional efforts. These include the shifting of existing XML Schema-based domain standards

to the ontology level as well as the additional description of WSDL-based Web services by

wrapping Semantic Web service descriptions. Furthermore, semantic bridges have to be defined

between all domains that are involved in cross-domain processes. And finally, the appropriate

tools for utilizing all these additional artifacts have to be further developed to a mature product

state, in order to exploit them for semantic integration support.

Considering again the Chambers case and the particular integration scenario with XGewerbe, a

conventional approach with XML-based Web service composition would maybe provide a

solution inducing less effort. By manually integrating the involved Web services, the

development of e.g. XPath expressions for the data flow definition and e.g. XSL

transformations for alignment of different representation formats would be probably less

expensive than to develop the various semantic artifacts, including domain ontologies, semantic

bridges and Semantic Web service descriptions.

However, the whole potential of the presented approach becomes only visible on a large scale.

Once the additional semantic artifacts are available, a network effect exceeds the initial costs.

The Chambers and XÖV-based services of public administrations are not only involved within

one particular scenario but are integrated in various eGovernment processes. Once the Semantic

Web service descriptions have been established, they are reused and thus the average costs per

each particular scenario is marginal. Similar, this holds for the domain standards, where

additionally domain ontologies have to be established as well as the semantic bridges that

enable the required mediation between them. These artifacts are reused by all services from one

domain and thus their creation efforts become reasonable. The core benefit of this approach lies

in the fact that semantic interoperability is addressed on the level of domain standards instead of

addressing it on application level. One-time ontology mediation substitutes n-time adaptation of

parameter formats during process integration. Thereby, the high abstraction level of the

information model provided by ontologies eases the maintainability of the semantic bridges.

Consequently, the full potential of the developed approach evolves when applying it to a larger

scope, for instance not only to the distributed Chambers organization but to the overall XÖV

landscape. Accordingly, domain ontologies would be required for each XÖV standard,

including XGewerbe, XMeld, etc. (cf. Figure 7-5). However, the additional development of

domain ontologies replacing the whole set of XÖV standards seems unrealistic especially with

regard to technological path dependency. Nevertheless, when looking at the design process of

XÖV standards as outlined in the XÖV-Framework [271], it includes a top-down approach

similar to the Chambers data conference starting with UML models for conceptualizing the

domain information models. Based on these UML models, which are restricted to well-defined

UML design templates, the actual XÖV XML Schemas are generated. Therefore, one starting

point for the adoption of the presented approach in this larger scope could be the exploitation of

the UML models. Notably, they correspond to the conceptual abstraction level for information

models and allow to generate as well domain ontologies besides the XML Schemas and to

provide as well lifting and lowering transformation code in this process. Thus, the conceptual

language features such as expressive relations between information entities, generalization and

polymorphism could be preserved in the ontology representation. And consequently they could

Chapter 7

190

be exploited for semantic mediation on the conceptual level between the heterogeneous XÖV

domains.

Based on the analysis above, it can be stated that the presented approach for semantic mediation

has the potential to contribute substantially to the reduction of complexity in semantic

integration. Furthermore, in order to finalize the evaluation the following section systematically

refers back to the originally set individual goals for the developed approach and discusses the

research hypothesis and the extent of its coverage.

7.3 Coverage of Goals and Confirmation of Research

Hypothesis

In order to discuss whether and to which extends the conceptual goals could be attained, this

second part of the evaluation systematically maps the outcomes of the analysis above to the

individual goals set in Section 4.2 and then interprets the results with regard to the formulated

research hypothesis and its confirmation.

7.3.1 Coverage of Conceptual Goals

The first conceptual goal has targeted the reduction of complexity in semantic integration by

separation of technical issues from business issues. As outlined in the Chambers case and its

larger scope in the XÖV landscape, the lifting of the semantic integration task to the conceptual

level by means of semantic mediation between domain ontologies addresses this goal.

Separation of concerns is improved as information flows can be designed on an abstraction level

suitable for business experts as demonstrated with the mediated business process modeling

prototype (cf. Section 6.2). Moreover, the transparent incorporation of declarative semantic

bridges enables process experts to concentrate on the cross-organizational business logic

without restrictions caused by heterogeneous information representations. As domain experts

provide the semantic bridges between different domains from an overall domain perspective, the

individual process expert is not required to have in parallel detailed insight into information

models from other domains but can focus on his familiar domain perspective. Independent from

business and conceptual tasks, the technical tasks such as the Semantic Web service enrichment

in terms of liftings and lowerings can be performed by service developers, whereas the required

transformations only concern the local information model. Consequently, the aim for separation

of concerns could be achieved.

The second goal has been the reflection of limited feasibility of semantic standards in cross-

organizational environments caused by differences in business requirements and organizational

boundaries. The analyzed case study has demonstrated this requirement and it could be shown

that the developed concept of loosely coupled information models (cf. Section 4.6.3) directly

addresses this goal. In particular, autonomy and independent evolution of heterogeneous

information models is anticipated and builds the foundation of the developed mediation

mechanism.

A further goal has been to overcome the status quo of complicated and highly technical

transformation coding for bridging heterogeneous information representations in semantic

integration scenarios. As demonstrated with the semantic mediation toolkit, the semantic

bridges are not only used during process modeling but as well during its instantiations in terms

of Semantic Web service composition. Heterogeneous service interfaces are transparently

Evaluation and Case Study of an Exemplary Distributed Organization

191

mediated as service parameter descriptions are transformed to polymorph representations which

correspond to the conceptualizations of different domains. Thus, the semantic integration is

shifted on the domain level, instead of implementing it recurrently within each service

composition realizing a concrete business process.

Another goal has targeted the expressiveness of the semantic mediation mechanism. As

differences in information model representations can be complex, semantic bridges should be

able to cover that complexity in terms of completeness but should as well remain easy to

maintain by domain experts. With regard to this goal it can be stated that the implementation of

the semantic bridges, which is based on description logic rules, provides a computational

complete solution. Any semantic mappings including one-to-one, one-to-many or many-to-

many between corresponding concepts can be covered as well as any transformation of

underlying semantic sub-graphs. Furthermore, granularity differences (such as showcased in

Section 4.8.4 with regard to e.g. Street and StreetNumber as separate fields or in contrast

StreetAddress containing the two values in a combined field) can be mapped. This is enabled by

integrating so called built-in functions which provide procedures such as string concatenation or

unit transformation, etc. within the declarative rules. However, this completeness only covers

mappings and transformations which are based on information contained explicitly or implicitly

within the concepts to be mapped or static information such as unit transformations which can

be embedded in built-ins. Other mappings which require external information such as the

translation of zip codes to location names, which can also cause semantic integration problems,

are not yet covered by the developed semantic mediation mechanism. However, this question is

further discussed in the open issues part of the conclusion in Chapter 8, where starting points for

possible extensions of the semantic mediation mechanism are discussed.

The final two goals have focused on industry suitability and technological path dependency. On

the one hand, the developed conceptual approach should remain consistent to best practice SOA

methodologies. As described in Section 5.3 the semantic mediation methodology has been

strictly aligned to the SOA life cycle, starting with business analysis leading to process models,

followed by service composition and process execution. In particular, no goal-based planning

approaches have been integrated into the semantic mediation mechanism, as its suitability for

achieving semantic interoperability in heterogeneous environments has been identified as less

suitable (cf. Section 3.4.3).

On the other hand, a further goal has been the restriction to build upon existing concepts and

standards of the World Wide Web and thus respect technological path dependency, especially

with regard to Web service technologies which are the dominant instantiation of SOA. As

described in Chapter 6, the semantic mediation toolkit provides its mediation functionality by

means of an additional semantic layer (domain ontologies and semantic bridges) on top of

existing XML-based Web service technology. Lifting and lowering mechanisms of Semantic

Web service technology are used to connect the different abstraction layers between the

semantics-based mediation layer and the existing traditional Web service technology.

Consequently, it can be stated that technological path dependency has been respected.

According to the analysis above, it can be stated that the originally set conceptual goals could be

almost completely covered. Minor aspects have been identified as open issues, which will be

further discussed in the conclusion of the thesis, whereas starting points for further extensions

and future work are pointed out. Finally, the confirmation of the research hypothesis is

discussed in the next section.

Chapter 7

192

7.3.2 Confirmation of Research Hypothesis

In order to confirm the hypothesis, this work has applied the methodology of design research in

information systems as outlined in Section 1.3. Consequently, this section reviews the major

artifacts produced in the methodology steps (awareness of the problem, suggestion,

development and evaluation) to provide the argumentation why the hypothesis can be

confirmed. To better structure this process the research hypothesis can be divided into three

parts:

(1) The goal: … to effectively and efficiently achieve semantic interoperability in large-

scale cross-organizational service-oriented architectures…

(2) The concept to achieve the goal: … the principle of loose coupling can be applied to

information models based on a flexible semantic mediation mechanism …

(3) And the technology to instantiate the concept: … using Semantic Web technology for

autonomous management and integration of domain-specific information models in

terms of self-contained ontologies.

In order to ensure that the goal (1) has been specified and scoped correctly, a systematic

analysis of the problem area of semantic interoperability in SOA has been performed in Chapter

2 with special focus on large-scale cross-organizational environments. The main outcome has

been a framework with its integral part describing the semantic interoperability gap between

heterogeneous information models on different abstraction levels. This framework could be

applied to analyze the state-of-the-art (cf. Chapter 3) and evaluate existing approaches with

regard to their effectiveness and efficiency and to outline limitations and directions for the later

presented concept.

The concept (2) of semantic mediation between loosely coupled information models (cf.

Chapter 4) has been derived from the analysis of limited effectiveness with regard to feasibility

and practicability of semantic standardization in large-scale service-oriented environments (cf.

Section 4.4 and 4.5). This has led to the identification and transfer of key characteristics of loose

coupling (cf. Section 4.6) to information models with its integral part of semantic mediation on

the conceptual level by means of rule-based semantic bridges (cf. Section 4.8). To point out the

advantages regarding effectiveness and efficiency, the developed concept has been compared to

alternative approaches. With regard to an identified inherent trade-off between effectiveness and

efficiency (cf. Section 4.7), it could be shown that the developed concept provides an optimized

balance within this trade-off and thus meets the required adverbs (effectively and efficiently) in

the goal part (1) of the research hypothesis.

Furthermore, it has been shown how Semantic Web technology and its specific features such as

polymorphism, facet analysis classification and declarative rule-based entity manipulation (cf.

Section 4.8) can be exploited to enable the semantic mediation mechanism. In order to

instantiate the concept (3), the semantic mediation mechanism has been implemented based on

independent OWL domain ontologies and SWRL rule-based semantic bridges. The mechanism

has been integrated into several prototypes (cf. Chapter 6) covering the SOA lifecycle according

to a developed semantic mediation methodology (cf. Chapter 5), ranging from business process

modeling, over service composition to run-time process execution.

Combining the results from the case study-oriented evaluation (cf. Section 7.2) including the

qualitative analysis regarding the coverage of the conceptual goals (cf. Section 7.3.1) and the

three steps of the analytical review regarding the research hypothesis, it can be concluded that

the research hypothesis can be confirmed. Identified remaining open issues and future work

including possible further extensions are discussed in the conclusion of this work in Chapter 8.

Evaluation and Case Study of an Exemplary Distributed Organization

193

7.4 Summary and Reflection

This chapter has presented the evaluation of the developed approach for semantic mediation

between loosely coupled information models in SOA. As a quantitative evaluation would be out

of scope for this thesis, the evaluation has followed a qualitative approach. The evaluation has

been structured in two parts: Firstly, a case study about the semantic interoperability activities

of the distributed organization of the German Chambers of Industry and Commerce has been

carried out and compared in terms of a gap analysis with the developed approach for semantic

mediation. Secondly, the outcome of this analysis has been mapped to the originally set

conceptual goals from Section 4.2 and to the claims of the research hypothesis of this work in

Section 1.3.1.

Firstly, the organizational background and IT landscape of the German Chambers of Industry

and Commerce with its 80 decentralized independent entities served by four different IT service

providers has been presented. In particular, the Chambers service bus as the major internal

integration infrastructure has been discussed together with the Chambers service hub providing

a single point of contact for electronic interaction with external partners.

The main focus of the case study then has been put on the Chambers data conference working

group, which has developed a methodology for and an instantiation of a chamber-wide XML-

based data exchange standard (XIHK). This corresponds to the in this work developed concept

of loose coupling on the semantic level insofar, that the developed Chamber-wide message

exchange standard XIHK remains independent from the internal information representations

within the decentralized IT applications. Moreover, the Chambers XIHK information model has

been designed independently from external information models in the eGovernment domain,

namely the XÖV exchange standards. This has ensured an optimal reflection of Chamber

requirements including its evolution without any organizational dependencies to external

entities. Shortcomings could be identified regarding the consistent and complete representation

of the XIHK information model on different abstraction levels, which has constrained the

envisioned gains for alignment between business and IT perspectives. Furthermore, the

semantic integration between the XIHK information model and the external XÖV information

models has not been realized on the domain level as planned but still redundantly on the

application level within decentralized IT applications.

In order to outline the potential of the developed semantic mediation methodology and the

toolkit, it has been shown how they can be mapped to the Chambers context. It has been pointed

out how the additional semantic layer on top of the existing XML-based Web service

technology can reduce complexity in process integration and enable semantic mediation on

domain level. Furthermore, a network effect has been identified which determines the full

potential of the developed approach. Therefore, the scope has been enlarged beyond the

applications within the Chambers organization to the overall XÖV landscape. In this larger

scope, it could be shown how the ontology and semantic bridge-based approach could be

integrated into the existing XÖV framework to improve semantic interoperability between the

various XÖV standards and corresponding domains.

The second part of the evaluation has focused on the coverage of the originally set conceptual

goals and the confirmation of the research hypothesis of this work. The conceptual goals have

been recapitulated and mapped to the results of the case study and they have been discussed

with regard to the developed concept for semantic mediation, the semantic mediation

methodology and the implemented prototypical toolkit. It has been concluded that the

conceptual goals could be accomplished successfully with minor open issues concerning e.g. the

Chapter 7

194

integration of external information within semantic bridges to support sophisticated semantic

integration challenges.

Finally, the confirmation of the research hypothesis of this work has been discussed. Based on a

three-step analysis, the hypothesis has been reviewed, whereas the previous evaluation results

and the major artifacts produced in this work have been considered. Special emphasizes has

been given to the aimed characteristics effectiveness and efficiency of the proposed approach,

whereas the main argumentations of the previous chapters have been recalled. Structured

according to the applied research methodology of design research the major artifacts include:

 the framework of semantic interoperability in SOA and a systematic state-of-the-art

analysis as the analytical step of understanding the problem domain and providing

awareness of the problem;

 the concept of semantic mediation between loosely coupled information models as the

conceptual suggestion step;

 the instantiation of the concept by means of the semantic mediation methodology and

the prototypical semantic mediation toolkit as the development step;

 and finally the evaluation in terms of the case study of the Chambers organization as an

exemplarily distributed organization including the enlarged scope covering as well its

external process partners in the eGovernment domain.

Consequently, an argumentation based on a qualitative analysis for the confirmation of the

research hypothesis could be provided. Moreover, professional reviews of the published papers

([8], [185], [188], [245], [272], [273], [282]) have confirmed the conceptual and technical

quality, originality and impact of the developed artifacts.

195

Chapter 8

Conclusion and Outlook

This chapter concludes the thesis. The aim of this work has been to develop an effective and

efficient approach for semantic interoperability in large-scale service-oriented architectures

based on semantic mediation between loosely coupled information models. The motivation has

been to reflect that the dominant semantic integration approach of developing a single

information model spanning multiple organizational domains has failed. The guiding idea has

been to transfer the principle of loose coupling to the semantic level and consider semantic

mediation between heterogeneous conceptualizations not as a necessary evil but as a silver

bullet to tackle the challenge of semantic interoperability with a more flexible information

architecture pattern. Furthermore, the goal has been to show how emerging semantic

technologies can contribute to the instantiation of this concept based on their capabilities to

explicitly express semantics. The following summarizes the central findings of this work and

points out its scientific contributions. Finally, the evolution of the work is outlined, whereas

open issues, potential advancements and future work and priorities in this area are discussed.

8.1 Summary and Main Contributions

In order to provide a concise overview of the thesis, the following recapitulates the line of

argument and depicts the central aspects of the developed artifacts in a condensed manner.

The thesis has started with an examination of the general research context of semantic

interoperability in the focused domain of service-oriented architectures. The outcome has been a

framework (cf. Chapter 2) which has differentiated four major abstraction levels for the

representation of information models: the initial conceptual idea in the mind, the conceptual

model, the logical model and the underlying physical model in each IT system. Thereby, the so

called semantic interoperability gap increases which each lower abstraction level.

The framework has been used as a common ground for comparison in a systematic state-of-the-

art analysis (cf. Chapter 3) of existing approaches and technologies for achieving semantic

interoperability in SOA. Firstly, traditional Web services along with the existing XML-based

technology stack have been discussed followed by an evaluation describing their capabilities

and limitations. After outlining the need for formally defined semantics of Web service

descriptions, an intermediate step introducing the core concepts and technologies of the

Semantic Web initiative has been provided. Then, it has been described how these concepts can

be applied to Web services in terms of so called Semantic Web services. Furthermore, relevant

related areas have been presented such as semantic information integration in distributed

database systems and distributed object-oriented systems. Additionally, these traditional

approaches have been related to a detailed analysis of ontology-based strategies for semantic

integration including approaches where multiple ontologies and ontology mapping approaches

are involved.

Chapter 8

196

Based on the problem identification and the analysis of the state-of-the-art, Chapter 4 has

presented the developed concept of semantic mediation between loosely coupled information

models in SOA. Firstly, the general idea of the concept has been outlined in order to provide a

condensed overview of the central aspects. These include mainly the shift from monolithic to

loosely coupled information models combined with the approach to address semantic

integration on a higher abstraction level for information representation. Thereby, the notion of a

shift on a higher abstraction level has been elaborated along two dimensions. On the one hand,

semantic integration is shifted from the schema or structure-based logical abstraction level to

the conceptual abstraction level. And on the other hand, semantic mediation is addressed on

domain level instead of recurrently on application or process level.

In the following the general idea has been refined: At first, the limitations of semantic

standardization in large-scale service-oriented architectures with multiple organizational

independent stakeholders have been pointed out. Then the underlying reason for context-

dependency of information models has been deeper analyzed by referring to a model theoretic

approach, namely the model of conception. Based on these findings, the transfer of the principle

of loose coupling to the semantic level has been discussed and a specification of loosely coupled

information models has been provided. According to the research hypothesis claiming for an

effective and efficient approach for the semantic interoperability problem in SOA, the

developed conceptual solution has been reflected including a proposed balance between these as

competing identified sub-goals. Furthermore, the semantic mediation mechanism as the

enabling part of the concept of loosely coupled information models has been specified based on

Semantic Web concepts in terms of ontologies and description logic rules.

The main conceptual conclusions can be summarized as follows:

 Semantic interoperability can be addressed on different abstraction levels for

information representation, whereas the semantic interoperability gap increases with

each lower abstraction level.

 Traditional XML-based Web service approaches for semantic interoperability require

high technical complexity, as they address the semantic interoperability gap on the

lower logical or physical abstraction level and are applied recursively on the

application or process level instead of on the domain level.

 Semantic Web service approaches provide the advantage to address the semantic

interoperability gap on the higher conceptual level; however the dominant approaches

are based on the idea of a common ontology that aims to cover exhaustively different

organizational domains, which has limited feasibility in practice.

 Goal-based Semantic Web services approaches do not ease the semantic interoperability

problem in heterogeneous environments, as different conceptualizations of service pre-

and postconditions cause further semantic heterogeneity to overcome. Furthermore, the

targeted automated planning of service compositions does not match to best practice

SOA approaches, where control should remain with the human process expert.

 The underlying reason for the failure of common ontologies as an effective means for

achieving semantic interoperability lies in the context dependency of information

models. Different organizational domains have different requirements on information

models to serve best for intra-domain information exchange, which results in limitations

for semantic standardization across domains.

 To provide a flexible mediation between independent domain information models the

principle of loose coupling can be transferred to the semantic level in terms of loosely

Conclusion and Outlook

197

coupled information models expressed on the conceptual level. Thereby, a balance

between effectiveness and efficiency can be ensured when semantic mediation is

applied on domain level.

 Semantic Web concepts and technologies including features such as polymorphism,

facet analysis classification and declarative entity manipulation can be exploited to

enable the semantic mediation mechanism based on domain ontologies and description

logic rules.

In order to instantiate the developed concept and provide practical evidence for the formulated

research hypothesis, Chapter 5 has presented the semantic mediation methodology, which

shows how the concept can be applied to the SOA life-cycle. Firstly, the basic steps of the SOA

life-cycle have been recapitulated and an actors model including roles and responsibilities of

stakeholders relevant for semantic mediation has been derived. The following actors could be

identified: domain information model expert, business process expert, service developer, service

composer and service or process consumer. Then the individual methodology steps have been

discussed with regard to their goals, the tasks within each step and the required functionalities

for performing them. The following seven methodology steps have been derived:

 Domain Ontology Development

 Mediated Business Process

Modeling

 Semantic Bridge Definition

 Semantic Bridge Testing

 Semantic Service Enrichment

 Mediated Service Composition

 Mediated Business Process Execution

In order to prepare an experimental confirmation of key steps of the methodology in terms of a

prototypical toolkit for semantic mediation in SOA, as well a high-level view on the functional

architecture for each methodology step has been derived. Some steps of the methodology could

be well covered with existing work and respective tools. Therefore, it has been abstained from

implementing such functionalities redundantly. Rather, it has been focused on these

methodology steps which cannot be performed adequately with existing tools. Accordingly, for

the steps: domain ontology development, semantic bridge definition and semantic service

enrichment existing work has been presented, whereas for the methodology steps: mediated

business process modeling, semantic bridge testing, mediated service composition and mediated

business process execution a dedicated functional architecture has been elaborated.

Based on the semantic mediation methodology and the functional architectures, Chapter 6 then

has presented the developed toolkit, which demonstrates how the semantic mediation

mechanism can be incorporated into key steps of the SOA life-cycle. The prototype for

mediated business process modeling has demonstrated how business process experts can be

enabled to design cross-organizational information flow explicitly on the conceptual level,

whereas different domain conceptualizations are kept transparent and are mediated in the

background. Additionally, it allows to derive and to identify further requirements for the used

information models directly during process modeling. The prototype for semantic bridge testing

has addressed the challenge that the underlying ontology mapping is a complex and error-prone

tasks, so that users and developers need to be enabled to systematically test and determine the

quality of semantic bridges. The purpose of the prototype for mediated service composition,

which instantiates the previously modeled business processes, has been to demonstrate how the

task of composing Web services described by heterogeneous ontologies can be facilitated in

terms of semantic matchmaking and incorporation of rule-based semantic bridges. Finally, the

prototype for mediated process execution has demonstrated how existing industry-proven

Chapter 8

198

process engines can be extended to process the semantic mediation mechanism during runtime.

Thereby, the whole process execution remains on the ontology-based conceptual level and is

just broken down at the latest possible point in terms of grounding Semantic Web service calls

to underlying traditional XML-based Web services. The four prototypes have been described

based on system requirements derived from the semantic mediation methodology followed by

an outline of the design and realization in terms of system architectures. Finally, the developed

prototypes have been validated and verified by applying them to integration scenarios including

two cross-organizational eBusiness and eGovernment processes, which have been highlighted

from different perspectives.

Finally, the evaluation of the developed approach for semantic mediation has been presented in

Chapter 7. The evaluation has been structured in two parts: Firstly, a case study about an

exemplarily distributed organization and its semantic interoperability activities has been carried

out. This included namely an analysis of the service bus and the data conference of the German

Chambers of Industry and Commerce, which then have been compared in terms of a gap

analysis with the developed approach for semantic mediation. Secondly, the outcome of this

analysis has been mapped to the originally set conceptual goals in Chapter 4 and to the claims of

the research hypothesis.

The case study has outlined how several aspects of the approach of loosely coupled information

models have been applied in practice and which benefits could be generated. Furthermore,

shortcomings and missing conceptual and technological aspects have been identified and it has

been shown how a complete application of the developed semantic mediation methodology and

toolkit can provide further improvements. Moreover, it turned out that due to a network effect,

the full potential of the approach becomes visible when mapping it to a larger scope.

Consequently, the application of the approach to a larger context – exemplarily the German

eGovernment landscape – has been discussed and the potential to ensure semantic

interoperability across multiple organizational domains could be pointed out.

Finally, the research hypothesis has been reviewed based on a three step analysis, whereas the

previous evaluation and the major artifacts produced in this work have been considered. Based

on the evaluation and the conceptual argumentation in Chapter 4, the claimed research

hypothesis could be confirmed.

Consequently, the main contributions of this work can be listed as follows:

 the framework of semantic interoperability in SOA mapped to an overview and

evaluation of existing approaches for bridging the semantic interoperability gap;

 the concept of semantic mediation between loosely coupled information models in

SOA;

 the semantic mediation mechanism on domain level based on ontologies and description

logic rules;

 the semantic mediation methodology and the semantic mediation toolkit for the SOA

life-cycle.

Finally, it should be noted that the central aspects of the contributions have been presented and

published at international research conferences and workshops including ([8], [185], [188],

[245], [272], [273], [282]).

Conclusion and Outlook

199

8.2 Evolution and Outlook

Having presented a summary of the thesis and a conclusion of the main contributions, this

section addresses the further evolution of it and provides an outlook on potential advancements

and future work.

Starting from the presented approach, additional questions arise. For example, it has to be

ensured that all stakeholders in cross-organizational business processes covering multiple

domains have access to the required assets, including process models, information models and

particularly the corresponding semantic bridges. The adoption of the concept of loose coupling

on the semantic level prevents from the limitations of cross-domain commitment to an overall

ontology and thus minimizes less practical central structures. However, it still requires a certain

central organizational framework or kind of central clearinghouse to support the stakeholders in

the process of providing and sharing the semantic assets. Such a clearinghouse should include a

repository to publish the various business process models, domain information models and

semantic bridges. It should provide means to categorize them with retrievable and thus

expressive semantics and provide methods for versioning and quality assurance to ensure their

sound evolution. For example in the eGovernment domain, the European Union has established

the semantic interoperability center SEMIC.EU [274] including an assets repository. However,

the focus is put on cross-border semantic interoperability within particular government branches

and less on cross-domain challenges. Moreover, the explicit mission of SEMIC.EU is to

promote reuse and harmonization of data formats and semantic assets [275]. This includes the

aim to establish so called pivot assets for core information entities to be consistently reused in

pan-European eGovernment processes, such as the recently published draft of a core person

model [281]. Consequently, it still follows rather the monolithic information model approach

and the concept of loose coupling on the semantic level and the provision and exchange of

semantic bridges is not yet well reflected. Thus, the incorporation of the developed approach in

this work into existing semantic clearinghouse platforms provides a potential field for future

work.

Another organizational perspective includes the question regarding the scope of domains. The

developed concept addresses semantic mediation on domain level but it leaves it open to the

specific integration scenarios which coverage the domain ontologies should have. Generally, it

has been argued that the scope of a domain ontology and the respective feasible semantic

standardization is limited and depends on certain organizational structures such as the existence

of an umbrella association etc. However, to derive concrete lower and upper bounds for

adequate domain ontology coverage and to identify the relevant factors for it requires dedicated

empirical studies in different organizational scenarios. From a macro-perspective this includes

the question about the adequate granularity of a network of loosely coupled domain ontologies

in an IT ecosystem of multiple organizations. Future research should address these issues with

the goal to develop a certain heuristic on how to map the structure of relations between

organizations or organizational entities to an adequate set of domain ontologies.

Besides these organizational issues, further functional advancements should be addressed in

future work. One point has already been discussed within the evaluation and concerns the

integration of external information into the semantic mediation mechanism. Semantic

heterogeneities may include different representations of information such as zip codes vs.

location names or representing a person with more or less detailed information about e.g. name

and place of birth etc. These differences cannot be semantically bridged without considering

further external information. A starting point for this advancement is to enable the rule-based

semantic bridges to incorporate function calls, which then provide access to the required

Chapter 8

200

external information or functionality. The built-in functions already applied to provide

procedures within the declarative rules such as string concatenation to overcome certain

granularity differences could be reused as an entry point for external function calls. The generic

extension then would consider the incorporation of Semantic Web service calls that process

information on the same abstraction level as used within the description logic based rules and

which then provide access to any kind of external functionality or information.

Regarding the general range of functionality of the semantic mediation toolkit, it has to be

stated that it just provides the basic functions to demonstrate how the approach can be

incorporated into the SOA life-cycle as a proof-of-concept. Thus, in order to evolve towards

mature industry products, the prototypes have to be extended substantially or integrated into

existing products to provide the required comprehensive functionality. E.g. the mediated service

composer has just provided certain control flow and information flow features allowing the

design of relatively simple BPEL-based service orchestrations. For more complex business

processes as well the full set of advanced BPEL constructs including scopes, compensation

handling and events would become necessary.

When discussing the general readiness of the approach for wide industry adoption, it has to be

distinguished between the concept of loosely coupled information models in SOA as a more

effective and efficient information architecture pattern and the proposed instantiation based on

evolving Semantic Web technologies. As shown in the case study of the German Chambers of

Industry and Commerce, it is already possible to apply the concept of loosely coupled

information models with state-of-the-art traditional XML-based technologies to a certain extent.

However, the full potential is just achieved by mapping it to Semantic Web technologies as for

instance complicated technical transformation coding is substituted by the declarative rule-based

approach. As Semantic Web technologies are still an emerging technology, most provided

frameworks and APIs are still academic-driven and at so called beta stage and thus cannot yet

be considered as technically stable and mature. In fact, during development and testing of the

semantic mediation toolkit several problems and bugs did occur. While most of the reported

bugs could be solved by contacting the developers of the provided frameworks (e.g. the

Protégé-API for ontology handling), as well workarounds had to be implemented for some

problems. To minimize these drawbacks and due to the raised claim of being not only of

theoretical but also of practical relevance, the presented approach relies strongly on existing

Web standards. This includes both: State-of-the-art XML-based technologies such as BPEL,

XSLT, XPath and XML Schema on the one hand and Semantic Web technologies such as

OWL, OWL-S and SWRL rules on the other hand. The frameworks and APIs supporting these

existing standards can be regarded as the most mature including available tool support and

stability compared to others. Nevertheless, the evolution of these standards and tools is still in

progress and future versions of the developed semantic mediation toolkit should consider this

evolution.

In this process, the future will show to which extent the Semantic Web technologies exploited

for the additional semantic layer on top of traditional Web service technology will emerge to

mature industry standards. An alternative path could be as well a kind of convergence. In this

perspective, the identified benefits of the underlying language concepts of Semantic Web

technologies such as polymorphism or facet analysis classification will have an impact in terms

of an infusion into the evolution of the XML-based standards itself. For instance in [276] it is

discussed how XML languages can be extended to support a polymorphic type system such as

subtyping, inference of types, etc. However, irrespective of which path will be adopted, the gap

between the two technology layers will be at the focus of further evolution. For instance a new

recommendation for Semantic Web service descriptions namely SAWSDL [277] has recently

been published by the W3C, which directly integrates semantic annotations into WSDL-based

Conclusion and Outlook

201

Web services descriptions. Even though solid tool support is still missing and thus the older and

more mature W3C recommendation OWL-S has been favored for the prototypical toolkit, the

lightweight approach of SAWSDL seems to be promising and suitable to foster industry

adoption. Another candidate for observation is XSPARQL [278], which promises to improve

the translation between the traditional XML and the semantics-oriented RDF world. This is

particularly relevant for grounding of Semantic Web services to existing traditional Web

services. Currently, the developed semantic mediation toolkit performs the required lifting and

lowering translations in terms of XSL transformations, which process the ontology annotations

on the level of their XML serialization. XSPARQL promises a more adequate way as it is

designed to understand both meta-models and to provide powerful means for transformation

between the two in any direction. Consequently, the evolution of the semantic mediation toolkit

should consider these advancements to further ease the provision of the additional semantic

layer and thus facilitate industry adoption.

In the long run these activities should contribute to the vision of performing cross-

organizational IT system integration in a seamless manner with less complexity. In this process,

established SOA concepts and standards combined and advanced with powerful emerging

Semantic Web technologies pave the way for more effective and efficient semantic

interoperability enabled by semantic mediation between loosely coupled information models.

203

Bibliography

[1] Bass and Lee 2003. Building a Business Case for Enterprise Integration. Accenture

Research & Insights, www.accenture.com/xdoc/en/services/insights/building.pdf.

[2] Medjahed, B., et al. 2003. Business-to-business interactions: issues and enabling

technologies. The VLDB Journal, vol. 12, no. 1, pp. 59-85.

[3] European Commission IDABC 2004. The European Interoperability Framework (EIF)

Version 1.0.

[4] Erl, T. 2005. Book: Service-Oriented Architecture: Concepts, Technology, and Design.

Prentice Hall.

[5] Vetere, G. and Lenzerini, M. 2005. Models for Semantic Interoperability in Service-

oriented Architectures. IBM Syst. J.

[6] Doan, A., Noy, N. F. and Halevy, A. Y. 2004. Introduction to the Special Issue on

Semantic Integration. SIGMOD Rec. vol. 33, no. 4, pp. 11-13.

[7] Noy, N. F. 2004. Semantic integration: a survey of ontology-based approaches. SIGMOD

Rec. vol. 33, no. 4, pp. 65-70.

[8] Barnickel, N., Fluegge, M. and Schmidt, K. 2006. Interoperability in eGovernment

through Cross-Ontology Semantic Web Service Composition. Proceedings Workshop

Semantic Web for eGovernment of 3rd European Semantic Web Conference.

[9] Simon, H.A. 1996. Book: The Sciences of the Artificial. 3rd edn. MIT Press.

[10] March, S. T. and Smith, G. F. 1995. Design and natural science research on information

technology. Decis. Support Syst. vol.15, no. 4, pp. 251-266.

[11] Vaishnavi, V. and Kuechler, W. 2004. Design Research in Information Systems,

http://desrist.org/design-research-in-information-systems/

[12] OASIS WSBPEL TC: Web Services Business Process Execution Language Version 2.0.

Committee Specification, 31 January 2007, http://docs.oasis-

open.org/wsbpel/2.0/CS01/wsbpel-v2.0-CS01.html

[13] Miller, P. 2000. Interoperability. What is it and Why should I want it? Ariadne Magazine,

Issue 24, University of Bath UK

[14] European Commission IDABC. 2008. Draft Document as basis for European

Interoperability Framework 2.0.

[15] Sheth, P. 1999. Book: Interoperating Geographic Information Systems, Chapter 2

Changing Focus on Interoperability in Information Systems: From System, Syntax,

Structure to Semantics. Kluwer Academic Publishers Group

[16] Uschold, M. and Gruninger, M. 1996. Ontologies: principles, methods, and

applications. Knowledge Engineering Review, vol. 11, no. 2, pp. 93–155.

[17] Morris, C. W. 1971. Writings on the General Theory of Sign, Approaches to Semiotics.

Vol. 16.

[18] Pokraev, S., Reichert, M. et al. 2005. Semantic and Pragmatic Interoperability: A Model

for Understanding. Proceedings of the Open Interop Workshop on Enterprise Modelling

and Ontologies for Interoperability.

Bibliography

204

[19] World Wide Web Consortium 2002. Web Services Activity, www.w3.org/2002/ws/

[20] Veltman, K. 2001. Syntactic and Semantic Interoperability: New Approaches to

Knowledge and the Semantic Web, The New Review of Information Networking, vol. 7.

[21] Eagleton, T. 1996. Book: Literary Theory: An Introduction. Chapter 3: Structuralism and

Semiotics, University of Minnesota Press.

[22] Bell, D. 2004. UML basics: The class diagram. An introduction to structure diagrams in

UML 2. IBM Technical Library,

www.ibm.com/developerworks/rational/library/content/RationalEdge/sep04/bell/

[23] Chen, P. 1976. The Entity-Relationship Model: Towards a Unified View of Data, ACM

Transactions on Database Systems. ACM-Press, p. 9-36.

[24] Miller, J. and Mukerji, J. 2003. MDA Guide Version 1.0.1, OMG.

[25] Object Management Group (OMG), Model Driven Architecture, www.omg.org/mda

[26] Bieberstein et al. 2006. Book: Service-Oriented Architecture (SOA) Compass. Pearson

plc, pp. 1-19.

[27] Object Management Group. 1993. The common object request broker: Architecture and

specification. Revision 1.1/Omg Document No 91.12.1

[28] DeMichiel, L.G et al. 2002. Enterprise JavaBeans TM Specification, Sun Microsystems,

Inc.

[29] Olsen, G. 2006. From COM to Common. Queue Journal, Vol. 4, No. 5, pp. 20-26.

[30] Feng, G., Wang, C, Li, H. 2005. Web Services Based Cross-Organizational Business

Process Management. In Proc. of the seventh Asia-Pacific Web Conference -

APWeb2005. Springer Berlin/Heidelberg, pp. 548-559.

[31] Gerneth, M., Hansen, H., Mahr, B. et al. 1995. A Framework for Telemedicine Services

in Europe - An Essential Guide for Establishing Telemedicine Services, KIT-Report 123,

Technical University Berlin.

[32] Arsanjani, A. 2004. Service-oriented modeling and architecture. IBM Technical Library,

www.ibm.com/developerworks/library/ws-soa-design1/

[33] Gu, Q. and Lago, P. 2007. A stakeholder-driven service life cycle model for SOA. In

Proc. of IW-SOSWE‟07, pages 1–7.

[34] McBride, G. 2007. The Role of SOA Quality Management in SOA Service Lifecycle

Management, IBM Technical Library,

www.ibm.com/developerworks/rational/library/mar07/mcbride

[35] Barstow, A., Hendler, J. et al. 2004. OWL Web Ontology Language for Services (OWL-

S), W3C Submissions.

[36] Wikipedia article: Function (mathematics),

http://en.wikipedia.org/wiki/Function_(mathematics), accessed 30/05/2010

[37] Fluegge, M. 2008. PhD Thesis: Adaptive Service Composition in a semantics-based

Interoperability Infrastructure. Technical University Berlin.

[38] World Wide Web Consortium 2002. Web Services Activity, www.w3.org/2002/ws/

[39] The Integrity Center, Inc. 2006. Web Services FAQs.

www.integctr.com/WebServices/FAQ105W.asp

[40] Berners Lee, T. et al. 2005. Uniform Resource Identifier. IETF RFC 3986.

Bibliography

205

[41] World Wide Web Consortium 2004. Web Services Architecture Requirements. W3C

Working Group Note, www.w3.org/TR/wsa-reqs/

[42] Bettag, U. 2005. Article: Web Services. Informatiklexikon. Gesellschaft für Informatik,

www.gi-ev.de/no_cache/service/informatiklexikon/informatiklexikon-

detailansicht/meldung/web-services-95/

[43] Cervantes, H. and Hall, R.S. 2004. Autonomous Adaptation to Dynamic Availability

Through A Service-Oriented Component Model. Proceedings of the

International Conference on Software Engineering.

[44] IBM 2004. Web Services Transactions specifications,

www.ibm.com/developerworks/library/ws-coor

[45] OASIS 2006. WS-Security 1.1 OASIS standard. Web Services Security (WSS) TC,

www.oasis-open.org/committees/wss/

[46] World Wide Web Consortium 2007. SOAP Version 1.2 Part 1: Messaging Framework

(Second Edition). W3C Recommendation.

[47] Wide Web Consortium 2007. Web Service Description Language (WSDL) Version 2.0

Part 0: Primer. W3C Recommendation.

[48] IBM 2001. Web Services Flow Language (WSFL) Version 1.0.

[49] Microsoft 2001. XLANG, http://msdn.microsoft.com/en-

us/library/aa577463(BTS.70).aspx

[50] OASIS 2007.Web Service Business Process Execution Language Version 2.0,

http://docs.oasis-open.org/wsbpel/2.0/OS/wsbpel-v2.0-OS.html

[51] World Wide Web Consortium 1999. XML Path Language (XPath) Version 1.0. W3C

Recommendation.

[52] World Wide Web Consortium 1999. XSL Transformations (XSLT) Version 1.0 W3C

Recommendation.

[53] Paolucci, M., Sycara, K. et al. 2003. Toward a Semantic Web e-Commerce. Proc. of 6th

Int. Conf. on Business Information Systems.

[54] Klein, M. and Bernstein, A. 2004. Towards High-Precision Service Retrieval. IEEE

Internet Computing, vol. 8, p. 30-36.

[55] Srivastava, B. and Koehler, J. 2004. Web Service Composition - Current Solutions and

Open Problems. Proceedings Workshop on Planning for Web Services. ICAPS.

[56] Cover, R. 1998. XML and semantic transparency. Technology reports. OASIS cover

pages, http://xml.coverpages.org/xmlAndSemantics.html

[57] World Wide Web Consortium 1998. Document Object Model Level 1. W3C

Recommendation, http://www.w3.org/DOM/Activity

[58] Hitzler P., Kroetzsch M., Rudolph S. and Sure Y. 2008. Book: Semantic Web.

Grundlagen. Springer, 2008. p.10.

[59] Baader, F., Horrocks, I. and Sattler, U. 2001. Description Logics for the Semantic Web.

IEEE Data Engineering Bulletin, vol. 16 no. 25, pp. 4-9.

[60] Berners-Lee, T., Hendler, J. and Lassila, O. 2001. Journal Article: The Semantic Web: A

new form of Web content that is meaningful to computers will unleash a revolution of

new possibilities. Scientific American, www.scientificamerican.com/article.cfm?id=the-

semantic-web

[61] World Wide Web Consortium 2001. Semantic Web Activity, www.w3.org/2001/sw/

Bibliography

206

[62] World Wide Web Consortium 2004. Resource Description Framework (RDF), Concepts

and Abstract Syntax. W3C Recommendation, www.w3.org/RDF/

[63] Dumbill, E. 2001. Building the Semantic Web. Knowledge Technologies Conference.

[64] Gruber, T. R. 1993. A translation approach to portable ontology specifications. Knowl.

Acquis. 5, 2 (Jun. 1993), 199-220.

[65] Davies, J., Fensel, D. and Van Harmelen, F. 2003. Book: Towards the Semantic Web:

Ontology driven Knowledge Management. Sussex: John Wiley and Sons Ltd.

[66] Fensel, D. 2003. Book: Ontologies: A Silver Bullet for Knowledge Management and E-

Commerce. Springer, Berlin, Heidelberg, 2 edition.

[67] Baader, F., Horrocks, I. and Sattler, U. 2002. Description Logics for the Semantic Web,

KI Jorunal, vol. 16, no. 4, pp. 57-59.

[68] Gödel, K. 1931. Über formal unentscheidbare Sätze der Principia Mathematica und

verwandter Systeme I, Monatshefte für Mathematik und Physik, vol. 38, pp. 173-98.

[69] Donini, F. M. et al. 1991. The complexity of concept languages. Proceedings of the

Second International Conference on Principles of Knowledge Representation and

Reasoning.

[70] Hart, L. and Emery, P. 2004. A Description Logic for Use as the ODM Core. AT&T

Government Solutions, Inc.

[71] Stollberg, M. 2008. PhD Thesis: Scalable Semantic Web Service Discovery for Goal-

Driven Service-Oriented Architecture. University of Innsbruck.

[72] de Bruijn, J. 2006. Logics for the Semantic Web. In Cardoses, J., Semantic Web: Theory,

Tools and Applications. Idea Publishing Group.

[73] Horrocks, I., Li, L., Turi, D., and Bechhofer, S. 2004. The In- stance Store: Description

Logic Reasoning with Large Numbers of Individuals. In Proc. of International Workshop

on Description Logics (DL 2004).

[74] Motik, B., Shearer, R., and Horrocks, I. 2007. Optimized Reasoning in Description

Logics using Hypertableaux. In Proc. of the 21st Conference on Automated Deduction

(CADE-21).

[75] Hepp, M., de Leenheer, P., de Moor, A., and Sure, Y. 2007. Book: Ontology

Management: Semantic Web, Semantic Web Services, and Business Applications.

Semantic Web and Beyond. Springer.

[76] Gomez-Perez, A., Corcho, O., and Fernandez-Lopez, M. 2003. Ontological Engineering:

With Examples from the Areas of Knowledge Management, E-Commerce and Semantic

Web. Series of Advanced Information and Knowledge Processing. Springer.

[77] Harth, A. and Decker, S. 2005. Optimized Index Structures for Querying RDF from the

Web. In Proc. of 3rd Latin American Web Congress.

[78] de Leenheer, P. and Mens, T. 2006. Chapter 5 Ontology Evolution: State of the Art and

Future Directions. In Hepp, de Leenheer, de Moor, and Sure, editors, Ontology

Management. Semantic Web, Semantic Web Services, and Business Applications.

Springer.

[79] Noy, N. 2004. Semantic Integration: A Survey of Ontology-based Approaches. ACM

SIGMOD Record, vol. 33, no. 4, pp. 65-70.

[80] Scharffe, F. and de Bruijn, J. 2005. A language to specify mappings between ontologies.

In Proc. of the Internet Based Systems IEEE Conference (SITIS05).

Bibliography

207

[81] Davis, J., Studer, R., and Warren, P. 2006. Book: Semantic Web Technology. Trends and

Research in Ontology-based System. Wiley & Sons

[82] Borkin, S. A. 1980. The Semantic Relation Data Model: Foundation for a User Interface.

ICOD, pp. 47-64.

[83] World Wide Web Consortium 2004. RDF Vocabulary Description Language 1.0: RDF

Schema, W3C Recommendation.

[84] Defense Advanced Research Projects Agency (DARPA). 2000. DARPA Agent Markup

Language, www.daml.org

[85] Fensel, D. et al. 2001. OIL: an ontology infrastructure for the Semantic Web. IEEE

Intelligent Systems, vol. 16, no. 2.

[86] Horrocks, I. et al. 2004. SWRL: A Semantic Web Rule Language Combining OWL and

RuleML. W3C Member Submission.

[87] World Wide Web Consortium. 2005. Rule Interchange Format (RIF) Working Group,

www.w3.org/2005/rules/

[88] Bontas, E. and Mei, J. 2005. Reasoning Paradigms for SWRL-enabled Ontologies.

Technical Report TR-B-05-01. Freie Universitaet Berlin.

[89] Horrocks, I. et al. 2005. Semantic Web Architecture: Stack or Two Towers?. Principles

and Practice of Semantic Web Reasoning, pp. 37-41.

[90] Hobbs, J. R. and Pan, F. 2004. An Ontology of Time for the Semantic Web. ACM

Transactions on Asian Language Information Processing, vol. 3, pp. 66-85.

[91] Chen, H., Fellah, S. and Bishr, Y. 2005. Rules for geospatial semantic web applications.

W3C Workshop on Rule Languages for Interoperability.

[92] Sanchez, E. et al. 2006. Book: Fuzzy Logic and the Semantic Web, Elsevier Science.

[93] Stoilos, G. et al. 2006. Deliverable Analysis of Requirements for Further Language

Extensions, EU-IST Network of Excellence Knowledge Web.

[94] World Wide Web Consortium 2009. OWL 2 Web Ontology Language Profiles. W3C

Recommendation.

[95] Ruttenberg, A. et al. 2007. Advancing translational research with the Semantic Web.

BMC Bioinformatics.

[96] Hyvönen, E. et al. 2007. Elements of a National Semantic Web Infrastructure.

Proceedings of the First International Semantic Computing Conference (IEEE ICSC

2007).

[97] World Wide Web Consortium 2009. Semantic Web Service Interest Group,

www.w3.org/2002/ws/swsig/

[98] McIlraith, S., Son, T. and Zeng, H. 2001. Semantic Web Services. IEEE Intelligent

Systems. Special Issue on the Semantic Web, vol. 16, no. 2, pp.46-53.

[99] Gugliotta, A. 2006. Knowledge Modelling for Service-oriented Applications in the e-

Government Domain. AAAI Spring Symposium Series Stanford University.

[100] Trapero, R. 2008. Deliverable Service Deployment and Management System, IST-FP6-

Project Opuce.

[101] World Wide Web Consortium 2007. Semantic Annotations for WSDL and XML Schema,

Terminology. W3C Recommendation.

[102] Martin, D. 2004. OWL-S: Semantic Markup for Web Services. W3C Member

Submission.

Bibliography

208

[103] Lausen, H., Polleres, A., and Roman (eds.), D. 2005. Web Service Modeling Ontology

(WSMO). W3C Member Submission.

[104] Battle, S., et al. 2005. Semantic Web Services Framework (SWSF). W3C Member

Submission.

[105] Akkiraju, R. et al. 2005. Web Service Semantics - WSDL-S. W3C Member Submission.

[106] Farrell, J. and Lausen, H. 2007. Semantic Annotations for WSDL and XML Schema.

W3C Recommendation.

[107] World Wide Web Consortium 2004. OWL Web Ontology Language for Services

(OWLS). W3C Recommendation.

[108] Sirin, E. 2005. Tools for Semantic Web Services. Developers Day at the 2005 World

Wide Web Conference, Japan.

[109] Horrocks, I. et al. 2004. SWRL: A Semantic Web Rule Language Combining OWL and

RuleML, W3C Member Submission.

[110] de Bruijn, J et al. 2005. The Web Service Modeling Language WSML. Deliverable

D16.1. WSML Working Group, www.wsmo.org/wsml

[111] Haller, A. et al. 2005. WSMX - a semantic service-oriented architecture, in Web services.

In Proceedings of the IEEE international Conference on Web Services (ICWS 2005).

[112] Stollberg, M. and Norton, B. 2007. A Refined Goal Model for Semantic Web Services.

Proceedings of the Second International Conference on Internet and Web Applications

and Services (ICIW 2007).

[113] Johnson, T. and Eigenmann, R. 2006. Context-Sensitive Domain-Independent Algorithm

Composition and Selection. ACM SIGPLAN Notices, vol. 41.

[114] Mocan, A. et al. 2005. WSMO Mediators. D29v0.1. WSMO Final Draft.

[115] Ankolekar, A. et al. 2004. OWL Web Ontology Language for Services (OWLS). OWL-S'

Relationship to Selected Other Technologies, W3C Member Submission.

[116] Gruninger, M. and Menzel, C. 2003. The Process Specification Language (PSL) Theory

and Applications. AI Magazine, vol. 24, no. 3, pp. 63-74.

[117] Akkiraju, R. et al. 2005. Web service Semantics – WSDL-S, W3C Member Submission.

[118] Kopecky, J. 2007. Semantic Annotations for WSDL and XML Schema. Presentation at

W3C track of the WWW 2007 Conference.

[119] Polleres, A. 2004. AI Planning for Semantic Web Service Composition?, Survey of

current approaches, challenges and open questions, Presentation, DERI, Innsbruck.

[120] Srivastava, B. and Koehler, J. 2003. Web Service Composition - Current Solutions and

Open Problems. ICAPS Workshop on Planning for Web Services.

[121] Stollberg, M. 2008. PhD Thesis: Scalable Semantic Web Service Discovery for Goal-

driven Service-Oriented Architectures. University of Innsbruck.

[122] Hendler, J. 2004. Composition as planning, W3C Mailing List Semantic Web Service

Interest Group, http://lists.w3.org/Archives/Public/public-sws-ig/2004Feb/0001.html

[123] The Semantic Dialogues 2009. Progress Software Corporation,

http://web.progress.com/de-de/semanticdialogues/

[124] Noy, N. F. 2004. Semantic integration: a survey of ontology-based approaches. SIGMOD

Rec. vol. 33, no. 4, pp. 65-70.

Bibliography

209

[125] ITU-T, ISO/IEC 1995. Recommendation X.902. International Standard 10746-2. ODP

Reference Model: Descriptive Model.

[126] Kimball, R. and Caserta, J. 2004. Book: The Data Warehouse ETL Toolkit. Chapter

Introduction pp. xxxi, Wiley.

[127] Rahm, E. and Berstein, P. 2001. A survey of approaches to automatic schema matching.

The VLDB Journal, vol. 10, pp.334-350.

[128] Kemper, A. and Eickler, A. 2004. Datenbanksysteme, Oldenburg, pp. 129-133.

[129] Cal, A. et al. 2002. On the Expressive Power of Data Integration Systems. Proceedings of

the 21st International Conference on Conceptual Modeling, pp. 338-350.

[130] Naumann, F. 2006. Lecture Script: Informationsintegration. Humboldt Universität Berlin,

www.ki.informatik.hu-berlin.de/wbi/ii/folien/InfoInt_02_Szenarien.ppt

[131] Vallecillo, A. 2001. RM-ODP: The ISO Reference Model for Open Distributed

Processing, DINTEL Edition on Software Engineering, no. 3, pp. 69-99.

[132] Meyer, B. 1998. Book: Object-oriented software construction, Prentice Hall.

[133] Fluegge, M, Jaeger, M. C. 2006. Adding Value to E-Services: A Business-Oriented

Model, in (Jie Lu et. al. Eds.): E-Service Intelligence - Methodologies, Applications,

Technologies in Springer's book series on Computational Intelligence.

[134] Vallecillo, A. 2001. RM-ODP: The ISO Reference Model for Open Distributed

Processing, DINTEL Edition on Software Engineering no. 3, pp. 69-99.

[135] Kutvonen, L. 1998. PhD thesis: Trading services in open distributed environments.,

Department of Computer Science. University of Helsinki.

[136] Müller, S. et. al. 1996. Global Trader Cooperation in Open Service Markets. Trends in

Distributed Systems, pp.214-227.

[137] Uschold, M. and Gruninger, M. 1996. Ontologies: Principles, methods and applications.

Knowledge Engineering Review, vol. 11, no. 2, pp. 93-155.

[138] van Harmelen, F. 2006. Semantic Web Research anno 2006: main streams, popular

fallacies, current status and future challenges. Proceedings CIA-2006. Number 4149 in

Lecture Notes in Artificial Intelligence.

[139] Wache, H. et al. 2001. Ontology-Based Integration of Information - A Survey of Existing

Approaches. In Stuckenschmidt, H., ed., IJCAI-01 Workshop: Ontologies and

Information Sharing, pp. 108-117.

[140] Ehrig, M.; Staab, S.: 2004. QOM - quick ontology mapping. In Proceedings of the 3rd

International Semantic Web Conference, pp. 683-697.

[141] Euzenat, J. and Shavaiko, P. 2007. Chapter 5: Matching strategies, pp. 133, in Book:

Ontology Matching, Springer.

[142] van Harmelen, F. 2006. Semantic Web Research anno 2006: main streams, popular

fallacies, current status and future challenges. Proceedings CIA-2006. Number 4149 in

Lecture Notes in Artificial Intelligence.

[143] Scharffe, F. 2006. Deliverable WP2: Ontology Management D2.6 Mapping Tool

Background. European Commission Integrated Project: Data, Information, and Process

Integration with Semantic Web Services (DIP).

[144] Euzenat, J. and Shavaiko, P. 2007. Chapter 11 Conclusion in Book: Ontology Matching.

Springer, pp. 274.

Bibliography

210

[145] Calvanese, D., De Giacomo, G. and Lenzerini, M. 2001. A framework for ontology

integration. In Proc. of the First Semantic Web Working Symposium, pp. 303-316.

[146] Maedche, A. et. al. 2002. Mafra - A mapping framework for distributed ontologies.

Proceedings of the EKAW, vol. 2473 of Lecture Notes in Computer Science, Springer.

[147] Dou, D., McDermott, D. and Qi, P. 2004. Ontology Translation on the Semantic Web.

LNCS Journal of Data Semantics, vol. II, LNCS 3360, pp.35-57.

[148] Zhao, G., Zheng J. and Meersman, R. 2003. An Architecture Framework of Ontology

Development and Deployment. Proceedings of the E-Society IADIS International

Conference.

[149] Kalfoglou, Y. and Robertson, D. 2000. Applying experienceware to support ontology

deployment. In Proceedings of the International Conference on Software Engineering and

Knowledge Engineering.

[150] Li, Y. et al. 2003. Beyond Ontology Construction; Ontology Services as Online

Knowledge Sharing Communities. Lecture Notes in Computer Science, vol. 2870/2003,

pp.469-483.

[151] Cali, A. et al. 2008. Tightly Integrated Probalistic Description Logic Programs for

Representing Ontology Mappings. Proceedings of the 5th International Symposium on

Foundations of Information and Knowledge Systems.

[152] Zapletal, M. et al. 2008. UN/CEFACT's Modeling Methodology. Vdm Verlag Dr. Müller.

[153] Damodaran, S. 2004. B2B Integration over the Internet with XML-RosettaNet -

Successes and Challenges. In Proceedings 13th Int‟l Conf. World Wide Web, pp. 188-

195, ACM Press.

[154] Bosak, J. et al. 2006. Universal Business Language 2.0, OASIS.

[155] United Nations Centre for Trade Facilitation and Electronic Business. 2003. Core

Component Technical Specification – Part 8 of the ebXML Framework.

[156] Jakobs, K. 2000. Book: Information technology standards and standardization: a global

perspective, preface p.i, Idea Group Inc (IGI).

[157] McDermott, D. 1993. Book Review of Building Large Knowledge-Based Systems:

Representation and Inference in the Cyc Project (D.B. Lenat and R.V. Guha). Artificial

Intelligence 61, pp.53-63.

[158] Rada, R. 2000. Chapter 2 Consensus Versus Speed. In Book: Information technology

standards and standardization. Idea Group Inc (IGI).

[159] Cox, K. J. 2007. Gloabal B2B Forum, B2B Standards Primer.

[160] Gibb, B. K. and Damodaran, S. 2002. Ebxml: Concepts and Application. John Wiley &

Sons, Inc.

[161] Air Transport Association. 2000. e-Business Specifications. Spec 2000,

http://www.ataebiz.org/specifications/

[162] Open Application Group, Inc. (OAGi). 2009. Chemistry Industry Data eXchange (CIDX),

www.cidx.org

[163] Commerce eXtensible Markup Language (cXML) 1..2. 2008., www.cxml.org

[164] Commerce One. 2000. XML Common Business Library (xCBL), www.xcbl.org

[165] Weisgerber, S. 2008. Interoperabilität und Normung aus Sicht des Deutschen Instituts für

Normung. Workshop: E-Government-Standards für Wirtschaft und Verwaltung. D21-

Kongress.

Bibliography

211

[166] Blanchard, H.E. 1996. Standards: A Look Ahead and an Overview, SIGCHI Bulletin,

vol.28 no.4.

[167] Translated from: Weisgerber, S. 2008. Interoperabilität und Normung aus Sicht des

Deutschen Instituts für Normung. Workshop: E-Government-Standards für Wirtschaft

und Verwaltung. D21-Kongress

[168] Turowski, K. 2000. Establishing Standards for Business Components. In Jakobs, K.

(Hrsg.): Information Technology Standards and Standardisation: A Global Perspective.

Idea Group Publishing, Hershey, pp.131-151

[169] Mahr, B. 2006. Ein Modell der Auffassung, KIT-Report 151, Technical University

Berlin.

[170] Mahr, B. 2009. Information Science and the Logic of Models. Software and Systems

Modeling. Springer Berlin, vol. 8, no. 3, pp. 365-383.

[171] Mahr, B. 1997. Gegenstand und Kontext: Eine Theorie der Auffassung. In Prinzipien der

Kontextualisierung. Technical University Berlin.

[172] Orton, J. D., Weick, K. E. 1990. Loosely Coupled Systems: A Reconceptualization,

Academy of Management Review, vol. 15, no. 2, pp. 203-223.

[173] Wikipedia article: Loose Coupling, http://en.wikipedia.org/wiki/Loose_coupling,

accessed 30/05/2010

[174] Hess A, Humm B, Voß M. 2006. Regeln für serviceorientierte Architekturen hoher

Qualität. Informatik Spektrum, vol. 29, no. 6, pp. 395–411.

[175] Humm, B. 2008. Lecture Script: Service-Oriented Architecture. Darmstadt University of

Applied Sciences, www.fbi.h-

da.de/fileadmin/personal/b.humm/SOA_SS08/Lecture/07_Loose_Coupling.pdf

[176] Translated and adapted from: Hess, A., Humm, B. and Voß, M. 2006. Figure 7. Regeln

für serviceorientierte Architekturen hoher Qualität. Informatik Spektrum, vol. 29, no. 6,

pp. 395–411.

[177] Stevens, W., Myers, G. and Constantine, L. 1997. Structured Design. IBM Systems

Journal, vol. 13, no. 2, pp. 115-139.

[178] Schorlemmer, M. and Kalfoglou, Y. 2008. Institutionalising Ontology-Based Semantic

Integration. Journal of Applied Ontology, vol. 3, no. 3.

[179] Bicer, V. et. al. 2005. Providing Semantic Interoperability in the Healthcare Domain

through Ontology Mapping. Sigmod Record, vol. 34, no. 3.

[180] Ehrig, M. and Staab, S. 2004. QOM - Quick Ontology Mapping. In F. van Harmelen, S.

McIlraith, and D. Plexousakis, editors, Proceedings of the Third International Semantic

Web Conference.

[181] Klein M. et. al. 2000. The Relation between Ontologies and Schema-languages:

Translating OIL-specifications in XML-Schema. In Proceedings of the 14th European

Conference on Artificial Intelligence (ECAI'00).

[182] Taylor, A. G. 1992. Book: Introduction to Cataloging and Classification. 8th ed.

Englewood, Colorado. Libraries Unlimited.

[183] Ranganathan, S.R. 1957. Book: Prolegomena to Library Classification. The Garden City

Press Ltd.

[184] World Wide Web Consortium. 2004. OWL Web Ontology Language Reference. 3.1.2

Property restrictions. W3C Recommendation.

Bibliography

212

[185] Barnickel, N., Weinand R. and Fluegge M. 2008. Semantic System Integration –

Incorporating Rule-based Semantic Bridges into BPEL processes. In Proceedings of

EON-SWSC 2008 Evaluation of Ontology-based Tools and the Semantic Web Service

Challenge (ESWC2008).

[186] Bosch, J. and Kruege, C. 2004. Proceedings Software reuse: methods, techniques, and

tools. 8th International conference (ICSR 2004), pp.28.

[187] Fensel, D. 2003. Book: Ontologies: A Silver Bullet for Knowledge Management and E-

Commerce. Springer, Berlin, Heidelberg, 2 Edition.

[188] Barnickel, N. 2008. Organisationsübergreifende Entwicklung von Prozessorientierten

Datenaustauschstandards. Fachtagung Verwaltungsinformatik der Gesellschaft für

Informatik.

[189] Noy, N. F. and McGuinness, D. L. 2001. Ontology development 101: A guide to creating

your first ontology, Stanford Knowledge Systems Laboratory Technical Report KSL-01-

[190] Sure, Y. et al. 2002. Collaborative Ontology Development for the Semantic Web. Lecture

notes in computer science. Springer.

[191] Noy, N. F., Crubezy, M. and Fergerson, H. 2003. Protégé Ontology Editor. Stanford

University‟s National Center for Biomedical Ontology, AMIA Annu Symp Proc.

[192] Altova 2010. SemanticWorks Semantic Web tool - Visual RDF and OWL editor,

www.altova.com/semanticworks.html

[193] Böttcher, J. 2009. Diploma Thesis: Semantische Datenflussannotation in der

Geschäftsprozessmodellierung. Freie Universität Berlin.

[194] Object Management Group 2008. Specification: The Business Process Modelling

Notation (BPMN 1.1).

[195] van der Aalst, W. 1999. Formalization and Verification of Event-driven Process Chains.

Information & Software Technology, vol. 41, pp. 10, pp. 639-650.

[196] Hepp, M and Roman, D. 2007. An Ontology Framework for Semantic Business Process

Management. Proceedings of Wirtschaftsinformatik 2007.

[197] Semtation GmbH. 2008. SemTalk., www.semtalk.com

[198] Microsoft 2007. Visio 2007, http://office.microsoft.com/en-us/visio/

[199] European Commission 2007. Integrated Project: Semantic Utilised for Process

Management within and between Enterprises (SUPER). 6th Framework Programme,

www.ip-super.org

[200] Ma, Z. et al. 2007. Semantic business process management: A lifecycle based

requirements analysis. In Proc. of Workshops on Semantic Business Process and Product

Lifecycle Management (SBPM 2007) at the 4th European Semantic Web Conference

(ESWC 2007).

[201] 1 a) matching architecture adapted from: Engmann, D. and Massmann, S. 2007. Instance

Matching with COMA++. BTW 2007 Workshop: Model Management und Metadaten-

Verwaltung.

[202] van Harmelen, F. 2006. Semantic Web Research anno 2006: main streams, popular

fallacies, current status and future challenges. Proceedings CIA-2006. Number 4149 in

Lecture Notes in Artificial Intelligence.

[203] Euzenat, J. and Shavaiko, P. 2007. Chapter 11 Conclusion in Book: Ontology Matching,

Springer, pp. 274.

Bibliography

213

[204] Euzenat, J. and Shavaiko, P. 2007. Book: Ontology Matching, Springer.

[205] Falconer, S. M., Noy, N. F. and Storey, M. 2007. Ontology Mapping - A User Survey.

2nd International Workshop on Ontology Matching.

[206] O‟Connor, M. et al. 2005. Writing Rules for the Semantic Web Using SWRL and Jess.

Protégé With Rules workshop collocated with 8th International Protégé Conference.

[207] Horrocks, I. et al. 2004. SWRL: A Semantic Web Rule Language. Combining OWL and

RuleML. W3C Member Submission.

[208] Ressler, J. 2007. Application of Ontology Translation. Proceedings of the Sixth

International Semantic Web Conference (ISWC2007).

[209] BBN Technologies 2007. Snoggle - A Graphical, SWRL-based Ontology Mapper.

[210] Burnstein, I. 2003. Book: Practical Software Testing. Springer.

[211] Spillner, A. and Linz, T. 2005. Basiswissen Softwaretest. 3rd edition. dpunkt.verlag.

[212] Brügge, B., Dutoit, A. 2004. Book: Objektorientierte Softwaretechnik mit UML,

Entwurfsmuster und Java. Pearson Studium.

[213] Paschke, A. et al. 2006. On Self-Validating Rule Bases. Proceedings of 2nd International

Workshop on Semantic Web Enabled Software Engineering (SWESE 2006).

[214] Vrandecic, D. and Gangemi, A. 2006. Unit tests for ontologies. Proceedings of the 1st

International Workshop on Ontology content and evaluation in Enterprise. Springer.

[215] Ontoprise 2007. OntoStudio User Manual. Version 2.0.

[216] Ding, Y. and Fensel, D. 2001. Ontology library systems: The key to successful ontology

re-use. In Int. Semantic Web Working Symposium (SWWS 2001).

[217] Carroll, J. and Roo, J. D. 2004. OWL Web Ontology Language Test Cases. W3C

Recommendation.

[218] Paschke, A 2005. The Contract Log Approach Towards Test-driven Verification and

Validation of Rule Bases – A Homogeneous Integration of Test Cases and Integrity

Constraints into Evolving Logic Programs and Rule Markup Languages (RuleML).

Technical Report 10/2005. IBIS. Technical University Munich.

[219] Evan der Vlist, E. 2002. XSLTunit, http://xsltunit.org

[220] Altova 2007. XMLSpy XSLT Debugger, www.altova.com/xmlspy/xslt-debugger.html

[221] Kerrigan, M. et al. 2007. The Web Service Modeling Toolkit - An Integrated

Development Environment for Semantic Web Services. Proceedings of the 4th European

Semantic Web Conference (ESWC). System Description Track.

[222] Heß, A., Johnston, E. and Kushmerick, N. 2004. ASSAM: A Tool for Semi-

Automatically Annotating Semantic Web Services. Lecture Notes in Computer Science.

3rd International Semantic Web Conference (ISWC 2004).

[223] Sirin, E. and Parsia, B. 2004. The OWL-S Java API. Proceedings of the 3rd International

Semantic Web Conference (ISWC 2004).

[224] Fluegge, M. and Barnickel, N. et al. 2008. Semantic Interoperability – Report about

Technological State-of-the-Art. Working Document wd3.2.1c. European Commission

Integrated Project: QualiPSo.

[225] Fluegge, M. and Barnickel, N. et al. 2008, Working document 3.2.3 – Work package

Semantic Interoperability. European Commission Integrated Project: QualiPSo.

Bibliography

214

[226] European Commission 2004. IST-1-002104-STP, SATINE (Semantic-based

Interoperability Infrastructure for Integrating Web Service Platforms to Peer-to- Peer-

Networks) project.

[227] Flügge, M. and Tourtchaninova, D. 2004. Ontology-derived Activity Components for

Composing Travel Web Services. International Workshop on Semantic Web

Technologies in Electronic Business (SWEB2004).

[228] Dogac, A. et al. 2004. Semantically Enriched Web Services for Travel Industry. Sigmod

Record.

[229] Bicer, V. 2004. OWLmt - OWL Mapping Tool Technical Report. SRDC-METU.

[230] Fluegge, M. 2008. PhD Thesis: Adaptive Service Composition in a Semantics-based

Interoperability Infrastructure. Technical University Berlin.

[231] Ambler, S. W. 2005. The Agile Unified Process (AUP),

www.ambysoft.com/unifiedprocess/agileUP.html

[232] Kruchten, P. et al. 1998. Book: The Rational Unified Process: An Introduction, Addison-

Wesley.

[233] Xiaofan, L., Böttcher, J. and Barnickel, N.2008. Enterprise Ontology-oriented Business

Process Modeling – Tool Evaluation Study. Seminar Paper. OKS Group of Computer

Science Department Technical University Berlin in Cooperation with Fraunhofer Institute

FOKUS.

[234] Decker, G., Overdick, H. and Weske, M. 2008. Oryx - An Open Modeling Platform for

the BPM Community. BPM. Volume 5240 of LNCS, pp. 382-385, Springer.

[235] World Wide Web Consortium 2003. Scalable Vector Graphics (SVG) 1.1 Specification.

W3C Recommendation.

[236] World Wide Web Consortium. 2004. OWL Web Ontology Language Reference. 3.1.2

Property restrictions, W3C Recommendation.

[237] Motik, B. 2008. Reasoning in KAON2, http://kaon2.semanticweb.org/#reasoning

[238] Mindswap 2008. Pellet Reasoner. Release version 1.5.2, http://pellet.owldl.com/

[239] Protégé Community of Practice 2008. Protégé Wiki Article: SWRLExtensionsBuiltIns,

http://protege.cim3.net/cgi-bin/wiki.pl?SWRLExtensionsBuiltIns

[240] Clark & Parsia. 2008. FAQ - Does Pellet support rules?, http://pellet.owldl.com/faq/rules

[241] Stanford Center for Biomedical Informatics Research 1997. Protégé-OWL editor.

http://protege.stanford.edu/overview/protege-owl.html

[242] Friedman-Hill, E. 2008. The Jess Rule Engine. Sandia National Laboratories,

http://herzberg.ca.sandia.gov/jess/

[243] Protégé Community of Practice 2008. Protégé Wiki Article: SWRL-JessBridge,

http://protege.cim3.net/cgi-bin/wiki.pl?SWRLJessBridge

[244] Protégé Community of Practice 2008. Protégé Wiki Article: SWRLFactoryFAQ. How do

I create a rule using the SWRL Factory?, http://protege.cim3.net/cgi-

bin/wiki.pl?SWRLFactoryFAQ#nid6N9

[245] Barnickel, N, Böttcher, J. and Paschke, A. 2010. Semantic Mediation of Information

Flow in Cross-Organizational Business Process Modeling. Proceedings Semantic

Business Process Management Workshop at 7th Extended Semantic Web Conference

(ESWC2010).

Bibliography

215

[246] Lausen, H., Zaremba, M. and Petrie, C. 2008. The Semantic Web Service Challenge.

Scenario: Purchase Order Mediation,

http://sws-challenge.org/wiki/index.php/Scenario:_Purchase_Order_Mediation

[247] Fluegge, M. and Barnickel, N. et al. 2008, Working document 3.2.4 – Semantic

Interoperability Scenarios and Experiments. European Commission Integrated Project:

QualiPSo.

[248] Brun, M. K. and Nielsen, B. 2003. Naming and Design Rules for eGovernment - The

Danish Approach. In Proceedings of XML 2003.

[249] Danish Ministry of Science, technology and Innovation 2005. Danish e-Government

Project: Infostructurebase, http://en.itst.dk/it-architecture-standards/data-

standardisation/infostructurebase/infostructurebase

[250] Deutschland Online Standardisierung 2006. OSCI-XÖV – XML Standards für die

öffentliche Verwaltung, www.xoev.de

[251] Active Endpoints 2008. ActiveBPEL. Release: ActiveBPEL-engine 4.1,

www.activevos.com/community-open-source.php

[252] Apache Software Foundation 2006. Apache ODE (Orchestration Director Engine),

http://ode.apache.org/

[253] Oracle 2005. Oracle BPEL Process Manager,

www.oracle.com/technology/products/ias/bpel

[254] Active Endpoints. 2007. Creating and Using Custom XPath Functions,

www.activebpel.org/samples/samples-4/ custom_functions/doc/index.html

[255] Kelly Thompson. Apache ODE FAQ - Language-Related BPEL Questions,

http://ode.apache.org/faq.html

[256] Antony Reynolds. 2007. Antony Reynolds Blog - Extending XPath,

http://blogs.oracle.com/reynolds/2007/07/18/

[257] Codehaus. Jaxen: Universal Java XPath Engine, http://jaxen.codehaus.org

[258] BerliOS 2010. Open Source Software Development Platform. Fraunhofer Institute

FOKUS. Berlin, www.berlios.de

[259] Barnickel, N and Böttcher, J. 2009. Project: Semantic Dataflow for BPM,

http://developer.berlios.de/projects/semext4oryx/

[260] Barnickel, N and Antonenko, L. 2008. Project: Semantic Mapping Testing,

http://developer.berlios.de/projects/mappingtesting/

[261] Barnickel, N and Weinand, R. 2008. Project: SWSComposer4BPEL,

http://developer.berlios.de/projects/swsc4bpel/

[262] Barnickel, N and Weinand, R. 2008. Project: Semantic BPEL Engine Extension,

http://developer.berlios.de/projects/sembpelext/

[263] Association of German Chambers of Industry and Commerce (DIHK). 2008. About Us,

www.dihk.de/english/aboutus/overview.pdf

[264] Chappell, D. 2004. Book: Enterprise Service Bus. Publisher: O‟Reilly Media Inc.

[265] Bremen Online Services 2001. OSCI – The secure communication standard for E-

Government, www.osci.de/materialien/englisch.pdf

[266] Bundesministerium des Inneren 2009. XÖV-Handbuch. Deutschland Online

Standardisierung, www.xoev.de/sixcms/media.php/13/2010-03-02-Handbuch-final.pdf

[267] XÖV AG Datenkonferenz. 2007, www.standardisierung.deutschland-online.de

Bibliography

216

[268] Deutschland Online Standardisierung 2009. XÖV-Kernkomponenten-Bibliothek.

Deutschland Online XRepository, www.xrepository.deutschland-online.de

[269] Deutschland Online Standardisierung 2007. Bericht der XÖV-AG „Datenkonferenz“ zur

Erprobung der XÖV-Kernkomponenten-Bibliothek, p.3

[270] OSCI-Leitstelle 2008. Interoperabilität von XÖV-Standards der Innenverwaltung –

Bericht an den AK I der IMK, Version 1.1.

[271] Deutschland Online Standardisierung KoopA-ADV 2006. XÖV-Framework V1.0.

Leitlinien für die XÖV-Standardisierung.

[272] Barnickel, N. and Fluegge, M. 2010. Towards a Conceptual Framework for Semantic

Interoperability in Service-Oriented Architectures. Proceedings of ACM International

Conference on Intelligent Semantic Web – Services and Applications (ISWSA2010).

[273] Barnickel, N. and Fluegge, M. 2010. Transferring the Principle of Loose Coupling to the

Semantic Level. Proceedings of ACM International Conference on Intelligent Semantic

Web – Services and Applications (ISWSA2010).

[274] European Commission 2008. The Semantic Interoperability Centre Europe - SEMIC.EU,

www.semic.eu

[275] Klaus Reichling et al. 2008. Orientation Paper: IDABC XML Clearinghouse: SEMIC.EU.

[276] Hosoya, H., Frisch, A. and Castagna, G. 2009. Parametric polymorphism for XML. ACM

Trans. Program. Lang. Syst., vol. 32, no. 1, pp 1-56.

[277] World Wide Web Consortium 2007. Semantic Annotations for WSDL and XML Schema.

W3C Recommendation.

[278] Akhtar, W., Kopecký, J., Krennwallner, T. and Polleres, A. 2008. XSPARQL: Traveling

between the XML and RDF worlds – and avoiding the XSLT pilgrimage. Proceedings of

5th European Semantic Web Conference, vol. 5021 of LNCS, Springer.

[279] The World Wide Web Consortium 2004. OWL Web Ontology Language Reference. 1.3

OWL syntax. W3C Recommendation, www.w3.org/TR/owl-ref/#Syntax

[280] The World Wide Web Consortium 2004. SWRL: A Semantic Web Rule

LanguageCombining OWL and RuleML. 1 2.2. Human Readable Syntax. W3C Member

Submission, www.w3.org/Submission/SWRL/#2.2

[281] European Commission 2010. SEMIC.EU Core Person: Final Draft.The Semantic

Interoperability Centre Europe, www.epractice.eu/en/news/330117

[282] Barnickel, N., Böttcher, J. and Paschke, A. 2010. Incorporating Semantic Bridges into

Information Flow of Cross-Organizational Business Process Models. Proceedings of the

6th International Conference on Semantic Systems (I-Semantics2010).

[283] Folmer, E. and Bastiaans, J. 2008. Quality of Electronic Messaging Standards‟

Specifications. Proceedings of 14th International Conference on Concurrent Enterprising

(ICE2008)

[284] World Wide Web Consortium 2004. Web Services Glossary. W3C Working Group Note

11, www.w3.org/TR/ws-gloss/#choreography

[285] Decker, G. 2009. PhD Thesis: Design and Analysis of Process Choreographies. Hasso

Plattner Institute, University of Potsdam.

217

Appendix

Sample Documents:

Domain Ontology Sample “RosettaNetOntology”

(excerpt in RDF/XML Syntax [279])

<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF xmlns="http://localhost:8080/ontologies/RosettaNetOntology.owl#"

xmlns:rosetta="http://localhost:8080/ontologies/RosettaNetOntology.owl#"

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:xsd="http://www.w3.org/2001/XMLSchema#"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"

xmlns:owl="http://www.w3.org/2002/07/owl#"

xml:base="http://localhost:8080/ontologies/RosettaNetOntology.owl">

<owl:Ontology rdf:about="http://localhost:8080/ontologies/RosettaNetOntology.owl"/>

<owl:Class rdf:ID="Partner">

<owl:equivalentClass>

<owl:Class>

<owl:intersectionOf rdf:parseType="Collection">

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasBusinessDescription"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasContactInformation"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasPhysicalAddress"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

</owl:intersectionOf>

</owl:Class>

</owl:equivalentClass>

</owl:Class>

…

<owl:FunctionalProperty rdf:about="#hasBusinessDescription">

<rdf:type rdf:resource="http://www.w3.org/2002/07/owl#ObjectProperty"/>

<rdfs:range rdf:resource="#BusinessDescription"/>

</owl:FunctionalProperty>

…

<owl:Class rdf:ID="BusinessDescription ">

<owl:equivalentClass>

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasBusinessName"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

</owl:equivalentClass>

</owl:Class>

…

<owl:FunctionalProperty rdf:about="#hasBusinessName">

<rdf:type rdf:resource="http://www.w3.org/2002/07/owl#DatatypeProperty"/>

<rdfs:range rdf:resource="http://www.w3.org/2001/XMLSchema#string"/>

Appendix

218

</owl:FunctionalProperty>

…

</rdf:RDF>

Domain Ontology Sample “MoonOntology”

(excerpt in RDF/XML Syntax [279])

<?xml version="1.0" encoding="UTF-8"?>

<rdf:RDF xmlns="http://localhost:8080/ontologies/MoonOntology.owl#"

xmlns:moon="http://localhost:8080/ontologies/MoonOntology.owl#"

xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:xsd="http://www.w3.org/2001/XMLSchema#"

xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"

xmlns:owl="http://www.w3.org/2002/07/owl#"

xml:base="http://localhost:8080/ontologies/MoonOntology.owl">

<owl:Ontology rdf:about="http://localhost:8080/ontologies/MoonOntology.owl"/>

<owl:Class rdf:ID="Customer">

<owl:equivalentClass>

<owl:Class>

<owl:intersectionOf rdf:parseType="Collection">

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasCustomerInfo"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasPostalAddress"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

</owl:intersectionOf>

</owl:Class>

</owl:equivalentClass>

</owl:Class>

…

<owl:FunctionalProperty rdf:about="#hasCustomerInfo">

<rdf:type rdf:resource="http://www.w3.org/2002/07/owl#ObjectProperty"/>

<rdfs:range rdf:resource="#CustomerInfo"/>

</owl:FunctionalProperty>

…

<owl:Class rdf:ID="CustomerInfo ">

<owl:equivalentClass>

<owl:Class>

<owl:intersectionOf rdf:parseType="Collection">

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasBusinessName"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

<owl:Restriction>

<owl:onProperty><owl:FunctionalProperty rdf:ID="hasEmail"/></owl:onProperty>

<owl:cardinality rdf:datatype="http://www.w3.org/2001/XMLSchema#int">1</owl:cardinality>

</owl:Restriction>

…

</owl:intersectionOf>

</owl:Class>

</owl:equivalentClass>

</owl:Class>

…

<owl:FunctionalProperty rdf:about="#hasBusinessName">

<rdf:type rdf:resource="http://www.w3.org/2002/07/owl#DatatypeProperty"/>

<rdfs:range rdf:resource="http://www.w3.org/2001/XMLSchema#string"/>

</owl:FunctionalProperty>

…

Appendix

219

</rdf:RDF>

Semantic Bridge Sample “RosettaNetOntology2MoonOntology”

(excerpt in Human Readable Syntax [280])

SWRL Rule “BusinessDescription&ConatctInformation2CustomerInfo”

rosetta:Partner (?partner) ∧

rosetta:hasBusinessDescription (?partner, ?businessDesc) ∧

rosetta:hasBusinessName (?businessDesc, ?businessName) ∧

rosetta:hasConatctInformation (?partner, ?contactInfo) ∧

rosetta:hasEmailAddress (?contactInfo, ?email) ∧

swrlx:makeOWLThing (?newCustomerInfo, ?partner)

⇒

moon:hasCustomerInfo (?partner, ?newCustomerInfo) ∧

moon:hasBusinessName (?newCustomerInfo, ?businessName) ∧

moon:hasEmail (?newCustomerInfo, ?email) ∧

rdfs:label (?partner, "firedRule_BusinessDescription&ConatctInformation2CustomerInfo ")

SWRL Rule “PhysicalAddress2PostalAddress”

(in Human Readable Syntax):

rosetta:PhysicalAddress (?physicalAddress) ∧

rosetta:hasAddressLine1 (?physicalAddress, ?addressLine1) ∧

rosetta:hasCityName (?physicalAddress, ?city) ∧

rosetta:hasGlobalCountryCode (?physicalAddress, ?countryCode) ∧

rosetta:hasNationalPostalCode (?physicalAddress, ?postalCode)

⇒

moon:hasStreet (?physicalAddress, ?addressLine1) ∧

moon:hasCity (?physicalAddress, ?city) ∧

moon:hasCountryCode (?physicalAddress, ?countryCode) ∧

moon:hasPostalCode (?physicalAddress, ?postalCode) ∧

rdfs:label (?physicalAddress,"firedRule_PhysicalAddress2PostalAddress")

Appendix

220

Semantic Web Service Sample “MoonCRMService”

(excerpt in RDF/XML Syntax [279])

<?xml version="1.0" encoding="WINDOWS-1252"?>

<rdf:RDF xmlns:process="http://www.daml.org/services/owl-s/1.1/Process.owl#"

xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns="http://www.example.org/service.owl"

xmlns:service="http://www.daml.org/services/owl-s/1.1/Service.owl#"

xmlns:grounding="http://www.daml.org/services/owl-s/1.1/Grounding.owl#"

…

xmlns:profile="http://www.daml.org/services/owl-s/1.1/Profile.owl#"

xml:base="http://localhost:8080/SemanticWebServices/MoonServices/MoonCRMService.owl#">

<owl:Ontology rdf:about="">

<owl:imports rdf:resource="http://localhost:8080/ontologies/MoonOntology.owl"/>

</owl:Ontology>

<service:Service rdf:ID="MoonCRMServiceService">

<service:presents rdf:resource="#MoonCRMServiceProfile"/>

<service:describedBy rdf:resource="#MoonCRMServiceProcess"/>

<service:supports rdf:resource="#MoonCRMServiceGrounding"/>

</service:Service>

<profile:Profile rdf:ID="MoonCRMServiceProfile">

<profile:serviceName xml:lang="en">MoonCRMService</profile:serviceName>

<profile:textDescription xml:lang="en">

Looks up a Customer in a CRM and returns an IdentifiedCustmer.

</profile:textDescription>

<profile:hasInput rdf:resource="#CustomerLookupRequest"/>

<profile:hasOutput rdf:resource="#CustomerLookupResponse"/>

</profile:Profile>

<process:AtomicProcess rdf:ID="MoonCRMServiceProcess">

<rdfs:label>MoonCRMServiceProcess</rdfs:label>

<process:hasInput rdf:resource="#CustomerLookupRequest"/>

<process:hasOutput rdf:resource="#CustomerLookupResponse"/>

</process:AtomicProcess>

<process:Input rdf:ID="CustomerLookupRequest">

<process:parameterType rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://.../MoonCRMService.owl#CustomerLookupRequest

</process:parameterType>

<rdfs:label>CustomerLookupRequest</rdfs:label>

</process:Input>

<process:Output rdf:ID="CustomerLookupResponse">

<process:parameterType rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://.../MoonCRMService.owl#CustomerLookupResponse

</process:parameterType>

<rdfs:label>CustomerLookupResponse</rdfs:label>

</process:Output>

<grounding:WsdlGrounding rdf:ID="MoonCRMServiceGrounding">

…

<grounding:WsdlAtomicProcessGrounding rdf:ID="MoonCRMServiceAtomicProcessGrounding">

<grounding:owlsProcess rdf:resource="#MoonCRMServiceProcess"/>

<grounding:wsdlDocument rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://sws-challenge.org/services/CRMService?wsdl

</grounding:wsdlDocument>

<grounding:wsdlOperation>

<grounding:WsdlOperationRef>

<grounding:portType rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://sws-challenge.org/services/CRMService?wsdl#CRMServicePortType

</grounding:portType>

<grounding:operation rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

Appendix

221

http://sws-challenge.org/services/CRMService?wsdl#search

</grounding:operation>

</grounding:WsdlOperationRef>

</grounding:wsdlOperation>

<grounding:wsdlInputMessage rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://sws-challenge.org/services/CRMService?wsdl#SearchCustomerRequestMessage

</grounding:wsdlInputMessage>

<grounding:wsdlInput>

<grounding:WsdlInputMessageMap>

<grounding:owlsParameter rdf:resource="#CustomerLookupRequest"/>

<grounding:wsdlMessagePart rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://sws-challenge.org/services/CRMService?wsdl#SearchCustomerRequest

</grounding:wsdlMessagePart>

<grounding:xsltTransformationString>

<![CDATA[ <xsl:stylesheet version="1.0" xmlns:moon="http://localhost:8080/ontologies/MoonOntology.owl#"

…

<xsl:template match="/">

<xsl:value-of select="/rdf:RDF/rdf:Description/moon:hasBusinessName"/>

</searchString>

</xsl:template>

</xsl:stylesheet> ]]>

</grounding:xsltTransformationString>

</grounding:WsdlInputMessageMap>

</grounding:wsdlInput>

<grounding:wsdlOutputMessage rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://sws-challenge.org/services/CRMService?wsdl#SearchCustomerResponseMessage

</grounding:wsdlOutputMessage>

<grounding:wsdlOutput>

<grounding:WsdlOutputMessageMap>

<grounding:owlsParameter rdf:resource="#CustomerLookupResponse"/>

<grounding:wsdlMessagePart rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURI">

http://sws-challenge.org/services/CRMService?wsdl#SearchCustomerResponse

</grounding:wsdlMessagePart>

<grounding:xsltTransformationString>

<![CDATA[<xsl:stylesheet version="1.0" … xmlns:xmoon="mooncompany">

<xsl:template match="/">

<rdf:RDF

xmlns:moon="http://localhost:8080/ontologies/MoonOntology.owl#"

…

xmlns="http://localhost:8080/process/MediationProcess/MoonCRMServiceOutput.owl#"

xml:base="http://localhost:8080/process/MediationProcess/MoonCRMServiceOutput.owl">

<moon:CustomerLookupResponse rdf:ID="CustomerLookupResponse_X">

<moon:hasCustomer>

<moon:IdentifiedCustomer rdf:ID="Customer_X">

<moon:hasCustomerID rdf:datatype="http://www.w3.org/2001/XMLSchema#string">

<xsl:value-of select="/xmoon:SearchCustomerResponse/xmoon:customerId"/>

</moon:hasCustomerID>

<moon:hasBusinessName rdf:datatype="http://www.w3.org/2001/XMLSchema#string">

<xsl:value select="/xmoon:SearchCustomerResponse/xmoon:businessName"/>

</moon:hasBusinessName>

</moon:IdentifiedCustomer>

</moon:hasCustomer>

</moon:CustomerLookupResponse>

</rdf:RDF>

</xsl:template>

</xsl:stylesheet>]]>

</grounding:xsltTransformationString>

</grounding:WsdlOutputMessageMap>

</grounding:wsdlOutput>

</grounding:WsdlAtomicProcessGrounding>

</rdf:RDF>