Benjamin N. Grosof
MIT Sloan School of Management
Description Logic Programs: Combining Logic Programs with Description LogicCambridge,
MA, USA
bgrosof@mit.edu
Ian Horrocks
University of Manchester
Manchester, United Kingdom
horrocks@cs.man.ac.uk
Raphael Volz
University of Karlsruhe
Karlsruhe, Germany
volz@fzi.de
Stefan Decker
USC ISI
Los Angeles, CA, USA
stefan@isi.edu
Copyright is held by the
author/owner(s).
WWW2003, May 20-24, 2003, Budapest, Hungary.
ACM 1-58113-680-3/03/0005.
We show how to perform DLP-fusion: the bidirectional translation of premises and inferences (including typical kinds of queries) from the DLP fragment of DL to LP, and vice versa from the DLP fragment of LP to DL. In particular, this translation enables one to ``build rules on top of ontologies'': it enables the rule KR to have access to DL ontological definitions for vocabulary primitives (e.g., predicates and individual constants) used by the rules. Conversely, the DLP-fusion technique likewise enables one to ``build ontologies on top of rules'': it enables ontological definitions to be supplemented by rules, or imported into DL from rules. It also enables available efficient LP inferencing algorithms/implementations to be exploited for reasoning over large-scale DL ontologies.
The challenge we address in this paper is how and why to combine rules with ontologies for the Semantic Web (SW). In this paper, we focus on meeting key requirements for such a combination of rules and ontologies by establishing the basis for a combined logical knowledge representation (KR) formalism. We start from the current draft standards for ontologies (DAML+OIL) [6] and for rules (RuleML) [3] in the Semantic Web context, and show how aspects of each language can be translated to the other. Both standards correspond with established KR formalisms: Description Logic (DL) in the case of DAML+OIL, and (declarative) logic programs (LP) in the case of RuleML.^{1} This correspondence allows us to exploit results w.r.t. the mapping of each KR to classical First Order Logic (FOL).
A mapping between ontology and rule languages is important for many aspects of the Semantic Web:
Language layering The Semantic Web can be viewed as largely about ``KR meets the Web''. Over the last two years or so, a broad consensus has evolved in the Semantic Web community that the vision of the Semantic Web includes, specifically, rules as well as ontologies. A key requirement for the Semantic Web's architecture overall, then, is to be able to layer rules on top of ontologies--in particular to create and reason with rule-bases that mention vocabulary specified by ontology-based knowledge bases--and to do so in a semantically coherent and powerful manner.
Querying The capabilities of ontology languages with respect to instances can be rather low, and even conjunctive queries--the least expressive query language usually considered in database research--are often not supported [4]. This area is a stronghold of rules, which offer extensive facilities for instance reasoning. Hence, it is interesting to consider combining DLs with the rule paradigm in order to state expressive instance queries w.r.t. terminological knowledge bases.
Data integration The majority of today's data resides in relational databases. As the Semantic Web grows in importance, people will probably start exporting their data according to some chosen ontology. This essentially leads to data that is replicated in order to enable ontology-based processing, e.g., by reading the exported files into a classifier such as FaCT [12] or RACER [10]. Logic programming systems such as XSB [16], however, can access databases directly through built-in predicates. Furthermore, restricted variants of logic programs, such as the ones established in this paper, can be directly implemented on top of SQL99-compliant relational databases. Hence, an LP-based implementation of an ontology language allows a closer interaction with live data.
Semantic Web Services A task-oriented motivation for combining rules with ontologies arises from the efforts to design and build Semantic Web Services (SWS). Semantic Web Services attempt to describe services in a knowledge-based manner in order to use them for a variety of purposes, including: discovery and search; selection, evaluation, negotiation, and contracting; composition and planning; execution; and monitoring. Both rules and ontologies are necessary for such service descriptions and play complementary roles: while ontologies are useful for representing hierarchical categorisation of services overall and of their inputs and outputs, rules are useful for representing contingent features such as business policies, or the relationship between preconditions and postconditions.
The established correspondence is used to define a new intermediate KR called Description Horn Logic (DHL), which is contained within the intersection, and the closely related Description Logic Programs (DLP), which can be viewed as DHL with a moderate weakening as to the kinds of conclusion can be drawn.
Figure 1 illustrates the relationship between the various KRs and their expressive classes. DL and Horn are strict (decidable) subsets of FOL. LP, on the other hand, intersects with FOL but neither includes nor is fully included by FOL. For example, FOL can express (positive) disjunctions, which are inexpressible in LP. On the other hand, several expressive features of LP, which are frequently used in practical rule-based applications, are inexpressible in FOL (and consequently also outside of def-Horn). One example is negation-as-failure, a basic kind of logical non-monotonicity. Another example is procedural attachments, e.g., the association of action-performing procedural invocations with the drawing of conclusions about particular predicates.
Description Logic Programs, our newly defined intermediate KR, is contained within the intersection of DL and LP. ``Full'' LP, including non-monotonicity and procedural attachments, can thus be viewed as including an ``ontology sub-language'', namely the DLP subset of DL.
Rather than working from the intersection as we do in this paper, one may instead directly address the expressive union of DL and LP by studying the expressive union of DL and LP within the overall framework of FOL. This is certainly an interesting thing to do. However, to our knowledge, this has not yet been well characterised theoretically, e.g., it is unclear how, if at all, such a union differs from full FOL.
Full FOL has some significant practical and expressive drawbacks as a KR in which to combine DL and rules. First, full FOL has severe computational complexity: it is undecidable in the general case, and intractable even under the Datalog restriction (see Section 3.2). Second, it is not understood even at a basic research level how to expressively extend full FOL to provide non-monotonicity and procedural attachments; yet these are crucial expressive features in many (perhaps most) practical usages of rules. Third, full FOL and its inferencing techniques have severe practicable limitations since it is unfamiliar to the great majority of mainstream software engineers, whereas rules (e.g., in the form of SQL-type queries, or Prolog) are familiar conceptually to many of them.
Via the DLP KR, we give a new technique to combine DL and LP. We show how to perform DLP-fusion: the bidirectional mapping of premises and inferences (including typical kinds of queries) from the DLP fragment of DL to LP, and from the DLP fragment of LP to DL. DLP-fusion allows us to fuse the two logical KRs so that information from each can be used in the other. The DLP-fusion technique promises several benefits. In particular, DLP-fusion enables one to ``build rules on top of ontologies'': it enables the rule KR to have access to DL ontological definitions for vocabulary primitives (e.g., predicates and individual constants) used by the rules. Conversely, the technique enables one to ``build ontologies on top of rules'': it enables ontological definitions to be supplemented by rules, or imported into DL from rules. It also enables efficient LP inferencing algorithms/implementations, e.g., rule or relational database engines, to be exploited for reasoning over large-scale DL ontologies.
In this section we will introduce Horn Logic, Description Logic (DL) and the DL based ontology language DAML+OIL. In particular, we will describe their syntax and formalise their meaning in terms of classical First Order Logic (FOL).
DAML+OIL is an ontology language designed for use on the (semantic) web. Although DAML+OIL is syntactically ``layered'' on top of RDFS, semantically it is layered on a subset of RDFS. This subset does not include RDFS's recursive meta model (i.e., the unrestricted use of the type relation), but instead treats RDFS as a very simple DL supporting only atomic class names. Like other DLs, this ``DAML+OIL subset'' of RDFS corresponds to a fragment of classical FOL, making it much easier to develop mappings to rule languages as well as to DLs. From now on, when we talk about RDFS, we will be referring to the DAML+OIL subset of RDFS.
DAML+OIL is equivalent to a very expressive DL--in fact it is equivalent to the DL [13, 11]. In addition to ``abstract'' classes and individuals, DAML+OIL also supports the use of ``concrete'' datatypes and data values (the in ). In this paper, however, we will restrict our attention to the abstract part of the language, which corresponds to the DL.
Figure 2 shows how DAML+OIL statements correspond to axioms, where (possibly subscripted) is a class, (possibly subscripted) is a property, is the inverse of , is the transitive closure of , (possibly subscripted) is an individual and is an abbreviation for for some class (i.e., the most general class, called ``Thing'' in DAML+OIL).
It can be seen that all DAML+OIL statements can be reduced to class/property inclusion axioms and ground facts (asserted class-instance and instance-property-instance relationships).^{2} In the case of transitiveProperty, however, the axiom is taken to be equivalent to asserting that is a transitive property (like DAML+OIL, does not include the transitive closure operator).
As in any DL, DAML+OIL classes can be names (URIs) or expressions, and a variety of constructors are provided for building class expressions. Figure 3 summarises the available constructors and their correspondence with class expressions.
The meaning of is usually given by a model theory [13]. However, can also be seen in terms of a correspondence to FOL, where classes correspond to unary predicates, properties correspond to binary predicates and subclass/property axioms correspond to implication [7, 4].
To be more precise, individuals are equivalent to FOL constants, classes and class expressions are equivalent to FOL formulae with one free variable, and properties (and property expressions when supported by the DL) are equivalent to FOL formulae with two free variables. Class and property inclusion axioms are equivalent to FOL sentences consisting of an implication between two formulae with the free variables universally quantified at the outer level. E.g., a DL axiom of the form is equivalent to a FOL sentence of the form . DL axioms of the form and correspond to ground atoms and . Finally, DL axioms asserting the transitivity of a property , the functionality of a property and that property is the inverse of property are equivalent to FOL sentences of the form , and respectively.
Figure 4 summarises the above equivalences and shows the FOL formulae corresponding to the DL class expressions described in Figure 3, where are constants, and is the free variable. These formulae can be composed in the obvious way, e.g., .
As a notational convention we will, throughout the paper, use and for constants and , , and for variables.
Declarative logic programs (LP) is the KR whose semantics underlies in a large part the four families of rule systems that are currently most commercially important--SQL relational databases, OPS5-heritage production rules, Prolog, and Event-Condition-Action rules--as well as the proposals for rules in context of the Semantic Web.
As mentioned before, the commonly used expressiveness of full LP includes features, notably negation-as-failure/priorities and procedural attachments, that are not expressible in FOL, much less in DL. We thus concentrate on only an expressive portion of LP.
An ordinary (a.k.a. ``normal''^{3}) LP is a set of
rules each having the form:
where , are
atoms (atomic formulae), and
. Note that no
restriction is placed on the arity of the predicates appearing in
these atoms. Logical variables, and logical functions (with any
arity), may appear unrestrictedly in these atoms.
is called the head (a.k.a.
consequent) of the rule;
is called the body (a.k.a. antecedent) of the
rule.
is to be read as ``if'', so that the overall rule should be read
as ``[head] if [body]'', i.e., ``if [body] then [head]''. If
, then the body is empty, i.e.,
, and notationally the ``
'' is often omitted. A fact is a rule whose body is empty
and whose head is a ground atom.
stands for negation-as-failure, a logically non-monotonic form of
negation whose semantics differs, in general, significantly from
the semantics of classical negation (). Intuitively,
means
`` is not believed'' (i.e., is
unknown or false), whereas
means `` is false''. Intuitively, each rule
can be viewed as universally quantified at the outer level. More
precisely, each rule can be viewed as standing for the set of all
its ground instantiations.
A definite LP is an ordinary LP in which
negation-as-failure does not appear, i.e., a set of rules each
having the form:
where , are
atoms, and .
Definite LP is closely related syntactically and semantically to
the Horn fragment of FOL, a.k.a. Horn-clause logic. A clause in FOL
has the form:
where each is a (classical) literal. A
literal has either the form (1)
or (2)
, where
is an atom. The literal is said to be positive in case
(1), or to be negative in case (2). A clause is said to be
Horn when at most one of its literals is
positive. A Horn clause is said to be definite when
exactly one of its literals is positive. A definite Horn
clause is also known as a Horn rule. A definite Horn
clause, a.k.a. Horn rule, can thus be written in the
form:
where , are
atoms, and . We say that this Horn rule
corresponds to the definite LP rule that has the same
syntactic form, and vice versa. Likewise, we say that a Horn
ruleset and a definite LP ruleset
correspond to each other when
their rules do (isomorphically). We then also say that
is the
LP-correspondent of , and
conversely that is the
Horn-correspondent of .
As mentioned above, it is implicit in this notation that all
logical variables are universally quantified at the outer level,
i.e., over the scope of the whole clause. E.g., the rule
can be written equivalently as:
.
Note the similarity with the FOL equivalent of a DL inclusion
(subClassOf) axiom given in
Figure 4.
An LP rule or Horn clause is said to be equality-free when the equality predicate does not appear in it. Likewise, each is said to be Datalog when no logical functions (of arity greater than zero) appear in it.^{4}
The semantics of an ordinary LP is defined to be a
conclusion set, where each conclusion is a ground atom,
i.e., fact, entailed by the LP. Formally, the semantics of a
definite LP is defined as follows. Let
stand for the Herbrand base of
. The conclusion set
is the smallest (w.r.t. set
inclusion) subset of such that for any rule
,
if
then
.
The relationship of LP semantics to FOL semantics is relatively simple to describe for the case of definite equality-free Datalog LP, which we call def-LP. The syntactically corresponding fragment of FOL is definite equality-free Datalog Horn FOL, which we call def-Horn. Let be a def-LP. Let stand for the corresponding def-Horn ruleset. The conclusion set of then coincides with the minimal (w.r.t. set inclusion) Herbrand model of .
Hence, the def-LP and the def-Horn ruleset entail exactly the
same set of facts. Every conclusion of the def-LP is also a
conclusion of the def-Horn ruleset. Relative to the def-Horn
ruleset, the def-LP is thus sound; moreover, it is complete for
fact-form conclusions, i.e., for queries whose answers amount to
conjunctions of facts. However, the def-LP is a mildly
weaker version of the def-Horn ruleset, in the following
sense. Every conclusion of the def-LP must have the form of a fact.
By contrast, the entailments, i.e., conclusions, of the def-Horn
ruleset are not restricted to be facts. E.g., suppose
consists of the two
rules
and
.
Then it entails
(a non-unit derived clause) whereas does
not. In practical applications, however, quite often only the
fact-form conclusions are desired, e.g., an application might be
interested above only in whether or not
is
entailed. The def-LP has the virtue of conceptual and computational
simplicity. Thinking in terms of expressive classes, we will view
def-LP as an expressive subset of def-Horn--we will call
it the expressive f-subset. def-LP is a mild weakening of
def-Horn along the dimension of entailment power, permitting only
fact-form conclusions--we will call this f-weakening.
In return for this f-weakening, def-LP has some quite attractive computational characteristics (as well as being expressively extensible in directions that FOL is not, as discussed earlier). For the propositional case of def-LP, exhaustive inferencing is where -- i.e., worst-case linear time [8]. For the general case with logical variables, the entire conclusion set of a def-LP can be computed in time , when there is a constant bound on the number of logical variables per rule (this restriction, which we will call VB, is typically met in practise). Inferencing in def-LP is thus tractable (worst-case polynomial time) given VB. In contrast, DLs are generally not tractable (typically ExpTime or even NExpTime complexity for key inference problems), and full FOL is not decidable.
In this section we will discuss how DL languages (e.g., DAML+OIL) can be mapped to def-Horn, and vice versa.
We will first discuss the expressive restrictions of DL and def-Horn as these will constrain the subset of DL and def-Horn for which a complete mapping can be defined.
DLs are decidable subsets of FOL where the decidability is due in large part to their having (a form of) the tree model property [19].^{5} This property says that a DL class has a model (an interpretation in which is non-empty) iff has a tree-shaped model, i.e., one in which the interpretation of properties defines a tree shaped directed graph.
This requirement severely restricts the way variables and
quantifiers can be used. In particular, quantifiers must be
relativised via atomic formulae (as in the guarded fragment
of FOL [9]), i.e., the
quantified variable must occur in a property predicate along with
the free variable (recall that DL classes correspond to formulae
with one free variable). For example, the DL class corresponds to the FOL formula
, where
the property predicate acts as a guard. One obvious
consequence of this restriction is that it is impossible to
describe classes whose instances are related to another anonymous
individual via different property paths. For example, it is
impossible to assert that individuals who live and work at the same
location are ``HomeWorkers''. This is easy with a Horn rule,
e.g.:
Definite Horn FOL requires that all variables are universally
quantified (at the outer level of the rule), and restricts logical
connectives in certain ways. One obvious consequence of the
restriction on quantifiers is that it is impossible to assert the
existence of individuals whose identity might not be known. For
example, it is impossible to assert that all persons have a father
(known or unknown). This is easy with a DL axiom, e.g.:
RDFS provides a subset of the DL statements described in Section 3.1: subclass, subproperty, range, and domain statements (which in a DL setting are often called Tbox axioms); and asserted class-instance (type) and instance-property-instance relationships (which in a DL setting are often called Abox axioms).
As we saw in Section 3.1, a DL inclusion axiom corresponds to an FOL implication. This leads to a straightforward mapping from class and property inclusion axioms to def-Horn rules as follows:
As shown in Figure 2, RDFS range and domain statements correspond to DL axioms of the form (range of is ) and (domain of is ). From Figure 4, we can see that these are equivalent to the FOL sentences and , which can be simplified to and respectively. These FOL sentences are already in def-Horn form, which gives us the following mappings for range and domain:
Finally, asserted class-instance (type) and instance-property-instance relationships, which correspond to DL axioms of the form and respectively (Abox axioms), are equivalent to FOL sentences of the form and , where and are constants. These are already in def-Horn form: they are simply rules with empty bodies (which are normally omitted):
DAML+OIL extends RDF with additional statements about classes and properties (Tbox axioms). In particular, it adds explicit statements about class, property and individual equality and inequality, as well as statements asserting property inverses, transitivity, functionality (unique) and inverse functionality (unambiguous).
As discussed in Section 3.1, class and property equivalence axioms can be replaced with a symmetrical pair of inclusion axioms, so they can be mapped to a symmetrical pair of def-Horn rules as follows:
As we saw in Section 3.1, the semantics of inverse axioms of the form are captured by FOL sentences of the form , and the semantics of transitivity axioms of the form are captured by FOL sentences of the form . This leads to a direct mapping into def-Horn as follows:
As we saw in Section 3.1, DL axioms asserting the functionality of properties correspond to FOL sentences with equality. E.g., a DL axiom ( is a functional property) corresponds to the FOL sentence .^{7} This kind of axiom cannot be dealt with in our current framework (see Section 4.1) as it would require def-Horn rules with equality in the head, i.e., rules of the form .
In the previous section we showed how DL axioms correspond with def-Horn rules, and how these can be used to make statements about classes and properties. In DLs, the classes appearing in such axioms need not be atomic, but can be complex compound expressions built up from atomic classes and properties using a variety of constructors. A great deal of the power of DLs derives from this feature, and in particular from the set of constructors provided.^{8} In the following section we will show how these DL expressions correspond to expressions in the body of def-Horn rules.
In the following we will, as usual, use to denote classes, to denote properties and to denote an integer.
A DL class can be formed by conjoining existing classes, e.g.,
. From
Figure 4 it can be
seen that this corresponds to a conjunction of unary predicates.
Conjunction can be directly expressed in the body of a def-Horn
rule. E.g., when a conjunction occurs on the l.h.s. of a subclass
axiom, it simply becomes conjunction in the body of the
corresponding rule
Similarly, when a conjunction occurs on the r.h.s. of a subclass
axiom, it becomes conjunction in the head of the corresponding
rule:
A DL class can be formed from a disjunction of existing classes,
e.g., . From
Figure 4 it can be
seen that this corresponds to a disjunction of unary predicates.
When a disjunction occurs on the l.h.s. of a subclass axiom it
simply becomes disjunction in the body of the corresponding
rule:
When a disjunction occurs on the r.h.s. of a subclass axiom it becomes a disjunction in the head of the corresponding rule, and this cannot be handled within the def-Horn framework.
In a DL, the universal quantifier can only be used in
restrictions--expressions of the form (see
Section 4.1).
This is equivalent to an FOL clause of the form
(see Figure 4).
must be a single primitive property,
but may be a compound expression.
Therefore, when a universal restriction occurs on the r.h.s. of a
subclass axiom it becomes an implication in the head of the
corresponding rule:
When a universal restriction occurs on the l.h.s. of a subclass axiom it becomes an implication in the body of the corresponding rule. This cannot, in general, be mapped into def-Horn as it would require negation in a rule body.
In a DL, the existential quantifier (like the universal quantifier) can only be used in restrictions of the form . This is equivalent to an FOL clause of the form (see Figure 4). must be a single primitive property, but may be a compound expression.
When an existential restriction occurs on the l.h.s. of a
subclass axiom, it becomes a conjunction in the body of a standard
def-Horn rule:
When an existential restriction occurs on the r.h.s. of a subclass axiom, it becomes a conjunction in the head of the corresponding rule, with a variable that is existentially quantified. This cannot be handled within the def-Horn framework.
These constructors cannot, in general, be mapped into def-Horn. The case of negation is obvious as negation is not allowed in either the head or body of a def-Horn rule. As can be seen in Figure 4, cardinality restrictions correspond to assertions of variable equality and inequality in FOL, and this is again outside of the def-Horn framework.
In some cases, however, it would be possible to simplify the DL expression using the usual rewriting tautologies of FOL in order to eliminate the offending operator(s). For example, negation can always be pushed inward by using a combination of De Morgan's laws and equivalences such as and [1]. Further simplifications are also possible, e.g., using the equivalences , and . For the sake of simplicity, however, we will assume that DL expressions are in a canonical form where all relevant simplifications have been carried out.
As we saw in Section 4.3, some DL constructors (conjunction and universal restriction) can be mapped to the heads of rules whenever they occur on the r.h.s. of an inclusion axiom, while some DL constructors (conjunction, disjunction and existential restriction) can be mapped to the bodies of rules whenever they occur on the l.h.s. of an inclusion axiom. This naturally leads to the definition of two DL languages, classes from which can be mapped into the head or body of LP rules; we will refer to these two languages as and respectively.
The syntax of the two languages is defined as follows. In both languages an atomic name is a class, and if and are classes, then is also a class. In , if is a class and is a property, then is also a class, while in , if are classes and is a property, then and are also classes.
Using the mappings from
Section 4.3, we
can now follow the approach of
[4] and define a recursive
mapping function which takes a DL
axiom of the form
, where
is an -class and
is an -class, and maps it into
an LP rule of the form
. The mapping is defined as follows:
As we saw in Section 4.3, rules of the form are rewritten as two rules and ; rules of the form are rewritten as ; and rules of the form are rewritten as two rules and .
For example, would map the DL
axiom
We call the intersection of
and , i.e., the language where an atomic name
is a class, and if
and are classes, then is also a class. We then extend
to deal with axioms of the
form , where
and are both -classes:
As we saw in
Section 4.2.1,
range and domain axioms
and
are
mapped into def-Horn rules of the form
and
respectively. Moreover, class-instance and
instance-property-instance axioms and
are mapped into
def-Horn facts (i.e., rules with empty bodies) of the form
and
respectively. We therefore extend to
deal with these axioms in the case that is an
-class:
Finally, we extend to deal with the
property axioms discussed in
Section 4.2:
Using the relationships of (full) DL to FOL discussed in Section 3.1, especially Figure 4, it is straightforward to show the following.
DHL can, therefore, be viewed alternatively and precisely as an expressive fragment of def-Horn -- i.e., as the range of .
Although the asymmetry of DHL (w.r.t. classes on different sides of axioms) makes it rather unusual by DL standards, it is easy to see that it includes (the DAML+OIL subset of) RDFS, as well as that part of DAML+OIL which corresponds to a simple frame language.
As far as RDFS is concerned, we saw in Section 4.2.1 that RDFS statements are equivalent to DL axioms of the form , , , , and , where are classes, are properties and are individuals. Given that all RDFS classes are -classes (they are atomic class names), a set of DL axioms corresponding to RDFS statements would clearly satisfy the above definition of a DHL ontology.
DHL also includes the subset of DAML+OIL corresponding to simple frame language axioms, i.e., axioms defining a primitive hierarchy of classes, where each class is defined by a frame. A frame specifies the set of subsuming classes and a set of slot constraints. This corresponds very neatly to a set of DL axioms of the form .
Moreover, DHL supports the extension of this language to include equivalence of conjunctions of atomic classes, and axioms corresponding to DAML+OIL transitive property, and inverse property statements.
A DLP is directly defined as the LP-correspondent of a def-Horn ruleset that results from applying the mapping . Semantically, a DLP is thus the f-weakening of that DHL ruleset (recall subsection 3.2). The DLP expressive class is thus the expressive f-subset of DHL. By Theorem 1, DLP can, therefore, be viewed alternatively and precisely as an expressive subset of DL, not just of def-Horn.
In summary, expressively DLP is contained in DHL which in turn is contained in the expressive intersection of DL and Horn.
As our discussion of expressive relationships has made clear, there is a bi-directional semantic equivalence of (1) the DHL fragment of DL and (2) the DHL fragment of def-Horn. Likewise, there is a bi-directional semantic equivalence of the DLP fragment of DL and the DLP fragment of def-Horn. So far, however, we have mostly concentrated on only one direction of syntactic mapping: from DL syntax to def-Horn syntax (and to the corresponding def-LP), rather than from def-Horn (or def-LP) to DL. Next, we elucidate our reasons for this emphasis.
First, a prime immediate goal for the Semantic Web is to enable rules (in LP / Horn) on top of ontologies (in DL) -- more than vice versa to enable DL ontologies on top of LP or Horn rules. Second, it is desirable to exploit the relatively numerous, mature, efficient, scalable algorithms and implementations (i.e., engines) already available for LP inferencing so as to perform some fragment of DL inferencing -- more than vice versa to perform LP via the fewer available DL engines, which are designed to handle more expressive languages (than DLP) and may not be optimised for DLP ontologies. Third, as compared to def-Horn, DL has a relatively detailed set of quite specific syntactic expressive constructs; it was easier to go through these one by one to define a translation mapping than to do so in the reverse direction where one has to invent more structure/forms.
We do not have space here to give detailed algorithms and computational complexity analyses of the syntactic translations. We will limit ourselves to some relatively high-level observations; these are straightforward to show. The mapping, from DL syntax to def-Horn/def-LP syntax, corresponds immediately to an algorithm whose computational complexity is tractable. This mapping is invertible (e.g., in the usual manner of parsers) from def-Horn/def-LP syntax to DL syntax, again, tractably.
As discussed in the previous section, one of the prime goals of this work is to enable some fragment of DL inferencing to be performed by LP engines. In this section we will discuss the kinds of inference typically of interest in DL and LP, and how they can be represented in each other, i.e., in LP and DL respectively. Although the emphasis is on performing DL inferencing, via our mapping translation, using an LP reasoning engine, the reverse mapping can be used in order to perform LP inferencing using a DL reasoning engine. In particular, we will show how inferencing in (the DHL fragment of) DL can be reduced, via our translation, to inferencing in LP; and how vice versa, inferencing in (the DLP fragment of) LP can be reduced to inferencing in DL.
In a DL reasoning system, several different kinds of query are typically supported w.r.t. a knowledge base . These include queries about classes:
In addition, there are similar queries about properties: property-instance membership, property subsumption, property hierarchy, and property satisfiability. We will call the language defined by the above kinds of DL queries.
In LP reasoning engines, there is one basic kind of query supported w.r.t. a ruleset : atom queries. These include:
We call the language defined by the above kinds of LP queries.
Next, we discuss how to reduce querying in (the DHL fragment of) DL to querying in (the DLP fragment of) LP using the mapping . We will assume that is a ruleset derived from a DL knowledge base via , and that all queries are w.r.t. .
(ground or open) atom queries can be used to answer (ground or open) class-instance membership queries when the class is an -class, i.e., is an instance of iff entails . When is an atomic class name, the mapping leads directly to a atom query. When is a conjunction, the result is a conjunction of atom queries, i.e., is an instance of iff entails and entails . When is a universal restriction, the mapping gives . This can be transformed into a atom query using a simple kind of skolemisation, i.e., is replaced with a constant , where is new in , and we have is an instance of iff entails .
The case of property-instance membership queries is trivial as all properties are atomic: is an instance of iff entails .
Complete information about class-instance relationships, to answer open or ``all-classes'' class-instance queries, can then be obtained via class-instance queries about all possible combinations of individuals and classes in .^{9} (Note that the set of named individuals and classes is known, and its size is worst-case linear in the size of the knowledge/rule base.)
For -classes, class subsumption queries can be reduced to using a similar technique to class-instance membership queries, i.e., is a subclass of iff entails , for new in . For property subsumption queries, is a subproperty of iff entails , for new in .
Complete information about the class hierarchy can be obtained by computing the partial ordering of classes in based on the subsumption relationship.
In the DHL (and DLP) fragment, determining class/property satisfiability is a non-issue as, with the expressive power at our disposal in def-Horn, it is impossible to make a class or a property unsatisfiable.
Now let us consider the reverse direction from to . In the DLP fragment of LP, every predicate is either unary or binary. Every atom query can thus be viewed as about either a named class or a property. Also, generally in LP, any open atom query is formally reducible to a set of ground atom queries--one for each of its instantiations. Thus is reducible to class-instance and property-instance membership queries in DL.
To recap, we have shown the following.
To compute the answer for user queries the translated views are used. This realises a form of bottom up processing, since the queries involved in view definitions are performed on the extensional data and intermediate results are propagated up to a final query, which is the user query. This results in many irrelevant facts being computed in the intermediate steps; more efficient procedures based on sideways information passing, however, have been developed in the deductive database literature.
The above mentioned strategy is, however, not possible for recursively defined rules. Here additional processing is required.
Usually, binary recursive rules such as transitivity can be
rewritten into a linear form. E.g. the mapping for transitive
properties (see 1)
can be rewritten into
The usual strategy to compute the remaining forms of recursive rules in relational databases is in-memory processing using some iterative strategy, e.g. the magic template procedure [15].
We can, however, translate this case into the database by exploiting the observation that this form of recursion decomposes into unions, since no join processing of intermediate results (such as involved in computing the transitive closure of transitive properties) is necessary. This is immediately clear for classes, since they are monadic predicates. A closer look at all axioms where binary predicates (properties) are in the head reveals the same. Hence, these cyclic references can be implemented via an algorithm that detects equivalence classes (each constituted by a cycle) in graphs. All incoming edges to an equivalence class must be duplicated to all members of the equivalence class; this may done by using a new intermediate predicate to collect the incoming edges and deriving the members of the equivalence class from this intermediate predicate. Afterwards, all rules that constitute the cyclic references within the equivalence class may safely be removed. The reader may note that this can also be performed (with appropriate adaptions) on the cyclic references imposed by inverse properties.
We have used the above techniques to realise a prototypical implementation of Description Horn Logic based on the Datalog engine written by Boris Motik in the KAON project. The implementation, called Bubo (after the Latin name of the biological genus of eagle owls), is freely available at http://kaon.semanticweb.org/owl/.
Initial tests of Bubo have been encouraging, but much more work needs to be done in order to determine if the benefits promised by the DLP-fusion approach are delivered by this implementation.
In this paper we have shown how to interoperate, semantically and inferentially, between the leading Semantic Web approaches to rules (RuleML Logic Programs) and ontologies (OWL/DAML+OIL Description Logic). We have begun by studying two new KRs, Description Logic Programs (DLP), which is defined by the expressive intersection of the two approaches, and the closely related Description Horn Logic (DHL).
We have shown that DLP (or DHL) can capture a significant fragment of DAML+OIL, including the whole of the DAML+OIL fragment of RDFS, simple frame axioms and more expressive property axioms. The RDFS fragment of DL permits: stating that a class D is a Subclass of a class E; stating that the Domain of a property P is a class C; stating that the Range of a property P is a class C; stating that a property P is a Subproperty of a property Q; stating that an individual b is an Instance of a class C; and stating that a pair of individuals (a,b) is an Instance of a property P. Additional DLP expressively permits (within DL): using the Intersection connective (conjunction) within class descriptions (i.e., in C, D, or E above); using the Union connective (disjunction) within subclass descriptions (i.e., in D above); using (a restricted form of) Universal quantification within superclass descriptions (i.e., in E above); using (a restricted form of) Existential quantification within subclass descriptions (i.e., in D above); stating that a property P is Transitive; stating that a property P is Symmetric; and stating that a property P is the Inverse of a property Q.
Many of the ontologies in the DAML ontology library are inside this fragment of DAML+OIL. An immediate result of this work is that LP engines could be used for reasoning with these ontologies and for reasoning with (possibly very large numbers of) facts, such as web page annotations, that use vocabulary from these ontologies.
This work represents only a first step in realising a more complete interoperability between rules and ontologies, and the layering of rules on top of ontology languages in the Semantic Web ``stack''. We were able to illustrate its utility both theoretically and within our prototypical implementation. We believe, however, that our study of the expressive intersection will provide a firm foundation for future investigations of more expressive languages up to and including the expressive union of rules and ontologies.
Future work will include extending the mapping to additional DL primitives, in particular those which require the ability to state and derive the equality of individuals, such as cardinality restrictions (including functional properties) and nominals (extensionally defined classes).
Thanks to Tim Berners-Lee, Harold Boley, Dan Connolly, Michael Dean, Richard Fikes, Patrick Hayes, Jim Hendler, Deborah McGuinness, Boris Motik, Daniel Oberle, Peter Patel-Schneider, Jos De Roo, Steffen Staab and members of the DAML+OIL Joint Committee for helpful comments and discussions.
This document was generated using the LaTeX2HTML translator Version 2K.1beta (1.50)
Copyright © 1993, 1994, 1995, 1996,
Nikos
Drakos, Computer Based Learning Unit, University of
Leeds.
Copyright © 1997, 1998, 1999,
Ross Moore,
Mathematics Department, Macquarie University, Sydney.
The command line arguments were:
latex2html -numbered_footnotes -split 0
p117-grosof
The translation was initiated by on 2003-03-28