DELE: Deductive $\mathcal{EL}^{++}$ Embeddings for Knowledge Base Completion
Mashkova, Zhapa-Camacho, Hoehndorf
Ontology embeddings map classes, roles, and individuals in ontologies into $\mathbb{R}^n$, and within $\mathbb{R}^n$ similarity between entities can be computed or new axioms inferred. For ontologies in the Description Logic $\mathcal{EL}^{++}$, several optimization-based embedding methods have been developed that explicitly generate models of an ontology. However, these methods suffer from some limitations; they do not distinguish between statements that are unprovable and provably false, and therefore they may use entailed statements as negatives. Furthermore, they do not utilize the deductive closure of an ontology to identify statements that are inferred but not asserted. We evaluated a set of embedding methods for $\mathcal{EL}^{++}$ ontologies, incorporating several modifications that aim to make use of the ontology deductive closure. In particular, we designed novel negative losses that account both for the deductive closure and different types of negatives and formulated evaluation methods for knowledge base completion. We demonstrate that our embedding methods improve over the baseline ontology embedding in the task of knowledge base or ontology completion.
academic
DELE: Deductive EL++ Embeddings for Knowledge Base Completion
This paper addresses the limitations of ontology embedding methods for description logic EL++ in knowledge base completion tasks by proposing DELE (Deductive EL++ Embeddings). While existing geometric embedding methods can explicitly generate ontology models, they suffer from two critical issues: (1) inability to distinguish between unprovable and falsifiable statements, potentially treating entailed statements as negative samples; (2) insufficient utilization of the ontology's deductive closure to identify inferred but unasserted statements. This paper improves knowledge base completion performance by designing novel negative loss functions and evaluation methods that effectively leverage the deductive closure.
Ontology embedding aims to map classes, roles, and individuals in an ontology to Rn space to compute entity similarity or infer new axioms. For EL++ description logic, several optimization-based geometric embedding methods exist, such as ELEmbeddings, ELBE, and Box2EL.
Negative Sample Selection Problem: When existing methods randomly select negative samples, they may mistakenly treat true statements entailed by the ontology as negative examples, affecting model training quality.
Insufficient Utilization of Deductive Closure: Inadequate consideration of the ontology's deductive closure—the set of all derivable statements—prevents effective distinction between inferred and unasserted knowledge.
Limited Evaluation Methods: Current evaluation methods primarily derive from knowledge graph completion tasks and do not account for the rich entailment relationships in ontologies.
Knowledge base completion is an important task requiring prediction of axioms that should be added to the knowledge base but are not yet represented. For formalized knowledge bases, this includes both deductive reasoning (predicting entailed axioms) and inductive reasoning (predicting novel non-entailed axioms). This paper aims to improve geometric embedding methods by better leveraging the deductive closure.
Proposed negative loss functions considering deductive closure: Designed new negative loss functions for all EL++ standard forms to avoid treating entailed statements as negative samples.
Developed fast approximate deductive closure computation algorithm: Proposed a sound algorithm for computing the deductive closure of EL++ theories to improve negative sample selection during training.
Formulated evaluation methods considering deductive closure: Designed new evaluation metrics for knowledge base completion tasks that can distinguish prediction performance between entailed and non-entailed axioms.
Extended multiple geometric embedding methods: Applied improvements to three representative methods—ELEmbeddings, ELBE, and Box2EL—demonstrating generalizability.
Knowledge base completion is defined as: given an EL++ ontology T, predict new axioms that should be added to T. The task can be further subdivided into:
Deductive Completion: Predicting axioms in the deductive closure T⊢ but not explicitly asserted in T
Inductive Completion: Predicting novel axioms not in the deductive closure
Accurate Problem Identification: Accurately identifies key issues in existing methods regarding negative sample selection and deductive closure utilization.
Reasonable Method Design: Proposed negative loss functions and filtering strategies are theoretically well-motivated.
Comprehensive Experiments: Validates method effectiveness across multiple datasets and tasks, including visualization analysis.
Theoretical Contribution: Provides a sound algorithm for deductive closure computation with theoretical value.
Strong Generalizability: Improvement strategies apply to multiple geometric embedding methods.
Limited Performance Gains: Improvements are modest on some tasks, potentially insufficient to justify additional complexity.
Computational Overhead: Deductive closure computation and negative sample filtering increase training time, but the paper insufficiently analyzes this overhead.
Benchmark Scale: Datasets used are relatively small; effectiveness on large-scale applications remains to be verified.
Insufficient Comparisons: Lacks comparison with recent LLM-based knowledge base completion methods.
The paper cites 50 related references covering important works in description logic, ontology embedding, knowledge graph completion, and related fields, providing a solid theoretical foundation for the research.