SigSPARQL: Signals as a First-Class Citizen When Querying Knowledge Graphs
Schwarzinger, Steindl, Frühwirth et al.
Purpose: Cyber-Physical Systems (CPSs) integrate computation and physical processes, producing time series data from thousands of sensors. Knowledge graphs can contextualize these data, yet current approaches that are applicably to monitoring CPS rely on observation-based approaches. This limits the ability to express computations on sensor data, especially when no assumptions can be made about sampling synchronicity or sampling rates.
Methodology: We propose an approach for integrating knowledge graphs with signals that model run-time sensor data as functions from time to data. To demonstrate this approach, we introduce SigSPARQL, a query language that can combine RDF data and signals. We assess its technical feasibility with a prototype and demonstrate its use in a typical CPS monitoring use case.
Findings: Our approach enables queries to combine graph-based knowledge with signals, overcoming some key limits of observation-based methods. The developed prototype successfully demonstrated feasibility and applicability.
Value: This work presents a query-based approach for CPS monitoring that integrates knowledge graphs and signals, alleviating problems of observation-based approaches. By leveraging system knowledge, it enables operators to run a single query across different system instances within the same domain. Future work will extend SigSPARQL with additional signal functions and evaluate it in large-scale CPS deployments.
academic
SigSPARQL: Signals as a First-Class Citizen When Querying Knowledge Graphs
This paper proposes a novel approach combining knowledge graphs with signal processing to address data querying challenges in Cyber-Physical Systems (CPS) monitoring. Traditional observation-based methods have limitations in handling sensor data computations, particularly when dealing with asynchronous or inconsistent sampling rates. The authors introduce SigSPARQL, a query language that models runtime sensor data as time-to-data functions (signals), enabling unified querying of RDF data and signals. The technical feasibility is validated through a prototype system, and practical effectiveness is demonstrated in typical CPS monitoring use cases.
Core Problem: Cyber-Physical Systems generate large volumes of time-series sensor data that must be analyzed in conjunction with system context information. However, existing observation-based methods exhibit complexity and limitations in expressing sensor data computations.
Significance: With ongoing digitalization, CPS are widely deployed in buildings, energy networks, manufacturing, and other domains. Effective utilization of sensor data is critical for system analysis, monitoring, and control.
Limitations of Existing Methods:
Ontology-Based Data Access (OBDA) methods scatter temporal values from a single sensor across thousands of elements representing independent observations
Query complexity increases, requiring reconstruction of temporal value concepts from independent observations
Challenges in handling asynchronous time series, where observations with different timestamps are difficult to combine
Lack of unified conceptual modeling between observation values
Research Motivation: Introduce the Signal concept as a "first-class citizen" to abstract independent observations and overcome limitations of current methods in expressing sensor data computations.
Proposed a language-agnostic framework: Defined three core operators (Signal, ApplySF, LiftVal) for integrating knowledge graph query languages with signal processing
Designed the SigSPARQL query language: Extended SPARQL syntax and semantics to support signals as first-class citizens in query results
Constructed a prototype system: Implemented based on Oxigraph, validating technical feasibility
Provided comprehensive theoretical foundations: Based on Functional Reactive Programming (FRP) theory, establishing formal relationships between signals and time-series data
Demonstrated practical value: Proved applicability through an electric vehicle charging station monitoring use case
Input: Signal-annotated RDF dataset <D, S, φ>, where D is an RDF dataset, S is a set of RDF signals, and φ is a signal annotation function
Output: Time-stamped solution sequences (TSS) or continuously updated RDF graphs
Constraints: Support continuous queries and handle asynchronous sensor data streams
Bollen et al.'s Approach: Extends graph matching to support measurements and time-series patterns
Hybrid Data Model Research: Fusion of graph and time-series data
Differentiation Advantage of This Work: Models temporal values based on signals, leveraging FRP theory to address limitations of observation-based methods
The paper cites 36 relevant references covering key works in RDF stream processing, temporal data querying, and functional reactive programming, providing solid theoretical foundations and technical background for this research.
Overall Assessment: This is a high-quality database systems research paper making important contributions to knowledge graph query language extensions. It features solid theoretical foundations, reasonable technical solutions, and relatively complete implementation. While there is room for improvement in evaluation and performance aspects, it provides valuable new directions for related field development.