Toward Explainable NILM: Real-Time Event-Based NILM Framework for High-Frequency Data
Gerasimov, Kamyshev, Hoosh et al.
Non-Intrusive Load Monitoring (NILM) is an advanced, and cost-effective technique for monitoring appliance-level energy consumption. However, its adaptability is hindered by the lack of transparency and explainability. To address this challenge, this paper presents an explainable, real-time, event-based NILM framework specifically designed for high-frequency datasets. The proposed framework ensures transparency at every stage by integrating a z-score-based event detector, appliance signature estimation, Fourier-based feature extraction, an XG-Boost classifier, and post hoc SHAP analysis. The SHAP analysis further quantifies the contribution of individual features, such as cosine of specific harmonic phases, to appliance classification. The framework is trained and evaluated on the PLAID dataset, and achieved a classification accuracy of 90% while maintaining low computational requirements and a latency of less than one second.
academic
Toward Explainable NILM: Real-Time Event-Based NILM Framework for High-Frequency Data
Non-intrusive load monitoring (NILM) is an advanced and cost-effective technique for appliance-level energy consumption monitoring. However, its adoption is limited by lack of transparency and explainability. To address this challenge, this paper proposes an explainable, real-time, event-based NILM framework specifically designed for high-frequency datasets. The framework ensures transparency at each stage by integrating a z-score-based event detector, device signature estimation, Fourier-based feature extraction, XGBoost classifier, and post-hoc SHAP analysis. SHAP analysis further quantifies the contribution of individual features (such as cosine values of specific harmonic phases) to device classification. The framework is trained and evaluated on the PLAID dataset, achieving 90% classification accuracy while maintaining low computational requirements and latency below 1 second.
With continuous growth in energy consumption, load monitoring has become critical. In the European Union, buildings account for approximately 41% of electricity consumption. Traditional intrusive load monitoring (ILM) requires installing sensors on each device, which, while accurate, is expensive, difficult to install, and unsuitable for large-scale applications. NILM serves as an alternative approach, monitoring total energy consumption at a single point (typically the main meter) and using decomposition algorithms to estimate energy consumption of individual devices.
Lack of Explainability: Existing NILM methods, particularly deep learning models, exhibit "black-box" characteristics with limited transparency in decision-making processes
High Computational Complexity: Non-event-driven methods require more computational resources and large annotated datasets
Insufficient Real-Time Performance: Existing methods struggle to meet real-time application requirements on edge devices
Limited High-Frequency Data Processing: Existing explainable NILM research primarily focuses on low-frequency data (1 Hz)
This work aims to construct an end-to-end explainable NILM framework specifically for high-frequency data (kHz range), enabling real-time decomposition with minimum latency of 320/380 milliseconds (for 60/50 Hz power systems), to build trust in NILM systems and promote their adoption in practical applications.
First Explainable End-to-End Real-Time NILM Framework for High-Frequency Data: Proposes a complete explainable framework specifically designed for high-frequency datasets
Real-Time Edge Device Applicability: The method is applicable to real-time applications on edge devices with latency of 19/f₀ + τ seconds (f₀ is fundamental frequency, τ is execution time)
Optimized Feature Selection: Selects a small set of minimally correlated and interpretable device signature features, achieving 90% classification accuracy on real datasets
Open-Source Implementation: Publicly releases framework source code on GitHub, promoting research reproducibility and application
Input: High-frequency voltage and current signals (30 kHz sampling rate)
Output: Device on/off event detection and classification results with decision explanations
Constraints: Real-time processing requirements, low computational complexity, high explainability
where iₐ(k) is the current vector after the event and iᵦ is the current from one cycle before the event. Selects 1 cycle before and 18 cycles after the event for computation, then extracts the median cycle to obtain device signature estimation:
PLAID Dataset: Contains 575 high-frequency voltage-current measurements at 30 kHz sampling rate, covering 13 device types and at least 11 different brands.
Training Set: 164,510 single-cycle current signatures
Test Set: 2,347 activation/deactivation current signatures
Impact of Post-Event Cycles: Experiments demonstrate that accuracy reaches maximum with 18 post-event cycles, with additional cycles providing no significant performance improvement.
Confusion Matrix Analysis: The model performs well on most devices but confuses electric irons and light bulbs, as both are resistive loads with similar current consumption magnitudes. Air conditioners may be confused with refrigerators or fans depending on operating mode.
SHAP Explanation Example: For air conditioner classification, low cosine values of 1st, 3rd, 5th, and 9th harmonic phases indicate the signature is unlikely to correspond to an air conditioner, while low cosφ₇ values indicate correspondence to an air conditioner.
Compared to existing work, this paper provides the first complete explainable framework for high-frequency data with higher computational efficiency and real-time performance.
The paper cites 20 important references covering NILM fundamentals, machine learning methods, explainable AI techniques, and related datasets, providing solid theoretical foundation for the research.
Overall Assessment: This is a high-quality technical paper making important contributions to the explainable NILM field. The framework design is sound, experimental validation is comprehensive, and it possesses good practical and academic value. Future work should strengthen cross-dataset validation and comparative analysis with advanced methods.