Optimised neural networks for online processing of ATLAS calorimeter data on FPGAs
Aad, Bertrand, Laatu et al.
A study of neural network architectures for the reconstruction of the energy deposited in the cells of the ATLAS liquid-argon calorimeters under high pile-up conditions expected at the HL-LHC is presented. These networks are designed to run on the FPGA-based readout hardware of the calorimeters under strict size and latency constraints. Several architectures, including Dense, Recurrent (RNN), and Convolutional (CNN) neural networks, are optimised using a Bayesian procedure that balances energy resolution against network size. The optimised Dense, CNN, and combined Dense+RNN architectures achieve a transverse energy resolution of approximately 80 MeV, outperforming both the optimal filtering (OF) method currently in use and RNNs of similar complexity. A detailed comparison across the full dynamic range shows that Dense, CNN, and Dense+RNN accurately reproduce the energy scale, while OF and RNNs underestimate the energy. Deep Evidential Regression is implemented within the Dense architecture to address the need for reliable per-event energy uncertainties. This approach provides predictive uncertainty estimates with minimal increase in network size. The predicted uncertainty is found to be consistent, on average, with the difference between the true deposited energy and the predicted energy.
academic
Optimised neural networks for online processing of ATLAS calorimeter data on FPGAs
This study presents an in-depth investigation of neural network architectures for reconstructing energy deposits in ATLAS liquid argon calorimeter cells under the high-pileup conditions expected at the High-Luminosity Large Hadron Collider (HL-LHC). These networks are designed to operate on FPGA-based calorimeter readout hardware under strict size and latency constraints. Through Bayesian optimization, multiple architectures including Dense networks, Recurrent Neural Networks (RNNs), and Convolutional Neural Networks (CNNs) are optimized to balance energy resolution against network complexity. The optimized Dense, CNN, and Dense+RNN hybrid architectures achieve transverse energy resolution of approximately 80 MeV, significantly outperforming the currently employed Optimal Filtering (OF) method and RNNs of comparable complexity. Detailed comparisons across the full dynamic range demonstrate that Dense, CNN, and Dense+RNN architectures accurately reproduce energy scales, while OF and RNN systematically underestimate energies. Furthermore, Deep Evidential Regression (DER) is implemented within the Dense architecture to provide reliable per-event energy uncertainty estimates.
High-Luminosity LHC Challenges: The HL-LHC upgrade (2026-2030) will produce up to 200 simultaneous proton-proton collisions, resulting in severe signal pileup issues
Hardware Constraints: The ATLAS liquid argon calorimeter contains 182,468 cells, generating hundreds of terabytes of data per second, requiring specialized electronic boards for processing
Latency Requirements: Energy reconstruction algorithms must complete within 125 ns to meet the fast response demands of the trigger system
Limitations of Existing Methods: The currently employed Optimal Filtering (OF) algorithm exhibits significantly degraded performance under high-pileup conditions
Advances in FPGA processing capabilities provide a unique opportunity to implement modern machine learning algorithms at early stages of the data processing pipeline
Development of new methods capable of operating under strict hardware constraints while outperforming OF algorithms
Implementation of per-event energy uncertainty estimation to enhance precision in subsequent data acquisition and reconstruction steps
Multi-Architecture Optimization: Proposes and optimizes four neural network architectures (Dense, RNN, CNN, Dense+RNN) through Bayesian optimization to achieve optimal balance between energy resolution and network complexity
Hardware-Constrained Objective Function: Designs a piecewise penalty objective function accounting for MAC unit counts, effectively controlling network size
Performance Enhancement: Optimal architectures achieve approximately 80 MeV transverse energy resolution, representing ~8% improvement over OF algorithm
Uncertainty Quantification: First implementation of Deep Evidential Regression (DER) under FPGA constraints, providing per-event energy uncertainty estimates
Full Dynamic Range Validation: Validates method effectiveness and energy scale accuracy across 0-130 GeV energy range
This paper cites 28 important references covering ATLAS experiment design, LHC upgrade plans, FPGA neural network implementations, and Deep Evidential Regression theory, providing solid theoretical and technical foundations for the research.
Overall Assessment: This is a high-quality applied research paper achieving good balance between theoretical innovation and engineering practice. The research directly serves major scientific facility upgrade requirements with well-designed methodology and comprehensive experimental validation, offering significant value to both high-energy physics experiments and FPGA application domains.