2025-11-18T12:01:20.412379

Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches

Safarzadeh, Jamali, Moshiri
Ensuring reliable ATM services is essential for modern banking, directly impacting customer satisfaction and the operational efficiency of financial institutions. This study introduces a data fusion approach that utilizes multi-classifier fusion techniques, with a special focus on the Stacking Classifier, to enhance the reliability of ATM networks. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied, enabling balanced learning for both frequent and rare events. The proposed framework integrates diverse classification models - Random Forest, LightGBM, and CatBoost - within a Stacking Classifier, achieving a dramatic reduction in false alarms from 3.56 percent to just 0.71 percent, along with an outstanding overall accuracy of 99.29 percent. This multi-classifier fusion method synthesizes the strengths of individual models, leading to significant cost savings and improved operational decision-making. By demonstrating the power of machine learning and data fusion in optimizing ATM status detection, this research provides practical and scalable solutions for financial institutions aiming to enhance their ATM network performance and customer satisfaction.
academic

Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches

Basic Information

  • Paper ID: 2501.01067
  • Title: Enhancing Precision of Automated Teller Machines Network Quality Assessment: Machine Learning and Multi Classifier Fusion Approaches
  • Authors: Alireza Safarzadeh, Mohammad Reza Jamali, Behzad Moshiri
  • Category: cs.LG (Machine Learning)
  • Publication Date: January 3, 2025
  • Paper Link: https://arxiv.org/abs/2501.01067

Abstract

Ensuring reliable ATM services is critical to modern banking, directly impacting customer satisfaction and operational efficiency of financial institutions. This research introduces a data fusion approach leveraging multi-classifier fusion techniques, with particular emphasis on stacking classifiers, to enhance ATM network reliability. To address class imbalance issues, Synthetic Minority Over-sampling Technique (SMOTE) was applied to achieve balanced learning of frequent and rare events. The proposed framework integrates multiple classification models—Random Forest, LightGBM, and CatBoost—within a stacking classifier, achieving a dramatic reduction in false alarms from 3.56% to 0.71%, while attaining an outstanding overall accuracy of 99.29%.

Research Background and Motivation

Problem Definition

ATM networks play a critical role in modern banking systems, yet their state detection faces significant challenges:

  1. False Alarm Problem: ATM status files frequently generate false positives, resulting in unnecessary maintenance activities and resource misallocation
  2. False Negative Problem: Genuine failures are not detected timely, prolonging downtime and reducing customer satisfaction
  3. Data Processing Complexity: ATM log files contain detailed information but are voluminous, making real-time processing difficult

Problem Significance

  • Customer Satisfaction: ATM availability directly impacts customer experience and banking service quality
  • Operational Efficiency: Accurate state detection reduces unnecessary maintenance and lowers operational costs
  • Financial Impact: Erroneous detection may lead to revenue loss and customer attrition

Limitations of Existing Methods

  • Traditional probabilistic methods (e.g., Bayesian inference) have limitations in handling complex heterogeneous data
  • Single data sources are prone to misclassification
  • Class imbalance causes models to bias toward the majority class (normal operation state)

Core Contributions

  1. Proposed Multi-Classifier Fusion Framework: Integrating stacking classifier with Random Forest, LightGBM, and CatBoost
  2. Innovative Data Fusion Method: Combining ATM status files and transaction records for reliable state detection
  3. Significant Performance Improvement: Reducing false alarm rate from 3.56% to 0.71%, achieving 99.29% overall accuracy
  4. Practical Class Imbalance Solution: Effective application of SMOTE technique for handling rare failure events
  5. Practical Application Value: Providing financial institutions with scalable ATM network performance optimization solutions

Methodology Details

Task Definition

Modeling ATM state detection as a binary classification problem:

  • Input: ATM status files, transaction records, temporal features, etc.
  • Output: ATM state (in-service/out-of-service)
  • Constraints: Real-time requirements, high precision demands, cost-effectiveness considerations

Feature Engineering

Key Feature Extraction

  1. ATM Status File Features:
    • Keyboard, card reader, network connectivity status
    • Any component failure indicates out-of-service state
  2. Temporal Features:
    • Day of month (1-31)
    • Day type (regular weekday/part-time weekday/holiday)
    • Time of day
  3. Transaction-Related Features:
    • Monthly transaction count (percentile stratification)
    • Transaction status features (based on Poisson process assumption)

Transaction Interval Modeling

Validating transaction intervals follow exponential distribution via Kolmogorov-Smirnov test:

  • Hypothesis: ATM transactions follow a Poisson process
  • Validation Result: Exponential distribution KS statistic of 0.1493, superior to Gamma (0.1654), Logistic (0.1906), and Normal (0.2557)
  • Probability Threshold: 99% probability threshold set for anomaly detection

Class Imbalance Handling

  • Problem: Out-of-service samples constitute only 0.85%
  • Solution: SMOTE with k=3 nearest neighbors
  • Effect: Balancing dataset to 50:50 ratio

Model Architecture

Stacking Classifier Design

First Layer (Base Learners):

  • Random Forest: 100 trees, Gini impurity criterion
  • LightGBM: 100 estimators, learning rate 0.1, leaf count 31
  • CatBoost: 100 iterations, learning rate 0.1, depth 6

Second Layer (Meta-Learner):

  • Logistic Regression: L2 regularization, cross-entropy loss optimization

Mathematical Framework

SVM objective function:

min(w,b,ζ) 1/2||w||² + C∑ζᵢ
subject to: yᵢ(w^T xᵢ + b) ≥ 1-ζᵢ, ζᵢ ≥ 0

Technical Innovations

  1. Multi-Source Data Fusion: Innovatively combining status files and transaction data
  2. Probabilistic Modeling Validation: Rigorous statistical testing to verify modeling assumptions
  3. Adaptive Ensemble Learning: Dynamic Classifier Selection (DCS) and Dynamic Ensemble Selection (DES)
  4. Balanced Performance Optimization: Focus on average precision, recall, and F1-score

Experimental Setup

Dataset Characteristics

  • Data Source: Actual operational data from bank ATM networks
  • Privacy Protection: Data used under strict confidentiality agreements
  • Class Distribution: Highly imbalanced (out-of-service: 0.85%)

Evaluation Metrics

Employing averaged metrics to avoid class bias:

  • Average Precision: (Precision_down + Precision_up)/2
  • Average Recall: (Recall_down + Recall_up)/2
  • Average F1-Score: (F1-Score_down + F1-Score_up)/2

Comparison Methods

  • Support Vector Machine (SVM)
  • Decision Tree
  • Bagging Classifier
  • Random Forest
  • LightGBM
  • CatBoost
  • Dynamic Classifier Selection (DCS LA)
  • Dynamic Ensemble Selection (DES KNORAE)

Experimental Results

Primary Results

Model Performance Comparison

ModelOut-of-Service PrecisionOut-of-Service RecallOut-of-Service F1In-Service PrecisionIn-Service RecallIn-Service F1
SVM0.85350.87470.86390.87150.84980.8605
Random Forest0.99610.98920.99260.98920.99610.9927
Stacking Classifier0.99490.99100.99290.99100.99490.9930

Outstanding Achievements

  • Overall Accuracy: 99.29% (highest among all models)
  • Significant False Alarm Reduction: From 3.56% to 0.71%
  • Balanced Performance: F1-scores for both classes exceed 99%

SMOTE Effect Analysis

Pre- and Post-SMOTE Comparison

ModelPre-SMOTE F1Post-SMOTE F1Improvement
SVM0.500.86+72%
Random Forest0.780.99+27%
LightGBM0.730.94+29%

Ablation Study

Validating each component's contribution through incremental addition:

  1. Baseline Features: ATM status file accuracy 96.14%
  2. Transaction Features: Standalone accuracy 85.43%
  3. Feature Fusion: Significant performance improvement
  4. SMOTE Processing: Critical enhancement for minority class detection
  5. Ensemble Learning: Achieving optimal final performance

Data Fusion Research

  • Bachmann et al. (2013): Multi-sensor traffic speed estimation
  • Meng et al. (2020): Machine learning data fusion survey
  • This Paper's Contribution: First application of data fusion to ATM state detection

Ensemble Learning Development

  • Wolpert (1992): Original stacked generalization proposal
  • Klein et al. (2023): Heterogeneous stacking ensembles for time series classification
  • This Paper's Innovation: Customized ensemble method for financial service reliability

Class Imbalance Handling

  • Chawla et al. (2002): Original SMOTE method
  • Khan et al. (2024): Combining ensemble learning with data augmentation
  • This Paper's Application: Effective practice in ATM failure detection

Conclusions and Discussion

Main Conclusions

  1. Technical Effectiveness: Multi-classifier fusion significantly enhances ATM state detection precision
  2. Practical Value: Substantially reduces false alarm rates and operational costs
  3. Method Generalizability: Framework is extensible to other critical system monitoring

Limitations

  1. Data Dependency: Requires high-quality historical training data
  2. Computational Complexity: Ensemble methods increase computational overhead
  3. Domain-Specific: Method is customized for ATM networks with limited generalization capability
  4. Data Privacy: Actual data cannot be disclosed, affecting reproducibility

Future Directions

  1. Real-Time Monitoring: Integration with real-time monitoring technologies
  2. Cross-Domain Applications: Extension to healthcare, transportation, and other critical systems
  3. Deep Learning: Exploration of neural network approaches
  4. Edge Computing: Optimization for edge deployment efficiency

In-Depth Evaluation

Strengths

  1. Strong Problem Relevance: Directly addresses practical pain points in banking
  2. Reasonable Methodology Design: Multi-level validation ensures reliability
  3. Comprehensive Experiments: Thorough comparative and ablation studies
  4. Significant Results: Breakthrough improvements in key metrics
  5. High Practical Value: Provides readily deployable solutions

Weaknesses

  1. Limited Theoretical Contribution: Primarily engineering application of existing techniques
  2. Single Dataset: Validation only on specific bank network
  3. Simple Baseline Comparisons: Lacks comparison with recent deep learning methods
  4. Insufficient Generalization Analysis: Limited discussion of cross-institutional applicability

Impact

  1. Academic Value: Provides practical methodology for fintech domain
  2. Industry Impact: Directly improves banking operational efficiency
  3. Methodological Contribution: Successful case of multi-classifier fusion in critical systems
  4. Reproducibility: Detailed method description facilitates reproduction and improvement

Applicable Scenarios

  1. Financial Institutions: ATM network monitoring and maintenance optimization
  2. Critical Infrastructure: Power and communication system state monitoring
  3. Manufacturing: Equipment failure prediction and maintenance scheduling
  4. Service Industry: Self-service device reliability management

References

  1. Wolpert, D. H. (1992). Stacked generalization. Neural Networks, 5, 241-259.
  2. Chawla, N. V., et al. (2002). SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, 321-357.
  3. Meng, T., et al. (2020). A survey on machine learning for data fusion. Information Fusion, 57, 115-129.
  4. Ke, G., et al. (2017). LightGBM: A highly efficient gradient boosting decision tree. NIPS 2017.
  5. Prokhorenkova, L., et al. (2018). CatBoost: unbiased boosting with categorical features. NIPS 2018.

Overall Assessment: This paper addresses the important practical problem of ATM network reliability by proposing an effective solution based on multi-classifier fusion. While theoretical innovation is limited, it demonstrates outstanding performance in engineering practice and performance improvement, possessing significant practical value and industrial impact. The successful application of this method provides valuable reference for reliability monitoring of similar critical systems.