Software Defect Prediction using Autoencoder Transformer Model
Barma, Hariharan, Arvapalli
An AI-ML-powered quality engineering approach uses AI-ML to enhance software quality assessments by predicting defects. Existing ML models struggle with noisy data types, imbalances, pattern recognition, feature extraction, and generalization. To address these challenges, we develop a new model, Adaptive Differential Evolution (ADE) based Quantum Variational Autoencoder-Transformer (QVAET) Model (ADE-QVAET). ADE combines with QVAET to obtain high-dimensional latent features and maintain sequential dependencies, resulting in enhanced defect prediction accuracy. ADE optimization enhances model convergence and predictive performance. ADE-QVAET integrates AI-ML techniques such as tuning hyperparameters for scalable and accurate software defect prediction, representing an AI-ML-driven technology for quality engineering. During training with a 90% training percentage, ADE-QVAET achieves high accuracy, precision, recall, and F1-score of 98.08%, 92.45%, 94.67%, and 98.12%, respectively, when compared to the Differential Evolution (DE) ML model.
academic
Software Defect Prediction using Autoencoder Transformer Model
This paper proposes an AI-ML-based quality engineering methodology to enhance software quality assessment through defect prediction. To address the difficulties of existing ML models in handling noisy data, data imbalance, pattern recognition, feature extraction, and generalization, we develop an Adaptive Differential Evolution (ADE)-based Quantum Variational Autoencoder-Transformer (QVAET) model (ADE-QVAET). This model combines ADE optimization with QVAET architecture to capture high-dimensional latent features while maintaining sequence dependencies, thereby improving defect prediction accuracy. At a 90% training ratio, ADE-QVAET achieves 98.08% accuracy, 92.45% precision, 94.67% recall, and 98.12% F1-score.
High Software Testing Costs: Software testing in enterprise environments faces challenges from complex data and business requirements, with quality engineers spending 30-40% of their time on manual test execution
Insufficient Defect Prevention: Traditional testing methods rely on discovering errors after development completion, overlooking potential failures that could be prevented earlier
In complex enterprise systems such as cloud computing, microservices, IoT, and AI implementations, complex business logic and technical dependencies create exponential complexity, where traditional methods face limitations including hallucinations, insufficient context generation, and loss of critical business relationships during retrieval processes.
Proposed ADE-QVAET Model: First to unify adaptive differential evolution, quantum variational autoencoder, and Transformer architecture into a single framework
Developed ANRA Preprocessing Framework: Adaptive Noise Reduction and data Augmentation framework to improve data quality and class balance
Implemented Dynamic Hyperparameter Optimization: ADE algorithm dynamically adjusts scaling factors and crossover rates based on candidate solution evolution performance
Achieved Significant Performance Improvements: 7.73% improvement in accuracy and 18.63% improvement in precision compared to traditional DE models
Input dataset: D = {x₁, x₂, ..., xₙ}
Preprocessed data: D' = ANRA(D)
The ANRA framework balances defective and non-defective instances through noise reduction, redundant information removal, and synthetic data generation.
Compared to traditional VAE, QVAE leverages quantum computing principles to extract richer high-dimensional latent features, capturing complex patterns difficult for classical methods to identify.
Objective function: θ* = argmin f(θ), where θ includes critical hyperparameters such as learning rate, regularization coefficient, and number of layers.
Data Source: Kaggle Software Defect Prediction Dataset
Features: Lines of Code (LOC), cyclomatic complexity, Depth of Inheritance Tree (DIT), Coupling Between Objects (CBO), and other structural elements
Labels: Binary classification (defect=1, non-defect=0)
Challenge: Class imbalance problem with low frequency of defective modules
This paper cites 21 related references, primarily including:
Machine learning methods for software defect prediction
Applications of evolutionary algorithms in software engineering
Quantum machine learning technologies
Deep learning applications in code analysis
Overall Assessment: This is a technically innovative paper that successfully integrates multiple cutting-edge technologies for software defect prediction. While there is room for improvement in theoretical analysis and validation completeness, its practical value and performance improvements are significant, contributing importantly to advancing AI applications in software quality engineering.