2025-11-14T07:52:11.150813

Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis

Bhatia, de Amorim, De Feo

Regression analysis is employed to examine and quantify the relationships between input variables and a dependent and continuous output variable. It is widely used for predictive modelling in fields such as finance, healthcare, and engineering. However, traditional methods often struggle with real-world data complexities, including uncertainty and ambiguity. While deep learning approaches excel at capturing complex non-linear relationships, they lack interpretability and risk over-fitting on small datasets. Fuzzy systems provide an alternative framework for handling uncertainty and imprecision, with Mamdani and Takagi-Sugeno-Kang (TSK) systems offering complementary strengths: interpretability versus accuracy. This paper presents a novel fuzzy regression method that combines the interpretability of Mamdani systems with the precision of TSK models. The proposed approach introduces a hybrid rule structure with fuzzy and crisp components and dual dominance types, enhancing both accuracy and explainability. Evaluations on benchmark datasets demonstrate state-of-the-art performance in several cases, with rules maintaining a component similar to traditional Mamdani systems while improving precision through improved rule outputs. This hybrid methodology offers a balanced and versatile tool for predictive modelling, addressing the trade-off between interpretability and accuracy inherent in fuzzy systems. In the 6 datasets tested, the proposed approach gave the best fuzzy methodology score in 4 datasets, out-performed the opaque models in 2 datasets and produced the best overall score in 1 dataset with the improvements in RMSE ranging from 0.4% to 19%.

academic

Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis

Basic Information

Paper ID: 2510.13437
Title: Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis
Authors: Ashish Bhatia, Renato Cordeiro de Amorim, Vito De Feo (University of Essex, United Kingdom)
Classification: cs.LG (Machine Learning)
Publication Date: October 15, 2025
Paper Link: https://arxiv.org/abs/2510.13437v1

Abstract

Regression analysis is widely applied in predictive modeling across finance, healthcare, and engineering domains to examine and quantify relationships between input variables and continuous output variables. However, traditional methods often struggle with the complexity of real-world data, including uncertainty and fuzziness. While deep learning approaches excel at capturing complex nonlinear relationships, they lack interpretability and are prone to overfitting on small datasets. Fuzzy systems provide an alternative framework for handling uncertainty and imprecision, where Mamdani and Takagi-Sugeno-Kang (TSK) systems offer complementary advantages: interpretability versus accuracy. This paper proposes a novel fuzzy regression method that combines the interpretability of Mamdani systems with the precision of TSK models. The method introduces a hybrid rule structure with fuzzy and crisp components and dual dominance types, while enhancing both accuracy and interpretability.

Research Background and Motivation

Problem Definition

Primary challenges faced by traditional regression methods when processing real-world data:

Uncertainty and Fuzziness: Inherent uncertainty and linguistic information present in real data
Interpretability-Accuracy Trade-off: Deep learning models are accurate but lack interpretability
Small Dataset Problem: Complex models are prone to overfitting on small datasets

Limitations of Existing Approaches

Traditional Regression Methods: Assume precise and definite numerical relationships, difficult to handle uncertainty
Deep Learning Methods: Lack interpretability, numerous parameters, unsuitable for small dataset training
Mamdani Fuzzy Systems: Strong interpretability but limited precision; coarse-grained partitioning leads to performance degradation
TSK Fuzzy Systems: High precision but lack interpretability, contradicting the purpose of using fuzzy systems

Research Motivation

Develop a hybrid framework that maintains the interpretability of Mamdani systems while achieving the precision of TSK systems, providing a balanced and versatile tool for predictive modeling.

Core Contributions

Hybrid Rule Structure: Proposes a novel fuzzy regression system combining linguistic interpretability of Mamdani systems and numerical precision of TSK models
Dual Dominance Mechanism: Introduces two rule weight calculation methods—based on fuzzy support/confidence and error-based dominance
Constrained TSK Component: TSK function outputs are constrained within the boundaries of corresponding fuzzy sets, maintaining interpretability
Interval Type-2 Fuzzy Sets: Employs interval type-2 fuzzy sets to better handle uncertainty
ACO Optimization: Utilizes Ant Colony Optimization for rule subset selection, balancing model compactness and accuracy

Methodology Details

Task Definition

Given input variables $x_1, x_2, ..., x_n$ and continuous output variable $y$ , the objective is to establish a regression model that is both accurate and interpretable, capable of handling uncertainty and fuzziness in data.

Model Architecture

1. Interval Type-2 Fuzzy Set Fuzzification

Inputs and outputs are fuzzified using interval type-2 fuzzy sets:

$\tilde{F} = \int_{x \in X} \left[ \int_{u \in [\underline{\mu}(x), \overline{\mu}(x)]} 1/u \right] /x$

where $\underline{\mu}(x)$ and $\overline{\mu}(x)$ are the lower and upper membership bounds, respectively.

2. Hybrid Rule Structure

Each rule contains two consequent components:

Rule Form:

IF x1 is F1 AND ... AND xn is Fn 
THEN (y is G, y = f(x1, x2, ..., xn))

Fuzzy Component: Traditional Mamdani consequent, pointing to output fuzzy set
TSK Function Component: n-order polynomial function, providing crisp output value

TSK Function Constraint: $y_{output} \in [LowerBound(F_{upper}), UpperBound(F_{upper})]$

Ensures TSK output always remains within the boundaries of the corresponding fuzzy set.

3. Dual Weight Mechanism

Fuzzy Rule Weights:

Support: $Support(A_j \to \tilde{C}_j) = \frac{1}{|N|} \sum_{p=1}^N \mu_{A_j}(x_p) \cdot \mu_{C_j}(y_p)$
Confidence: $Confidence(A_j \to \tilde{C}_j) = \frac{\sum_{p=1}^N \mu_{A_j}(x_p) \cdot \mu_{C_j}(y_p)}{\sum_{p=1}^N \mu_{A_j}(x_p)}$
Dominance: $D = [S_{Rule\_lower} \cdot C_{lower}, S_{Rule\_upper} \cdot C_{upper}]$

Error-Based Dominance: $Dominance_{ri} = \frac{1}{1 + r_i}$ where $r_i$ is the RMSE of the i-th rule.

4. ACO Rule Selection

Employs Ant Colony Optimization to select optimal rule subsets:

Each ant constructs rule subset solutions
Solutions are evaluated based on RMSE performance
Pheromone update mechanism guides the search process
Balances model compactness and prediction accuracy

5. Defuzzification

Computes final output using TSK weighted averaging method: $y_{final} = \frac{\sum_{i=1}^M \omega_i y_i}{\sum_{i=1}^M \omega_i}$

Technical Innovations

Constrained TSK Component: Unlike traditional TSK, function outputs are restricted within fuzzy set boundaries
Subset Training: TSK functions are trained only on training samples with positive firing strength
Dual Dominance Mechanism: Combines fuzzy logic and numerical precision in weight calculation
Maintaining Interpretability: Rule form resembles traditional Mamdani systems but with higher precision

Experimental Setup

Datasets

Six benchmark datasets from the KEEL repository:

Concrete Compressive Strength: 1,030 instances, 8 input variables
Diabetes: 768 records, 2 input attributes
ELE-2: 11,105 instances, 4 predictor variables
Mortgage: 1,045 instances, 14 variables
Treasury: 956 records, 16 financial attributes
Wankara: 1,609 records, 4 input variables

Evaluation Metrics

RMSE (Root Mean Squared Error): Primary evaluation metric
Interpretability Metrics: Rule coverage, active rules count, rule base characteristics, robustness

Comparison Methods

MP: Multilayer Perceptron
SMOreg: Sequential Minimal Optimization
WM: Wang-Mendel Algorithm
CHV: Cordon, Herrera, and Villar Algorithm
GLD-WM: Global Lateral Parameters + WM Rule Base

Model Variants

HIT2-MTSK-D2: 2nd-order TSK function
HIT2-MTSK-D3: 3rd-order TSK function

Experimental Results

Main Results

Across six datasets:

4 datasets: Achieved best fuzzy method scores
2 datasets: Surpassed opaque models
1 dataset: Achieved best overall score
RMSE improvement range: 0.4% - 19%

Specific Performance:

Concrete: HIT2-MTSK-D3 achieves 7.29 RMSE, best overall performance
Mortgage: HIT2-MTSK-D3 achieves 0.13 RMSE, best fuzzy method
Treasury: HIT2-MTSK-D3 achieves 0.27 RMSE, best fuzzy method
Wankara: Both variants achieve 1.58 RMSE, matching SMOreg performance

Case Study: California Housing Price Prediction

Performance on California housing dataset:

Dataset: 20,640 observations, 8 features
HIT2-MTSK RMSE: 0.695
Mamdani FRBS RMSE: 0.751
Number of Rules: 75 rules
Coverage: 100% of test instances activate at least one rule

Interpretability Assessment

Category Coverage: Covers 96% of actual test data range
Active Rules: Average 8.38 rules fired at threshold 0.15
Rule Characteristics: 75 rules, average 2.67 antecedents
Noise Robustness: 12.24% prediction variation at 10% noise level

Development of Fuzzy Rule Systems

Wang-Mendel Method: Early data-driven rule generation
Genetic Fuzzy Systems (GFS): Integrating genetic algorithms for rule learning optimization
Evolutionary Algorithm Applications: Optimizing knowledge base, rule base, and database

Hybrid Approaches

Gr-MF and GA-WM: Combining rule base and database optimization
GA-COR Method: Integrating genetic algorithms with COR framework
WM+GL Method: Using linguistic 2-tuples for lateral tuning

Advantages of This Work

Compared to existing work, this method significantly improves precision while maintaining interpretability, addressing the accuracy-interpretability trade-off in traditional methods.

Conclusions and Discussion

Main Conclusions

Effective Balance: Successfully balances the interpretability-accuracy trade-off in fuzzy systems
Significant Improvement: Achieves best or near-best performance on most benchmark datasets
Maintains Interpretability: Rule form resembles traditional Mamdani systems, easy to understand
Strong Robustness: Demonstrates stable performance across different domains and datasets

Limitations

Complexity Trade-off: Higher-order TSK functions may lead to overfitting
Optimization Space: Performance improvement potential remains on some datasets
Computational Complexity: ACO optimization increases computational cost
Parameter Sensitivity: Requires reasonable setting of fuzzy set quantity and TSK function order

Future Directions

Optimization Enhancement: Explore other metaheuristic optimization techniques
Domain Adaptation: Domain-specific preprocessing and feature selection
Extended Evaluation: Validation on more datasets and evaluation metrics
Theoretical Analysis: In-depth convergence and complexity analysis

In-Depth Evaluation

Strengths

Strong Innovation: First to propose constrained TSK component in hybrid fuzzy systems
High Practical Value: Addresses key problems in real-world applications
Comprehensive Experiments: Validation on multiple benchmark datasets with detailed interpretability analysis
Reasonable Methodology: Clear technical design logic and solid theoretical foundation

Weaknesses

Insufficient Theoretical Analysis: Lacks theoretical guarantees for convergence and complexity
Limited Parameter Selection Guidance: No systematic parameter selection strategy provided
Computational Efficiency: Insufficient detailed analysis of ACO optimization complexity
Limited Applicability Boundaries: Limited discussion on method applicability conditions

Impact

Academic Contribution: Provides new research direction for fuzzy systems field
Practical Value: Important application value in scenarios requiring explainable AI
Reproducibility: Detailed method description facilitates reproduction and extension

Applicable Scenarios

Financial Risk Assessment: Requires interpretable prediction models
Medical Diagnosis: Demands transparent decision-making processes
Engineering Control: Requires understanding system behavior in control applications
Small Dataset Regression: Scenarios where traditional deep learning is unsuitable

References

The paper cites important literature in fuzzy systems, evolutionary algorithms, and machine learning, including foundational work by Zadeh on fuzzy logic, classical fuzzy systems by Mamdani and TSK, and recent research on genetic fuzzy systems and interval type-2 fuzzy sets.

Overall Assessment: This is a high-quality academic paper proposing an innovative hybrid fuzzy system method that significantly improves regression accuracy while maintaining interpretability. The methodology is well-designed, experiments are comprehensive, and it contributes important value to the fuzzy systems and explainable AI fields.