2025-11-14T07:52:11.150813

Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis

Bhatia, de Amorim, De Feo
Regression analysis is employed to examine and quantify the relationships between input variables and a dependent and continuous output variable. It is widely used for predictive modelling in fields such as finance, healthcare, and engineering. However, traditional methods often struggle with real-world data complexities, including uncertainty and ambiguity. While deep learning approaches excel at capturing complex non-linear relationships, they lack interpretability and risk over-fitting on small datasets. Fuzzy systems provide an alternative framework for handling uncertainty and imprecision, with Mamdani and Takagi-Sugeno-Kang (TSK) systems offering complementary strengths: interpretability versus accuracy. This paper presents a novel fuzzy regression method that combines the interpretability of Mamdani systems with the precision of TSK models. The proposed approach introduces a hybrid rule structure with fuzzy and crisp components and dual dominance types, enhancing both accuracy and explainability. Evaluations on benchmark datasets demonstrate state-of-the-art performance in several cases, with rules maintaining a component similar to traditional Mamdani systems while improving precision through improved rule outputs. This hybrid methodology offers a balanced and versatile tool for predictive modelling, addressing the trade-off between interpretability and accuracy inherent in fuzzy systems. In the 6 datasets tested, the proposed approach gave the best fuzzy methodology score in 4 datasets, out-performed the opaque models in 2 datasets and produced the best overall score in 1 dataset with the improvements in RMSE ranging from 0.4% to 19%.
academic

Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis

Basic Information

  • Paper ID: 2510.13437
  • Title: Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis
  • Authors: Ashish Bhatia, Renato Cordeiro de Amorim, Vito De Feo (University of Essex, United Kingdom)
  • Classification: cs.LG (Machine Learning)
  • Publication Date: October 15, 2025
  • Paper Link: https://arxiv.org/abs/2510.13437v1

Abstract

Regression analysis is widely applied in predictive modeling across finance, healthcare, and engineering domains to examine and quantify relationships between input variables and continuous output variables. However, traditional methods often struggle with the complexity of real-world data, including uncertainty and fuzziness. While deep learning approaches excel at capturing complex nonlinear relationships, they lack interpretability and are prone to overfitting on small datasets. Fuzzy systems provide an alternative framework for handling uncertainty and imprecision, where Mamdani and Takagi-Sugeno-Kang (TSK) systems offer complementary advantages: interpretability versus accuracy. This paper proposes a novel fuzzy regression method that combines the interpretability of Mamdani systems with the precision of TSK models. The method introduces a hybrid rule structure with fuzzy and crisp components and dual dominance types, while enhancing both accuracy and interpretability.

Research Background and Motivation

Problem Definition

Primary challenges faced by traditional regression methods when processing real-world data:

  1. Uncertainty and Fuzziness: Inherent uncertainty and linguistic information present in real data
  2. Interpretability-Accuracy Trade-off: Deep learning models are accurate but lack interpretability
  3. Small Dataset Problem: Complex models are prone to overfitting on small datasets

Limitations of Existing Approaches

  • Traditional Regression Methods: Assume precise and definite numerical relationships, difficult to handle uncertainty
  • Deep Learning Methods: Lack interpretability, numerous parameters, unsuitable for small dataset training
  • Mamdani Fuzzy Systems: Strong interpretability but limited precision; coarse-grained partitioning leads to performance degradation
  • TSK Fuzzy Systems: High precision but lack interpretability, contradicting the purpose of using fuzzy systems

Research Motivation

Develop a hybrid framework that maintains the interpretability of Mamdani systems while achieving the precision of TSK systems, providing a balanced and versatile tool for predictive modeling.

Core Contributions

  1. Hybrid Rule Structure: Proposes a novel fuzzy regression system combining linguistic interpretability of Mamdani systems and numerical precision of TSK models
  2. Dual Dominance Mechanism: Introduces two rule weight calculation methods—based on fuzzy support/confidence and error-based dominance
  3. Constrained TSK Component: TSK function outputs are constrained within the boundaries of corresponding fuzzy sets, maintaining interpretability
  4. Interval Type-2 Fuzzy Sets: Employs interval type-2 fuzzy sets to better handle uncertainty
  5. ACO Optimization: Utilizes Ant Colony Optimization for rule subset selection, balancing model compactness and accuracy

Methodology Details

Task Definition

Given input variables x1,x2,...,xnx_1, x_2, ..., x_n and continuous output variable yy, the objective is to establish a regression model that is both accurate and interpretable, capable of handling uncertainty and fuzziness in data.

Model Architecture

1. Interval Type-2 Fuzzy Set Fuzzification

Inputs and outputs are fuzzified using interval type-2 fuzzy sets:

F~=xX[u[μ(x),μ(x)]1/u]/x\tilde{F} = \int_{x \in X} \left[ \int_{u \in [\underline{\mu}(x), \overline{\mu}(x)]} 1/u \right] /x

where μ(x)\underline{\mu}(x) and μ(x)\overline{\mu}(x) are the lower and upper membership bounds, respectively.

2. Hybrid Rule Structure

Each rule contains two consequent components:

Rule Form:

IF x1 is F1 AND ... AND xn is Fn 
THEN (y is G, y = f(x1, x2, ..., xn))
  • Fuzzy Component: Traditional Mamdani consequent, pointing to output fuzzy set
  • TSK Function Component: n-order polynomial function, providing crisp output value

TSK Function Constraint: youtput[LowerBound(Fupper),UpperBound(Fupper)]y_{output} \in [LowerBound(F_{upper}), UpperBound(F_{upper})]

Ensures TSK output always remains within the boundaries of the corresponding fuzzy set.

3. Dual Weight Mechanism

Fuzzy Rule Weights:

  • Support: Support(AjC~j)=1Np=1NμAj(xp)μCj(yp)Support(A_j \to \tilde{C}_j) = \frac{1}{|N|} \sum_{p=1}^N \mu_{A_j}(x_p) \cdot \mu_{C_j}(y_p)
  • Confidence: Confidence(AjC~j)=p=1NμAj(xp)μCj(yp)p=1NμAj(xp)Confidence(A_j \to \tilde{C}_j) = \frac{\sum_{p=1}^N \mu_{A_j}(x_p) \cdot \mu_{C_j}(y_p)}{\sum_{p=1}^N \mu_{A_j}(x_p)}
  • Dominance: D=[SRule_lowerClower,SRule_upperCupper]D = [S_{Rule\_lower} \cdot C_{lower}, S_{Rule\_upper} \cdot C_{upper}]

Error-Based Dominance: Dominanceri=11+riDominance_{ri} = \frac{1}{1 + r_i} where rir_i is the RMSE of the i-th rule.

4. ACO Rule Selection

Employs Ant Colony Optimization to select optimal rule subsets:

  • Each ant constructs rule subset solutions
  • Solutions are evaluated based on RMSE performance
  • Pheromone update mechanism guides the search process
  • Balances model compactness and prediction accuracy

5. Defuzzification

Computes final output using TSK weighted averaging method: yfinal=i=1Mωiyii=1Mωiy_{final} = \frac{\sum_{i=1}^M \omega_i y_i}{\sum_{i=1}^M \omega_i}

Technical Innovations

  1. Constrained TSK Component: Unlike traditional TSK, function outputs are restricted within fuzzy set boundaries
  2. Subset Training: TSK functions are trained only on training samples with positive firing strength
  3. Dual Dominance Mechanism: Combines fuzzy logic and numerical precision in weight calculation
  4. Maintaining Interpretability: Rule form resembles traditional Mamdani systems but with higher precision

Experimental Setup

Datasets

Six benchmark datasets from the KEEL repository:

  1. Concrete Compressive Strength: 1,030 instances, 8 input variables
  2. Diabetes: 768 records, 2 input attributes
  3. ELE-2: 11,105 instances, 4 predictor variables
  4. Mortgage: 1,045 instances, 14 variables
  5. Treasury: 956 records, 16 financial attributes
  6. Wankara: 1,609 records, 4 input variables

Evaluation Metrics

  • RMSE (Root Mean Squared Error): Primary evaluation metric
  • Interpretability Metrics: Rule coverage, active rules count, rule base characteristics, robustness

Comparison Methods

  • MP: Multilayer Perceptron
  • SMOreg: Sequential Minimal Optimization
  • WM: Wang-Mendel Algorithm
  • CHV: Cordon, Herrera, and Villar Algorithm
  • GLD-WM: Global Lateral Parameters + WM Rule Base

Model Variants

  • HIT2-MTSK-D2: 2nd-order TSK function
  • HIT2-MTSK-D3: 3rd-order TSK function

Experimental Results

Main Results

Across six datasets:

  • 4 datasets: Achieved best fuzzy method scores
  • 2 datasets: Surpassed opaque models
  • 1 dataset: Achieved best overall score
  • RMSE improvement range: 0.4% - 19%

Specific Performance:

  • Concrete: HIT2-MTSK-D3 achieves 7.29 RMSE, best overall performance
  • Mortgage: HIT2-MTSK-D3 achieves 0.13 RMSE, best fuzzy method
  • Treasury: HIT2-MTSK-D3 achieves 0.27 RMSE, best fuzzy method
  • Wankara: Both variants achieve 1.58 RMSE, matching SMOreg performance

Case Study: California Housing Price Prediction

Performance on California housing dataset:

  • Dataset: 20,640 observations, 8 features
  • HIT2-MTSK RMSE: 0.695
  • Mamdani FRBS RMSE: 0.751
  • Number of Rules: 75 rules
  • Coverage: 100% of test instances activate at least one rule

Interpretability Assessment

  • Category Coverage: Covers 96% of actual test data range
  • Active Rules: Average 8.38 rules fired at threshold 0.15
  • Rule Characteristics: 75 rules, average 2.67 antecedents
  • Noise Robustness: 12.24% prediction variation at 10% noise level

Development of Fuzzy Rule Systems

  • Wang-Mendel Method: Early data-driven rule generation
  • Genetic Fuzzy Systems (GFS): Integrating genetic algorithms for rule learning optimization
  • Evolutionary Algorithm Applications: Optimizing knowledge base, rule base, and database

Hybrid Approaches

  • Gr-MF and GA-WM: Combining rule base and database optimization
  • GA-COR Method: Integrating genetic algorithms with COR framework
  • WM+GL Method: Using linguistic 2-tuples for lateral tuning

Advantages of This Work

Compared to existing work, this method significantly improves precision while maintaining interpretability, addressing the accuracy-interpretability trade-off in traditional methods.

Conclusions and Discussion

Main Conclusions

  1. Effective Balance: Successfully balances the interpretability-accuracy trade-off in fuzzy systems
  2. Significant Improvement: Achieves best or near-best performance on most benchmark datasets
  3. Maintains Interpretability: Rule form resembles traditional Mamdani systems, easy to understand
  4. Strong Robustness: Demonstrates stable performance across different domains and datasets

Limitations

  1. Complexity Trade-off: Higher-order TSK functions may lead to overfitting
  2. Optimization Space: Performance improvement potential remains on some datasets
  3. Computational Complexity: ACO optimization increases computational cost
  4. Parameter Sensitivity: Requires reasonable setting of fuzzy set quantity and TSK function order

Future Directions

  1. Optimization Enhancement: Explore other metaheuristic optimization techniques
  2. Domain Adaptation: Domain-specific preprocessing and feature selection
  3. Extended Evaluation: Validation on more datasets and evaluation metrics
  4. Theoretical Analysis: In-depth convergence and complexity analysis

In-Depth Evaluation

Strengths

  1. Strong Innovation: First to propose constrained TSK component in hybrid fuzzy systems
  2. High Practical Value: Addresses key problems in real-world applications
  3. Comprehensive Experiments: Validation on multiple benchmark datasets with detailed interpretability analysis
  4. Reasonable Methodology: Clear technical design logic and solid theoretical foundation

Weaknesses

  1. Insufficient Theoretical Analysis: Lacks theoretical guarantees for convergence and complexity
  2. Limited Parameter Selection Guidance: No systematic parameter selection strategy provided
  3. Computational Efficiency: Insufficient detailed analysis of ACO optimization complexity
  4. Limited Applicability Boundaries: Limited discussion on method applicability conditions

Impact

  1. Academic Contribution: Provides new research direction for fuzzy systems field
  2. Practical Value: Important application value in scenarios requiring explainable AI
  3. Reproducibility: Detailed method description facilitates reproduction and extension

Applicable Scenarios

  1. Financial Risk Assessment: Requires interpretable prediction models
  2. Medical Diagnosis: Demands transparent decision-making processes
  3. Engineering Control: Requires understanding system behavior in control applications
  4. Small Dataset Regression: Scenarios where traditional deep learning is unsuitable

References

The paper cites important literature in fuzzy systems, evolutionary algorithms, and machine learning, including foundational work by Zadeh on fuzzy logic, classical fuzzy systems by Mamdani and TSK, and recent research on genetic fuzzy systems and interval type-2 fuzzy sets.


Overall Assessment: This is a high-quality academic paper proposing an innovative hybrid fuzzy system method that significantly improves regression accuracy while maintaining interpretability. The methodology is well-designed, experiments are comprehensive, and it contributes important value to the fuzzy systems and explainable AI fields.