2025-11-14T07:52:11.150813

Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis

Bhatia, de Amorim, De Feo

Regression analysis is employed to examine and quantify the relationships between input variables and a dependent and continuous output variable. It is widely used for predictive modelling in fields such as finance, healthcare, and engineering. However, traditional methods often struggle with real-world data complexities, including uncertainty and ambiguity. While deep learning approaches excel at capturing complex non-linear relationships, they lack interpretability and risk over-fitting on small datasets. Fuzzy systems provide an alternative framework for handling uncertainty and imprecision, with Mamdani and Takagi-Sugeno-Kang (TSK) systems offering complementary strengths: interpretability versus accuracy. This paper presents a novel fuzzy regression method that combines the interpretability of Mamdani systems with the precision of TSK models. The proposed approach introduces a hybrid rule structure with fuzzy and crisp components and dual dominance types, enhancing both accuracy and explainability. Evaluations on benchmark datasets demonstrate state-of-the-art performance in several cases, with rules maintaining a component similar to traditional Mamdani systems while improving precision through improved rule outputs. This hybrid methodology offers a balanced and versatile tool for predictive modelling, addressing the trade-off between interpretability and accuracy inherent in fuzzy systems. In the 6 datasets tested, the proposed approach gave the best fuzzy methodology score in 4 datasets, out-performed the opaque models in 2 datasets and produced the best overall score in 1 dataset with the improvements in RMSE ranging from 0.4% to 19%.

academic

Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis

基本信息

论文ID: 2510.13437
标题: Hybrid Interval Type-2 Mamdani-TSK Fuzzy System for Regression Analysis
作者: Ashish Bhatia, Renato Cordeiro de Amorim, Vito De Feo (University of Essex, United Kingdom)
分类: cs.LG (Machine Learning)
发表时间: 2025年10月15日
论文链接: https://arxiv.org/abs/2510.13437v1

摘要

回归分析被广泛应用于金融、医疗和工程等领域的预测建模，用于检查和量化输入变量与连续输出变量之间的关系。然而，传统方法在处理现实世界数据的复杂性（包括不确定性和模糊性）时往往存在困难。虽然深度学习方法擅长捕捉复杂的非线性关系，但缺乏可解释性且在小数据集上存在过拟合风险。模糊系统为处理不确定性和不精确性提供了替代框架，其中Mamdani和Takagi-Sugeno-Kang (TSK)系统提供了互补的优势：可解释性与准确性。本文提出了一种新颖的模糊回归方法，结合了Mamdani系统的可解释性和TSK模型的精确性。该方法引入了具有模糊和清晰组件以及双重主导类型的混合规则结构，同时增强了准确性和可解释性。

研究背景与动机

问题定义

传统回归方法在处理现实世界数据时面临的主要挑战：

不确定性和模糊性：现实数据中存在的固有不确定性和语言信息
可解释性与准确性的权衡：深度学习模型虽然准确但缺乏可解释性
小数据集问题：复杂模型在小数据集上容易过拟合

现有方法局限性

传统回归方法：假设精确和明确的数值关系，难以处理不确定性
深度学习方法：缺乏可解释性，参数众多，不适合小数据集训练
Mamdani模糊系统：可解释性强但精度有限，粗粒度划分导致性能下降
TSK模糊系统：精度高但缺乏可解释性，违背了使用模糊系统的初衷

研究动机

开发一个既能保持Mamdani系统可解释性又能达到TSK系统精确性的混合框架，为预测建模提供平衡且多功能的工具。

核心贡献

混合规则结构：提出了结合Mamdani系统语言可解释性和TSK模型数值精确性的新型模糊回归系统
双重主导机制：引入了两种规则权重计算方法——基于模糊支持度/置信度和基于误差的主导度
约束TSK组件：TSK函数输出被约束在相应模糊集的边界内，保持可解释性
区间二型模糊集：使用区间二型模糊集更好地处理不确定性
ACO优化：采用蚁群优化算法进行规则子集选择，平衡模型紧凑性和准确性

其中 $\underline{\mu}(x)$ 和 $\overline{\mu}(x)$ 分别是下界和上界隶属度。

2. 混合规则结构

每个规则包含两个后件组件：

规则形式：

IF x1 is F1 AND ... AND xn is Fn 
THEN (y is G, y = f(x1, x2, ..., xn))

模糊组件：传统Mamdani后件，指向输出模糊集
TSK函数组件：n阶多项式函数，提供清晰输出值

TSK函数约束： $y_{output} \in [LowerBound(F_{upper}), UpperBound(F_{upper})]$

确保TSK输出始终在对应模糊集边界内。

3. 双重权重机制

模糊规则权重：

支持度： $Support(A_j \to \tilde{C}_j) = \frac{1}{|N|} \sum_{p=1}^N \mu_{A_j}(x_p) \cdot \mu_{C_j}(y_p)$
置信度： $Confidence(A_j \to \tilde{C}_j) = \frac{\sum_{p=1}^N \mu_{A_j}(x_p) \cdot \mu_{C_j}(y_p)}{\sum_{p=1}^N \mu_{A_j}(x_p)}$
主导度： $D = [S_{Rule\_lower} \cdot C_{lower}, S_{Rule\_upper} \cdot C_{upper}]$