Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
Marconato, Bortolotti, van Krieken et al.
Neuro-symbolic (NeSy) AI aims to develop deep neural networks whose predictions comply with prior knowledge encoding, e.g. safety or structural constraints. As such, it represents one of the most promising avenues for reliable and trustworthy AI. The core idea behind NeSy AI is to combine neural and symbolic steps: neural networks are typically responsible for mapping low-level inputs into high-level symbolic concepts, while symbolic reasoning infers predictions compatible with the extracted concepts and the prior knowledge. Despite their promise, it was recently shown that - whenever the concepts are not supervised directly - NeSy models can be affected by Reasoning Shortcuts (RSs). That is, they can achieve high label accuracy by grounding the concepts incorrectly. RSs can compromise the interpretability of the model's explanations, performance in out-of-distribution scenarios, and therefore reliability. At the same time, RSs are difficult to detect and prevent unless concept supervision is available, which is typically not the case. However, the literature on RSs is scattered, making it difficult for researchers and practitioners to understand and tackle this challenging problem. This overview addresses this issue by providing a gentle introduction to RSs, discussing their causes and consequences in intuitive terms. It also reviews and elucidates existing theoretical characterizations of this phenomenon. Finally, it details methods for dealing with RSs, including mitigation and awareness strategies, and maps their benefits and limitations. By reformulating advanced material in a digestible form, this overview aims to provide a unifying perspective on RSs to lower the bar to entry for tackling them. Ultimately, we hope this overview contributes to the development of reliable NeSy and trustworthy AI models.
제목: Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
저자: Emanuele Marconato, Samuele Bortolotti, Emile van Krieken, Paolo Morettin, Elena Umili, Antonio Vergari, Efthymia Tsamoura, Andrea Passerini, Stefano Teso
신경-기호(NeSy) AI는 예측 결과가 사전 지식 인코딩(예: 안전성 또는 구조적 제약)을 준수하는 심층 신경망을 개발하는 것을 목표로 하며, 신뢰할 수 있고 투명한 AI의 가장 유망한 경로 중 하나를 나타냅니다. NeSy AI의 핵심 아이디어는 신경 단계와 기호 단계를 결합하는 것입니다: 신경망은 저수준 입력을 고수준 기호 개념으로 매핑하고, 기호 추론은 개념 및 사전 지식과 호환되는 예측을 추론하고 추출합니다. 광범위한 전망에도 불구하고, 최근 연구에 따르면 개념에 직접적인 감독이 없을 때 NeSy 모델은 추론 지름길(Reasoning Shortcuts, RSs)의 영향을 받을 수 있습니다. 즉, 잘못된 기초 개념을 통해 높은 레이블 정확도를 달성할 수 있습니다. RS는 모델 해석의 해석 가능성, 분포 외 시나리오의 성능을 손상시킬 수 있으므로 신뢰성에 영향을 미칩니다. 동시에 개념 감독(일반적으로 사용 불가능)이 없으면 RS를 감지하고 예방하기 어렵습니다.