Large Language Models have been shown to contain extensive world knowledge in their parameters, enabling impressive performance on many knowledge intensive tasks. However, when deployed in novel settings, LLMs often encounter situations where they must integrate parametric knowledge with new or unfamiliar information. In this work, we explore whether LLMs can combine knowledge in-context with their parametric knowledge through the lens of counterfactual reasoning. Through synthetic and real experiments in multi-hop reasoning problems, we show that LLMs generally struggle with counterfactual reasoning, often resorting to exclusively using their parametric knowledge. Moreover, we show that simple post-hoc finetuning can struggle to instill counterfactual reasoning ability -- often leading to degradation in stored parametric knowledge. Ultimately, our work reveals important limitations of current LLM's abilities to re-purpose parametric knowledge in novel settings.
- è«æID: 2506.15732
- ã¿ã€ãã«: Can LLMs Reconcile Knowledge Conflicts in Counterfactual Reasoning?
- èè
: Khurram Yamin*, Gaurav Ghosal*, Bryan Wilder (Carnegie Mellon University)
- åé¡: cs.AI cs.LG
- çºè¡šææ/äŒè°: ICLR 2026
- è«æãªã³ã¯: https://arxiv.org/abs/2506.15732v2
å€§èŠæš¡èšèªã¢ãã«ïŒLLMïŒã¯ãã©ã¡ãŒã¿ã«è±å¯ãªäžçç¥èã嫿ããå€ãã®ç¥èéçŽçã¿ã¹ã¯ã§åªããæ§èœã瀺ããŠãããããããæ°ããç°å¢ã«é
眮ãããéãLLMã¯ãã©ã¡ãŒã¿åãããç¥èãæ°èŠãŸãã¯äžæ
£ããªæ
å ±ãšçµã¿åãããå¿
èŠãããç¶æ³ã«é »ç¹ã«çŽé¢ãããæ¬ç ç©¶ã¯åäºå®æšè«ã®èгç¹ããLLMãæèç¥èãšãã©ã¡ãŒã¿åç¥èãçµã¿åãããããšãã§ãããã©ãããæ¢ç©¶ããã倿®µéæšè«åé¡ã«ãããåæããã³å®éšçæ€èšŒãéããŠãæ¬ç ç©¶ã¯LLMãåäºå®æšè«ã«ãããŠåºç¯ãªå°é£ã瀺ãããã°ãã°ãã©ã¡ãŒã¿åç¥èã®ã¿ã«äŸåããããšã瀺ããŠãããããã«ãåçŽãªäºåŸåŸ®èª¿æŽã¯åäºå®æšè«èœåã®æ€ã蟌ã¿ã«å°é£ã§ããããã°ãã°ä¿åããããã©ã¡ãŒã¿åç¥èã®å£åããããããæçµçã«ãæ¬ç ç©¶ã¯çŸåšã®LLMãæ°ããèšå®ã«ãããŠãã©ã¡ãŒã¿åç¥èãåå©çšããèœåã«ãããéèŠãªéçãæããã«ããã
æ¬ç ç©¶ã解決ããããšããæ žå¿çåé¡ã¯ä»¥äžã®éãã§ããïŒçŸä»£ã®LLMã¯ããã©ã¡ãŒã¿åç¥èãæèå
ã®åäºå®åæãšéžæçã«çµã¿åãããŠã倿®µéåé¡ã«æ£ããçããããšãã§ãããïŒ
- å®è·µçå¿çšã®å¿
èŠæ§ïŒçŸå®äžçã®å€ãã®ã·ããªãªã§ã¯ãLLMãäºååŠç¿ç¥èãæšè«æã«æäŸãããæ°èŠãŸãã¯ä»®èª¬çæ
å ±ãšçµã¿åãããå¿
èŠããã
- ç¥èççŸã®èª²é¡ïŒå€éšææžãå
éšç¥èãšççŸããå Žåãæ€çŽ¢æ¡åŒµçæã¯å°é£ã«çŽé¢ãã
- å®å
šæ§ãéèŠãªã¢ããªã±ãŒã·ã§ã³ïŒå¯Ÿè©±åã·ã¹ãã ãæ€çŽ¢æ¡åŒµãã€ãã©ã€ã³ãããã³å®å
šæ§ãéèŠãªã¢ããªã±ãŒã·ã§ã³ã§ã¯ãæ£ç¢ºãªæ¡ä»¶ä»ãæšè«ãäžå¯æ¬ ã§ãã
- æ¢åã®å€æ®µéQAãã³ãããŒã¯ã¯äž»ã«ãã¢ãã«ãä¿åäºå®ãæ³èµ·ãããããã©ã¡ãŒã¿åç¥èãã§ãŒã³ãçµã¿åãããèœåãè©äŸ¡ããããäºéèŠä»¶ããã¹ãããªã
- ç¥èççŸç ç©¶ã¯åäºå®å€æ®µéæšè«ã®äœç³»çãªæ¢ç©¶ã«æ¬ ãã
- RAGæ¹æ³ã¯å€éšæ
å ±ãçµ±åã§ããããåäºå®æšè«ã®ç¬ç¹ã®èª²é¡ã«å¯ŸåŠã§ããªã
åäºå®æšè«ãšããå
·äœçãªã¿ã¹ã¯ãéããŠãLLMãç¥èççŸã«çŽé¢ããå Žåã®æ§èœãäœç³»çã«ç ç©¶ãããç¹ã«ãæèçäžæžãïŒContextual OverrideïŒãšéžæçæ€çŽ¢ïŒSelective RetrievalïŒã®èœåãåæã«å¿
èŠãšããå Žåãç ç©¶ããã
- åäºå®QAãã³ãããŒã¯ïŒåæã°ã©ãããŒã¹ã®ã¿ã¹ã¯ãšçŸå®äžçã®å ææšè«ã·ããªãªã«åºã¥ããŠãäºååŠç¿ç¥èã°ã©ãã«å¯Ÿãã(i)匷åã(ii)远å ã(iii)ççŸã(iv)ç¡é¢é£ãªæèã®å Žåãåé¢ãããã³ãããŒã¯ãå°å
¥
- å®èšŒçåæïŒGPT-4oããã³ä»ã®æå
端ã¢ãã«ã®å®éšãéããŠã2ã€ã®äž»èŠãªå€±æãã¿ãŒã³ãç¹å®ïŒ(a)æèç¡èŠïŒã¢ãã«ãä¿åäºå®ãããã©ã«ãã§äœ¿çšïŒããã³(b)æèéå°é©åïŒã¢ãã«ãããã³ããã«ç²ç®çã«åŸãïŒ
- 埮調æŽã®èœãšã穎åæïŒåçŽãªäºåŸåŸ®èª¿æŽãåäºå®äŸã«å¯ŸããŠããããªå©çããããããããäºæããªããã¥ãŒãªã¹ãã£ãã¯ãèªå°ããããšã§æšæºäºå®ãã³ãããŒã¯ã®æ§èœãäœäžãããå¯èœæ§ãããããšã蚌æ
- å®è·µçæçŸ©ïŒç ç©¶çµæã察話åã·ã¹ãã ãæ€çŽ¢æ¡åŒµãã€ãã©ã€ã³ãããã³å®å
šæ§ãéèŠãªã¢ããªã±ãŒã·ã§ã³ã«äžãã圱é¿ãè°è«
æ¬ç ç©¶ã¯åäºå®å€æ®µéæšè«ã¿ã¹ã¯ãå®çŸ©ããã¢ãã«ã«ä»¥äžãèŠæ±ããïŒ
- æèçäžæžãïŒããã©ã«ãäºå®ãäžæçã«æå¶ãã仮説çåæãåãå
¥ãã
- éžæçæ€çŽ¢ïŒéã¿ã«ä¿åãããé¢é£ããé¢é£æ§ãæ€çŽ¢ããŠå©çšããããã ããäžéšã®æ
å ±ã¯æ¢ã«å€æŽãããŠãã
äŸïŒãããªãã€ã¿ãªã¢ã«äœçœ®ããŠããå Žåããšããã§ã«å¡ã¯ã©ã®åœã«ããã ãããïŒã
- ãããªã¯ãã©ã³ã¹ã«ããããšãããã©ã¡ãŒã¿åç¥èãäžæžãããå¿
èŠããã
- ããšããã§ã«å¡ã¯ããªã«ããããšããé¢é£æ§ãä¿æããå¿
èŠããã
æèæ
å ±ã4ã€ã®ã·ããªãªã«åé¡ïŒ
- ã·ããªãª1ïŒäºåç¥èã®åŒ·åïŒïŒãã©ã¡ãŒã¿åç¥èã°ã©ãã«æ¢ã«ååšããé¢ä¿ãæäŸ
- ã·ããªãª2ïŒæ°æ
å ±ã®è¿œå ïŒïŒã¯ãšãªã«çããããã«å¿
èŠã ããã©ã¡ãŒã¿åç¥èã°ã©ãã«æ¬ èœããŠããæ
å ±ãæäŸ
- ã·ããªãª3ïŒäºåç¥èãšã®ççŸïŒïŒæ¢åã®ãã©ã¡ãŒã¿åç¥èãšåŒ·ãççŸããæ
å ±ãæäŸ
- ã·ããªãª4ïŒç¡é¢é£æ
å ±ïŒïŒã¯ãšãªãšç¡é¢ä¿ãªæ
å ±ãæäŸ
å¶åŸ¡ãããåæç¥èã°ã©ãèšå®ã§ïŒ
- æåã°ã©ãGãã©ã³ãã ã«çæãé ç¹ã¯ãšã³ãã£ãã£ã蟺ã¯é¢ä¿ã衚ã
- ååäºå®ïŒåäžèŸºïŒãšæšè«äºå®ïŒ2段éã®çµã¿åããïŒãåºå¥
- 3ã€ã®åäºå®ã¿ã€ãããã¹ãïŒ
- ããã1é¢é£ïŒåäºå®åæãæšè«äºå®ã®æåã®ããããä¿®æ£
- ããã2é¢é£ïŒåäºå®åæãããªããžãšã³ãã£ãã£ãšæçµåçã®ãªã³ã¯ãä¿®æ£
- ç¡é¢é£åäºå®ïŒåäºå®åæã倿®µéã¯ãšãªãšå®å
šã«ç¡é¢ä¿
3ã€ã®æŠç¥ãæ¯èŒïŒ
- æšæºïŒçŽæ¥å æã¯ãšãª
- CoTïŒæèã®é£éããã³ãã
- FTïŒCoT説æä»ãã®åäºå®äŸã§ã®åŸ®èª¿æŽ
- å®äžçå®éšïŒå æé¢ä¿ã«åºã¥ãäºå€åé¡ã¿ã¹ã¯ãã©ã³ãã ããŒã¹ã©ã€ã³ã¯50%
- åæå®éšïŒã©ã³ãã ã«çæãããç¥èã°ã©ããååäºå®ãšæšè«äºå®ãå«ã
- æ£ç¢ºåºŠïŒAccuracyïŒ
- 1段éããã³2æ®µéæšè«ã¿ã¹ã¯ã§ã®æ§èœ
- GPT-4oïŒæšæºãCoTã埮調æŽçïŒ
- GPT-5 (Thinking)
- Llama 3.1 8B
- GPT埮調æŽïŒåŠç¿ããŒã¯ã³38,754ã3ãšããã¯ãããããµã€ãº1ãåŠç¿çåæ°2
- Llama埮調æŽïŒ5ãšããã¯ãLoRA rank 8ãåŠç¿ç0.0001
- åæå®éšïŒ4ã€ã®NVIDIA A6000 GPU䜿çšãåèš72 GPUæé
- ã·ããªãª1ïŒäºåç¥èã®åŒ·åïŒïŒãã¹ãŠã®ã¢ãã«ãåªããæ§èœã瀺ããæ£ç¢ºåºŠã¯90%-100%ã®ç¯å²
- ã·ããªãª2ïŒæ
å ±ã®è¿œå ïŒïŒé埮調æŽã¢ãã«ã®æ£ç¢ºåºŠã¯60-75%ã埮調æŽåŸã¯çŽ90%ã«åäž
- ã·ããªãª3ïŒäºåç¥èãšã®ççŸïŒïŒæ§èœã50%ã®ããŒã¹ã©ã€ã³ã«è¿ãæ°Žæºã«åŽ©å£ã埮調æŽã¯ããããªæ¹åã®ã¿
- ã·ããªãª4ïŒç¡é¢é£æ
å ±ïŒïŒåŒ·ãæ§èœãGPT-5ã¯ã»ãŒå®å
šãªæ£ç¢ºåºŠã«è¿ã
- 埮調æŽãã·ã§ãŒãã«ãããèªå°ïŒã¢ãã«ã¯çã®æšè«ãè¡ã代ããã«ãåäºå®åæã«ç€ºããããšã³ãã£ãã£ãç¹°ãè¿ãããšãçŽ æ©ãåŠç¿
- éžæçäžæžãã®å°é£ïŒã¢ãã«ã¯åäºå®åæããã€é¢é£ããããåºå¥ããããšãåŠç¿ã§ããªã
- äºååŠç¿äžã®åäºå®ããŒã¿ã®çµã¿èŸŒã¿ïŒåäºå®æšè«æ§èœãæ¹åã§ããããäºå®ã¿ã¹ã¯æ§èœãæãªãå¯èœæ§ããã
å¶åŸ¡å®éšãéããŠãæ§èœäœäžããã©ãŒããã倿Žã«ãããã®ã§ã¯ãªãããšã蚌æïŒ
- æèçäžæžããå¿
èŠãšããªãCoTã¿ã¹ã¯ãæ§ç¯
- 埮調æŽã¯ãã®ãããªã¿ã¹ã¯ã«çŽ æ©ãé©å¿ïŒãã¹ãæ£ç¢ºåºŠ100%ïŒ
- åäºå®æšè«ã®å€±æã¯äžè¬çãªç Žæ»
çå¿åŽã§ã¯ãªããã¿ã¹ã¯èªäœã®å°é£ãã«ç±æ¥ããããšã瀺å
- 2ã€ã®äž»èŠãªå€±æãã¿ãŒã³ïŒ
- æèç¡èŠïŒã¢ãã«ãä¿åäºå®ãããã©ã«ãã§äœ¿çš
- æèéå°é©åïŒã¢ãã«ãããã³ããã«ç²ç®çã«åŸãããé¢é£ãªã³ã¯ãå¿ãã
- ã¢ã©ã€ã¡ã³ãã®åœ±é¿ïŒçŸä»£ã®æ¬çªç°å¢LLMã¯äºå®æ§ãšå®å
šæ§ã¢ã©ã€ã¡ã³ãèšç·ŽãåããŠãããäºååŠç¿ãã©ã¡ãŒã¿åç¥èãžã®äŸåãååããã
- 埮調æŽã®éçïŒåçŽãªäºåŸåŸ®èª¿æŽã¯å
ç¢ãªåäºå®æšè«èœåã®æ€ã蟌ã¿ã«å°é£ã§ãã
- HotpotQAãªã©ã®ãã³ãããŒã¯ã倿®µéæšè«èœåããã¹ã
- æ¢åç ç©¶ã¯äž»ã«ãã©ã¡ãŒã¿åç¥èã®ã¿ãå«ã倿®µéæšè«ã«çŠç¹
- æ¬è«æã¯ãã©ã¡ãŒã¿åç¥èãšæèç¥èã®çµã¿åãããå¿
èŠãšããå Žåãç¬èªã«ç ç©¶
- RAGæ¹æ³ããã©ã¡ãŒã¿åã¡ã¢ãªãšæ€çŽ¢æ
å ±ã®çµ±åã詊ã¿ã
- æ¢åæ¹æ³ã¯éåžžãåäºå®æšè«ã®ç¬ç¹ã®èª²é¡ã«é©ããªã
- ãã©ã¡ãŒã¿åç¥èãå®å
šã«ç Žæ£ããã®ã§ã¯ãªããéžæçã«ä¿æããã³çµ±åããå¿
èŠããã
- LLMã®å ææšè«èœåã¯æŽ»çºãªç ç©¶é å
- æ¢åãã³ãããŒã¯ïŒCLadderãCounterBenchãªã©ïŒã¯LLMã®æ£åŒãªåäºå®æšè«ã«ãããéçãæããã«ãã
- æ¬è«æã¯LLMã倿®µéæšè«ã«ãããŠãã©ã¡ãŒã¿åç¥èãšåäºå®åæãã©ã®ããã«çµ±åããããçè§£ãã空çœãåãã
- æ ¹æ¬çãªéçïŒçŸåšã®LLMã¯ãççŸãŸãã¯æ°ããæ
å ±ã«å¿çããŠå
éšç¥èã°ã©ããåçã«ä¿®æ£ãŸãã¯æ¡åŒµããããã®å
ç¢ãªã¡ã«ããºã ã«æ¬ ãã
- 倱æãã¿ãŒã³ã®æ®éæ§ïŒæèç¡èŠãšæèéå°é©åã®åé¡ã¯ãç°ãªãããã³ããæŠç¥ããã³åŸ®èª¿æŽæ¹æ³å
šäœã§æç¶ãã
- 埮調æŽå¹æã®é宿§ïŒåçŽãªåŸ®èª¿æŽæ¹æ³ã¯åäºå®æšè«åé¡ã广çã«è§£æ±ºã§ãããæ¢åç¥èãæãªãå¯èœæ§ããã
- ç°¡ç¥åãããèšå®ïŒåæç°å¢ã§ã¯åäºå®åæãéçç¥èã°ã©ãã®åäžèŸºç·šéãšããŠè¡šçŸãããã¯ãšãªã¯2段éãªã³ã¯ã«å¶éããã
- è€éæ§ã®äžè¶³ïŒçŸå®äžçã®ã·ããªãªã¯è€æ°è¿°èªçžäºäœçšãææ§ãŸãã¯ç¢ºççé¢ä¿ãè€æ°ãœãŒã¹ã®ãã€ãºã®ãã蚌æ ãå«ã
- æ·±ãã®å¶éïŒããæ·±ããããè€éãªå€æ®µéé¢ä¿ã«ã¯æ¡åŒµãããŠããªã
- æ°ããã¢ããªã³ã°ãã©ãã€ã ïŒä¿åç¥èãšæèç¥èãåçã«çµ±åããªãããã©ã¡ãã®åŽé¢ãæãªããªãæ°ããã¢ããªã³ã°ããã³èšç·Žãã©ãã€ã ã®éçºãå¿
èŠ
- ã¡ã«ããºã ç ç©¶ïŒéžæçç¥èäžæžãã®ã¡ã«ããºã å®è£
ã®æ·±ãç ç©¶
- è€éæ§ã®æ¡åŒµïŒããæ·±ããããè€éãªå€æ®µéé¢ä¿ããã³çŸå®ã·ããªãªãžã®åæã®æ¡åŒµ
- åé¡ã®éèŠæ§ïŒLLMãç¥èççŸã·ããªãªã«ãããéèŠãªéçãèå¥ããäœç³»çã«ç ç©¶
- å³å¯ãªå®éšèšèšïŒå®äžçãšåæç°å¢ãçµã¿åãããå
æ¬çãªåæèŠç¹ãæäŸ
- æŽå¯çãªçºèŠïŒ2ã€ã®æç¢ºãªå€±æãã¿ãŒã³ãæããã«ããLLMåäœã®çè§£ã«éèŠãªèŠè§£ãæäŸ
- æ¹æ³è«çè²¢ç®ïŒåäºå®æšè«èœåãè©äŸ¡ããããã®å¹æçãªãã¬ãŒã ã¯ãŒã¯ãææ¡
- 解決çã®æ¬ åŠïŒäž»ã«åé¡ãç¹å®ãããã广çãªè§£æ±ºçãææ¡ããŠããªã
- ã¢ãã«ç¯å²ã®éå®ïŒå°æ°ã®ã¢ãã«ã®ã¿ããã¹ãããããåºç¯ãªã¢ãã«è©äŸ¡ã«æ¬ ãã
- ã¿ã¹ã¯è€éæ§ïŒçŸåšã®ã¿ã¹ã¯èšå®ã¯æ¯èŒçåçŽã§ãå®éã®ã¢ããªã±ãŒã·ã§ã³ãšã®å·®ããã
- çè«åæã®äžè¶³ïŒå€±æã¡ã«ããºã ã®æ·±å±€çè«ç説æã«æ¬ ãã
- åŠè¡ç䟡å€ïŒLLMç¥èçµ±åç ç©¶ã«éèŠãªåºç€ãæäŸããåŸç¶ç ç©¶æ¹åãåçºããå¯èœæ§ããã
- å®çšçæçŸ©ïŒRAGã·ã¹ãã ããã³åçç¥èçµ±åãå¿
èŠãªã¢ããªã±ãŒã·ã§ã³ã«éèŠãªæå°ãæäŸ
- èŠåç圹å²ïŒç ç©¶è
ãšå®è·µè
ã«ç¥èççŸã·ããªãªã«ãããLLMã®éçã«æ³šæãä¿ã
- æ€çŽ¢æ¡åŒµã·ã¹ãã ïŒççŸæ
å ±ãåŠçããéã®RAGã·ã¹ãã èšèšãæå°
- 察話åAIïŒä»®èª¬ã·ããªãªãåŠçããå¿
èŠããã察話ã·ã¹ãã ã«åèãæäŸ
- å®å
šæ§ãéèŠãªã¢ããªã±ãŒã·ã§ã³ïŒæ£ç¢ºãªæ¡ä»¶ä»ãæšè«ãå¿
èŠãªé åã§ã®é©çšæã«ç¹å¥ãªæ³šæãå¿
èŠ
è«æã¯é¢é£åéã®éèŠãªç ç©¶ãåŒçšããŠããã以äžãå«ãïŒ
- 倿®µé質åå¿çãã³ãããŒã¯ïŒHotpotQAãNaturalQuestionsïŒ
- ç¥èççŸåŠçæ¹æ³ïŒRAGãREALMãDPRïŒ
- å ææšè«è©äŸ¡ïŒCLadderãCounterBenchïŒ
- LLMã¡ã«ããºã åæïŒGrokking transformersãªã©ïŒ
ç·åè©äŸ¡ïŒããã¯é«å質ã®ç ç©¶è«æã§ãããLLMãåäºå®æšè«ã«ãããéèŠãªéçãäœç³»çã«ç¹å®ãåæããŠãããå®å
šãªè§£æ±ºçã¯æäŸããŠããªãããLLMã®ç¥èçµ±åèœåãçè§£ãæ¹åããããã®éèŠãªåºç€ã確ç«ãããã®åéã®çºå±ã«éèŠãªæšé²åãããããã