The Curious Case of Curiosity across Human Cultures and LLMs
Borah, Mihalcea
Recent advances in Large Language Models (LLMs) have expanded their role in human interaction, yet curiosity -- a central driver of inquiry -- remains underexplored in these systems, particularly across cultural contexts. In this work, we investigate cultural variation in curiosity using Yahoo! Answers, a real-world multi-country dataset spanning diverse topics. We introduce CUEST (CUriosity Evaluation across SocieTies), an evaluation framework that measures human-model alignment in curiosity through linguistic (style), topic preference (content) analysis and grounding insights in social science constructs. Across open- and closed-source models, we find that LLMs flatten cross-cultural diversity, aligning more closely with how curiosity is expressed in Western countries. We then explore fine-tuning strategies to induce curiosity in LLMs, narrowing the human-model alignment gap by up to 50\%. Finally, we demonstrate the practical value of curiosity for LLM adaptability across cultures, showing its importance for future NLP research.
academic
The Curious Case of Curiosity across Human Cultures and LLMs
本文研究了大型语言模型(LLMs)中好奇心的跨文化表现。作者使用Yahoo! Answers多国数据集,提出了CUEST(CUriosity Evaluation across SocieTies)评估框架,通过语言风格、话题偏好和社会科学理论来衡量人类与模型在好奇心表达上的一致性。研究发现LLMs会抹平跨文化差异,更倾向于西方国家的好奇心表达方式。通过微调策略,作者将人类-模型对齐差距缩小了50%,并证明了好奇心对LLM跨文化适应性的实用价值。