Leading the Follower: Learning Persuasive Agents in Social Deduction Games
Zheng, Ye, Zhao et al.
Large language model (LLM) agents have shown remarkable progress in social deduction games (SDGs). However, existing approaches primarily focus on information processing and strategy selection, overlooking the significance of persuasive communication in influencing other players' beliefs and responses. In SDGs, success depends not only on making correct deductions but on convincing others to response in alignment with one's intent. To address this limitation, we formalize turn-based dialogue in SDGs as a Stackelberg competition, where the current player acts as the leader who strategically influences the follower's response. Building on this theoretical foundation, we propose a reinforcement learning framework that trains agents to optimize utterances for persuasive impact. Through comprehensive experiments across three diverse SDGs, we demonstrate that our agents significantly outperform baselines. This work represents a significant step toward developing AI agents capable of strategic social influence, with implications extending to scenarios requiring persuasive communication.
academic
Leading the Follower: Learning Persuasive Agents in Social Deduction Games
Large Language Model (LLM) agents have demonstrated significant progress in social deduction games (SDGs). However, existing methods primarily focus on information processing and strategy selection, overlooking the importance of persuasive communication in influencing other players' beliefs and responses. In SDGs, success depends not only on correct reasoning but also on persuading others to act according to one's intentions. To address this limitation, the authors formalize turn-based dialogue in SDGs as Stackelberg competition, where the current player acts as a leader strategically influencing the follower's response. Based on this theoretical foundation, the authors propose a reinforcement learning framework that trains agents to optimize the persuasive impact of utterances. Comprehensive experiments on three different SDGs demonstrate that the proposed method significantly outperforms baseline approaches.
Existing LLM agents in social reasoning games face the following issues:
Neglecting Persuasive Communication: Existing methods primarily focus on information processing and strategy selection, lacking consideration of persuasiveness
Lack of Influence Modeling: No systematic modeling of how to influence other players' behavior through language
Insufficient Local Optimization: Lack of strategic optimization for each utterance in turn-based dialogue
Theoretical Innovation: Formalizes turn-based dialogue in SDGs as a Stackelberg competition model, providing a systematic theoretical foundation for persuasive communication
Methodological Framework: Proposes a reinforcement learning framework that directly optimizes the influence of utterances on subsequent players' responses
Experimental Validation: Validates the effectiveness and generalizability of the method on three different SDGs (Werewolf, Avalon, ONUW)
Technical Contribution: Develops a complete training pipeline combining the advantages of API-based LLMs and open-source LLMs
In social deduction games, players must influence other players' behavior through turn-based dialogue to achieve their respective victory conditions. This paper models each round of dialogue as a Stackelberg competition:
Input: Game rules R, current game state G_t, dialogue history D_t, player role r_t
Output: Optimized persuasive utterance u_t
Objective: Maximize favorable influence on the next player's response
Testing on GPT-5 and Qwen3-14B without additional training yields consistent performance improvements, demonstrating the cross-model generalization capability of the method.
This paper cites important works from multiple domains including social deduction games, reinforcement learning, and game theory, particularly:
Xu et al. (2024): SLA method
Light et al. (2025): Strategist method
Shao et al. (2024): GRPO algorithm
Bakhtin et al. (2022): Cicero system
Overall Assessment: This is a high-quality paper with significant contributions to the field of AI social intelligence. Through innovative theoretical modeling and effective technical implementation, it provides new research directions and practical methods for developing persuasive AI agents.