BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
Pai, Wang, Lu et al.
Multi-LLM systems enhance the creativity of large language models by simulating human collective intelligence but suffer from significant drawbacks, such as high computational costs and inference latency. To address these limitations, we propose BILLY (BlendIng persona vectors for Large Language model creativitY), a training-free framework that captures the benefits of multi-LLM collaboration, i.e. inducing diverse perspectives and specialized expertise, within a single model. BILLY operates by extracting and blending multiple distinct persona vectors directly in the model's activation space. We steer the model's generation process with this merged vector while inference, enabling multi-perspective output without explicit multi-LLM communication. Our experiments across creativity-oriented benchmarks demonstrate that BILLY surpasses single model prompting and traditional multi-LLM approaches, while substantially reducing inference time and computational costs. Our analyses further reveal that distinct persona vectors can be blended to achieve both effective control over complementary aspects of generation and greater interpretability.
academic
BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
Multi-LLM systems enhance the creativity of large language models by simulating collective human intelligence, but suffer from significant drawbacks including high computational costs and substantial inference latency. To address these limitations, this paper proposes BILLY (BlendIng persona vectors for Large Language model creativitY), a training-free framework that captures the advantages of multi-LLM collaboration within a single model by introducing diverse perspectives and expertise. BILLY operates by extracting and merging multiple distinct persona vectors in the model's activation space, using the merged vector at inference time to guide the model's generation process, thereby achieving multi-perspective outputs without explicit multi-LLM communication.
Computational Burden of Multi-LLM Systems: While traditional multi-LLM collaborative frameworks enhance creativity, they suffer from high computational costs and inference latency issues
Limitations of Single-Prompt Methods: Relying solely on prompts to enable models to simultaneously adopt multiple roles and perspectives faces integration difficulties
Trade-off Between Efficiency and Creativity: How to significantly reduce computational overhead while maintaining creative advantages
Creativity is widely recognized as a cornerstone of human progress, driving innovation across various domains. Recent research has explored LLM creativity, positioning it as a promising tool for applications such as story writing, design ideation, and scientific discovery. The multi-LLM paradigm simulates collective human intelligence by enabling multiple LLMs to engage in iterative discussions to obtain more comprehensive solutions.
Multi-LLM Frameworks: High computational costs, long inference times, and process loss
Single-Prompt Methods: Rely on the model's ability to integrate multiple roles, but in practice models may handle each persona separately and struggle to integrate them coherently
Proposes BILLY Framework: A training-free persona vector merging framework that enables multi-perspective creative responses within a single LLM
Efficiency and Simplicity: Completely training-free, requiring no additional fine-tuning or multi-LLM communication, significantly reducing computational and token costs
Interpretable Mechanism: Operates directly in latent activation space, providing an interpretable creativity control mechanism
Performance Improvement: Surpasses single-model prompting and traditional multi-LLM methods on creativity-oriented benchmarks
Given a creative task input, BILLY aims to generate creative outputs that merge multiple professional perspectives (such as environmentalist, creative professional, futurist) while avoiding the computational overhead of multi-LLM systems.
Merged Vector: Achieves true concept fusion, retaining the substantive concepts of the environmentalist while reframing them with the visual style of the creative professional
Through analyzing projections of activation changes onto persona vectors:
Limitations of Prompting Methods: Multi-role prompting shows positive projection on vCRE but negative on vENV, indicating inability to consistently induce all intended personas
BILLY Advantages: Maintains positive projections on both vCRE and vENV, successfully co-activating multiple personas
Vector Fusion Effectiveness: Single vectors precisely control corresponding semantic concepts, while merged vectors achieve complex interactions
Classical creativity psychology research (Torrance, 1966; Wallach & Kogan, 1965)
Recent advances in multi-agent systems (Wu et al., 2023; Lu et al., 2024)
Development of activation steering techniques (Turner et al., 2024; Chen et al., 2025)
Overall Assessment: This is a high-quality paper with significant contributions to the field of LLM creativity enhancement. The BILLY method ingeniously addresses the efficiency problems of multi-LLM systems while maintaining creative advantages. Although there is room for improvement in theoretical analysis and evaluation methods, its practical value and technical innovation make it an important advance in the field.