BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
Pai, Wang, Lu et al.
Multi-LLM systems enhance the creativity of large language models by simulating human collective intelligence but suffer from significant drawbacks, such as high computational costs and inference latency. To address these limitations, we propose BILLY (BlendIng persona vectors for Large Language model creativitY), a training-free framework that captures the benefits of multi-LLM collaboration, i.e. inducing diverse perspectives and specialized expertise, within a single model. BILLY operates by extracting and blending multiple distinct persona vectors directly in the model's activation space. We steer the model's generation process with this merged vector while inference, enabling multi-perspective output without explicit multi-LLM communication. Our experiments across creativity-oriented benchmarks demonstrate that BILLY surpasses single model prompting and traditional multi-LLM approaches, while substantially reducing inference time and computational costs. Our analyses further reveal that distinct persona vectors can be blended to achieve both effective control over complementary aspects of generation and greater interpretability.
academic
BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
多LLM系统通过模拟人类集体智慧增强大语言模型的创造力,但存在计算成本高和推理延迟大等显著缺陷。为解决这些限制,本文提出BILLY(BlendIng persona vectors for Large Language model creativitY),这是一个无需训练的框架,能在单一模型内捕获多LLM协作的优势,即引入多样化视角和专业知识。BILLY通过在模型激活空间中提取和融合多个不同的人格向量来操作,在推理时使用这个合并向量引导模型的生成过程,实现多视角输出而无需显式的多LLM通信。