BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
Pai, Wang, Lu et al.
Multi-LLM systems enhance the creativity of large language models by simulating human collective intelligence but suffer from significant drawbacks, such as high computational costs and inference latency. To address these limitations, we propose BILLY (BlendIng persona vectors for Large Language model creativitY), a training-free framework that captures the benefits of multi-LLM collaboration, i.e. inducing diverse perspectives and specialized expertise, within a single model. BILLY operates by extracting and blending multiple distinct persona vectors directly in the model's activation space. We steer the model's generation process with this merged vector while inference, enabling multi-perspective output without explicit multi-LLM communication. Our experiments across creativity-oriented benchmarks demonstrate that BILLY surpasses single model prompting and traditional multi-LLM approaches, while substantially reducing inference time and computational costs. Our analyses further reveal that distinct persona vectors can be blended to achieve both effective control over complementary aspects of generation and greater interpretability.
複数LLMシステムは人間の集団知性をシミュレートすることで大規模言語モデルの創造性を向上させますが、計算コストが高く推論遅延が大きいという顕著な欠陥があります。これらの制限を解決するため、本論文ではBILLY(BlendIng persona vectors for Large Language model creativitY)を提案します。これは訓練不要なフレームワークであり、単一モデル内で複数LLM協働の利点、すなわち多様な視点と専門知識の導入を捉えることができます。BILLYはモデルの活性化空間から複数の異なるペルソナベクトルを抽出・融合することで動作し、推論時にこの融合ベクトルを使用してモデルの生成プロセスを操作し、明示的な複数LLM通信なしで多視点出力を実現します。