2025-11-22T07:58:16.571206

BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation

Pai, Wang, Lu et al.

Multi-LLM systems enhance the creativity of large language models by simulating human collective intelligence but suffer from significant drawbacks, such as high computational costs and inference latency. To address these limitations, we propose BILLY (BlendIng persona vectors for Large Language model creativitY), a training-free framework that captures the benefits of multi-LLM collaboration, i.e. inducing diverse perspectives and specialized expertise, within a single model. BILLY operates by extracting and blending multiple distinct persona vectors directly in the model's activation space. We steer the model's generation process with this merged vector while inference, enabling multi-perspective output without explicit multi-LLM communication. Our experiments across creativity-oriented benchmarks demonstrate that BILLY surpasses single model prompting and traditional multi-LLM approaches, while substantially reducing inference time and computational costs. Our analyses further reveal that distinct persona vectors can be blended to achieve both effective control over complementary aspects of generation and greater interpretability.

academic

BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation

Basic Information

Paper ID: 2510.10157
Title: BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
Authors: Tsung-Min Pai, Jui-I Wang, Li-Chun Lu, Shao-Hua Sun, Hung-Yi Lee, Kai-Wei Chang
Classification: cs.CL cs.AI
Publication Date: October 11, 2025 (arXiv preprint)
Paper Link: https://arxiv.org/abs/2510.10157v1

Abstract

Multi-LLM systems enhance the creativity of large language models by simulating collective human intelligence, but suffer from significant drawbacks including high computational costs and substantial inference latency. To address these limitations, this paper proposes BILLY (BlendIng persona vectors for Large Language model creativitY), a training-free framework that captures the advantages of multi-LLM collaboration within a single model by introducing diverse perspectives and expertise. BILLY operates by extracting and merging multiple distinct persona vectors in the model's activation space, using the merged vector at inference time to guide the model's generation process, thereby achieving multi-perspective outputs without explicit multi-LLM communication.

Research Background and Motivation

Core Problems

Computational Burden of Multi-LLM Systems: While traditional multi-LLM collaborative frameworks enhance creativity, they suffer from high computational costs and inference latency issues
Limitations of Single-Prompt Methods: Relying solely on prompts to enable models to simultaneously adopt multiple roles and perspectives faces integration difficulties
Trade-off Between Efficiency and Creativity: How to significantly reduce computational overhead while maintaining creative advantages

Research Significance

Creativity is widely recognized as a cornerstone of human progress, driving innovation across various domains. Recent research has explored LLM creativity, positioning it as a promising tool for applications such as story writing, design ideation, and scientific discovery. The multi-LLM paradigm simulates collective human intelligence by enabling multiple LLMs to engage in iterative discussions to obtain more comprehensive solutions.

Limitations of Existing Methods

Multi-LLM Frameworks: High computational costs, long inference times, and process loss
Single-Prompt Methods: Rely on the model's ability to integrate multiple roles, but in practice models may handle each persona separately and struggle to integrate them coherently

Core Contributions

Proposes BILLY Framework: A training-free persona vector merging framework that enables multi-perspective creative responses within a single LLM
Efficiency and Simplicity: Completely training-free, requiring no additional fine-tuning or multi-LLM communication, significantly reducing computational and token costs
Interpretable Mechanism: Operates directly in latent activation space, providing an interpretable creativity control mechanism
Performance Improvement: Surpasses single-model prompting and traditional multi-LLM methods on creativity-oriented benchmarks

Methodology Details

Task Definition

Given a creative task input, BILLY aims to generate creative outputs that merge multiple professional perspectives (such as environmentalist, creative professional, futurist) while avoiding the computational overhead of multi-LLM systems.

Model Architecture

1. Persona Vector Extraction (Contrastive Activation)

For persona P, the persona vector is extracted through a contrastive method:

v⃗(l)_P = (1/|D+_P|) ∑(x∈D+_P) a⃗(l)(x) - (1/|D-_P|) ∑(x∈D-_P) a⃗(l)(x)

Where:

D+_P: Collection of responses exhibiting the persona characteristics
D-_P: Collection of baseline responses lacking the characteristics
a⃗(l)(x): Token-averaged residual stream activation at layer l

2. Offline Fusion

Merging N persona vectors into a single composite vector:

v⃗(l)_merged = (1/N) ∑(i=1 to N) v⃗(l)_i

3. Inference-time Steering

Modifying activations during inference:

a⃗(l)_steered = a⃗(l)_original + α · v⃗(l)_merged

Where α is a scaling coefficient, and layer 20 is selected for steering in experiments.

Technical Innovations

Activation Space Manipulation: Operating directly at the model's internal representation level rather than relying on external prompt engineering
Vector Fusion Strategy: Achieving complex persona combinations through simple averaging of multiple persona vectors
Training-Free Design: Adding vectors only at inference time without modifying model weights
Reusability: Persona vectors extracted once can be used multiple times, amortizing costs

Experimental Setup

Datasets

Extended benchmarks based on the Wallach-Kogan creativity test, enhanced by Lu et al. using GPT-4:

AUT: Requires providing practical innovative uses for everyday objects
INSTANCES: Enumerating objects according to specific criteria
SIMILARITIES: Identifying connections or similarities between objects
SCIENTIFIC: Generating creative solutions for scientific problems

Evaluation Metrics

Adopting two core metrics from the Torrance Tests of Creative Thinking (TTCT):

Originality: Assesses the statistical rarity or unconventionality of responses
Elaboration: Measures the level of detail and supporting information in responses

Using GPT-4o-mini as the LLM judge with a 1-5 scoring scale.

Comparison Methods

Single Agent (SA): Single LLM creative response (temperature 0.7)
SA (T=1.0): Single agent with high-temperature decoding (temperature 1.0)
SA-MRP: Single agent with multi-role prompting
LLM Discussion: Multi-LLM framework with three-stage structured discussion process

Implementation Details

Base Models: Qwen-2.5-7B-Instruct, Llama-3.1-8B-Instruct, Gemma-3-4B-it
Steering Layer: Layer 20
Persona Vectors: Environmentalist, creative professional, futurist, analytical thinker, etc.
Evaluation: 100 queries per benchmark, computing mean and standard deviation

Experimental Results

Main Results

On four creativity benchmarks, BILLY consistently surpasses all baseline methods on the originality metric:

AUT Task Originality Scores:

Qwen-2.5-7B: BILLY (4.71) > LLM Discussion (4.24) > SA-MRP (4.07) > SA (4.03)
Llama-3.1-8B: BILLY (4.38) > LLM Discussion (4.21) > SA-MRP (4.05) > SA (3.73)
Gemma-3-4B: BILLY (4.99) > SA-MRP (4.96) > SA (4.79) > LLM Discussion (3.70)

Efficiency Comparison:

Inference Latency: BILLY (19s) vs LLM Discussion (513s), approximately 25x faster
Token Cost: BILLY reduces token usage by over 95%
Cost per 10,000 queries: BILLY ( $0.30) vs LLM Discussion ($ 25.50)

Ablation Studies

Persona Vector Combination Analysis:

A single creative professional vector already demonstrates excellent creative performance
Increasing from 1 to 3 vectors shows significant improvement
Further increases from 4-7 vectors yield no significant additional benefits

Case Analysis

Using the "redesign a hospital" task as an example:

Environmentalist Vector: Generates practical blueprints (e.g., community gardens)
Creative Professional Vector: Produces artistically rich terminology (e.g., bioluminescent ecological clusters)
Merged Vector: Achieves true concept fusion, retaining the substantive concepts of the environmentalist while reframing them with the visual style of the creative professional

Activation Projection Analysis

Through analyzing projections of activation changes onto persona vectors:

Limitations of Prompting Methods: Multi-role prompting shows positive projection on vCRE but negative on vENV, indicating inability to consistently induce all intended personas
BILLY Advantages: Maintains positive projections on both vCRE and vENV, successfully co-activating multiple personas
Vector Fusion Effectiveness: Single vectors precisely control corresponding semantic concepts, while merged vectors achieve complex interactions

LLM Creativity Enhancement

Role-Playing Prompting: Guiding models to adopt specific creative personas
Multi-Agent Collaborative Frameworks: Multiple agents simulating brainstorming sessions
Comprehensive Reviews: Thorough surveys of creativity research directions

Multi-LLM Collaboration

Structured Dialogue Frameworks: Such as next-generation applications of AutoGen
Role-Playing Dialogue: LLM Discussion and HOLLMWOOD
Limitations: High computational costs, instability, and inefficiency

Activation Steering Techniques

Behavioral Control: Influencing model behavior through modifying internal activations
Contrastive Methods: Computing steering vectors using activation differences
Application Domains: Authenticity enhancement, safety improvement, and personality control

Conclusions and Discussion

Main Conclusions

BILLY successfully captures the creative advantages of multi-LLM collaboration within a single model
Significantly reduces inference time and computational costs while maintaining or exceeding creative performance
Activation steering provides better controllability and interpretability compared to multi-role prompting

Limitations

Vector Combination Method: Currently uses simple averaging; more complex combination techniques are needed in the future
Non-linear Interactions: The fusion effects of persona vectors are not simple linear superposition
Task-Specific Weighting: Lacks mechanisms for learning task-specific weights for each persona vector

Future Directions

Develop complex vector combination frameworks beyond simple averaging
Learn task-specific weights for persona vectors
Design mechanisms that explicitly model functional roles for more precise control

In-Depth Evaluation

Strengths

Methodological Innovation: First application of persona vector merging to creativity enhancement with a novel technical approach
Comprehensive Experiments: Thorough evaluation across multiple models and benchmarks, including efficiency, quality, and interpretability analysis
Practical Value: Significant efficiency improvements make the method highly practical
Theoretical Insights: Activation projection analysis reveals the fundamental differences between prompting methods and activation steering

Weaknesses

Vector Fusion Theory: Lacks in-depth theoretical analysis of why simple averaging is effective
Evaluation Limitations: Primarily relies on LLM judges, which may introduce bias
Generalization Capability: Validated only on creativity tasks; applicability to other tasks remains unknown
Persona Vector Quality: Stability and consistency of the persona vector extraction process require further verification

Impact

Domain Contribution: Provides a new technical pathway for LLM creativity enhancement
Practical Value: Substantially reduces computational costs, promising to facilitate practical deployment of multi-persona AI systems
Reproducibility: The method is relatively simple and easy to reproduce and extend

Applicable Scenarios

Creative Writing: Story creation requiring fusion of multiple perspectives
Design Ideation: Design solutions combining aesthetics, functionality, and sustainability
Scientific Innovation: Research problem-solving requiring interdisciplinary thinking
Educational Applications: Cultivating students' multi-perspective thinking abilities

References

The paper cites abundant related work, including:

Classical creativity psychology research (Torrance, 1966; Wallach & Kogan, 1965)
Recent advances in multi-agent systems (Wu et al., 2023; Lu et al., 2024)
Development of activation steering techniques (Turner et al., 2024; Chen et al., 2025)

Overall Assessment: This is a high-quality paper with significant contributions to the field of LLM creativity enhancement. The BILLY method ingeniously addresses the efficiency problems of multi-LLM systems while maintaining creative advantages. Although there is room for improvement in theoretical analysis and evaluation methods, its practical value and technical innovation make it an important advance in the field.