2025-11-24T04:07:17.991345

Personalized Federated Fine-Tuning of Vision Foundation Models for Healthcare

Tupper, GagnÃ©

Foundation models open up new possibilities for the use of AI in healthcare. However, even when pre-trained on health data, they still need to be fine-tuned for specific downstream tasks. Furthermore, although foundation models reduce the amount of training data required to achieve good performance, obtaining sufficient data is still a challenge. This is due, in part, to restrictions on sharing and aggregating data from different sources to protect patients' privacy. One possible solution to this is to fine-tune foundation models via federated learning across multiple participating clients (i.e., hospitals, clinics, etc.). In this work, we propose a new personalized federated fine-tuning method that learns orthogonal LoRA adapters to disentangle general and client-specific knowledge, enabling each client to fully exploit both their own data and the data of others. Our preliminary results on real-world federated medical imaging tasks demonstrate that our approach is competitive against current federated fine-tuning methods.

academic

Personalized Federated Fine-Tuning of Vision Foundation Models for Healthcare

Basic Information

Paper ID: 2510.12741
Title: Personalized Federated Fine-Tuning of Vision Foundation Models for Healthcare
Authors: Adam Tupper, Christian Gagné (Université Laval, Mila - Quebec AI Institute)
Classification: cs.CV, cs.DC
Conference: SMASH 2025
Paper Link: https://arxiv.org/abs/2510.12741

Abstract

Foundation models open new possibilities for medical AI applications, but require fine-tuning for specific downstream tasks even when pre-trained on health data. Due to patient privacy protection constraints, sharing and aggregating data from different sources remains a challenge. This paper proposes a novel personalized federated fine-tuning method, FedOPAL, which decouples general knowledge and client-specific knowledge by learning orthogonal LoRA adapters, enabling each client to fully leverage both its own data and data from other clients. Preliminary results on real-world federated medical imaging tasks demonstrate that the method is competitive with current federated fine-tuning approaches.

Research Background and Motivation

Core Problems

Data Privacy Constraints: Medical data cannot be centrally aggregated for training due to strict privacy regulations
Data Scarcity: Annotated datasets in medical imaging are scarce and expensive to obtain
Data Heterogeneity: Significant data distribution differences exist across different medical institutions
Communication Overhead: Federated learning of large foundation models faces substantial communication costs

Research Motivation

While federated learning addresses data sharing constraints, it introduces new challenges in foundation model fine-tuning
Existing methods fail to effectively handle data heterogeneity across clients
There is a need for a federated fine-tuning approach that leverages global knowledge while maintaining personalization

Core Contributions

Proposes FedOPAL Method: A novel personalized federated fine-tuning framework using orthogonal LoRA adapters to separate general and client-specific knowledge
Dual Orthogonality Constraint Mechanism: Designs two approaches—weight orthogonalization (FedOPAL-W) and representation orthogonalization (FedOPAL-R)
Real Medical Data Validation: Validates the method's effectiveness on two challenging medical imaging datasets: Fed-ISIC 2019 and Camelyon17-WILDS
Systematic Comparative Analysis: Comprehensive comparison with multiple state-of-the-art LoRA-based federated learning methods

Methodology Details

Task Definition

In federated medical imaging classification tasks, multiple medical institutions (clients) collaborate to train a shared vision foundation model while protecting their respective data privacy and handling data distribution heterogeneity.

Model Architecture

Dual Adapter Design

FedOPAL employs a dual LoRA adapter architecture:

Global Adapter: Collaboratively updated by all clients, learning universal features
Personal Adapter: Private to each client, learning client-specific features

For pre-trained weight matrix $W_0 \in \mathbb{R}^{d \times k}$ , the update formula is: $\Delta W = W_0 + B_{global}A_{global} + B_{personal}A_{personal}$

Orthogonality Constraint Mechanism

1. Weight Orthogonalization (FedOPAL-W) Based on the O-LoRA method, orthogonality constraints are imposed on adapter weights: $\ell_{orth} = \frac{1}{N} \sum_{i=1}^{N} |(A_i_{global})^T A_i_{personal}|$

2. Representation Orthogonalization (FedOPAL-R) Directly constrains learned representations by minimizing cosine similarity: $\ell_{orth} = \frac{1}{NB} \sum_{i=1}^{N} \sum_{j=1}^{B} |\cos(z_{global}^{i,j}, z_{personal}^{i,j})|$

Total loss function: $\ell_{total} = \ell_{task} + \lambda \ell_{orth}$

Technical Innovations

Knowledge Decoupling Mechanism: Orthogonality constraints force the two adapters to learn different feature subspaces
Dual-Level Orthogonalization: Provides orthogonalization strategies at both weight and representation levels
Federated Personalization: Achieves client personalization while maintaining global collaboration

Experimental Setup

Datasets

1. Fed-ISIC 2019

Scale: 6 clients from medical institutions in Australia, Austria, USA, and Spain
Task: Skin lesion classification (9 diagnostic categories)
Characteristics: Significant label shift and uneven sample sizes

2. Camelyon17-WILDS

Scale: 5 clients (hospitals), each with 7-10 patients
Task: Patch-level classification of breast cancer metastasis in lymph node slides
Characteristics: Color variations due to differences in staining procedures

Evaluation Metrics

Balanced accuracy is used to address class imbalance issues

Baseline Methods

FedIT: Trains A and B matrices of a single global adapter
FFA-LoRA: Fixes A matrix, trains only B matrix
FedSA: Shares only A matrix, personalizes B matrix
FedDPA: Trains independent global and personal adapters
FedPAL: Baseline method without orthogonality constraints

Implementation Details

Model: ViT-Tiny (9.7M parameters), pre-trained on ImageNet-21K
Optimizer: SGD, learning rate η ∈ {1e-4, 5e-4, 1e-3, 5e-3, 1e-2}
Federated Setting: FedAvg, 100% client participation, 1 epoch local training per round

Experimental Results

Main Results

Fed-ISIC 2019 Task Results:

FedOPAL-R achieves the best average ranking (3.17) with average accuracy of 0.726
FedPAL baseline performs well (average ranking 3.50, accuracy 0.726)
On client 2, FedPAL achieves the highest accuracy of 0.931

Camelyon17-WILDS Task Results:

FedDPA performs best (average ranking 2.40, accuracy 0.909)
FedOPAL-R achieves the second-best average accuracy of 0.870
All federated methods approach centralized training performance (0.905)

Key Findings

Method Competitiveness: FedOPAL methods are competitive with existing state-of-the-art approaches
Consistency Issues: No single method consistently outperforms across all clients and tasks
Orthogonalization Effects: Representation orthogonalization (FedOPAL-R) generally outperforms weight orthogonalization (FedOPAL-W)
Room for Improvement: Results indicate further improvement is possible

Federated Learning and LoRA Integration

Parameter Efficiency: LoRA reduces communication overhead and computational requirements
Personalization Methods: Partial model personalization alleviates data heterogeneity issues
Recent Advances: Methods like FedDPA and FedSA explore different personalization strategies

Orthogonal Learning Methods

O-LoRA: Uses orthogonality constraints in continual learning to prevent catastrophic forgetting
Subspace Learning: Constrains learning of different feature subspaces

Conclusions and Discussion

Main Conclusions

FedOPAL successfully decouples general and specific knowledge through orthogonal LoRA adapters
The method demonstrates competitiveness with existing approaches on real medical datasets
Representation-level orthogonalization is more effective than weight-level orthogonalization

Limitations

Performance Consistency: Method performance varies across different clients and tasks
Preliminary Results: Current results are preliminary in nature, requiring deeper analysis
Hyperparameter Sensitivity: Selection of λ parameter may affect method performance
Computational Overhead: Dual adapter design increases computational complexity

Future Directions

Adapter Rank Optimization: Explore optimal rank settings for global and local adapters
Knowledge Separation Improvement: Refine separation of client-agnostic and client-specific knowledge
Extended Evaluation: Evaluate on larger models and more tasks
Theoretical Analysis: Provide theoretical guarantees and convergence analysis

In-Depth Evaluation

Strengths

Strong Innovation: First to introduce orthogonality constraints in federated LoRA fine-tuning with novel approach
High Practical Value: Directly addresses real needs and challenges in medical AI
Complete Methodology: Provides two orthogonalization strategies at different levels
Comprehensive Experiments: Systematic evaluation on real medical datasets

Weaknesses

Weak Theoretical Foundation: Lacks theoretical analysis of orthogonality constraint effectiveness
Insufficient Experimental Depth: Missing detailed ablation studies and parameter sensitivity analysis
Limited Performance Improvement: Marginal improvements compared to existing methods
Limited Evaluation Scope: Evaluation on only two datasets

Impact

Academic Contribution: Provides new insights for the intersection of federated learning and medical AI
Practical Prospects: Promising for application in real medical federated learning scenarios
Scalability: Method framework is extensible to other domains and tasks

Applicable Scenarios

Multi-Institution Medical Collaboration: Suitable for federated learning across hospitals
Privacy-Sensitive Tasks: Appropriate for applications requiring strict data privacy protection
Data Heterogeneity Environments: Particularly suitable for scenarios with large data distribution differences across clients

References

This paper primarily references important works in federated learning (McMahan et al., 2017), LoRA adapters (Hu et al., 2022), and orthogonal learning (Wang et al., 2023), providing a novel solution to federated fine-tuning problems in medical AI.

Summary: This is an innovative work addressing federated learning challenges in medical AI, achieving knowledge decoupling through orthogonal LoRA adapters to improve personalization while protecting privacy. While current results are preliminary, the work provides valuable new insights and methodological frameworks for the field.