2025-11-13T11:58:11.146801

RedDino: A foundation model for red blood cell analysis

Zedda, Loddo, Di Ruberto et al.
Red blood cells (RBCs) are essential to human health, and their precise morphological analysis is important for diagnosing hematological disorders. Despite the promise of foundation models in medical diagnostics, comprehensive AI solutions for RBC analysis remain scarce. We present RedDino, a self-supervised foundation model designed for RBC image analysis. RedDino uses an RBC-specific adaptation of the DINOv2 self-supervised learning framework and is trained on a curated dataset of 1.25 million RBC images from diverse acquisition modalities and sources. Extensive evaluations show that RedDino outperforms existing state-of-the-art models on RBC shape classification. Through assessments including linear probing and nearest neighbor classification, we confirm its strong feature representations and generalization ability. Our main contributions are: (1) a foundation model tailored for RBC analysis, (2) ablation studies exploring DINOv2 configurations for RBC modeling, and (3) a detailed evaluation of generalization performance. RedDino addresses key challenges in computational hematology by capturing nuanced morphological features, advancing the development of reliable diagnostic tools. The source code and pretrained models for RedDino are available at https://github.com/Snarci/RedDino, and the pretrained models can be downloaded from our Hugging Face collection at https://huggingface.co/collections/Snarcy/reddino-689a13e29241d2e5690202fc
academic

RedDino: A foundation model for red blood cell analysis

Basic Information

  • Paper ID: 2508.08180
  • Title: RedDino: A foundation model for red blood cell analysis
  • Authors: Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Carsten Marr
  • Categories: eess.IV cs.AI cs.CV
  • Publication Date: August 22, 2025 (arXiv v2)
  • Paper Link: https://arxiv.org/abs/2508.08180

Abstract

Red blood cells (RBCs) are crucial to human health, and precise morphological analysis is essential for diagnosing hematological diseases. Although foundation models have demonstrated significant potential in medical diagnostics, comprehensive AI solutions specifically for RBC analysis remain scarce. This paper proposes RedDino, a self-supervised foundation model specifically designed for RBC image analysis. RedDino employs a DINOv2 self-supervised learning framework tailored for RBCs, trained on a carefully curated dataset containing 1.25 million RBC images from different acquisition modalities and sources. Extensive evaluation demonstrates that RedDino significantly outperforms existing state-of-the-art models on RBC shape classification tasks. The model's strong feature representation and generalization capabilities are validated through linear probing and nearest neighbor classification evaluation methods.

Research Background and Motivation

Problem Definition

Red blood cell morphological analysis is fundamental to hematological diagnostics but faces several key challenges:

  1. Staining and imaging variability: Different staining protocols and imaging devices introduce bias, increasing analysis complexity
  2. Batch effects: Significant systematic differences exist in multi-source, multi-patient scenarios
  3. Professional training requirements: Traditional analysis requires extensive professional training
  4. Lack of specialized AI tools: Compared to white blood cell analysis, RBC analysis lacks mature foundation models

Research Motivation

While foundation models have demonstrated significant advantages in white blood cell analysis, effectively predicting clinical outcomes and addressing batch effects, the RBC analysis field has not yet fully explored the potential of these advanced techniques. This research aims to fill this gap by developing a foundation model specifically tailored for RBC analysis.

Core Contributions

  1. Specialized foundation model: Proposes RedDino, the first self-supervised foundation model family optimized specifically for RBC analysis
  2. In-depth configuration study: Conducts rigorous comparative analysis of DINOv2 configurations for RBC morphological modeling
  3. Comprehensive performance evaluation: Performs extensive benchmarking on multiple RBC datasets, demonstrating superiority over existing state-of-the-art models
  4. Strong generalization capability: Effectively mitigates batch effects challenges, demonstrating excellent cross-domain generalization performance

Methodology Details

Task Definition

RedDino aims to learn universal RBC feature representations supporting downstream RBC shape classification, anomaly detection, and morphological analysis tasks. Input consists of RBC microscopy images, with output being high-dimensional feature vectors applicable to various RBC analysis tasks.

Model Architecture

Base Framework

RedDino is built upon the DINOv2 self-supervised learning framework, employing Vision Transformer (ViT) as the backbone network. The model family includes three versions:

  • RedDino Small: Feature dimension 384, batch size 512, 22 million parameters
  • RedDino Base: Feature dimension 768, batch size 384, 86 million parameters
  • RedDino Large: Feature dimension 1024, batch size 256, 304 million parameters

Key Technical Improvements

  1. Removal of Koleo regularizer: The original DINOv2 uses Koleo regularization to prevent feature collapse. However, in RBC scenarios, due to the natural consistency of RBC shape and color, this regularizer excessively suppresses feature expression of pathological and abnormal RBCs
  2. Sinkhorn-Knopp centering: Replaces moving average centering, improving representation quality
  3. Customized data augmentation: Replaces DINOv2's original augmentation strategy with 32 pixel-level augmentations from the Albumentations library

Data Processing Strategy

Training Data Construction

  • Data scale: 56,712 raw images from 18 datasets, covering over 420 individuals
  • Data extraction: Two methods employed:
    1. Cell segmentation using improved CellPose, producing 3,076,269 segmented cells
    2. Extraction of 224×224 pixel non-overlapping image patches, generating 1,250,781 image patches
  • Data balancing: White blood cell image datasets are incorporated to mitigate natural imbalance between red and white blood cells

Training Strategy Optimization

Systematic experiments reveal:

  1. Training with image patches outperforms single-cell training
  2. Removing local crops significantly improves performance
  3. Customized augmentation pipeline further enhances feature quality

Experimental Setup

Datasets

Training data: 18 public RBC datasets, including different imaging modalities, resolutions, and staining techniques Test data:

  • Elsafty dataset: 240,000 images, 9 classes, from 4 different sources
  • Chula dataset: 20,875 images, 12 RBC classes
  • DSE dataset: 5,659 images, 8 classes

Evaluation Metrics

  • Accuracy (Acc)
  • Balanced Accuracy (bAcc)
  • Weighted F1 Score (wF1)

Comparison Methods

  • ResNet50
  • DINOv2 (Small/Base/Large)
  • DinoBloom (Small/Base/Large) - current state-of-the-art feature extractor for hematological data

Evaluation Methods

  1. Linear probing: Evaluates feature adaptation capability for downstream tasks
  2. K-nearest neighbor classification (1-NN, 20-NN): Evaluates feature robustness under batch effects
  3. Cross-source evaluation: Uses leave-one-source-out validation strategy
  4. Five-fold cross-validation: For imbalanced datasets

Experimental Results

Main Results

Elsafty Dataset Cross-Source Evaluation

In the most challenging cross-source evaluation, RedDino achieves significant advantages:

ModelLinear Probe wF11-NN wF120-NN wF1
ResNet5077.6±8.164.3±4.866.2±4.9
DinoBloom-L85.4±5.274.1±5.077.0±4.5
DINOv2 large86.0±5.673.7±6.276.4±7.0
RedDino base88.1±4.978.8±3.682.6±2.8
RedDino large88.5±5.578.5±4.681.6±4.7

Key Findings:

  • RedDino achieves improvements exceeding 2.1% (linear probing) and 3.0% (nearest neighbor classification) over the best baseline methods
  • Average improvement margins of 4.0-6.5%, demonstrating consistent performance advantages

Performance on Other Datasets

On the Chula and DSE datasets with five-fold cross-validation, RedDino similarly demonstrates excellent performance, surpassing baseline methods on nearly all metrics.

Ablation Studies

Impact of key configuration improvements:

  1. Removal of Koleo regularizer: Significantly improves performance, preventing pathological RBC features from being excessively suppressed
  2. Sinkhorn-Knopp centering: Further performance improvement when replacing moving average centering
  3. Image patches vs. single-cell training: Image patch training strategy outperforms single-cell training
  4. Customized augmentation pipeline: Shows clear improvements compared to original DINOv2 augmentation strategy

Visualization Analysis

PCA Visualization

Three-component PCA visualization validates RedDino feature effectiveness:

  • Capable of distinguishing background, cells, membrane structures, and parasites
  • Demonstrates excellent discrimination ability for abnormal morphologies such as malaria-infected RBCs and acanthocytes

UMAP Visualization

UMAP projection using the Elsafty dataset shows:

  • Different classes form clear clusters with no apparent batch effects
  • Clinically difficult-to-distinguish classes (such as spherical RBCs, elliptocytes, etc.) indeed overlap in feature space
  • Cell aggregates form unique clusters, proving the model can distinguish single cells from aggregates

Current State of Hematological AI Analysis

  • White blood cell analysis: Mature foundation models such as DinoBloom exist, demonstrating excellent performance in clinical outcome prediction
  • Red blood cell analysis: Comparatively underdeveloped, lacking specialized foundation models
  • Computer-aided diagnosis: Gradually becoming an important tool for addressing critical diagnostic challenges in hematology

Application of Self-Supervised Learning in Medical Imaging

Self-supervised methods such as DINOv2 have achieved tremendous success on natural images, but their application in medical imaging, particularly RBC analysis, remains to be fully explored.

Conclusions and Discussion

Main Conclusions

  1. Performance breakthrough: RedDino achieves new state-of-the-art performance on RBC classification tasks
  2. Strong generalization capability: Effectively mitigates batch effects, demonstrating excellent performance in cross-source scenarios
  3. High practical value: Provides reliable foundational tools for automated hematological diagnostics

Limitations

  1. Training data constraints: Despite the large dataset scale, certain rare RBC morphologies may be underrepresented
  2. Computational resource requirements: Large model versions require substantial computational resources
  3. Annotated data dependency: Downstream tasks still require certain amounts of annotated data for fine-tuning

Future Directions

  1. Extended application scenarios: Explore applications in other hematological tasks
  2. Model compression: Develop lighter-weight versions for resource-constrained environments
  3. Multimodal fusion: Incorporate other types of medical data to improve diagnostic accuracy

In-Depth Evaluation

Strengths

  1. Strong problem specificity: Specifically addresses RBC analysis, an important yet overlooked field
  2. Reasonable methodology design: Makes targeted improvements to DINOv2 based on RBC characteristics
  3. Rigorous experimental design: Employs strict evaluation methods such as cross-source validation, ensuring result reliability
  4. Large dataset contribution: Constructs the largest RBC image training collection to date
  5. Open-source friendly: Provides complete code and pre-trained models

Weaknesses

  1. Limited theoretical analysis: Theoretical explanation for why Koleo regularizer removal is effective lacks depth
  2. Insufficient computational cost analysis: Lacks detailed analysis of computational efficiency trade-offs between different model versions
  3. Lack of clinical validation: Absence of validation results in real clinical environments

Impact

  1. Academic value: Provides important foundational tools and benchmarks for the RBC analysis field
  2. Practical value: Has potential to significantly enhance automation of hematological diagnostics
  3. Reproducibility: Provides complete open-source implementation, facilitating use and improvement by the research community

Applicable Scenarios

  • Blood pathology diagnostic assistance
  • Large-scale blood screening
  • RBC morphological research
  • Hematological education and training tool development

Technical Innovation Summary

RedDino's core innovation lies in successfully adapting a general self-supervised learning framework to specialized medical domains. Through removing inappropriate regularization constraints and optimizing training strategies, it achieves significant performance improvements. This provides valuable reference for foundation model development in other medical imaging analysis tasks.


Environmental Impact Statement: The paper reports experimental carbon emissions of 4.15 kg CO2eq, reflecting attention to environmental responsibility.