2025-11-14T03:19:10.909198

Post-surgical Endometriosis Segmentation in Laparoscopic Videos

Leibetseder, Schoeffmann, Keckstein et al.
Endometriosis is a common women's condition exhibiting a manifold visual appearance in various body-internal locations. Having such properties makes its identification very difficult and error-prone, at least for laymen and non-specialized medical practitioners. In an attempt to provide assistance to gynecologic physicians treating endometriosis, this demo paper describes a system that is trained to segment one frequently occurring visual appearance of endometriosis, namely dark endometrial implants. The system is capable of analyzing laparoscopic surgery videos, annotating identified implant regions with multi-colored overlays and displaying a detection summary for improved video browsing.
academic

Post-surgical Endometriosis Segmentation in Laparoscopic Videos

Basic Information

  • Paper ID: 2510.13899
  • Title: Post-surgical Endometriosis Segmentation in Laparoscopic Videos
  • Authors: Andreas Leibetseder, Klaus Schoeffmann (Klagenfurt University), Jörg Keckstein (Ulm University), Simon Keckstein (Ludwig-Maximilians-University Munich)
  • Classification: cs.CV cs.LG cs.MM
  • Publication Date: October 14, 2025 (arXiv preprint)
  • Paper Link: https://arxiv.org/abs/2510.13899

Abstract

Endometriosis is a common gynecological condition that exhibits diverse visual appearances across different anatomical locations within the body. This characteristic makes its identification extremely difficult and error-prone, particularly for non-specialist physicians. To assist gynecologists in treating endometriosis, this demonstration paper describes a system trained to segment common visual manifestations of endometriosis, specifically dark endometrial implants. The system analyzes laparoscopic surgical videos, annotates identified implant regions with multi-colored overlays, and displays detection summaries to improve video browsing experience.

Research Background and Motivation

1. Research Problem

This research addresses the automatic identification and segmentation of endometriotic lesions during laparoscopic surgery. Endometriosis is a condition characterized by abnormal growth of uterine-like tissue outside the uterus, affecting women of reproductive age.

2. Problem Significance

  • Diagnostic Difficulty: Endometriosis presents diverse visual appearances at different anatomical locations, increasing identification difficulty
  • Medical Quality: Complete identification and documentation of all lesions is crucial for improving patient symptoms and quality of life
  • Educational Needs: Inexperienced physicians under time pressure face risks of incomplete diagnosis
  • Classification Systems: Two major classification systems (rASRM and Enzian) exist, requiring accurate visual assessment

3. Limitations of Existing Approaches

  • Reliance on subjective visual assessment by surgeons
  • Limited detection across large pelvic and peritoneal regions
  • Increased identification difficulty due to varying colors and appearances of endometrial lesions
  • Diagnostic errors resulting from insufficient training and time pressure

4. Research Motivation

Leveraging the successful applications of deep learning in medical imaging, this work develops a system capable of automatically identifying and segmenting dark endometrial implants to support intraoperative or postoperative analysis and improve educational training outcomes.

Core Contributions

  1. Model Adaptation: Adaptation of Mask R-CNN for binary segmentation of endometrial implants
  2. Visualization System: Provision of spatial and temporal visualization of endometrial implants in laparoscopic surgical videos
  3. Open-Source Tools: Release of tool source code and pre-trained models for academic use
  4. Practical Demonstration: Demonstration of the feasibility of applying traditional machine learning object detection to real-world medical use cases

Methodology

Task Definition

Input: Laparoscopic surgical video Output: Annotations of dark endometrial implants with segmentation masks and confidence scores Constraints: Focus on single-category dark endometrial implant identification

Model Architecture

1. Overall Architecture

The system comprises three main steps:

  • Dataset Creation: Extraction of single-category lesion dataset from GLENDA dataset
  • Model Training: Transfer learning using Mask R-CNN
  • Video Analysis: Model application and result visualization

2. Dataset Construction

  • Base Data: Extracted from Gynecologic Laparoscopy Endometriosis Dataset (GLENDA)
  • Scale: Over 350 region-based endometrial implant annotations covering 160 frames from over 100 patient cases
  • Data Augmentation: Techniques including rotation, blur, perspective transformation, desaturation, and target tracking

3. Model Design

  • Base Network: Mask R-CNN with ResNet-101 as backbone network
  • Loss Function: Multi-task loss function including:
    • Classification loss (log loss)
    • Bounding box loss (smooth L1 loss)
    • Mask segmentation loss (binary cross-entropy loss)
  • Training Parameters: 50 epochs, learning rate 0.001, stochastic gradient descent optimizer

4. Video Processing Pipeline

Raw surgical video → Frame-by-frame analysis → Extract bounding boxes, masks, and labels → Generate annotated frames → Create detection summary bar → Output annotated video

Technical Innovations

  1. Medical Domain Adaptation: Successful adaptation of generic object detection networks to specific medical scenarios
  2. Temporal Visualization: Innovative provision of temporal indicator bars for detection confidence, facilitating rapid identification of key frames
  3. Real-time Processing Capability: Optimized processing speed averaging 150-250ms per frame
  4. Multi-modal Output: Simultaneous provision of visual annotations and structured data in JSON format

Experimental Setup

Dataset

  • Name: Custom single-category dataset based on GLENDA
  • Scale: 350+ annotations, 160 frames, 100+ patient cases
  • Characteristics: Focus on dark endometrial implants
  • Division: Training, validation, and test sets

Evaluation Metrics

  • Primary Metric: Mean Average Precision (mAP) for mask segmentation
  • Threshold Settings: IoU threshold 0.5 and 0.5-0.95 range
  • Confidence: Detection confidence threshold 0.50

Implementation Details

  • Image Input: Resized to 800 pixels (short edge) and 1333 pixels (long edge)
  • Best Model: Optimal performance achieved after 29 epochs
  • Enhancement Strategy: Rotation and cropping augmentations proved most effective

Experimental Results

Main Results

  • Optimal Performance:
    • mAP@0.50IoU: 0.642 (IoU threshold 0.5)
    • mAP@0.50:0.95: 0.324 (IoU threshold 0.5-0.95)
  • Training Efficiency: Model training completed in approximately 2 hours
  • Processing Speed: Processing time comparison across different resolutions
ResolutionAverage Processing Time (ms)
640×360153
1280×720158
1920×1080170
3840×2160207

Performance Analysis

  • Processing Estimation: One-hour HD resolution (25fps) video requires approximately 4 hours 15 minutes processing
  • Hardware Requirements: Intel Core i7-5820K, 32GB RAM, GTX 1080
  • Cross-platform Compatibility: Support for Linux, Windows, and anticipated MacOS support

Case Analysis

The paper provides four annotation examples of dark endometrial implants, demonstrating the system's ability to identify pathological regions that are visually distinct from surrounding tissue but similar to blood spots or dark blood vessels.

1. Medical Image Segmentation

Widespread application of deep learning in medical imaging provides the technical foundation for this research.

2. Object Detection Networks

  • Faster R-CNN: Provides region proposal network foundation
  • Mask R-CNN: Core segmentation network architecture
  • ResNet: Serves as backbone feature extraction network

3. Endometriosis Classification

  • rASRM Classification: Applicable to peritoneal lesion documentation
  • Enzian Classification: Covers deep infiltrating endometriosis

Conclusions and Discussion

Main Conclusions

  1. Successfully demonstrates the feasibility of Mask R-CNN for endometriosis segmentation tasks
  2. Develops a complete video analysis tool chain supporting postoperative video archive analysis
  3. Provides visualization interface facilitating treatment planning and clinical education

Limitations

  1. Single Type: Addresses only dark endometrial implants, not covering other visual manifestations
  2. Dataset Scale: Relatively small dataset may limit model generalization capability
  3. Demonstration Nature: Current version is a proof-of-concept lacking comprehensive user interface
  4. Processing Speed: Real-time processing capability requires improvement

Future Directions

  1. Extension to multi-category endometriosis lesion detection
  2. Development of interactive postoperative video browsing system
  3. Improvement of user interface and user experience
  4. Expansion of larger-scale annotated datasets

In-Depth Evaluation

Strengths

1. Technical Innovation

  • Domain Adaptation: Successful adaptation of generic computer vision techniques to specialized medical scenarios
  • Practical Tools: Provision of complete end-to-end solution from model training to video analysis
  • Open-Source Contribution: Release of source code and pre-trained models promoting academic research

2. Experimental Sufficiency

  • Multi-dimensional Evaluation: Comprehensive analysis including performance metrics, processing time, and hardware requirements
  • Practical Application: Design based on real patient data and clinical requirements
  • Reproducibility: Detailed implementation details and open-source code supporting result reproduction

3. Clinical Value

  • Educational Significance: Facilitates physician training and skill enhancement
  • Diagnostic Support: Reduces missed diagnosis risk and improves diagnostic accuracy
  • Efficiency Improvement: Automated analysis saves physician time

Shortcomings

1. Methodological Limitations

  • Single Category: Addresses only one visual manifestation; actual applications require identification of multiple lesion types
  • Data Dependency: Relatively small dataset may impact model generalization across different hospitals and equipment
  • Threshold Sensitivity: Fixed confidence threshold may not be applicable to all scenarios

2. Evaluation Insufficiency

  • Lack of Clinical Validation: No validation studies conducted in actual clinical environments
  • Limited Baseline Comparisons: Lacks detailed comparison with other medical segmentation methods
  • Missing User Studies: Absence of evaluation regarding actual physician use and acceptance of the tool

3. Technical Details

  • Insufficient Real-time Performance: Processing speed difficult to meet intraoperative real-time analysis requirements
  • Rudimentary Interface: Current version lacks well-designed user interface

Impact

1. Academic Contribution

  • Provides new research direction for medical video analysis field
  • Demonstrates potential of deep learning in gynecological disease diagnosis
  • Provides reusable datasets and tools

2. Practical Value

  • Potential to improve diagnostic accuracy of endometriosis
  • Applicable to medical education and training
  • Establishes foundation for developing more comprehensive medical auxiliary diagnostic systems

3. Reproducibility

  • Provides detailed technical implementation details
  • Open-source code and pre-trained models
  • Clear installation and usage instructions

Applicable Scenarios

  1. Postoperative Analysis: Retrospective analysis of surgical videos ensuring complete lesion identification
  2. Medical Education: Training junior physicians to recognize endometrial lesions
  3. Research Tool: Supporting large-scale clinical research in lesion annotation and analysis
  4. Quality Control: Verifying surgical completeness and diagnostic accuracy

References

  1. Canis, M., et al. "Revised american society for reproductive medicine classification of endometriosis: 1996." Fertility and Sterility, 1997.
  2. He, K., et al. "Mask R-CNN." IEEE Trans. Pattern Anal. Mach. Intell., 2020.
  3. Leibetseder, A., et al. "GLENDA: gynecologic laparoscopy endometriosis dataset." MultiMedia Modeling, 2020.

Summary: This is a demonstration paper showcasing the application of deep learning in gynecological medical video analysis. While the current version has certain limitations, it provides valuable exploration in the field of medical AI-assisted diagnosis with promising development prospects and practical value. The open-source nature of this work will promote further development of related research.