2025-11-15T17:37:12.267441

Post Processing of image segmentation using Conditional Random Fields

Dhawan, Bodani, Garg

The output of image the segmentation process is usually not very clear due to low quality features of Satellite images. The purpose of this study is to find a suitable Conditional Random Field (CRF) to achieve better clarity in a segmented image. We started with different types of CRFs and studied them as to why they are or are not suitable for our purpose. We evaluated our approach on two different datasets - Satellite imagery having low quality features and high quality Aerial photographs. During the study we experimented with various CRFs to find which CRF gives the best results on images and compared our results on these datasets to show the pitfalls and potentials of different approaches.

academic

Post Processing of image segmentation using Conditional Random Fields

Basic Information

Paper ID: 2510.09833
Title: Post Processing of image segmentation using Conditional Random Fields
Authors: Aashish Dhawan (JMIETI, Radaur), Pankaj Bodani (ISRO, Space Applications Center), Vishal Garg (JMIETI, Radaur)
Classification: cs.CV
Publication Date/Conference: INDIACom 2019 (IEEE Conference ID 46181)
Paper Link: https://arxiv.org/abs/2510.09833

Abstract

The output of satellite image segmentation often lacks clarity due to the low-quality characteristics of satellite imagery. The objective of this research is to identify appropriate Conditional Random Fields (CRF) to enhance the clarity of segmented images. The study begins with different types of CRF, analyzing the reasons for their applicability or inapplicability. The method was evaluated on two distinct datasets: satellite images with low-quality characteristics and high-quality aerial photographs. Various CRF models were tested in experiments to identify the best-performing CRF on images, and results were compared across datasets, demonstrating the deficiencies and potential of different approaches.

Research Background and Motivation

Core Problem: The output of the image segmentation process often lacks sufficient clarity, particularly for low-resolution satellite images, where segmentation accuracy is very low due to poor image quality characteristics.
Problem Significance: Image segmentation has broad applications in medical imaging, object detection, and object recognition. For satellite image analysis, clear and accurate segmentation is crucial for applications such as urban planning and environmental monitoring.
Limitations of Existing Methods:
- Traditional segmentation techniques (K-means clustering, region growing algorithms, watershed transformation) have low accuracy
- Results vary depending on the data used, algorithm parameter values, and the efficiency of the personnel performing the task
- The surface of output images after neural network processing becomes jagged
Research Motivation: CRF, as an undirected graphical model, can account for the influence of "adjacent" samples and is applicable to pattern recognition and sequence prediction. It has been used as a post-processing technique in neural network pipelines such as W-Net.

Core Contributions

Systematic Comparison: Conducted systematic analysis and comparison of different types of CRF (linear CRF, grid CRF, dense CRF), determining their applicability in image segmentation post-processing
Optimal CRF Selection: Identified that the fully connected CRF of Krähenbühl and Koltun's version is the best choice for image segmentation post-processing
Multi-dataset Validation: Validated the method's effectiveness on two different types of datasets: low-quality satellite images and high-quality aerial photographs
Parameter Optimization Analysis: Through experiments, determined the impact of critical parameters such as negative probability on result quality

Methodology Details

Task Definition

Input: Segmented image (labeled image) Output: Clarified segmented image after CRF post-processing Objective: Improve image segmentation clarity and accuracy, reduce noise and edge jaggedness

CRF Model Analysis

1. Linear CRF

Structure: One-dimensional chain structure, with each node connected to the previous and next nodes
Applicability: Primarily used for NLP tasks (part-of-speech tagging, shallow parsing)
Inapplicability Reason: Image information is distributed in two dimensions; a one-dimensional structure cannot establish effective relationships between image pixel nodes

2. Grid CRF

Structure: Two-dimensional grid structure, with each node connected to four adjacent nodes
Applicability: Can be used for image segmentation tasks, widely used in pattern recognition
Limitations: Node connections are insufficiently dense to solve complex problems; inadequate for the complexity of information distribution in images

3. Dense CRF (Fully Connected CRF)

Structure: Each node is connected to all other n-1 nodes
Advantages:
- Can interpret long-range interactions that may exist in images
- Suitable for image data distribution characteristics
- Optimal CRF model for image segmentation
Challenges: Extremely high computational complexity; processing large images may require days

Technical Innovation

Efficient Inference Algorithm: Employs the efficient fully connected CRF inference algorithm proposed by Krähenbühl and Koltun:

Reduces computation time from 36 hours to 0.2 seconds
Maintains output quality essentially unchanged
Significantly reduces computational cost

Conditional Probability Modeling:

P(A|B) = P(A∩B) / P(B)

Where P(A|B) defines the probability of event A given that event B has occurred.

Experimental Setup

Datasets

1. Urban Dataset

Source: Provided by India's Space Applications Center
Content: High-quality satellite images of several Indian cities (Jodhpur, Rampura, etc.)
Sensor: LISS-3 sensor from India's IRS-2 satellite
Annotation: Annotated using region growing algorithm into two classes (binary classification)
Includes: Original input images and labeled images

2. Potsdam Dataset

Source: International Society for Photogrammetry and Remote Sensing (ISPRS)
Content: 38 image patches of 6000×6000 pixels from Potsdam city
Channels: 4-channel (red, green, blue, infrared)
Classification: 6-class annotation
- Clutter/Background (red)
- Cars (yellow)
- Low vegetation (cyan)
- Buildings (blue)
- Trees (green)
- Impervious surfaces (white)
Includes: Input images, labeled images, and ground truth labels

Evaluation Metrics

Visual quality assessment (due to lack of ground truth labels in the urban dataset)
Comparison with ground truth labels (Potsdam dataset)

Implementation Details

Key Parameters: Negative probability (adjusted from 70% to 99%)
Algorithm: Efficient fully connected CRF by Krähenbühl and Koltun
Optimization Strategy: Improve results by adjusting negative probability and potential function parameters

Experimental Results

Main Results

Urban Dataset Experiments

Initial Results (70% negative probability):
- Corrected some under-predicted regions
- Corrected patchy regions (over-prediction) to some extent
- Completely missed certain regions
Optimized Results (95% negative probability):
- Most prediction errors in initial segmentation were corrected
- Results were satisfactory and met expected standards

Potsdam Dataset Experiments

Initial Results: Severe output distortion, with many regions missed by the model
After Parameter Optimization: Output quality improved significantly as negative probability increased
Best Results: When using labeled images and ground truth labels, output was very close to ground truth

Parameter Impact Analysis

Experiments demonstrated that the negative probability parameter has a significant impact on result quality:

80% → 90% → 95% → 99%: Quality progressively improved
Higher negative probability produces better results
This pattern was verified across all experiments

Experimental Findings

Fully Connected CRF Most Suitable: Compared to linear CRF and grid CRF, fully connected CRF is most suitable for image segmentation post-processing
Parameter Sensitivity: The negative probability parameter has a decisive impact on result quality
Dataset Adaptability: The method performs well on different types of datasets (binary and multi-class classification)
Computational Efficiency: Efficient algorithms make practical application feasible

W-Net: An unsupervised image segmentation pipeline using CRF as post-processing technique, employing two convolutional neural networks for encoding and decoding
SVM-CRF: Plath et al. used support vector machine-based CRF for image feature segmentation
Deep CRF: Alam et al. combined deep CRF with convolutional neural networks for hyperspectral image segmentation
Markov Random Fields: Other graphical models such as MRF are also used for image segmentation post-processing

Advantages of This Work

Systematically compared the applicability of different CRF types
Employed computationally more efficient fully connected CRF implementation
Validated method generalizability across multiple datasets

Conclusions and Discussion

Main Conclusions

Optimal CRF Identified: The fully connected CRF of Krähenbühl and Koltun's version is the best choice for image segmentation post-processing
Parameter Importance: Adjustment of the negative probability parameter is critical to result quality
Generalizability Verified: The method is applicable to both binary and multi-class image segmentation tasks
Efficiency Improvement: Efficient algorithms make fully connected CRF feasible for practical applications

Limitations

Parameter Tuning: Requires manual parameter adjustment for different datasets
Insufficient Quantitative Evaluation: The urban dataset lacks ground truth labels, preventing quantitative accuracy assessment
Computational Resources: Despite optimization, still requires considerable computational resources
Scope of Applicability: Primarily targeted at satellite and aerial imagery; applicability to other image types remains insufficiently validated

Future Directions

The paper proposes developing a complete image segmentation pipeline:

Preprocessing
Unsupervised segmentation techniques
CRF model post-processing The goal is to obtain more accurate and clear segmented images.

In-depth Evaluation

Strengths

Systematic Study: Comprehensive applicability analysis of different CRF types
High Practical Value: Addresses practical problems in satellite image segmentation
Multi-dataset Validation: Validates method effectiveness across different dataset types
Computational Efficiency Consideration: Employs efficient algorithms to address computational complexity
Detailed Parameter Analysis: Determines the impact of critical parameters through experiments

Weaknesses

Limited Innovation: Primarily application and comparison of existing methods; insufficient technical innovation
Incomplete Evaluation: Lacks quantitative metrics and statistical significance testing
Small Experimental Scale: Relatively small dataset scale; generalizability remains to be verified
Lack of Theoretical Analysis: Lacks theoretical explanation for why fully connected CRF performs best
Limited Comparison Methods: Primarily compares with traditional methods; lacks comparison with modern deep learning approaches

Impact

Field Contribution: Provides practical solutions for satellite image segmentation post-processing
Practical Value: Has certain reference value for remote sensing image processing field
Reproducibility: Method description is relatively clear, but lacks code and detailed parameter settings

Applicable Scenarios

Satellite Image Analysis: Urban planning, land use analysis
Remote Sensing Applications: Environmental monitoring, agricultural analysis
Image Segmentation Post-processing: Any application requiring improved segmentation clarity
Resource-limited Environments: Lower computational resource requirements compared to deep learning methods

References

The paper cites 14 related references, primarily including:

Krähenbühl & Koltun (2011): Efficient fully connected CRF inference algorithm
W-Net related work: Unsupervised image segmentation
CRF applications in medical imaging and computer vision
Traditional image segmentation methods and evaluation

Overall Assessment: This is an application-oriented paper that systematically compares the performance of different CRF models in image segmentation post-processing, providing valuable guidance for practical applications. Although technical innovation is limited, the research methodology is solid, conclusions are practical, and it makes certain contributions to the remote sensing image processing field.