🔒 Challenges
(1) Pixel-level Diffusion process is computing resource-consuming during training.
(2) Multi-step reverse process and multiple-sample average scheme are time-consuming during inference.
🌟 Stable not only shows that SDSeg is built on Stable Diffusion but also indicates its remarkable stability.
🌟 SDSeg only requires a single-step reverse process to generate segmentation results.
🌟 SDSeg has remarkable stability and doesn't need to sample multiple times for average.
(1) Pixel-level Diffusion process is computing resource-consuming during training.
(2) Multi-step reverse process and multiple-sample average scheme are time-consuming during inference.
(1) Using Latent-level Diffusion model (Stable Diffusion, SD).
Also, SD's Autoencoder can generalize to segmentation maps. No fintune needed.
(2) Designing a single-step reverse process with strong stability against initial noise.
👉 Dataset-level Stability: performs repeated inferences on test data to measure variability.
👉 Instance-level Stability: examines the model’s consistency under varying initial noise.
@InProceedings{lin2024stable,
author="Lin, Tianyu
and Chen, Zhiguang
and Yan, Zhonghao
and Yu, Weijiang
and Zheng, Fudan",
title="Stable Diffusion Segmentation for Biomedical Images with Single-Step Reverse Process",
booktitle="Medical Image Computing and Computer Assisted Intervention -- MICCAI 2024",
year="2024",
publisher="Springer Nature Switzerland",
address="Cham",
pages="656--666",
isbn="978-3-031-72111-3"
}