DRUM: Diffusion-based Raydrop-aware Unpaired Mapping for Sim2Real LiDAR Segmentation

ICRA 2026
Tomoya Miyawaki1     Kazuto Nakashima1     Yumi Iwashita2     Ryo Kurazume1
1Kyushu University, Japan   2Jet Propulsion Laboratory, USA
Paper Code Demo
TL;DR: Sim2Real domain translation of LiDAR data using diffusion models.

Overview

LiDAR-based semantic segmentation is a key component for autonomous mobile robots, yet large-scale annotation of LiDAR point clouds is prohibitively expensive and time-consuming. Although simulators can provide labeled synthetic data, models trained on synthetic data often underperform on real-world data due to a data-level domain gap. To address this issue, we propose DRUM, a novel Sim2Real translation framework. We leverage a diffusion model pre-trained on unlabeled real-world data as a generative prior and translate synthetic data by reproducing two key measurement characteristics: reflectance intensity and raydrop noise. To improve sample fidelity, we introduce a raydrop-aware masked guidance mechanism that selectively enforces consistency with the input synthetic data while preserving realistic raydrop noise induced by the diffusion prior. Experimental results demonstrate that DRUM consistently improves Sim2Real performance across multiple representations of LiDAR data.

Goal: Sim2Real LiDAR Segmentation

LiDAR simulators provide perfect segmentation labels but lack realistic sensor characteristics such as reflectance intensity and raydrop noise, leading to a performance drop in real-world deployment. Our method DRUM bridges this sim2real gap by synthesizing pseudo-real training samples that combine simulator-provided annotations with learned realistic sensor characteristics.

Simulation Pseudo-real (ours) Real
Range sim-depth pseudoreal-depth real-depth
Reflectance
intensity
N/A pseudoreal-reflectance real-reflectance
Raydrop noise N/A pseudoreal-raydrop real-raydrop
Semantic label label label N/A

Method: DRUM

Posterior Sampling using Diffusion Models

We formulate Sim2Real LiDAR translation as a posterior sampling. First, we pre-train a LiDAR diffusion model[Nakashima+ 2024] on real-world data to capture its underlying distribution, which serves as a generative prior. We then condition the diffusion sampling process with the simulation sample via the proposed raydrop-aware masked guidance.

Posterior Sampling using Diffusion Models

Raydrop-aware Masked Guidance

Naïve conditioning suppresses realistic raydrop noise. In this work, we first generate the raydrop-aware mask \(m_t\) from the tentative Tweedie's estimate \(\hat{x}_t\) and then compute the sim-real discrepancy based on the pseudoinverse method [Song+ 2023]. The operator \(H\) corrupts the reflectance modality.

Raydrop-aware Masked Guidance

Results

Pseudo-real Samples

sim real
Simulation
Real

Semantic Segmentation

19-class semantic segmentation on real-world dataset [Behley+ 2019].

Citation

@inproceedings{miyawaki2026drum,
    title     = {{DRUM}: Diffusion-based Raydrop-aware Unapaired Mapping for Sim2Real {LiDAR} Segmentation},
    author    = {Tomoya Miyawaki and Kazuto Nakashima and Yumi Iwashita and Ryo Kurazume},
    booktitle = {Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)},
    pages     = {},
    year      = 2026
}

Acknowledgments

This work was supported by JSPS KAKENHI Grant Number JP23K16974 and JSPS KAKENHI Grant Number JP20H00230

LiDAR Data Synthesis with Denoising Diffusion Probabilistic Models
Kazuto Nakashima, Ryo Kurazume
ICRA 2024
https://arxiv.org/abs/2309.09256
Pseudoinverse-Guided Diffusion Models for Inverse Problems
Jiaming Song, Arash Vahdat, Morteza Mardani, Jan Kautz
ICLR 2020
https://arxiv.org/abs/2006.11239
Transfer learning from synthetic to real {LiDAR} point cloud for semantic segmentation
Aoran Xiao, Jiaxing Huang, Dayan Guan, Fangneng Zhan, Shijian Lu
AAAI 2022
https://arxiv.org/abs/2109.13410
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D
Yiyi Liao, Jun Xie, Andreas Geiger
TPAMI 2022
https://arxiv.org/abs/2109.13410
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
Jens Behley, Martin Garbade, Andres Milioto, Jan Quenzel, Sven Behnke, Cyrill Stachniss, Juergen Gall
ICCV 2019
https://arxiv.org/abs/2109.13410
Development of a Realistic LiDAR Simulator based on Deep Generative Models
Kazuto Nakashima
Grant-in-Aid for Early-Career Scientists,
The Japan Society for the Promotion of Science (JSPS)
https://kaken.nii.ac.jp/en/grant/KAKENHI-PROJECT-23K16974/
Development of garbage collecting robot for marine microplastics
Ryo Kurazume, Akihiro Kawamura, Qi An, Shoko Miyauchi, Kazuto Nakashima
Grant-in-Aid for Scientific Research (A),
The Japan Society for the Promotion of Science (JSPS)
https://kaken.nii.ac.jp/en/grant/KAKENHI-PROJECT-20H00230/