FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting

Under submission

Anonymous authors

Abstract

We present FlashSLAM, a novel SLAM approach that leverages 3D Gaussian Splatting for efficient and robust 3D scene reconstruction. Existing 3DGS-based SLAM methods often fall short in sparse view settings and during large camera movements due to their reliance on gradient descent-based optimization, which is both slow and inaccurate. FlashSLAM addresses these limitations by combining 3DGS with a fast vision-based camera tracking technique, utilizing a pretrained feature matching model and point cloud registration for precise pose estimation in under 80 ms - a 90% reduction in tracking time compared to SplaTAM - without costly iterative rendering. In sparse settings, our method achieves up to a 92% improvement in average tracking accuracy over previous methods. Additionally, it accounts for noise in depth sensors, enhancing robustness when using unspecialized devices such as smartphones. Extensive experiments show that FlashSLAM performs reliably across both sparse and dense settings, in synthetic and real-world environments. Evaluations on benchmark datasets highlight its superior accuracy and efficiency, establishing FlashSLAM as a versatile and high-performance solution for SLAM, advancing the state-of-the-art in 3D reconstruction across diverse applications.

Architecture overview

RELICA

Room 0 (r0)
Room 1 (r1)
Room 2 (r2)
Office 0 (o0)
Office 1 (o1)
Office 2 (o2)
Office 3 (o3)
Office 4 (o4)

TUM RGB-D

Freiburg1 desk (fr1/ desk)
Freiburg1 desk2 (fr1/ desk2)
Freiburg1 room (fr1/ room)
Freiburg2 xyz (fr2/ xyz)
Freiburg3 long office (fr3/ office)

ScanNet++

8b5caf3398
b20a261fdf

Self-captured dataset