Semi-direct RGB-D SLAM Algorithm for Dynamic Indoor Environments
GAO Chengqiang1, ZHANG Yunzhou1,2, WANG Xiaozhe1, DENG Yi1, JIANG Hao1
1. College of Information Science and Engineering, Northeastern University, Shenyang 110819, China;
2. Faculty of Robot Science and Engineering, Northeastern University, Shenyang 110819, China
Abstract:To solve the accurate positioning problem of mobile robots in dynamic indoor environments, a semi-direct RGBD visual SLAM (simultaneous localization and mapping) algorithm integrating a motion detection algorithm, is proposed. It consists of three parts, including motion detection, camera pose estimation and dense map building based on TSDF (truncated signed distance function) model. Firstly, the primary camera pose can be estimated with the sparse image alignment algorithm by minimizing the photometric residual. Then, the pose estimation of visual odometry is used to compensate for image movement. A Gaussian model is built based on real-time update of the image patch. The moving object in the image can be segmented according to the variance change of each patch. By eliminating the local map points projected on the motion region and minimizing the reprojection error, the camera pose can be optimized again to improve its estimation accuracy. Finally, the dense TSDF map can be built with the camera pose and RGB-D image. Furthermore, the map can be updated in real time in dynamic environments by using the motion detection result and and the color change of voxel patches in the map. Experimental results show that the proposed algorithm can effectively improve the precision of camera pose estimation and implement the real-time update of the dense map in dynamic indoor environments. Therefore, it can improve the system robustness and the scene reconstruction accuracy.
[1] Kerl C, Sturm J, Cremers D. Robust odometry estimation for RGB-D cameras[C]//IEEE International Conference on Robotics and Automation. Piscataway, USA:IEEE, 2013:3748-3754.
[2] Mur-Artal R, Tardós J D. ORB-SLAM2:An open-source slam system for monocular, stereo, and RGB-D cameras[J]. IEEE Transactions on Robotics, 2017, 33(5):1255-1262.
[3] Newcombe R A, Izadi S, Hilliges O, et al. KinectFusion:Realtime dense surface mapping and tracking[C]//10th IEEE/ACM International Symposium on Mixed and Augmented Reality. Piscataway, USA:IEEE, 2011:127-136.
[4] 付梦印,吕宪伟,刘彤,等.基于RGB-D数据的实时SLAM算法[J].机器人, 2015, 37(6):683-692. Fu M Y, Lü X W, Liu T, et al. Real-time SLAM algorithm based on RGB-D data[J]. Robot, 2015, 37(6):683-692.
[5] Nistér D, Naroditsky O, Bergen J. Visual odometry[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway, USA:IEEE, 2004:652-659.
[6] Tan W, Liu H M, Dong Z L, et al. Robust monocular SLAM in dynamic environments[C]//12th IEEE/ACM International Symposium on Mixed and Augmented Reality. Piscataway, USA:IEEE, 2013:209-218.
[7] Hahnel D, Triebel R, Burgard W, et al. Map building with mobile robots in dynamic environments[C]//IEEE International Conference on Robotics and Automation. Piscataway, USA:IEEE, 2003:1557-1563.
[8] Bibby C, Reid I. Simultaneous localisation and mapping in dynamic environments (SLAMIDE) with reversible data association[C]//Robotics:Science and Systems Ⅲ. 2007. DOI:10.15607/RSS.2007.Ⅲ.014.
[9] Wang Y B, Huang S D. Motion segmentation based robust RGB-D SLAM[C]//11th World Congress on Intelligent Control and Automation. Piscataway, USA:IEEE, 2014:3122-3127.
[10] Sun Y X, Liu M, Meng M Q H. Improving RGB-D SLAM in dynamic environments:A motion removal approach[J]. Robotics and Autonomous Systems, 2017, 89:110-122.
[11] Forster C, Pizzoli M, Scaramuzza D. SVO:Fast semi-direct monocular visual odometry[C]//IEEE International Conference on Robotics and Automation. Piscataway, USA:IEEE, 2014:15-22.
[12] Engel J, Schöps T, Cremers D. LSD-SLAM:Large-scale direct monocular SLAM[C]//13th European Conference on Computer Vision. Cham, Switzerland:Springer, 2014:834-849.
[13] 魏彤,金砺耀.基于双目ORB-SLAM的障碍物记忆定位与去噪算法[J].机器人, 2018, 40(3):266-272. Wei T, Jin L Y. Obstacle memory localization and denoising algorithm based on binocular ORB-SLAM[J]. Robot, 2018, 40(3):266-272.
[14] Newcombe R A, Fox D, Seitz S M. Dynamicfusion:Reconstruction and tracking of non-rigid scenes in real-time[C]//IEEE Conference on Computer Vision and Pattern Recognition. Piscataway, USA:IEEE, 2015:343-352.
[15] Fehr M, Furrer F, Dryanovski I, et al. TSDF-based change detection for consistent long-term dense reconstruction and dynamic object discovery[C]//IEEE International Conference on Robotics and Automation. Piscataway, USA:IEEE, 2017:5237-5244.
[16] Sturm J, Engelhard N, Endres F, et al. A benchmark for the evaluation of RGB-D SLAM systems[C]//IEEE/RSJ International Conference on Intelligent Robots and Systems. Piscataway, USA:IEEE, 2012:573-580.
[17] Rublee E, Rabaud V, Konolige K, et al. ORB:An efficient alternative to SIFT or SURF[C]//IEEE International Conference on Computer Vision. Piscataway, USA:IEEE, 2011:2564-2571.
[18] Baker S, Matthews I. Lucas-Kanade 20 years on:A unifying framework[J]. International Journal of Computer Vision, 2004, 56(3):221-255.
[19] van Droogenbroeck M, Barnich O. ViBe:A disruptive method for background subtraction[M]//Background Modeling and Foreground Detection for Video Surveillance. Boca Raton, USA:CRC, 2014. DOI:10.1201/b17223-10.
[20] Yi K M, Yun K, Kim S W, et al. Detection of moving objects with non-stationary cameras in 5.8 ms:Bringing motion detection to your mobile device[C]//IEEE Conference on Computer Vision and Pattern Recognition Workshops. Piscataway, USA:IEEE, 2013:27-34.
[21] Bouguet J Y. Pyramidal implementation of the affine Lucas Kanade feature tracker description of the algorithm[EB/OL].[2018-05-01]. https://pdfs.semanticscholar.org/aa97/2b40c0f8e20b07e02d1fd320bc7ebadfdfc7.pdf.
[22] Rosten E, Porter R, Drummond T. Faster and better:A machine learning approach to corner detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(1):105-119.
[23] Klingensmith M, Dryanovski I, Srinivasa S S, et al. CHISEL:Real time large scale 3D reconstruction onboard a mobile device using spatially hashed signed distance fields[C/OL]//Robotics:Science and Systems. 2015. http://www.roboticsproceedings.org/rss11/p40.pdf.
[24] Li S, Lee D. RGB-D SLAM in dynamic environments using static point weighting[J]. IEEE Robotics and Automation Letters, 2017, 2(4):2263-2270.
[25] Kim D H, Kim J H. Effective background model-based RGB-D dense visual odometry in a dynamic environment[J]. IEEE Transactions on Robotics, 2016, 32(6):1565-1573.