TY - JOUR
T1 - Motion segmentation of RGB-D sequences
T2 - Combining semantic and motion information using statistical inference
AU - Muthu, Sundaram
AU - Tennakoon, Ruwan
AU - Rathnayake, Tharindu
AU - Hoseinnezhad, Reza
AU - Suter, David
AU - Bab-Hadiashar, Alireza
N1 - Funding Information:
Manuscript received September 2, 2019; revised February 3, 2020; accepted March 17, 2020. Date of publication April 7, 2020; date of current version April 20, 2020. The work was supported by the Australian Research Council through an ARC Linkage Project under Grant LP160100662. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Shuicheng Yan. (Corresponding author: Sundaram Muthu.) Sundaram Muthu, Tharindu Rathnayake, Reza Hoseinnezhad, and Alireza Bab-Hadiashar are with the School of Engineering, RMIT University, Melbourne, VIC 3000, Australia (e-mail: [email protected]).
Publisher Copyright:
© 2020 IEEE.
PY - 2020/4/7
Y1 - 2020/4/7
N2 - This paper presents an innovative method for motion segmentation in RGB-D dynamic videos with multiple moving objects. The focus is on finding static, small or slow moving objects (often overlooked by other methods) that their inclusion can improve the motion segmentation results. In our approach, semantic object based segmentation and motion cues are combined to estimate the number of moving objects, their motion parameters and perform segmentation. Selective object-based sampling and correspondence matching are used to estimate object specific motion parameters. The main issue with such an approach is the over segmentation of moving parts due to the fact that different objects can have the same motion (e.g. background objects). To resolve this issue, we propose to identify objects with similar motions by characterizing each motion by a distribution of a simple metric and using a statistical inference theory to assess their similarities. To demonstrate the significance of the proposed statistical inference, we present an ablation study, with and without static objects inclusion, on SLAM accuracy using the TUM-RGBD dataset. To test the effectiveness of the proposed method for finding small or slow moving objects, we applied the method to RGB-D MultiBody and SBM-RGBD motion segmentation datasets. The results showed that we can improve the accuracy of motion segmentation for small objects while remaining competitive on overall measures.
AB - This paper presents an innovative method for motion segmentation in RGB-D dynamic videos with multiple moving objects. The focus is on finding static, small or slow moving objects (often overlooked by other methods) that their inclusion can improve the motion segmentation results. In our approach, semantic object based segmentation and motion cues are combined to estimate the number of moving objects, their motion parameters and perform segmentation. Selective object-based sampling and correspondence matching are used to estimate object specific motion parameters. The main issue with such an approach is the over segmentation of moving parts due to the fact that different objects can have the same motion (e.g. background objects). To resolve this issue, we propose to identify objects with similar motions by characterizing each motion by a distribution of a simple metric and using a statistical inference theory to assess their similarities. To demonstrate the significance of the proposed statistical inference, we present an ablation study, with and without static objects inclusion, on SLAM accuracy using the TUM-RGBD dataset. To test the effectiveness of the proposed method for finding small or slow moving objects, we applied the method to RGB-D MultiBody and SBM-RGBD motion segmentation datasets. The results showed that we can improve the accuracy of motion segmentation for small objects while remaining competitive on overall measures.
KW - dynamic SLAM
KW - EVT
KW - Kolmogorov-Smirnov test
KW - multibody structure and motion
KW - RGB-D motion segmentation
UR - http://www.scopus.com/inward/record.url?scp=85084150364&partnerID=8YFLogxK
U2 - 10.1109/TIP.2020.2984893
DO - 10.1109/TIP.2020.2984893
M3 - Article
C2 - 32275594
AN - SCOPUS:85084150364
SN - 1057-7149
VL - 29
SP - 5557
EP - 5570
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
ER -