Submission: RDPN/YCB-V/cir

Download submission
Submission name cir
Submission time (UTC) Jan. 29, 2024, 5:21 p.m.
User r10922190
Task 6D localization of seen objects
Dataset YCB-V
Description
Evaluation scores
AR:0.883
AR_MSPD:0.878
AR_MSSD:0.921
AR_VSD:0.850
average_time_per_image:2.189

Method: RDPN

User r10922190
Publication None
Implementation None
Training image modalities RGB-D
Test image modalities RGB-D
Description

In this work, we present a novel method for determining the 6DoF pose of an object from a single RGB-D image. Unlike existing methods that either directly predict the object’s pose or rely on sparse keypoints for pose recovery, our approach addresses this challenging task using dense correspondence, i.e., it regresses the object coordinates for each visible pixel. Our approach leverages readily available object detection methods. A re-projection mechanism is introduced to change the camera intrinsic matrix to handle cropping in RGB-D images. Moreover, we transform the 3D object coordinates into a residual representation, which proves effective in reducing the output space and yields superior performance. We conducted extensive experiments to validate the effectiveness of our approach for 6D pose estimation. Our approach outperforms most previous methods, especially in occlusion scenarios, and demonstrates notable improvements over the state-of-the-art methods.

Computer specifications RTX3090