Submission: Pix2Pose-BOP20-ICCV19/TUD-L

Download submission
Submission name
Submission time (UTC) Aug. 18, 2020, 7:36 a.m.
User kirumang
Task 6D localization of seen objects
Dataset TUD-L
Training model type Default
Training image type Real
Description
Evaluation scores
AR:0.420
AR_MSPD:0.641
AR_MSSD:0.364
AR_VSD:0.255
average_time_per_image:0.419

Method: Pix2Pose-BOP20-ICCV19

User kirumang
Publication Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation, ICCV 2019
Implementation https://github.com/kirumang/Pix2Pose
Training image modalities RGB
Test image modalities RGB
Description

Poses are estimated using RGB images only without refinement. Results are derived after the following modifications from the original implementation of the paper. Other setups are the same as performed in BOP 2019.

1) Replaced the encoder part with the first three blocks of Resnet-50 with pre-trained weights using ImageNet.

2) Increased a threshold for inlier pixels during PnP-Ransac operation (3->5).

3) A minor bug that causes bad detection results for the T-Less dataset is fixed (different image resolutions were used during training and inference)

4) Increased the number of RPN proposals and NMS thresholds in Mask-RCNN (1000/0.7 to 2000/0.9), which produces more detection proposals

All updates will be shared in our public repository (checkout bop2020 branch after the deadline)

Computer specifications CPU: i7-9700K, GPU: Titan V, RAM: 32GB