BOP: Benchmark for 6D Object Pose Estimation

Submission: Pix2Pose-BOP20-ICCV19/YCB-V

Download submission

Submission name

Submission time (UTC)

Aug. 18, 2020, 7:35 a.m.

User

kirumang

Task

Model-based 6D localization of seen objects

Dataset

YCB-V

Training model type

Default

Training image type

Synthetic + real

Description

Evaluation scores

AR:	0.457
AR_MSPD:	0.571
AR_MSSD:	0.429
AR_VSD:	0.372
average_time_per_image:	1.025

Method: Pix2Pose-BOP20-ICCV19

User	kirumang
Publication	Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation, ICCV 2019
Implementation	https://github.com/kirumang/Pix2Pose
Training image modalities	RGB
Test image modalities	RGB
Description	Poses are estimated using RGB images only without refinement. Results are derived after the following modifications from the original implementation of the paper. Other setups are the same as performed in BOP 2019. 1) Replaced the encoder part with the first three blocks of Resnet-50 with pre-trained weights using ImageNet. 2) Increased a threshold for inlier pixels during PnP-Ransac operation (3->5). 3) A minor bug that causes bad detection results for the T-Less dataset is fixed (different image resolutions were used during training and inference) 4) Increased the number of RPN proposals and NMS thresholds in Mask-RCNN (1000/0.7 to 2000/0.9), which produces more detection proposals All updates will be shared in our public repository (checkout bop2020 branch after the deadline)
Computer specifications	CPU: i7-9700K, GPU: Titan V, RAM: 32GB