BOP: Benchmark for 6D Object Pose Estimation

Submission: ZebraPoseSAT-EffnetB4 (PBR_Only)/T-LESS

Download submission

Submission name

Submission time (UTC)

Oct. 16, 2022, 8:03 p.m.

User

zebrapose

Task

Model-based 2D segmentation of seen objects

Dataset

T-LESS

Training model type

CAD

Training image type

Synthetic (only PBR images provided for BOP Challenge 2020 were used)

Description

Evaluation scores

AP:	0.629
AP50:	0.829
AP75:	0.763
AP_large:	0.667
AP_medium:	0.600
AP_small:	0.066
AR1:	0.602
AR10:	0.718
AR100:	0.720
AR_large:	0.800
AR_medium:	0.632
AR_small:	0.091
average_time_per_image:	0.080

Method: ZebraPoseSAT-EffnetB4 (PBR_Only)

User	zebrapose
Publication	ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation, CVPR2022
Implementation	https://github.com/suyz526/ZebraPose
Training image modalities	RGB
Test image modalities	RGB
Description	Based on the paper "ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation", CVPR 2022. Training images: PBR image Setting: One network per object was trained 2D Detector: detection results provided by CDPNv2. The Detector also trained only with PBR images Modifications to the original ZebraPose paper: Added Symmetry-Aware Training (SAT). The network and loss functions are not changed. There will be a new ground truth for the sym. objects, details can be found in the Github Repository. Special thanks to Yongliang Lin for his contribution. We replace the Resnet34 backbone with EffnetB4 in ZebraPose. (Only replace the backbone in the pose estimation part) About the submission to segmentation challenge 2022: For every 2D bounding box provided by a 2D detector, we use ZebraPose network to infer the object visible mask and binary codes (as we did for object pose estimation). And we save 1) the confidence score from the 2D detector 2) as well as the predicted visible object mask into the json file for the segmentation evaluation. The reported inference time included 2D detection time. List of contributors: German Research Center for Artificial Intelligence (DFKI), Augmented Vision department: Yongzhi Su, Praveen Nathan, Torben Fetzer, Jason Rambach, Didier Stricker Technical University Munich (TUM), CAMPAR: Mahdi Saleh, Yan Di, Nassir Navab, Benjamin Busam, Federico Tombari Zhejiang University (ZJU): Yongliang Lin, Yu Zhang
Computer specifications	Intel(R) Xeon(R) E-2146G CPU @ 3.50GHz, Nvidia RTX2080Ti