BOP: Benchmark for 6D Object Pose Estimation

Submission: RCVPose 3D_SingleModel_VIVO_PBR/T-LESS

Download submission

Submission name

Submission time (UTC)

Oct. 4, 2022, 3:12 p.m.

User

aaronwool

Task

Model-based 6D localization of seen objects

Dataset

T-LESS

Training model type

CAD

Training image type

Synthetic + real

Description

Evaluation scores

AR:	0.708
AR_MSPD:	0.711
AR_MSSD:	0.710
AR_VSD:	0.704
average_time_per_image:	0.979

Method: RCVPose 3D_SingleModel_VIVO_PBR

User	aaronwool
Publication	Yangzheng Wu, Alireza Javaheri, Mohsen Zand and Michael Greenspan: Keypoint Cascade Voting for Point Cloud Based 6DoF Pose Estimation, 3DV 2022.
Implementation	https://github.com/aaronWool/rcvpose3d.git
Training image modalities	RGB-D
Test image modalities	RGB-D
Description	A single model is trained for both semantic segmentation and pose estimation per dataset. Only provided PBR images are used for training. One model estimates all poses for all objects(MIMO) inside the scene of one dataset. The hyperparameters are consistent among all core datasets with a batch size of 8, an initial lr=1e-4, and an SGD optimizer. The implementation is mostly the same as described in the paper except the networks are extended to MIMO, i.e. estimating poses for all objects in the scene simultaneously in a single model. Three keypoints are used for each of the objects.
Computer specifications	Validdation - CPU: Intel i7-11700F, GPU: RTX3090; Training - CPU: Intel(R) Xeon(R) Gold 5218, GPU: 8*RTX6000