BOP: Benchmark for 6D Object Pose Estimation

Submission: PoZe-CNOS/HB

Download submission

Submission name

Submission time (UTC)

Sept. 26, 2023, 6:44 p.m.

User

acaraffa@fbk.eu

Task

Model-based 6D localization of unseen objects

Dataset

HB

Description

Evaluation scores

AR:	0.712
AR_MSPD:	0.710
AR_MSSD:	0.709
AR_VSD:	0.718
average_time_per_image:	159.671

Method: PoZe-CNOS

User	acaraffa@fbk.eu
Publication
Implementation
Training image modalities	None
Test image modalities	RGB-D
Description	Submitted to: BOP Challenge 2023 Training data: 3D point clouds of indoor scenes from the 3DMatch dataset [A] Onboarding data: No need for onboarding Used 3D models: Default point clouds provided in: "models_reconst" for T-LESS, "models_eval" for ITODD, "models" for the other datasets. Notes: PoZe performs pose estimation of unseen objects through zero-shot learning. PoZe takes as input a colored 3D point cloud that represents the object and an RGB-D image capturing the scene. PoZe consists of five modules: 2D object segmentation: We segment the region that the object occupies in the RGB image. We use the segmentation masks predicted by CNOS [B]. 3D lifting: We crop the input image around the segmentation mask and back-project the cropped scene in the 3D space by using the camera intrinsic parameters. Feature extraction: We extract point-wise features from the point clouds of the object and the cropped scene. We use a frozen GeDi [C] model trained on 3DMatch for the point cloud registration task. We extract features at three different scales. Pose estimation: We estimate the 6D pose of the object with respect to the cropped scene using feature-matching RANSAC on the features extracted at each scale. Thereafter, we select the best estimated pose according to the number of RANSAC inliers. Pose refinement: We refine the pose estimated by the previous module by using the ICP algorithm. [A] Zeng et al.: 3DMatch: Learning the matching of local 3D geometry in range scans, CVPR 2017 [B] Nguyen et al.: CNOS: A Strong Baseline for CAD-based Novel Object Segmentation, arXiv 2023 [C] Poiesi et al.: Learning general and distinctive 3D local deep descriptors for point cloud registration, IEEE PAMI 2023
Computer specifications	A40