BOP: Benchmark for 6D Object Pose Estimation

Submission: TF6D (Default, CNOS) + Megapose (5 hypothesis)/LM-O/TF6D (CNOS) + Megapose (5 hypothesis)

Download submission

Submission name

TF6D (CNOS) + Megapose (5 hypothesis)

Submission time (UTC)

Nov. 29, 2024, 12:40 p.m.

User

tf6d

Task

Model-based 6D localization of unseen objects

Dataset

LM-O

Description

1) Submitted to: BOP Challenge 2024 2) Training-free method: Our method does not require any task-specific training 3) Pose refinement: Yes, the pose refinement network is from MegaPose with 5 hypotheses 4) Onboarding data: 40 rendered templates 5) Used 3D models: Default, CAD 6) Testing data: RGB 7) Segmentation method: Default, CNOS

Evaluation scores

AR:	0.560
AR_MSPD:	0.684
AR_MSSD:	0.551
AR_VSD:	0.445
average_time_per_image:	7.237

Method: TF6D (Default, CNOS) + Megapose (5 hypothesis)

User	tf6d
Publication
Implementation
Training image modalities	None
Test image modalities	RGB
Description	Submitted to: BOP Challenge 2024 Training data: None, our method does not require any task-specific training Onboarding data: Model-based rendering, 40 rendered templates Used 3D models: Default, CAD Notes: 1) Testing data: RGB 2) Segmentation method: Default, CNOS 3) Our method does not introduce any additional or task-specific training 4) Pose refinement: Yes, the pose refinement network is from MegaPose with 5 hypotheses 5) Key idea: Our method leverages 3D-consistent features derived from 3D foundation models for training-free 6D pose estimation, rather than relying on 2D foundation models (e.g., DINO).
Computer specifications	NVIDIA A100 80GB PCIe + Intel(R) Xeon(R) Gold 6342 CPU @ 2.80GHz