Submission: TF6D (Default, CNOS) + Megapose (5 hypothesis)/LM-O/TF6D (CNOS) + Megapose (5 hypothesis)

Download submission
Submission name TF6D (CNOS) + Megapose (5 hypothesis)
Submission time (UTC) Nov. 29, 2024, 12:40 p.m.
User tf6d
Task Model-based 6D localization of unseen objects
Dataset LM-O
Description 1) Submitted to: BOP Challenge 2024 2) Training-free method: Our method does not require any task-specific training 3) Pose refinement: Yes, the pose refinement network is from MegaPose with 5 hypotheses 4) Onboarding data: 40 rendered templates 5) Used 3D models: Default, CAD 6) Testing data: RGB 7) Segmentation method: Default, CNOS
Evaluation scores
AR:0.560
AR_MSPD:0.684
AR_MSSD:0.551
AR_VSD:0.445
average_time_per_image:7.237

Method: TF6D (Default, CNOS) + Megapose (5 hypothesis)

User tf6d
Publication
Implementation
Training image modalities None
Test image modalities RGB
Description

Submitted to: BOP Challenge 2024

Training data: None, our method does not require any task-specific training

Onboarding data: Model-based rendering, 40 rendered templates

Used 3D models: Default, CAD

Notes:

1) Testing data: RGB

2) Segmentation method: Default, CNOS

3) Our method does not introduce any additional or task-specific training

4) Pose refinement: Yes, the pose refinement network is from MegaPose with 5 hypotheses

5) Key idea: Our method leverages 3D-consistent features derived from 3D foundation models for training-free 6D pose estimation, rather than relying on 2D foundation models (e.g., DINO).

Computer specifications NVIDIA A100 80GB PCIe + Intel(R) Xeon(R) Gold 6342 CPU @ 2.80GHz