Method: GenFlow-coarse

User sp9103
Publication https://arxiv.org/abs/2403.11510
Implementation -
Training image modalities RGB
Test image modalities RGB
Description

Submitted to: BOP Challenge 2023

Training data: MegaPose-GSO and MegaPose-ShapeNetCore

Onboarding data: No

Used 3D models: Default, CAD

Notes:

In this submission, CNOS_fastSAM [A] detections are used as the input to our pose estimation method. Our pose estimation method uses the coarse-to-fine strategy following the MegaPose [B] structure. A single model is used for all datasets.

Our coarse network is based on the MegaPose [B] coarse network. The main differences from the original MegaPose paper are as follows:

  • To bypass the computation of rendering tons of images, we render fewer images and run the coarse network. Then we create a GMM with the top-k hypotheses and run the coarse network to the sampled hypotheses from the GMM.

[A] Nguyen et al.: CNOS: A Strong Baseline for CAD-based Novel Object Segmentation, arXiv 2023
[B] Labbé et al.: MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare, CoRL 2022

List of contributors: Sungphill Moon (sungphill.moon@naverlabs.com), Hyeontae Son (son.ht@naverlabs.com)

If you have any questions, feel free to contact us.

Computer specifications GPU V100; CPU Intel Xeon Gold6248@2.5G

Public submissions

Date Submission name Dataset
2023-09-18 07:27 - TUD-L
2023-09-18 07:28 - HB
2023-09-18 07:29 - IC-BIN
2023-09-18 07:30 - YCB-V
2023-09-18 07:31 - T-LESS
2023-09-18 07:41 - LM-O
2023-09-18 08:28 - ITODD
2023-09-26 07:51 - LM-O
2023-09-26 07:52 - T-LESS
2023-09-26 07:52 - TUD-L
2023-09-26 07:53 - IC-BIN
2023-09-26 07:53 - ITODD
2023-09-26 07:54 - HB
2023-09-26 07:54 - YCB-V