User | sp9103 |
---|---|
Publication | Not yet |
Implementation | - |
Training image modalities | RGB-D |
Test image modalities | RGB |
Description | Submitted to: BOP Challenge 2024 Training data: MegaPose-GSO and MegaPose-ShapeNetCore Onboarding data: 42 rendered templates Used 3D models: Default, CAD Notes: Our coarse estimator is based on local feature matching between the query image and multiple pre-rendered templates. We model the query and rendered images as aggregation of multiple patches. The coarse network finds the matchings between patch centers of input crop and rendered templates. From the 42 templates, the best template is selected and pose hypothesis is generated by RANSAC-PnP for RGB and MAGSAC++ [A] for RGB-D case. We use CroCo [B] pretraining for our coarse estimator. Note that the inputs to our neural networks are the rgb images only. The 2D detector used is specified in parentheses in the title, and it uses the FastSAM object proposals. [A] Barath et al.: MAGSAC++, a fast, reliable and accurate robust estimator, CVPR 2020 List of contributors: Sungphill Moon (sungphill.moon@naverlabs.com), Hyeontae Son (son.ht@naverlabs.com) If you have any questions, feel free to contact us. |
Computer specifications |
Date | Submission name | Dataset | ||
---|---|---|---|---|
2024-09-03 12:23 | coarse only | LM-O | ||
2024-09-03 12:24 | coarse only | T-LESS | ||
2024-09-03 12:24 | coarse only | TUD-L | ||
2024-09-03 12:25 | coarse only | IC-BIN | ||
2024-09-03 12:25 | coarse only | ITODD | ||
2024-09-03 12:26 | coarse only | HB | ||
2024-09-03 12:26 | coarse only | YCB-V | ||
2024-11-28 08:08 | BOP 2024 | LM-O | ||
2024-11-28 08:09 | BOP 2024 | T-LESS | ||
2024-11-28 08:09 | BOP 2024 | TUD-L | ||
2024-11-28 08:10 | BOP 2024 | IC-BIN | ||
2024-11-28 08:11 | BOP 2024 | ITODD | ||
2024-11-28 08:11 | BOP 2024 | HB | ||
2024-11-28 08:12 | BOP 2024 | YCB-V |