Submission: OfficialDet-PFA-Mixpbr-RGB-D/YCB-V

Download submission
Submission name
Submission time (UTC) Sept. 25, 2023, 3:59 p.m.
User CVIA_Lab
Task 6D localization of seen objects
Dataset YCB-V
Description
Evaluation scores
AR:0.899
AR_MSPD:0.894
AR_MSSD:0.932
AR_VSD:0.872
average_time_per_image:0.945

Method: OfficialDet-PFA-Mixpbr-RGB-D

User CVIA_Lab
Publication Yinlin Hu et, at: Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation, ECCV, 2022
Implementation
Training image modalities RGB
Test image modalities RGB-D
Description

Training data: Official PBR + Real (if possible).

Used 3D models: Official ones.

Object detection: Official detection results

Pose estimation: WDR

Pose refinement: PFA

Setting: Single model per dataset

We use the standard initialization-and-refinement framework, and train a single model for all objects in the same dataset. We use the official detection results as the preprocessing, WDR-Pose as the pose initialization, and PFA as our refinement.

All the mentioned models are trained purely on RGB data, and we only use the depth image during the inference to get the final pose in RGBD track by RANSAC-Kabsch.

Most of the framework is the same as the version we submitted last year .

The main differences include:

using the official detection;optimized batch size, training time, and other hyper-parameters;better Ransac-Kabsch

List of contributors:

Xidian University: Xinyao Fan, Fengda Hao, Yang Hai, Jiaojiao Li, Rui Song

EPFL: Haixin Shi, Mathieu Salzmann

Magic Leap: David Ferstl, Yinlin Hu

Computer specifications NVIDIA-3090,Intel xeon 4310