BOP: Benchmark for 6D Object Pose Estimation

Submission: GMatch-Research/HOPE/GMatch-SPP

Download submission

Submission name

GMatch-SPP

Submission time (UTC)

May 3, 2025, 11:22 a.m.

User

yang2019901

Task

Model-based 6D localization of seen objects

Dataset

HOPE

Description

NOTE: masks are manually classified with their object id. it's ONLY for performance test. Pipeline: SuperPoint -> GMatch -> RANSAC -> ICP. bbox is used instead of segmentation.

Evaluation scores

AR:	0.373
AR_MSPD:	0.340
AR_MSSD:	0.283
AR_VSD:	0.497
average_time_per_image:	70.738

Method: GMatch-Research

User	yang2019901
Publication
Implementation
Training image modalities	None
Test image modalities	RGB-D
Description	Researchs on the pose estimation pipeline based on feature matching, which is compose of a keypoint descriptor, a feature matcher, a geometric solver and a pose refiner. GMatch is the name of our proposed matcher. Possible descriptor: SIFT [1], ORB [2], SuperPoint [3]. Possible matcher: GMatch, NN (Nearest Neighbour with Lowe's Ratio Test [1]), LightGlue [4]. Solver: RANSAC from open3d. Refiner: ICP from open3d. [1] David G Lowe. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60:91–110, 2004. [2] Ethan Rublee, Vincent Rabaud, Kurt Konolige, and Gary Bradski. Orb: An efficient alternative to sift or surf. In 2011 International conference on computer vision, pages 2564–2571. Ieee, 2011. [3] Daniel DeTone, Tomasz Malisiewicz, and Andrew Rabinovich. Superpoint: Self-supervised interest point detection and description. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 224–236, 2018. [4] Philipp Lindenberger, Paul-Edouard Sarlin, and Marc Pollefeys. Lightglue: Local feature matching at light speed. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 17627–17638, 2023.
Computer specifications	i5-12400F (+ RTX4060, for lightglue and superpoint only) + 8G RAM, Ubuntu20.04 (WSL)