Method: MUSE_dinov3

User csm8167
Publication Not Yet
Implementation TBD
Views Single
Test image modalities RGB
Description

Submitted to: BOP Challenge 2025

the version of muse dinov3 MUSE : Model-based Uncertainty-aware Similarity Estimation In this work, we present MUSE (Model-based Uncertainty-aware Similarity Estimation), a training-free framework for model-based zero-shot 2D object detection and segmentation. First, MUSE incorporates 2D multi-view templates from 3D unseen objects and 2D object proposals from the input query image, respectively. In the embedding stage, we propose a new feature embedding scheme which integrates class and patch embeddings. Specifically, the patch embeddings are normalized using the generalized mean pooling (GeM). In the matching stage, a joint similarity score is introduced, which integrates an absolute score and a relative score. Finally, we update the similarity score using an uncertainty-aware object prior. MUSE achieves state-of-the-art performance on the BOP Challenge 2025, ranking first in the Classic Core, H3, and Industrial tracks—without any additional training or fine-tuning. Therefore, we believe that MUSE is a promising framework for zero-shot 2D object detection and segmentation.

Computer specifications RTX 4090

Public submissions

No submissions yet.