Submission: lcc-fastsam/TUD-L/3rd stage

Download submission
Submission name 3rd stage
Submission time (UTC) Sept. 27, 2023, 4:49 p.m.
User felix.stillger
Task Model-based 2D segmentation of unseen objects
Dataset TUD-L
Description
Evaluation scores
AP:0.153
AP50:0.288
AP75:0.146
AP_large:0.142
AP_medium:0.161
AP_small:0.012
AR1:0.249
AR10:0.291
AR100:0.291
AR_large:0.395
AR_medium:0.276
AR_small:0.175
average_time_per_image:0.356

Method: lcc-fastsam

User felix.stillger
Publication
Implementation
Training image modalities RGB
Test image modalities RGB
Description

Training: There is no training step.

Onboarding: For each object, 43 random pre-rendered images from the "train_pbr" dataset are selected. The masks of these objects are then extracted and encoded using CLIP. This data is input into a simple expert binary classifier, trained for each object.

Test: During testing, Fastsam-s extracts masks from a test image, and the object's ID is determined by the expert binary classifiers.

Computer specifications RTX 3090, AMD Ryzen 9 3900X 12-Core Processor