Question about frozen encoder and decoder architecture in Figure 2

<img width="1916" height="798" alt="Image" src="https://github.com/user-attachments/assets/d082f576-8b55-479f-b842-f5e01e90ac8f" />
First of all, I'd like to commend the authors on the excellent work presented in SSS!

I have a quick question regarding the model architecture, specifically related to the frozen image encoder and feature decoder described in Figure 2 of the paper:

Is the frozen Image Encoder identical in structure to the fine-tuned Image Encoder?
Does the Feature Decoder follow the same architecture as the MedSAM-2 Decoder?
To summarize my question: Is the architecture of the frozen backbone (Image Encoder + Feature Decoder) the same as that of MedSAM-2? If not, could you kindly provide a brief description of its structure?

Thank you in advance for your clarification — looking forward to your response!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about frozen encoder and decoder architecture in Figure 2 #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about frozen encoder and decoder architecture in Figure 2 #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions