📈enabling fast, scalable annotation with minimal supervision.
See the qualitative results 👇
📈enabling fast, scalable annotation with minimal supervision.
See the qualitative results 👇
SAM2 features are rich, but optimized for tracking.
🧠 Insert bottleneck adapters into frozen SAM2
📉 These restructure feature space to disentangle semantics
📈 Result: features cluster semantically—even for unseen classes (see PCA👇)
SAM2 features are rich, but optimized for tracking.
🧠 Insert bottleneck adapters into frozen SAM2
📉 These restructure feature space to disentangle semantics
📈 Result: features cluster semantically—even for unseen classes (see PCA👇)
We turn SAM2 into a semantic few-shot segmenter:
🧠 Unlocks latent semantics in frozen SAM2
✏️ Supports any prompt: fast and scalable annotation
📦 No extra encoders
📎 github.com/ClaudiaCutta...
We turn SAM2 into a semantic few-shot segmenter:
🧠 Unlocks latent semantics in frozen SAM2
✏️ Supports any prompt: fast and scalable annotation
📦 No extra encoders
📎 github.com/ClaudiaCutta...
🔹 Textual Prompts for SAM2: Early fusion of visual-text cues via a novel adapter
🔹 Temporal Modeling: Essential for video understanding, beyond frame-by-frame object tracking
🔹 Tracking Bias: Correcting tracking bias in SAM2 for text-aligned object discovery
🔹 Textual Prompts for SAM2: Early fusion of visual-text cues via a novel adapter
🔹 Temporal Modeling: Essential for video understanding, beyond frame-by-frame object tracking
🔹 Tracking Bias: Correcting tracking bias in SAM2 for text-aligned object discovery
We make #SegmentAnything wiser, enabling it to understand textual prompts—training only 4.9M parameters! 🧠
💻 Code, models & demo: github.com/ClaudiaCutta...
Why SAMWISE?👇
We make #SegmentAnything wiser, enabling it to understand textual prompts—training only 4.9M parameters! 🧠
💻 Code, models & demo: github.com/ClaudiaCutta...
Why SAMWISE?👇