You can also find me at threads: @sung.kim.mw
Swap arxiv → quickarxiv on any paper URL to get an instant blog with figures, insights, and explanations.
Swap arxiv → quickarxiv on any paper URL to get an instant blog with figures, insights, and explanations.
DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. They find that:
- A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture.
DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. They find that:
- A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture.
A full Terminal UI (TUI) for live, interactive W&B monitoring right in your terminal.
wandb.ai/wandb_fc/pro...
A full Terminal UI (TUI) for live, interactive W&B monitoring right in your terminal.
wandb.ai/wandb_fc/pro...
developers.googleblog.com/en/google-co...
developers.googleblog.com/en/google-co...
- 60+ arch., up to 2B params
- 10+ datasets
- in-domain training (>DINOv3)
- corr(train loss, test perf)=95%
- 60+ arch., up to 2B params
- 10+ datasets
- in-domain training (>DINOv3)
- corr(train loss, test perf)=95%