https://www.fregu856.com/
I actually wrote "The one proper method change that seems to have the biggest effect is probably adding the KoLeo regularization loss term?" in my notes, so would be nice to read more about how that works.
I actually wrote "The one proper method change that seems to have the biggest effect is probably adding the KoLeo regularization loss term?" in my notes, so would be nice to read more about how that works.
Their model distillation approach is also interesting, distilling their ViT-g down to ViT-L and smaller models.
Their model distillation approach is also interesting, distilling their ViT-g down to ViT-L and smaller models.
DINOv2: Learning Robust Visual Features without Supervision (TMLR, 2024)
DINOv2 doesn't really add much methodological difference compared to iBOT, they give a good summary of what they do:
DINOv2: Learning Robust Visual Features without Supervision (TMLR, 2024)
DINOv2 doesn't really add much methodological difference compared to iBOT, they give a good summary of what they do: