DINOv2 Vision Demo

Explore Meta's DINOv2 self-supervised vision transformer — image classification, patch-level feature visualization, and embedding similarity. Everything runs on CPU using dinov2-small (~86 MB).

Upload an image to classify it against ImageNet-1k labels using facebook/dinov2-small-imagenet1k-1-layer.