Happy to share my first @Gradio demo hosted as a @huggingface Space! It showcases @facebookai's new DINO self-supervised method, which allows Vision Transformers to segment objects within an image without ever being trained to do so! Try it yourself!
I've also converted all ViT's trained with DINO from the official repository and uploaded them to the hub: huggingface.co/models?other=d…. Just load them into a ViTModel or ViTForImageClassification ;)
Also, amazed at how ridicously easy @Gradio and @huggingface Spaces are, I got everything set up in 10 minutes