Karan Desai (KD) Profile picture
I fight the devil in the details 🧐 Prev: CS PhD @ University of Michigan
Nov 23, 2021 11 tweets 6 min read
📢New dataset!📢 RedCaps: 12M image-text pairs from Reddit for vision and vision-and-language applications.
Website: redcaps.xyz
Paper: arxiv.org/abs/2111.11431

Check out captions from a RedCaps-trained model!⬇️
Try more here: huggingface.co/spaces/umichVi…
What's new?🧵1/8 Conversational flavor of data: RedCaps data is created with a specific intent of human interaction on social media. Reddit users have an incentive (upvotes) to upload high-quality data — sometimes witty or emotional, unlike HTML alt-text. 2/8