Discover and read the best of Twitter Threads about #redcaps

Most recents (1)

📢New dataset!📢 RedCaps: 12M image-text pairs from Reddit for vision and vision-and-language applications.
Website: redcaps.xyz
Paper: arxiv.org/abs/2111.11431

Check out captions from a RedCaps-trained model!⬇️
Try more here: huggingface.co/spaces/umichVi…
What's new?🧵1/8
Conversational flavor of data: RedCaps data is created with a specific intent of human interaction on social media. Reddit users have an incentive (upvotes) to upload high-quality data — sometimes witty or emotional, unlike HTML alt-text. 2/8
Subreddits: We collect data from 350 manually chosen subreddits. Largest subreddits show that Reddit users like to share pets, hobbies, photography! These subreddits let us steer data distribution, and provide image labels even when captions don’t mention objects in image. 3/8
Read 11 tweets

Related hashtags

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!