Zahere Profile picture
Jun 14 9 tweets 2 min read Twitter logo Read on Twitter
How Canva efficiently removes duplicates from billions of images

T𝘄𝗶𝗻 𝗧𝗿𝗼𝘂𝗯𝗹𝗲𝘀: 𝗛𝗼𝘄 𝗖𝗮𝗻𝘃𝗮 𝗖𝗼𝗻𝗾𝘂𝗲𝗿𝘀 𝗕𝗶𝗹𝗹𝗶𝗼𝗻𝘀 𝗼𝗳 𝗜𝗺𝗮𝗴𝗲𝘀 𝘄𝗶𝘁𝗵 𝗘𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝗰𝘆
🎯 #Refactor75 ➡Day41 #systemdesign #content #engineers Image
Canva has more than 20 billion user-uploaded images, with 50 million uploaded daily.

Dealing with a large variety of media creates difficulties when it comes to moderating and minimizing unnecessary duplicate content
To solve this problem, Canva uses perceptual hashing with an internally built reverse image search system
Perceptual hashing, also known as visual hashing or image fingerprinting, is a technique used to create a compact digital representation of an image or video frame based on its visual content.
The goal of perceptual hashing is to generate a hash value that is unique to the image and remains relatively unchanged even if the image undergoes minor transformations such as compression, color correction, and brightness
To perform a reverse image search using perceptual hashing, we can compare two hash values by calculating the Hamming distance between them.
Hamming distance is a metric used to measure, compare and evaluate the similarity or dissimilarity between two binary strings
Benefits for Canva

1. Reduced Storage Costs
2. Near-unique images served on lookup
3. Content Moderation: Takedown of known illegal images within seconds
Please follow @zahiruddin_t for more such content

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Zahere

Zahere Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(