Tweet

Hilde Kuehne @ CVPR2022

Jun 17 • 7 tweets • 6 min read

Happy to finally share our paper about differentiable Top-K Learning by Sorting that didn’t make it to #CVPR2022, but was accepted for #ICML2022! We show that you can improve classification by actually considering top-1 + runner-ups… 1/6🧵

#ComputerVision #AI #MachineLearning

@FHKPetersen

Paper: arxiv.org/abs/2206.07290

Great work by @FHKPetersen in collaboration with Christian Borgelt, @OliverDeussen . 2/6🧵

@MITIBMLab @goetheuni @UniKonstanz

Idea: Top-k class accuracy is used in many ML tasks, but training is usually limited to top-1 accuracy (or another k). We propose a differentiable top-k classification loss that allows training by considering any combination of top-k predictions, e.g. top-2 top-5, 3/6🧵

To this end, we leverage recent advances in differentiable sorting and ranking. We capture the probability for a class to be among the top-k given, e.g. an image. 4/6🧵

@CuturiMarco

This works with any differentiable sorting framework:
NeuralSort: arxiv.org/abs/1903.08850
SoftSort: arxiv.org/abs/2006.16038
SinkhornSort: arxiv.org/abs/1905.11885
DiffSortNets: arxiv.org/abs/2203.09630 arxiv.org/abs/2105.04019
5/6🧵
@CuturiMarco @adityagrover_ @SebastianPrillo

We evaluate the top-k loss on state-of-the-art architectures. We find that relaxing k does not only produce better top-5 accuracies but also leads to top-1 accuracy improvements and can achieve new state-of-the-art by fine-tuning on publicly available ImageNet models. 6/6🧵

Hope you enjoy the paper! Feel free to leave comments or contact us if you have questions. Code will be available soon!

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @HildeKuehne

Hilde Kuehne @ CVPR2022

@HildeKuehne

Jun 16

@MITIBMLab

Check out our #CVPR2022 paper! We improve multimodal zero-shot text-to-video retrieval on Youcook2/MSR-VTT by leveraging fusion transformer and combinatorial loss. 1/🧵

#ComputerVision #AI #MachineLearning

@MITIBMLab @goetheuni @MIT_CSAIL @IBMResearch

@ninashv__

If you want to go directly to the paper/code, please check out:
paper: arxiv.org/abs/2112.04446
Github link: github.com/ninatu/everyth…

Great work by @ninashv__ , @Brian271828, @arouditchenko Samuel Thomas, Brian Kingsbury, @RogerioFeris , David Harwath, and James Glass.

We propose a multimodal modality agnostic fusion transformer that learns to exchange information between multiple modalities, e.g. video, audio, text, and builds an embedding that aggregates multi-modal information.

Read 5 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Hilde Kuehne @ CVPR2022

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @HildeKuehne

Hilde Kuehne @ CVPR2022

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?