πŸŽ‰πŸŽŠπŸ₯³Day 30/30: Labeling data and training object detection models are time consuming and expensive.

Here is a Survey of Self-Supervised and Few-Shot Object Detection (FSOD). Those technique aim at alleviating those issues.
The authors have categorized, reviewed, and compared several few-shot and self-supervised object detection methods

πŸ“Œ FSOD is about training a model on novel (unseen) object classes with little data. It still requires prior training on many labeled data of base (seen) classes
πŸ“Œ Self-Supervised Learning (SSL) methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection

πŸ“Œ Combining few-shot and self-supervised object detection is a promising research direction
The authors summarized their main takeaways, made future best practice recommendations, highlighted trends to follow, and given pointers to related tasks.

πŸ”Ž TAKEAWAYS & TRENDS

πŸ“Œ Finetuning is a strong baseline

πŸ“Œ Impact of self-supervision for object detection
πŸ“Œ Using heuristics to generate weak labels (e.g. data augmentation)

πŸ“Œ Rise of transformers

πŸ“Œ Problems with current evaluation procedures
πŸ”— RELATED TASKS

πŸ“Œ Weakly-supervised object detection

πŸ“Œ Self-supervision using other modalities

πŸ“Œ Low-data object detection

πŸ“Œ Few-shot semantic segmentation

πŸ“Œ Zero-shot object detection
πŸ“° Paper: A Survey of Self-Supervised and Few-Shot Object Detection

abs: arxiv.org/abs/2110.14711
pdf: arxiv.org/pdf/2110.14711…
⭐️ If you find this thread helpful, feel free to follow
@ai_fast_track for more OD / CV demystified content in your feed

⭐️ If you could give this thread a quick retweet, it would help others discover this content. πŸ™

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with AI Fast Track (30/30)

AI Fast Track (30/30) Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ai_fast_track

26 Nov
Day 26/30: General Gaussian Heatmap Labeling is Arbitrary-Oriented Object Detection AOOD

πŸ“Œ It uses an anchor-free object-adaptation label assignment strategy to define positive candidates based on 2D GH, reflecting shape and direction features of arbitrary-oriented objects.
πŸ“Œ GGHL improves the AOOD performance with low parameter-tuning and time costs.

πŸ“Œ It is also applicable to most AOOD methods to improve their performance including lightweight models.
πŸ“° Paper: A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object Detection

abs: arxiv.org/abs/2109.12848…
pdf: arxiv.org/pdf/2109.12848…
Read 4 tweets
24 Nov
Day 24/30: πŸ₯‡ EfficientDet is a very popular object detection model for a good reason!

Let’s see why

πŸ“Œ EfficientDet achieved State-Of-The-Art (SOTA) accuracy while reducing both the size of parameters, and the FLOPS, when it was released. It’s still a very good contender.
πŸ“Œ Before introducing EfficientDet, models were getting impressively big to achieve SOTA results

❓ The authors asked the following question:
Is it possible to build a scalable detection architecture with both higher accuracy and better efficiency across # resource constraints?
So, they systematically studied neural network architecture design choices for object detection, and proposed several key optimizations to improve efficiency:

1- A weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multiscale feature fusion
Read 8 tweets
23 Nov
I’m at Day 23 of my 30 posts (on Object Detection) in 30 days challenge

I gathered 12 visual summaries on OD Modeling 🎁

A lot of people find those posts helpful, follow @ai_fast_track to catch the upcoming posts, and give this tweet a quick retweet πŸ™

Summary of summariesπŸ‘‡
1- Common Object Detector Architecture you should be familiar with:

2- Four Feature Pyramid Network (FPN) Designs you should know:

Read 14 tweets
20 Nov
FCOS is an an anchor-free object detector.

It was one of first competitors of anchor-based single/two stage object detectors.

Understanding FCOS will help understanding other model inspired by FCOS.

Summary ...πŸ‘‡
πŸ“Œ FCOS reformulates object detection in a per-pixel prediction fashion

πŸ“Œ It uses multi-level prediction to improve the recall and resolve the ambiguity resulted from overlapped bounding boxes
πŸ“Œ It proposes β€œcenter-ness” branch, which helps suppress the low-quality detected bounding boxes and improves the overall performance by a large margin

πŸ“Œ It avoids complex computation such as the intersection-over-union (IoU)
Read 6 tweets
18 Nov
How to create a robustness evaluation dataset?

"Natural Adversarial Objects" (NAO) dataset is a challenging robustness evaluation dataset for models trained on MSCOCO

πŸ“Œ Models generally perform well on large scale training sets
πŸ“Œ They generalize on test sets coming from the same distribution

πŸ“Œ When using NAO dataset, EfficientDet-D7 mAP reduced by 74.5% compared to MSCOCO

πŸ“Œ Faster RCNN reduced by 36.3% compared to MSCOCO
πŸ“Œ They evaluated 7 SOTA models, and showed they consistently fail to perform accurately on NAO, comparing to MSCOCO

πŸ“Œ The drop is present on both in-distribution and out-of-distribution objects
Read 6 tweets
17 Nov
❇VFNet: A very interesting model that isn’t under the radar. You should give it a try :)

VariFocalNet: An IoU-aware Dense Object Detector

🧊 Background:
πŸ“Œ Accurately ranking candidate detections is crucial for dense object detectors to achieve high performance
...
πŸ“Œ Prior work uses the classification score or a combination of classification and predicted localization scores (centerness) to rank candidates.

πŸ“Œ Those 2 scores are still not optimal

🧊 Novelty:
πŸ“Œ VFNet proposes to learn an IoU-Aware Classification Score (IACS)
πŸ“ŒIACS is used as a joint representation of object presence confidence and localization accuracy using IoU

πŸ“Œ VFNet introduces the VariFocal Loss

πŸ“Œ The VariFocal Loss down-weights only negative examples for addressing the class imbalance problem during training
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(