Tweet

AI Fast Track (30/30)

30 Nov, 8 tweets, 2 min read

🎉🎊🥳Day 30/30: Labeling data and training object detection models are time consuming and expensive.

Here is a Survey of Self-Supervised and Few-Shot Object Detection (FSOD). Those technique aim at alleviating those issues.

The authors have categorized, reviewed, and compared several few-shot and self-supervised object detection methods

📌 FSOD is about training a model on novel (unseen) object classes with little data. It still requires prior training on many labeled data of base (seen) classes

📌 Self-Supervised Learning (SSL) methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection

📌 Combining few-shot and self-supervised object detection is a promising research direction

The authors summarized their main takeaways, made future best practice recommendations, highlighted trends to follow, and given pointers to related tasks.

🔎 TAKEAWAYS & TRENDS

📌 Finetuning is a strong baseline

📌 Impact of self-supervision for object detection

📌 Using heuristics to generate weak labels (e.g. data augmentation)

📌 Rise of transformers

📌 Problems with current evaluation procedures

🔗 RELATED TASKS

📌 Weakly-supervised object detection

📌 Self-supervision using other modalities

📌 Low-data object detection

📌 Few-shot semantic segmentation

📌 Zero-shot object detection

📰 Paper: A Survey of Self-Supervised and Few-Shot Object Detection

abs: arxiv.org/abs/2110.14711
pdf: arxiv.org/pdf/2110.14711…

@ai_fast_track

⭐️ If you find this thread helpful, feel free to follow
@ai_fast_track for more OD / CV demystified content in your feed

⭐️ If you could give this thread a quick retweet, it would help others discover this content. 🙏

https://twitter.com/ai_fast_track/status/1465773297837215750

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @ai_fast_track

AI Fast Track (30/30)

@ai_fast_track

26 Nov

Day 26/30: General Gaussian Heatmap Labeling is Arbitrary-Oriented Object Detection AOOD

📌 It uses an anchor-free object-adaptation label assignment strategy to deﬁne positive candidates based on 2D GH, reﬂecting shape and direction features of arbitrary-oriented objects.

📌 GGHL improves the AOOD performance with low parameter-tuning and time costs.

📌 It is also applicable to most AOOD methods to improve their performance including lightweight models.

📰 Paper: A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object Detection

abs: arxiv.org/abs/2109.12848…
pdf: arxiv.org/pdf/2109.12848…

Read 4 tweets

AI Fast Track (30/30)

@ai_fast_track

24 Nov

Day 24/30: 🥇 EfficientDet is a very popular object detection model for a good reason!

Let’s see why

📌 EfficientDet achieved State-Of-The-Art (SOTA) accuracy while reducing both the size of parameters, and the FLOPS, when it was released. It’s still a very good contender.

📌 Before introducing EfficientDet, models were getting impressively big to achieve SOTA results

❓ The authors asked the following question:
Is it possible to build a scalable detection architecture with both higher accuracy and better efficiency across # resource constraints?

So, they systematically studied neural network architecture design choices for object detection, and proposed several key optimizations to improve efficiency:

1- A weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multiscale feature fusion

Read 8 tweets

AI Fast Track (30/30)

@ai_fast_track

23 Nov

@ai_fast_track

I’m at Day 23 of my 30 posts (on Object Detection) in 30 days challenge

I gathered 12 visual summaries on OD Modeling 🎁

A lot of people find those posts helpful, follow @ai_fast_track to catch the upcoming posts, and give this tweet a quick retweet 🙏

Summary of summaries👇

https://twitter.com/ai_fast_track/status/1453368771285032971

1- Common Object Detector Architecture you should be familiar with:

https://twitter.com/ai_fast_track/status/1453368771285032971

https://twitter.com/ai_fast_track/status/1460297658472574982

2- Four Feature Pyramid Network (FPN) Designs you should know:

https://twitter.com/ai_fast_track/status/1460297658472574982

Read 14 tweets

AI Fast Track (30/30)

@ai_fast_track

20 Nov

FCOS is an an anchor-free object detector.

It was one of first competitors of anchor-based single/two stage object detectors.

Understanding FCOS will help understanding other model inspired by FCOS.

Summary ...👇

📌 FCOS reformulates object detection in a per-pixel prediction fashion

📌 It uses multi-level prediction to improve the recall and resolve the ambiguity resulted from overlapped bounding boxes

📌 It proposes “center-ness” branch, which helps suppress the low-quality detected bounding boxes and improves the overall performance by a large margin

📌 It avoids complex computation such as the intersection-over-union (IoU)

Read 6 tweets

AI Fast Track (30/30)

@ai_fast_track

18 Nov

How to create a robustness evaluation dataset?

"Natural Adversarial Objects" (NAO) dataset is a challenging robustness evaluation dataset for models trained on MSCOCO

📌 Models generally perform well on large scale training sets

📌 They generalize on test sets coming from the same distribution

📌 When using NAO dataset, EfficientDet-D7 mAP reduced by 74.5% compared to MSCOCO

📌 Faster RCNN reduced by 36.3% compared to MSCOCO

📌 They evaluated 7 SOTA models, and showed they consistently fail to perform accurately on NAO, comparing to MSCOCO

📌 The drop is present on both in-distribution and out-of-distribution objects

Read 6 tweets

AI Fast Track (30/30)

@ai_fast_track

17 Nov

❇VFNet: A very interesting model that isn’t under the radar. You should give it a try :)

VariFocalNet: An IoU-aware Dense Object Detector

🧊 Background:
📌 Accurately ranking candidate detections is crucial for dense object detectors to achieve high performance
...

📌 Prior work uses the classification score or a combination of classification and predicted localization scores (centerness) to rank candidates.

📌 Those 2 scores are still not optimal

🧊 Novelty:
📌 VFNet proposes to learn an IoU-Aware Classification Score (IACS)

📌IACS is used as a joint representation of object presence confidence and localization accuracy using IoU

📌 VFNet introduces the VariFocal Loss

📌 The VariFocal Loss down-weights only negative examples for addressing the class imbalance problem during training

Read 7 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

AI Fast Track (30/30)

Try unrolling a thread yourself!

More from @ai_fast_track

AI Fast Track (30/30)

AI Fast Track (30/30)

AI Fast Track (30/30)

AI Fast Track (30/30)

AI Fast Track (30/30)

AI Fast Track (30/30)

Did Thread Reader help you today?

Like this author's thread?