Detecting small object detection is one of the most challenging tasks in object detection.

❓ Want to know an effective way to overcome that❓

Check out the SAHI integration in IceVision🧊
👇
All IceVision models transparently support SAHI (MMDetection, EfficientDet, Torchvision, and YOLOv5). The results are quite impressive!

SAHI is a very cool library that drastically improves detection of small objects.
📌 It is simply used at inference time

📌 It leverages high resolution images.

📌 Your training process remains the same as usual.

🔎 At the high level, this is how SAHI works:

1- It takes a high resolution (e.g.1920 x 1024 px), and divide it in patches (e.g. 128 x 128 px)
2- It performs an inference on that patch

3- It slides the window, and repeat step 2. You can choose the amount (percentage) of the overlapping part

4- It combines the boxes from all the patches

5- Project those boxes on the original image
Integrating SAHI in IceVision was led by our amazing core-developer @Fra_Pochetti 🙏

Francesco added a new notebook showing the step-by-step process to help you using SAHI in your project!

🎁 NB: github.com/airctic/icevis…
He also posted a great article here:

🎁 francescopochetti.com/icevision-sahi…

Kudos to the SAHI creator team for their amazing job! 🙏
@fcakyon

SAHI Repo: github.com/obss/sahi
⭐️ If you are interested in mastering object detection, and standing out from the crowd, feel free to follow me
@ai_fast_track

⭐️ If you could give this thread a quick retweet, it would help others discover this content 🙏

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with AI Fast Track (33/45)

AI Fast Track (33/45) Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ai_fast_track

4 Dec
💡 Prop Tip to 🚀 fast track your object detection training

All @fastdotai users use this method for free!

It's fine_tune()

We noticed an important increase in the COCO Metric when we train object detection models using this method even with the simple default values Image
Fine tuning has 2 phases:

🧊 Phase 1: Freeze your model's backbone and neck. Train your model for a couple of epochs (e.g. 1 to 5)

📈 Phase 2: Unfreeze the whole model. Train for higher number of periods (depending on your dataset, and how hard/easy your data is)
Phase 1 is a kind of warm-up for your model head.

Here is an example showing how to call the fine_tune() method.

In this case, we freeze both the backbone and the neck for 1 epoch in phase 1, and train for 20 epochs in phase 2. Image
Read 5 tweets
30 Nov
🎉🎊🥳Day 30/30: Labeling data and training object detection models are time consuming and expensive.

Here is a Survey of Self-Supervised and Few-Shot Object Detection (FSOD). Those technique aim at alleviating those issues.
The authors have categorized, reviewed, and compared several few-shot and self-supervised object detection methods

📌 FSOD is about training a model on novel (unseen) object classes with little data. It still requires prior training on many labeled data of base (seen) classes
📌 Self-Supervised Learning (SSL) methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection

📌 Combining few-shot and self-supervised object detection is a promising research direction
Read 8 tweets
26 Nov
Day 26/30: General Gaussian Heatmap Labeling is Arbitrary-Oriented Object Detection AOOD

📌 It uses an anchor-free object-adaptation label assignment strategy to define positive candidates based on 2D GH, reflecting shape and direction features of arbitrary-oriented objects.
📌 GGHL improves the AOOD performance with low parameter-tuning and time costs.

📌 It is also applicable to most AOOD methods to improve their performance including lightweight models.
📰 Paper: A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object Detection

abs: arxiv.org/abs/2109.12848…
pdf: arxiv.org/pdf/2109.12848…
Read 4 tweets
24 Nov
Day 24/30: 🥇 EfficientDet is a very popular object detection model for a good reason!

Let’s see why

📌 EfficientDet achieved State-Of-The-Art (SOTA) accuracy while reducing both the size of parameters, and the FLOPS, when it was released. It’s still a very good contender.
📌 Before introducing EfficientDet, models were getting impressively big to achieve SOTA results

❓ The authors asked the following question:
Is it possible to build a scalable detection architecture with both higher accuracy and better efficiency across # resource constraints?
So, they systematically studied neural network architecture design choices for object detection, and proposed several key optimizations to improve efficiency:

1- A weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multiscale feature fusion
Read 8 tweets
23 Nov
I’m at Day 23 of my 30 posts (on Object Detection) in 30 days challenge

I gathered 12 visual summaries on OD Modeling 🎁

A lot of people find those posts helpful, follow @ai_fast_track to catch the upcoming posts, and give this tweet a quick retweet 🙏

Summary of summaries👇
1- Common Object Detector Architecture you should be familiar with:

2- Four Feature Pyramid Network (FPN) Designs you should know:

Read 14 tweets
20 Nov
FCOS is an an anchor-free object detector.

It was one of first competitors of anchor-based single/two stage object detectors.

Understanding FCOS will help understanding other model inspired by FCOS.

Summary ...👇
📌 FCOS reformulates object detection in a per-pixel prediction fashion

📌 It uses multi-level prediction to improve the recall and resolve the ambiguity resulted from overlapped bounding boxes
📌 It proposes “center-ness” branch, which helps suppress the low-quality detected bounding boxes and improves the overall performance by a large margin

📌 It avoids complex computation such as the intersection-over-union (IoU)
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(