Tweet

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

Nov 10, 2021 • 14 tweets • 3 min read

✨ Training Object Detection Models Tips & Tricks ✨

🧊 Data Labeling:

📌 Avoid adding low quality data: it confuses your model

📌 Prevent data leak

📌 Dataset size: Smaller size if using pretrained models. Bigger size if training from scratch

...

📌 Use prototypical (representative) data for each class

📌 Identify incorrect classes

📌 Identify ambiguous labelled images

📌 Balance your data distribution

📌 Train from scratch if your dataset is different from the COCO dataset

📌 When your model stops improving, you might add more data to move the needle:

📌 Use Soft-Labelling: label new data with pretrained models => Free labels

📌 Use Self-Training: label new data with your model you are training. Add newly labelled data to your training dataset. Loop

🧊 Modeling

📌 Use larger models. They outperform smaller ones

📌 Use smaller models 😀 when training small dataset

📌 Use Focal Loss for the classification head (RetinaNet , EfficientDet, ...)

📌 Use GIoU Loss for the regression head (box location): Need a separate post 😄

📌 Trained with ImageNet isn't always effective

📌 YOLO-ReT paper showed that using the whole pretrained backbone harms model performance

📌 Use Backbone Truncation (like YOLO-ReT): Only 60% of the backbone is initialised with pretrained weights.

🧊 Anchor Boxes

📌 Use anchor boxes with size/ratio close to target boxes

📌 Make sure you know how your anchor boxes look like

📌 Use some anchor-free OD models (e.g., VFNet): No anchor boxes. Some perform even better than anchor-based models

🧊 Data augmentation

📌 Oversample images with small boxes

📌 Use transforms close to your use case

📌 Use Copy & Paste boxes data augmentation

📌 Use mosaic data augmentation

📌 Some suggest using heavy data augmentation at the beginning of training, and light data augmentation at the end

🧊 Training

📌 Don’t worry too much about overfitting at the beginning of project: Adding more data/ data augmentation should mitigate that

📌 Train for a longer period (more epochs): As long as your loss is decreasing

📌 In transfer learning, when freezing NN layers, you should leave BatchNorm layers as trainable

📌 Train using progressive resizing

📌 Use discriminative learning rate. Low learning rate for backbones, higher learning rate for the head

📌 Use the recommended LR scheduler

🧊 Inference

📌 Use the same image size as the one you train your model with

📌 With high resolution images, apply inference on patches/slices and then stitch them together: e.g., Slicing-Aided Hyper Inference

📌 Don’t forget to put the model on evaluation mode (eval_mode): It automatically disables Dropout, BatchNorm, and backpropagation.

@ai_fast_track

🎉This is my longest thread since I joined🐦

If you like this kind of content, follow @ai_fast_track for more OD / CV demystified content in your feed

🙏If you could give the thread a quick retweet, it would help others discover this content! Thanks!

https://twitter.com/ai_fast_track/status/1458493806362378244

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @ai_fast_track

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

@ai_fast_track

Feb 9

Here is a summary of summaries about:

• Creating a Deep Learning Pipeline
• Deploying Models on AWS Lambda
• Deploying Models on Edge Devices
• Showcasing Models Hugging Face Spaces
👇

https://twitter.com/ai_fast_track/status/1482759861519761409

1- Creating a Deep Learning Pipeline

https://twitter.com/ai_fast_track/status/1482759861519761409

https://twitter.com/ai_fast_track/status/1471563607569747985

2- Object detection inference with AWS Lambda and IceVision (PyTorch)

https://twitter.com/ai_fast_track/status/1471563607569747985

Read 7 tweets

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

@ai_fast_track

Feb 8

@ai_fast_track

🎉 Celebrating 100 days of sharing Visual Summaries in Computer Vision

I plan to continue sharing:
• More content
• Summaries of summaries by topic

Follow me for more threads on advanced computer vision techniques used in industry-level applications → @ai_fast_track

https://twitter.com/ai_fast_track/status/1463177309381402634

- 12 visual summaries on OD Modeling:

https://twitter.com/ai_fast_track/status/1463177309381402634

https://twitter.com/ai_fast_track/status/1467967804120965132

- Tips & Trick & Best Practices in training (not only) object detection models.

https://twitter.com/ai_fast_track/status/1467967804120965132

Read 4 tweets

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

@ai_fast_track

Feb 2

🌟 VFNet: IMHO, is the best anchor-free single-stage model, and it's not under the radar.

VariFocalNet: An IoU-aware Dense Object Detector

🧊 Background:
📌 Accurately ranking candidate detections is crucial for dense object detectors to achieve high performance.
...

📌 Prior work uses the classification score or a combination of classification and predicted localization scores (centerness) to rank candidates.

📌 Those 2 scores are still not optimal.

🧊 Novelty:
📌 VFNet proposes to learn an IoU-Aware Classification Score (IACS)

📌IACS is used as a joint representation of object presence confidence and localization accuracy using IoU

📌 VFNet introduces the VariFocal Loss

📌 The VariFocal Loss down-weights only negative examples for addressing the class imbalance problem during training.

Read 7 tweets

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

@ai_fast_track

Feb 1

4 types of imbalance issues in object detection that you should know:

Here is a brief description of each one of them and some potential solutions.

• Scale imbalance
• Objective imbalance
• Class imbalance
• Spatial imbalance

🔸 Scale imbalance
It happens when the objects have different sizes with different numbers of objects: e.g. small objects vs. big objects.

✅ Potential Solution
• Oversample small objects using the Copy&Paste data augmentation
• Use higher resolution images

🔸 Objective imbalance
It happens when calculating a total loss (classiﬁcation and regression losses). One loss might dominate another.

✅ Potential Solution
• Use the weighted loss

Read 8 tweets

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

@ai_fast_track

Jan 9

@wightmanr

How do you use transfer learning with images with 3+ (or 1) channel(s)?

Timm library, developed by @wightmanr, has an elegant way to handle that:

You can specify any input channel number (e.g. in_chans=1 or in_chans=8) using timm.create_model() function like this:

@wightmanr

@wightmanr m = timm.create_model('resnet34', pretrained=True, in_chans=8)

How does it work?

• Case 1: number of input channels is 1
timm simply sums the 3 channel weights into one single channel

@wightmanr

@wightmanr • Case 2: number of input channels is 8 (more than 3)
timm repeats the 3 channel weights as many times as required, and then select the required number of input channels weights

In 8 channels example, that would be: repeat 3 times (9 channels generated), then keep the first 8

Read 5 tweets

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

@ai_fast_track

Jan 5

https://twitter.com/ai_fast_track/status/1474969748626673666

Here is a mega-summary of my YOLO-Series Visual Summaries:

1- YOLO Family Real-Time Performance

https://twitter.com/ai_fast_track/status/1474969748626673666

https://twitter.com/ai_fast_track/status/1472070080557113344

2- IA-YOLO improves object detection in adverse weather conditions using a hybrid task.

Image improvement combined with object detection.

https://twitter.com/ai_fast_track/status/1472070080557113344

https://twitter.com/ai_fast_track/status/1455585130446270471

3- YOLO Real-Time (YOLO-ReT) architecture targets edge devices.

https://twitter.com/ai_fast_track/status/1455585130446270471

Read 8 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

Try unrolling a thread yourself!

More from @ai_fast_track

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

AI Fast Track (9️⃣8️⃣/1️⃣0️⃣0️⃣)

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Like this author's thread?