🎉🎊🥳Day 30/30: Labeling data and training object detection models are time consuming and expensive.
Here is a Survey of Self-Supervised and Few-Shot Object Detection (FSOD). Those technique aim at alleviating those issues.
The authors have categorized, reviewed, and compared several few-shot and self-supervised object detection methods
📌 FSOD is about training a model on novel (unseen) object classes with little data. It still requires prior training on many labeled data of base (seen) classes
📌 Self-Supervised Learning (SSL) methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection
📌 Combining few-shot and self-supervised object detection is a promising research direction
Day 26/30: General Gaussian Heatmap Labeling is Arbitrary-Oriented Object Detection AOOD
📌 It uses an anchor-free object-adaptation label assignment strategy to define positive candidates based on 2D GH, reflecting shape and direction features of arbitrary-oriented objects.
📌 GGHL improves the AOOD performance with low parameter-tuning and time costs.
📌 It is also applicable to most AOOD methods to improve their performance including lightweight models.
📰 Paper: A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object Detection
Day 24/30: 🥇 EfficientDet is a very popular object detection model for a good reason!
Let’s see why
📌 EfficientDet achieved State-Of-The-Art (SOTA) accuracy while reducing both the size of parameters, and the FLOPS, when it was released. It’s still a very good contender.
📌 Before introducing EfficientDet, models were getting impressively big to achieve SOTA results
❓ The authors asked the following question:
Is it possible to build a scalable detection architecture with both higher accuracy and better efficiency across # resource constraints?
So, they systematically studied neural network architecture design choices for object detection, and proposed several key optimizations to improve efficiency:
1- A weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multiscale feature fusion