๐๐๐ฅณDay 30/30: Labeling data and training object detection models are time consuming and expensive.
Here is a Survey of Self-Supervised and Few-Shot Object Detection (FSOD). Those technique aim at alleviating those issues.
The authors have categorized, reviewed, and compared several few-shot and self-supervised object detection methods
๐ FSOD is about training a model on novel (unseen) object classes with little data. It still requires prior training on many labeled data of base (seen) classes
๐ Self-Supervised Learning (SSL) methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection
๐ Combining few-shot and self-supervised object detection is a promising research direction
Day 26/30: General Gaussian Heatmap Labeling is Arbitrary-Oriented Object Detection AOOD
๐ It uses an anchor-free object-adaptation label assignment strategy to de๏ฌne positive candidates based on 2D GH, re๏ฌecting shape and direction features of arbitrary-oriented objects.
๐ GGHL improves the AOOD performance with low parameter-tuning and time costs.
๐ It is also applicable to most AOOD methods to improve their performance including lightweight models.
๐ฐ Paper: A General Gaussian Heatmap Labeling for Arbitrary-Oriented Object Detection
Day 24/30: ๐ฅ EfficientDet is a very popular object detection model for a good reason!
Letโs see why
๐ EfficientDet achieved State-Of-The-Art (SOTA) accuracy while reducing both the size of parameters, and the FLOPS, when it was released. Itโs still a very good contender.
๐ Before introducing EfficientDet, models were getting impressively big to achieve SOTA results
โ The authors asked the following question:
Is it possible to build a scalable detection architecture with both higher accuracy and better efficiency across # resource constraints?
So, they systematically studied neural network architecture design choices for object detection, and proposed several key optimizations to improve efficiency:
1- A weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multiscale feature fusion