Tweet

@ai_fast_track

More from @ai_fast_track

AI Fast Track

@ai_fast_track

18 Nov

How to create a robustness evaluation dataset?

"Natural Adversarial Objects" (NAO) dataset is a challenging robustness evaluation dataset for models trained on MSCOCO

📌 Models generally perform well on large scale training sets

📌 They generalize on test sets coming from the same distribution

📌 When using NAO dataset, EfficientDet-D7 mAP reduced by 74.5% compared to MSCOCO

📌 Faster RCNN reduced by 36.3% compared to MSCOCO

📌 They evaluated 7 SOTA models, and showed they consistently fail to perform accurately on NAO, comparing to MSCOCO

📌 The drop is present on both in-distribution and out-of-distribution objects

Read 6 tweets

AI Fast Track

@ai_fast_track

16 Nov

@OpenMMLab

📢 The amazing @OpenMMLab just released a new project:

MMFlow: an open-source optical flow toolbox written in Pytorch

OpenMMLab hosts several impressive open-source projects for both academic research and industrial applications.

OpenMMLab covers a wide range of research topics of computer vision, e.g., classification, detection, segmentation and super-resolution.

📌 MMCV: Foundational library for computer vision.

📌 MIM: MIM Installs OpenMMLab Packages.

📌 MMClassification: Image classification toolbox and benchmark.

📌 MMDetection: Detection toolbox and benchmark.

📌 MMDetection3D: Next-generation platform for general 3D object detection.

📌 MMSegmentation: Semantic segmentation toolbox and benchmark.

Read 6 tweets

AI Fast Track

@ai_fast_track

15 Nov

4 Feature Pyramid Network (FPN) Design you should know:

FPN, PANet, NAS-FPN, and BiFPN

📌 (a) FPN uses a top-down pathway to fuse multi-scale features from level 3 to 7 (P3 - P7);

📌 (b) PANet adds an additional bottom-up pathway on top of FPN;

📌 (c) NAS-FPN uses neural architecture search to ﬁnd an irregular feature network topology and then repeatedly apply the same block;

📌 (d) BiFPN is a bit similar to PANet, adds shortcut fusing, and then repeatedly apply the same block

📝 Some other observations:

📌 The model diagram corresponds to the One-Stage Object Detection Architecture

📌 The FPN illustration is extracted from the EfficientDet paper

📌The (P3-P5) layers are referred as the Convolutional (C3-C5) Layers in other papers

Read 5 tweets

AI Fast Track

@ai_fast_track

5 Nov

https://twitter.com/ai_fast_track/status/1407333442145206280

🤔 How to increase your Small Object Detection Average Precision APs?

💡 By increasing both image and backbone sizes when training your model:

📌 Increasing both image and backbone sizes in EfficientDet jumped APs by 14+%

📌 Increasing backbone size in RFBNet increased APs

https://twitter.com/ai_fast_track/status/1407333442145206280

📌 Increasing image size from 320 to 608 in PP-YOLO led to 10+% increase in APs

For more tips and tricks to improve small object detection tips & tricks, check out the list I shared in my first tweet.

Benchmarks are extracted from the PP-YOLO paper:

📰 Paper: PP-YOLO: An Effective and Efficient Implementation of Object Detector

PDF: arxiv.org/pdf/2007.12099…

Read 4 tweets

AI Fast Track

@ai_fast_track

4 Nov

🥇 FCOS3D won the 1st place out of all the vision-only methods in the nuScenes 3D Detection Challenge of NeurIPS 2020.

Here is a brief description:

📌 FCOS3D is a monocular 3D object detector

📌 It’s an anchor-free model based on FCOS (2D) counterpart

📌 It replaces the FCOS regression branch by 6 branches

📌 The center-ness is redeﬁned with a 2D Gaussian distribution based on the 3D-center

📌 The authors showed some failure cases, mainly focused on the detection of large objects and occluded objects.

⏹ Source code and models are shared in the MMDetection3D repo:
github.com/open-mmlab/mmd…

⏹ MMDetection3D also has many other 3D detection models:

Read 6 tweets

AI Fast Track

@ai_fast_track

2 Nov

YOLO Real-Time (YOLO-ReT) architecture targets edge devices.

It achieves 68.75 mAP on Pascal VOC and 34.91 mAP on COCO using MobileNetV2×0.75 backbone.

Here is a brief description of the YOLO-ReT 👇

Both model accuracy and execution time (Frame Per Second) are crucial when deploying a model on edge device. YOLO-ReT is based on these 2 ideas:

⏹ Backbone Truncation: Only 60% of the backbone is initialised with pretrained weights. Using all the weights harms model accuracy

⏹ Raw Feature Collection and Redistribution (RFCR):

📌 Fuse {C2, C3, C4} into C5 layer (fused feature map)

📌 Discard last CNN layers

📌 Pass the fused feature map through a 5x5 Mobile Convolution block (MBConv)

Read 6 tweets

Share this page!

AI Fast Track

Try unrolling a thread yourself!

More from @ai_fast_track

AI Fast Track

AI Fast Track

AI Fast Track

AI Fast Track

AI Fast Track

AI Fast Track

Did Thread Reader help you today?

Like this author's thread?