Many open-world applications require the detection of novel objects.
but state-of-the-art object detection and instance segmentation models are unable to do so.
β’ Itβs because models learn to suppress any unannotated objects by treating them as background
β’ To address that issue, the authors propose a simple yet surprisingly powerful data augmentation and training scheme they call Learning to Detect Every Thing (LDET)
β’ To avoid suppressing hidden (unannotated) objects, background objects that are visible but unlabeled, they paste annotated objects on a background image sampled from a small region of the original image (see figure)
β’ Trained a VFNet model using IceVision and @fastdotai
β’ Reached 73%πin the COCO metric score
Blog posts: π
π Main takeaway of this story is: You can learn object detection very quickly if You:
β’ Are determined
β’ Follow the optimal learning path
β’ Embrace the 80-20, and the KISS principles.
β’ Have access to high curated content, and libraries
β’ Know how to avoid roadblocks
β’ Stay focus, and avoid distraction
β¨ Like many things in life, object detection is:
β’ neither too hard
β’ nor too easy
β’ right in between ... when you have the right ingredients