My Authors
Read all threads
Check out our paper today at #CVPR2020 🎉🎉🎉
cvpr20.com/event/context-…

Context R-CNN: Long Term Temporal Context for Per Camera Object Detection

Come ask questions at the live Q&As!

First session: June 18, 3-5 PM PDT
Second session: June 19, 3-5 AM PDT

Our @CVPR spotlight vid:
In static cameras (like #cameratraps), relevant context for identifying objects can be spread out across long time horizons. For example, these two images come from the same camera and are uncannily similar, but they were taken a month apart! Turns out animals are pretty habitual
We propose a simple and flexible method for aggregating context from up to a month of data, using attention! We first build an (unsupervised) “memory bank” for each location. We add context for each object by finding features in the memory bank that help us identify that object.
The best news is, it really helps! When testing this method on the #SnapshotSerengeti Dataset (lila.science/datasets/snaps…), we saw a 47.6% relative improvement in mAP from the baseline Faster R-CNN model 🎉🎉🎉

Check out the results in our paper: openaccess.thecvf.com/content_CVPR_2…
The biggest improvements are on challenging cases, ones that even a human might need some context to identify correctly. Static cameras collect lots of difficult-to-ID images because they can't rely on a human photographer to make sure the objects of interest are easy to see & ID Here we identify an object that's moving out of the frame, tHere we found an elephant hidden behind a tree!In this one we found and identified a second monkey, even thHere we found all the impala in the image, despite the fact
Huge thanks to my co-authors @jonathanhuang11, @GuanhangW, @vivekrathod, and Ronny Votel! And to the @WildInsights AI team, the Visual Dynamics team at @GoogleAI, and Pietro's lab at @Caltech for letting me bounce ideas off of you all 🥰
We also find that increasing the time horizon that we use when providing context increases performance! When we use longer time horizons we are better able to take advantage of how habitual animals are. That warthog that uses the same game trail every day? Got em. 🐗
If you want to see how it works, we've set up a demo in @GoogleColab with our Snapshot Serengeti trained models: colab.research.google.com/github/tensorf…

We provided a few test images, but feel free to try it on other sets of camera trap data from Snapshot Serengeti or elsewhere in East Africa!
The model was trained on Season 1-6, with the train split locations recommended by lila.science/datasets/snaps…. Any images without ground truth bounding box labels got given 'weakly supervised' bounding box labels using the @Microsoft_Green #MegaDetector prior to training.
The code is open-sourced in the @TensorFlow Object Detection API, release notes here: github.com/tensorflow/mod…
Missing some Tweet in this thread? You can try to force a refresh.

Keep Current with Sara Beery | #CVPR2020

Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!