Following tips may boost model performance across different network structures with up to 5% (mAP or mean Average Precision) without increasing computational costs in any way.

#computervision #pytorch #deeplearning #deeplearningai #100daysofmlcode #neuralnetworks #AI
Visually Coherent Image Mix-up for Object Detection. This has already been proven to be successful in lessening adversarial fears in network classification after testing it on COCO 2017 and PASCAL datasets with YOLOv3 models.
#computervision #pytorch
🤖 Optical Character Recognition (OCR) is one of the most important applications of #ComputerVision in the real world.

In this thread, we cover some of our popular free tutorials on #OCR, which will get you started in no time. 🚀

What is Optical Character Recognition?

✨Accept an image
✨Detect the text
✨Convert the text to machine-readable format


Installing #Tesseract, #PyTesseract, and #Python OCR Packages On Your System

✨Install the Tesseract OCR engine on your machine
✨Create a python virtual environment for installation
✨Install necessary python packages


#Highlights2021 for me: our #survey on efficient processing of #sparse and compressed tensors of #ML/#DNN models on #hardware accelerators published in @ProceedingsIEEE.
RT/sharing appreciated. 🧵
Context: Tensors of ML/DNN are compressed by leveraging #sparsity, #quantization, shape reduction. We summarize several such sources of sparsity & compression (§3). Sparsity is induced in structure while pruning & it is unstructured inherently for various applications or sources. Various sources induce stru...Common structures of sparsi...
Likewise, leveraging value similarity or approximate operations could yield irregularity in processing. Also, techniques for size-reduction make tensors asymmetric-shaped. Hence, special mechanisms can be required for efficient processing of sparse and irregular computations.
GauGAN2 combines segmentation mapping, inpainting and text-to-image generation in a single model

#Computervision #AI #ArtificialIntelligence #TensorFlow #PyTorch #DeepLearning #DataScience #MachineLearning #100DaysOfMLCode #Python #DataScientist #Statistics #Mathematics

Unlike GauGAN1 the GauGAN2 can translate natural language descriptions into landscape images. Typing a phrase like “sunset at a beach” generates the scene

#Computervision #AI #ArtificialIntelligence #TensorFlow #PyTorch #DeepLearning #DataScience #MachineLearning #Math
Image interpolation occurs when you resize or distort your image from one pixel grid to another.


#computervision #IMAGE #DataScience #MachineLearning #DeepLearning #100DaysOfMLCode #Python #programming #Math #Stat #dataviz #DataAnalytics #AI #ArtificialIntelligence #data
Image interpolation works in two directions, and tries to achieve a best approximation of a pixel's intensity based on the values at surrounding pixels.


#computervision #IMAGE #DataScience #MachineLearning #DeepLearning #100DaysOfMLCode #Python #programming #Math #Stat
Image resizing is necessary when you need to increase or decrease the total number of pixels, whereas remapping can occur when you are correcting for lens distortion or rotating an image.

#computervision #DataScience #MachineLearning #DeepLearning #100DaysOfMLCode #Python
Incredibly happy that our @RoboStack paper has been accepted to the @ieeeras Robotics & Automation Magazine 🥳. @RoboStack brings together #ROS @rosorg with @condaforge and @ProjectJupyter. Preprint: Find out some key benefits in this 🧵: 1/n
You can now easily use ROS (both ROS1 #Noetic and ROS2 #Galactic) on a wide range of platforms: @linuxfoundation (not just a specific Ubuntu, any Linux!), @Apple #MacOS and @Microsoft @Windows - even on ARM processors including the new M1 (work still in progress, though). 2/n
Thanks to the tight coupling to @condaforge, this enables you to (very easily) install ROS side-by-side with thousands of scientific libraries, including recent #computervision and #machinelearning ones (think @TensorFlow, @opencvlibrary, @PyTorch and more). 3/n
Read 7 tweets
@amaarora really explained #convolutions very well in #fastbook week 12 session which can be viewed here

I wasn't able to write a blog post explaining my learnings from the stream but would threfore write a 🧵

After going through 1st part of #convolutions chapter, have cleared a concept and was introduced to two new concepts.

1. How depthwise convolutions work (3/n)
2. Dilated convolutions (7/n)
3. Alternate interpretation of #stride (9/n)

When we have a n-channel input and a m-channel output, we need to convolve over not only 2-Dimensions (W x H) but also across the depth D.

An RGB image for example has 3 channels

Let us consider we want to derive 10 feature maps from this input.
Read 10 tweets
"torch.manual_seed(3407) is all you need"!
Sorry for the title. I promise it's not (entirely) just for trolling. It's my little spare time project of this summer to investigate unaccounted randomness in #ComputerVision and #DeepLearning.
🧵👇 1/n
The idea is simple: after years of reviewing deep learning stuff, I am frustrated of never seeing a paragraph that shows how robust the results are w.r.t the randomness (initial weights, batch composition, etc). 2/n
After seeing several videos by @skdh about how experimental physics claims tend to disappear through repetition, I got the idea of gauging the influence of randomness by scanning a large amount of seeds. 3/n
Read 11 tweets
Quick Tweet Storm ⛈

How does AI bounding box detection work?

🧠 Learn in 30 seconds

#100DaysOfCode #CodeNewbie #MadeWithTFJS #MachineLearning #ComputerVision
It looks so simple when #AI does it right?

But #machinelearning doesn't give you an image, it gives you data. It's up to you to make it look simple. Image
You might think a #FrontEnd box gives you four values, and you're right, but it only gives you TWO points. From that you can infer a box to draw with #html5. Image
Read 9 tweets
I'm really looking forward to participating in the forthcoming @markcubanai Boot Camps this Fall! I will be participating as a mentor and collaborating with @Caltech and the @AppAcademyPHS! cc @mcuban #AI #artificialintelligence #machinelearning #ML
The Mark Cuban Foundation works with local companies to host Introduction to #AI bootcamps for underserved high school (9th-12th) grade students at no cost.
the program does not have any pre-requisites or require any prior experience with coding. Students with any level of interest in technology will walk away from the @markcubanai program with a greater understanding of #AI
Read 8 tweets
#CVPR2021 #cvpr2021_cv4aec is Live!
@li_fuxin is opening the 1st "Workshop & Challenge on #ComputerVision in the #BuiltEnvironment for the #Design, #Construction and #Operation of #Buildings"
The winner of the 2D challenge is the Institute of Automation, Chinese Academy of Sciences

The winner of the 3D challenge is Purdue University

Congrats! #CVPR2021 #cvpr2021_cv4aec
First Keynote talk by Prof. Derek Hoiem @Illinois_Alma @reconstructinc #CVPR2021 #cvpr2021_cv4aec
Read 12 tweets
This paper shares 56 stories of researchers in Computer Vision, young and old, scientists and engineers. Reading it was a cocktail of emotions as you simultaneously relate to the stories of joy,excitement,cynicism,and fear. Give it a read!

Some quotes from the stories - it was a "tough and hopeless time" in computer vision "before 2012, [when] the annual performance improvements over ImageNet are quite marginal."
"she told me you should solve the problem purely based on deep learning... I did not think the occlusion problem can be solved without explicitly reasoning of shape priors and depth ordering"
Read 8 tweets
Excited for my first re:Invent and getting ready for the @ajassy keynote coming up!

#awsreinvent #AWS #cloud
Loving the opening music act 🎸
Zach Person is the musician playing, so awesome...

Here is his Instagram -
Read 75 tweets
💡The challenge: teach my iPhone to recognize sharks and fish in images without writing a single line of code.

Let's see how close we can get to #NoCode #mobile #ComputerVision in 2020. A how-to thread 👇 1/13
Step 1: Collecting images.

I went to the Omaha Zoo and spent a few hours wandering around the aquarium taking photos. 2/13
Step 2: Curating.

After looking through what I collected I decided I to train a model to detect the following critters:

🐠 fish
🦑 jellyfish
🐧 penguins
🦈 sharks
🐣 puffins
🐝☀️ stingrays
⭐️🐟 starfish
🧑 humans, and
🐢 turtles.

Read 13 tweets
Many of us are distracted by phones, tech, too many notifications etc, so what's the solution?

Is it really more tech? Such as these new smart glasses that use #computervision?… #Mindfulness #DigitalHealth ImageImageImage
At least they are able to offer a higher standard of privacy Image
I do wonder how accurate the activity tracking will be, given the accelerometer is at eye level. The last time I tried smart glasses, the distance walked/steps was far higher than they should have been (due to accelerometer above shoulders I am guessing)
Read 6 tweets
Please help us welcome our next curator Darryl Takudzwa Griffiths. @BlaqNinja completed his Bachelors Degree in Computer Engineering at DUT, graduated in 2011. Due to struggling to find suitable employment he went on to study multiple certificates from bodies such as Microsoft.
He has certificates in N+ (Computer Networking), A+ (Computer Technician & Technical Support), Certified Ethical Hacking V7 (CEH v7), Offensive Security Certified Professional (OSCP). Sadly even with these, he could not secure his desired post so in 2016 he moved to USA.
Darryl was able to secure a job in a corporation that owns casinos as a system analyst & security architect. Within the same year he embarked on a Masters degree in Robotics & Artificial Intelligence Engineering. In 2017 he resigned from his post and started his own company...
Read 99 tweets
5 Reasons to Learn Python:

1. Easy to Learn.😀

2. Versatility:

▪ You can use it for, 🌐Web Developement, 📊Data Science, 💻Machine Learning, Computer 🔎Vision, 📈Data Analysis and Visualization, Scripting, 💻Gaming and in Robotics.

3. High Salaary.💰

4. Scalability:
▪ It is extremely 👊powerful that you can build real-world🌐 applications.

5. Job Market:

▪ High📈 Demand and 📉Low Supply of Python🐍 Developer.💻

▪ Python is number 1 Programming language for ML and AI.

▪ By Learning Python you are actually investing💰 in your future.
Read 4 tweets
Introducing RepNet, a model that counts repetitions in videos of *any* action

w @yusufaytar, @JonathanTompson, @psermanet and Andrew Zisserman


#CVPR2020 #computervision #deeplearning
Here's an overview of RepNet's architecture.
An integral part of RepNet is the temporal self-similarity matrix (TSM) which not only makes it easy to count repetitions but also drives generalization to actions and domains not seen during training.
Read 9 tweets
3 days participate in #CVPR2020 conference. excited about a lot of interesting subjects covered in computer vision: Adversarial Learning, Effective training and inference, representation learning...

Will do a write-up later.
#CVPR20 #computervision
Some preferred papers so far 👇
1. Dynamic Graph Message Passing Networks…
It addresses the modelling long-range dependencies problem by using feature map as a feature vector nodes and dynamically sample the neighborhood of a node from the feature graph.
2. Semantic Pyramid for Image Generation…
A generative image model that can leverage the feature space from different semantic levels learned by a pretrained classification network. many generative applications to play with
Read 10 tweets
Thanks to @turo I'm renting a #TeslaModel3 during my visit to Silicon Valley - certainly a different experience than traditional #carrental agencies #future #zeroemission #electriccars
Gone to Tesla's HQ in Fremont, California to try the supercharging there. It's supposed to be much faster than before.
I had hoped the Model 3 would charge at the new rate but it didn't happen
Read 51 tweets
Now reading the the #TopolReview report - Preparing the healthcare workforce to deliver the digital future. It's an illuminating read #DigitalHealth
It's clear that a lot of effort has gone into the #TopolReview - it paints a glorious vision of what healthcare would look like over the next 20 years (for both staff and patients) - enabled by technology such as #genomics, #AI and #robotics
For me the #Topolreview has too narrow a focus. It's been commissioned by healthcare, yet we know that with an aging population, many people with long term conditions have to rely upon #socialcare as well. We need joined up thinking, imho.
#PillPack #Acquisition:

Hi #entrepreneur #startup: One of the best businesses to start - building up the best #distribution pipeline and providing awesome #CustomerExperience .

#Technology #ideas

If your someone else is building the distribution network with awesome #CustomerExperience in your space then be very scared - Ask the Hollywood studios about Netflix and Amazon Prime

#Amazon #Technology #PillPack #acquisition #entrepreneur #startups

Amazon to buy (for $1bn) online pharmacy @PillPack which ships prescriptions around the US, and overnight.

Under threat: $400 billion pharmacy business. @Walmart @Walgreens @cvspharmacy stocks dropped.

#Amazon #Technology #PillPack #acquisition #entrepreneur #startup
