We build models optimized for a specific type of dataset like:
- text
- audio
- computer vision
- etc.
Is it possible to create a general model? @DeepMind unveils the answer⬇️
Recently, DeepMind published two papers about general-purpose architectures that can process different types of input datasets.
1) Perceiver supports any kind of input. 2) Perceiver IO supports any kind of output.
More⬇️
Perceivers can handle new types of data with only minimal modifications.
They process inputs using domain-agnostic Transformer-style attention.
Perceiver IO matches a Transformer-based BERT baseline on the GLUE language benchmark.
3/7
Unlike Transformers, Perceivers first map inputs to a small latent space where processing is cheap and doesn't depend on the input size.
See the architectures of both Perceiver (pic 1) and Perceiver IO (pic 2). 4/7
Results:
Perceiver outperforms strong, specialized models on classification tasks across various modalities:
- images
- point clouds
- audio
- video
- video+audio.
5/7
Perceiver IO achieves strong results on tasks with highly structured output spaces, such as:
- natural language
- visual understanding
- StarCraft II
- multi-task and multi-modal domains.
6/7
Thanks for learning ML and AI with us!
If you are curious about general-purpose architectures, here is the link for you: github.com/deepmind/deepm…
Share this thread with your friends and spread the open ML knowledge!
7/7
• • •
Missing some Tweet in this thread? You can try to
force a refresh
NAS is one of the most promising areas of deep learning.
But it remains super difficult to use.
Archai = an open-source framework that enables the execution of state-of-the-art NAS methods in PyTorch.⬇️
Archai enables the execution of modern NAS methods from a simple command-line interface.
Archai developers are striving to rapidly update the list of algorithms.
Current deck:
- PC-DARTS
- Geometric NAS
- ProxyLess NAS
- SNAS
- DATA
- RandNAS
2/5
Benefits for the adopters of NAS techniques:
- Declarative Approach and Reproducibility
- Search-Space Abstractions
- Mix-and-Match Techniques
- & more!
There are many challenges teams encounter while performing data labeling.
That's why we decided to discuss 3 real-world use cases.
Find one that fits your project⬇️
1) Object detection and image classification
1. Select Object Detection with Bounding Boxes template 2. Modify it to include image classification options to suit your case
It is straightforward to customize the labeling interface using XML-like tags on Label Studio.
2) Correct predictions while labeling
Using Label Studio, you can:
- Display predictions in the labeling interface
- Allow their annotators to focus on validating or correcting the lowest-confidence predictions
.@OpenAI ImageGPT is one of the first transformer architectures applied to computer vision scenarios.👇
In language, unsupervised learning algorithms that rely on word prediction (like GPT-2 and BERT) are extremely successful.
One possible reason for this success is that instances of downstream language tasks appear naturally in the text.
2/4
In contrast, sequences of pixels do not clearly contain labels for the images they belong to.
However, OpenAI believes that sufficiently large transformer models:
- could be applied to 2D image analysis
- learn strong representations of a dataset
3/4