.@OpenAI ImageGPT is one of the first transformer architectures applied to computer vision scenarios.👇
In language, unsupervised learning algorithms that rely on word prediction (like GPT-2 and BERT) are extremely successful.

One possible reason for this success is that instances of downstream language tasks appear naturally in the text.
2/4
In contrast, sequences of pixels do not clearly contain labels for the images they belong to.

However, OpenAI believes that sufficiently large transformer models:
- could be applied to 2D image analysis
- learn strong representations of a dataset
3/4
Find more about ImageGPT here: openai.com/blog/image-gpt/

Thanks for learning ML and AI with us! This is a thread from Edge#117 – our series about transformers.
4/4

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with TheSequence

TheSequence Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @TheSequenceAI

29 Oct
Forecasting high-dimensional time series plays a crucial role in many applications like:
- demand forecasting
- financial predictions

You can use @AmazonScience's DeepGLO for these problems.⬇️
The challenge with multi-dimensional time-series datasets is a serious one.

1) Traditional methods (like ARIMA) can't scale to large datasets with millions of time series.

2) Deep neural networks have been proven to handle scalability more effectively. BUT⬇️
BUT many deep neural nets:

- only forecast values from the same dimension
- require different time series to be normalized on a single scale

DeepGLO addresses these challenges.
3/6
Read 6 tweets
29 Oct
There are a handful of frameworks to implement basic NLP.

And what about implementing models like BERT or GPT-3? A framework that does not require monumental development efforts.

@allen_ai created one for you. It's AllenNLP.⬇️
AllenNLP provides a simple & modular programming model for:

1. Applying advanced deep learning techniques to NLP research
2. Streamlining the creation of NLP experiments
3. Abstracting the core building blocks of NLP models

2/5
Portfolio of NLP tasks under AllenNLP:

- Text Generation
- Language Modeling
- Multiple Choice
- Pair Classification
- Structured Prediction
- Sequence Tagging
- Text + vision
3/5
Read 5 tweets
27 Oct
3 big AI industry insights🔥

1) Companies are big spenders on AI but lack confidence
2) AI is a cloud-native world
3) Budgets are growing, despite challenges

Fascinating details👀⬇️
1) Big spenders, but a lack of confidence

- 38% of companies have a budget of more than $1M per year for AI infrastructure alone!

- However, for 77% of companies, less than half of models make it to production 38% of companies have a budget of more than $1M per year for
3) AI is a cloud-native world

- 81% of companies use containers and cloud technologies for their AI workloads

- Nearly 1/2 of them are using @kubernetesio

=> AI is a leader in cloud-native adoption 81% of companies use containers and cloud technologies for t
Read 5 tweets
9 Oct
3 ML frameworks you should check:

1. VISSL
2. AdaNet
3. Archai neural architecture search

Here they are. Boom!⬇️
AdaNet for neural networks discovery
2/3
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(