Tweet

Amin Barekatain

Oct 5 • 8 tweets • 6 min read

@Nature

Excited to share our work published in the @Nature cover, which surpasses human knowledge in a 50-year-old open question in Mathematics!

We introduce #AlphaTensor, an #AlphaZero-based RL agent that discovers faster exact matrix multiplication algorithms.

🧵1/

https://twitter.com/DeepMind/status/1577677899108421633

Matrix Multiplication (MatMul) is one of the root node problems where any speedup results in improvements in many areas, including Matrix Inversion, Factorization, & Neural Networks.

In industry, MatMul is used for image processing, speech recognition, computer graphics, etc. 2/

We formulated the problem of finding MatMul algorithms as a single-player game. In each episode, the agent starts from a tensor representing a MatMul operator and has to find the shortest path to an all-zero tensor. The length of this path corresponds to the # multiplications. 3/

The main RL challenges are:

1) Gigantic action space, e.g., 10^23 for 4x4 MatMul.
2) Many symmetries in observations & actions, e.g., actions are order invariance.
3) A single wrong action/move results in a worse/sub-optimal algorithm.

4/

To tackle the problem, we developed #AlphaTensor, an #AlphaZero-based agent with novel capabilities dealing with the above RL challenges. For example, we employed a risk-seeking distributional value head to improve exploration. 5/

For many matrix sizes, #AlphaTensor discovered MatMul algorithms that are faster than the state of the art. These algorithms can be applied recursively to larger matrix sizes, effectively resulting in a bank of new, exact, and faster MatMul algorithms! 6/

#AlphaTensor invented a beautiful algorithm for 4x4 modular arithmetic MatMul, which surpassed two-level Strassen’s algorithm for the first time in 50 years!

This result was so significant that at first, we thought there is a bug 😅 7/

@AlhusseinFawzi

This project has been one of the most enjoyable works I have ever done, thanks to having excellent collaborators: @AlhusseinFawzi, @matejbalog, Aja Huang, Thomas Hubert, @ber24, @SashaVNovikov,
@franciscuto, @mononofu, @GrzegorzMS, David Silver, @demishassabis, & @pushmeet 8/8

• • •

Missing some Tweet in this thread? You can try to force a refresh

Share this page!

Amin Barekatain

People who liked this thread also liked...

Try unrolling a thread yourself!

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!