Latest Twitter Threads by @tunguz on Thread Reader App

Apr 24 • 11 tweets • 2 min read

I am pleased to announce that my report for the American Enterprise Institute (@aei) on the transformative power of the reasoning AI tools has been published. AI tools have already had a major impact on education, but most of it has been unwitting and reactive.

1/11

I argue that we need to rethink education from the ground up with AI playing a central role in it. If understood properly, AI can be a major driving force enabling better, more accessible, and more impactful education and training.

2/11

Jan 13 • 8 tweets • 2 min read

Python π is out - the latest version of Python, Python 3.14, has just been released. Python is one of the top programming languages, and it is particularly widely used in Data Science and Machine Learning, where it has become a de facto standard.

1/8

In recent years changes to the language have largely focused on the under-the-hood improvements, focusing on such things as stability, portability, performance, etc. Some of the top improvements:

2/8

Dec 31, 2024 • 13 tweets • 3 min read

Recent debates on X have , among other things, brought forth the purported underachievement attitude of the American education and children's upbringing. I have voiced my serious reservations regarding that point, to put it mildly.

Many of my own ideas on this matter have been heavily influenced by @DavidEpstein's book "Range". It is one of my favorite books on education and skill development in general.

1/13

David Epstein argues against the prevailing cultural narrative that specialization is the only path to success. Instead, he champions the idea of "range" — the benefits of having a broad set of experiences, skills, and knowledge. Epstein uses a variety of examples from sports, science, music, and business to illustrate how generalists often outperform specialists in complex and unpredictable environments. He suggests that learning broadly before (or even instead of) specializing can lead to greater creativity, adaptability, and success.

2/13

Dec 23, 2024 • 9 tweets • 2 min read

Today @SemiAnalysis published their extremely comprehensive, detailed, and honest report on performance comparison between @NVIDIA's H100/H200 GPUs and @AMD's MI300X.

1/9

On paper, MI300X has many advantages compared to the H100/H200, but in practice AMD's hardware is effectively nerfed by their catastrophically weak software. TL;DR: out of the box you will not be able to use MI300X for ML/AI training.

2/9

Dec 10, 2024 • 4 tweets • 1 min read

I've been holding off saying more about Google's purported quantum computing breakthrough until I read a bit more about it. (You should try doing something like that too!) It turns out, as I had suspected, it was waaaaay overhyped.

1/4

https://twitter.com/skdh/status/1866352680899104960

Yes, it's good science, but in terms of any kind of practical applications we are probably at least a decade away. Even then it will most likely be very specialized areas of application, like molecular dynamics.

2/4

Oct 14, 2024 • 7 tweets • 2 min read

At the end of the last week @DarioAmodei, co-founder and the CEO of @AnthropicAI, one of the top AI labs, published a long essay on his vision of what an advanced AI in the upcoming years could potentially accomplish. There have been several other similar essays over the past few months from other top AI voices, but in my opinion this one is the most thoughtful and most detailed so far. It stirs away from many ideological squabbles that have become all too common these days, and provides ample citations that bolster his points, and help with further reading and self-guided research. Dario's own scientific background is in Biophysics and Neuroscience, and his takes on potential in those fields are particularly insightful.

1/7

Some key takeaways:

Biology and health

AI could drastically accelerate biological and medical progress, compressing 50-100 years of advancements into just 5-10 years. This could lead to the elimination of most diseases, significant extensions of the human lifespan, and greater control over biological functions like reproduction and aging. Key breakthroughs such as CRISPR, mRNA vaccines, and genome sequencing are examples of innovations AI could multiply, transforming human health.

2/7

Sep 9, 2023 • 7 tweets • 2 min read

.@NVIDIA has just announced TensorRT-LLM, open source software designed to accelerate Large Language Model inference on H100s.

1/7

This software has been developed though close collaboration with many leading AI companies, such as @Meta, @anyscalecompute , @cohere , Deci AI, @Grammarly, @databricks and many others.

2/7

Aug 28, 2023 • 6 tweets • 2 min read

By now most of us well aware of transformer-based large language model capabilities and, in many instances, failures. The failures in particular can seem extremely head-scratching, as they often involve the kind of mental reasoning that even a young schoolboy cold do.

1/6

A new paper tries to investigate the nature of these failures, and understand the limits of LLM-based reasoning. It seems that the failures primarily arise from the tasks with low in-domain knowledge and high compositional complexity.

2/6

Aug 23, 2023 • 4 tweets • 1 min read

Very exciting news - Python is now available in the official version of Excel! Excel is the most widely used analytics tool in the World, and Python has become the most popular programming language for Data Science and Machine Learning tasks.

1/4

It is a very intuitive and easy to learn programming language. The merger of these two tools will open new opportunities and use cases.

This merger is the culmination of years-long effort and collaboration between Microsoft and the open source Python community.

2/4

Jun 5, 2023 • 7 tweets • 3 min read

Large Language Models (LLMs) have emerged as the cornerstone of the current Generative AI revolution. The big problem with LLMs is that they are, well, large. Really, really, large.

1/7

They require an enormous amount of high-quality data to train and even more unfathomably large amount of computational power.

2/7

Mar 14, 2023 • 7 tweets • 5 min read

Would you like to win an RTX 4080? You are in luck, because at @nvidia we are giving away one (1) for GTC 2023. All you have to do is:

1. Like and share this tweet

2. Register for GTC: nvda.ws/3j6gw41

3. Post a screenshot of you in a session as a response below

1/7

A few points:

1. I am working with the NVIDIA marketing team to promote one giveaway; there are other influencers who are giving away more GPUs in their own giveaways.

2. GTC registration is completely free and open to the general public. All sessions are online.

2/7

Feb 8, 2023 • 4 tweets • 3 min read

Things seem to be moving at a breakneck speed in the world of generative AI and large language models. In a surprise press event yesterday, @Microsoft announced a wide integration of @OpenAI tools into a couple of their major products,

1/4

Bing search engine and Edge web browser. In particular, this seems to be the first time that we'll see anywhere a public use of OpenAI's next generation LLM, GPT4. Most of the new features are still relatively limited, and you'll need to join the waitlist for the full access. 2/4

Feb 7, 2023 • 5 tweets • 3 min read

In a highly anticipated move, @Google yesterday announced that they are launching Bard, a conversational AI app that is based on their LaMDA model.

1/5

LaMDA - Language Model for Dialogue Applications - has been around for at least a year, but due to variety of considerations it has never been accessible to to the public.

2/5

Jan 30, 2023 • 5 tweets • 3 min read

Deep Learning and Neural Networks have become the default approaches to Machine Learning in recent years. However, despite their spectacular success in certain domains (vision and NLP in particular),

1/5

their use across the board for all ML problems and with all datasets is problematic, to say the least. Oftentimes better and more robust results can be obtained with simpler, easier to train and deploy, classical ML algorithms.

2/5

Jan 29, 2023 • 5 tweets • 1 min read

There was nothing that shocked me more when I entered the industry from academia than this kind of attitude. I came from an environment where teaching and learning were the norm, to the one where giving help to “underperformers” was viewed with disdain as a liability.

1/5

Fortunately not all organizations and managers are this cutthroat, but this kind of mindset is pervasive, especially at startups. There is a widespread attitude that *it’s someone else’s responsibility to do the educating*: yours, your previous job’s, your college’s etc.

2/5

Dec 12, 2022 • 4 tweets • 4 min read

Last week @DeepMind’s research on AlphaCode - a competative programming system - has been published in Science. AlphaCode has been able to beat 54% of humans on a competative coding challenges, putting it on par with many junior-level developers.

1/4

The original announcement from DeepMind came out in February, which in the fast-paced world of AI is already ancient history.

2/4

Dec 5, 2022 • 7 tweets • 3 min read

Last week @OpenAI released ChatGPT - a Large Language AI Model that interacts with users in a natural conversational way. The chatbot is able to answer complex questions, even in highly technically demanding categories.

1/7

It is also able to answer the follow up question, backtrack on wrong assumptions, and provide other detailed resources, including code fragments.

2/7

Dec 3, 2022 • 6 tweets • 3 min read

PyTorch 2.0 is out! This major release upgrade brings about many new features, but the main improvements are under the hood.

1/6

The three main principles behind PyTorch

1. High-Performance eager execution
2. Pythonic internals
3. Good abstractions for Distributed, Autodiff, Data loading, Accelerators, etc.

PyTorch 2.0 is fully backward compatible with the previous versions of PyTorch.

2/6

Oct 10, 2022 • 4 tweets • 3 min read

Decision trees based Machine Learning models are some of the best performant algorithms in eras of predictive capability, especially on small and heterogenous datasets.

1/4

They also provide an unparalleled level of interpretability compared to all other non-linear algorithms. However, they are very hard to optimize on Von Neumann architecture machines due to their non-uniform memory access patterns.

2/4

Oct 8, 2022 • 6 tweets • 3 min read

This past week I came across another paper that purports to get the SOTA for NNs for tabular data. Due to the extreme penchant for exaggeration in this community, I have given up on checking most of these claims, but decided to take a look at this particular work.

1/6

I decided to check how does XGBoost *really* perform on the datasets used in the paper, and the results were not pretty.

2/6

Oct 1, 2022 • 4 tweets • 3 min read

This week @NVIDIA open sourced the 3D object generation AI model, GET3D. GET3D is a generative model of high quality 3D textured shapes learned from images.

1/4

Trained using only 2D images, GET3D generates 3D shapes with high-fidelity textures and complex geometric details.

2/4

Share this page!

Enter URL or ID to Unroll