MatthewBerman Profile picture
AI 📈🚀
3 subscribers
Jan 21 10 tweets 3 min read
DeepSeek R1 has been out for 24 hours.

The AI industry's reaction has been...strong!

Here's a collection of the most telling reactions: 🧵 Image Dr. Jim Fan, Sr. Research Manager at NVIDIA, points out how odd it is that a non-US company is leading the Open Source AI charge, given that was the original mission of OpenAI.

Jan 18 11 tweets 4 min read
Test Time Compute is bigger than anyone realizes.

It's the most important breakthrough in AI since Transformers.

Let me explain...🧵 Image What is Test Time Compute?

Think of it like this: Instead of AI giving instant answers, it now "thinks" longer - just like humans do when solving complex problems. Image
Jan 16 11 tweets 5 min read
1/ SakanaAI just dropped their latest research: Transformer²

It's a self-adaptive architecture that allows AI to evolve at inference time.

Model weights are no longer "static"

Let’s break it down: 🧵 Image 2/ Traditional Transformers are static post-training.

Once trained, they can’t learn or adapt without expensive fine-tuning or additional methods like retrieval-augmented generation (RAG).

Transformer² changes this entirely. Image
Jan 15 11 tweets 5 min read
1/ Google Research unveils new paper: "Titans: Learning to Memorize at Test Time"

It introduces human-like memory structures to overcome the limits of Transformers, with one "SURPRISING" feature.

Here's why this is huge for AI. 🧵👇 Image 2/ The Problem:

Transformers, the backbone of most AI today, struggle with long-term memory due to quadratic memory complexity.

Basically, there's a big penalty for long context windows!

Titans aims to solve this with massive scalability. Image
Jan 13 11 tweets 4 min read
1/9 BREAKING

Biden Admin drops major AI chip rules today!

The 200+ page "AI Diffusion" framework completely reshapes global AI tech trade.

Key goal: Keep advanced AI development running on "American rails"

But not everyone is happy... 🧵 Image 2/9 THE ALLIES LIST

18 countries get VIP treatment with ZERO restrictions - including UK, Canada, Japan, Germany, South Korea & Taiwan.

These trusted partners can freely access US AI tech.

Small orders (up to 1,700 GPUs) worldwide won't need special permission. Image
Jan 8 12 tweets 2 min read
What will society look like after AGI is achieved?

I found a great prediction on LessWrong by L Rudolf L (link below).

Capital will matter MORE after AGI.

A thread on the future of wealth, power & human agency 🧵 1/ Most think money won't matter post-AGI.

But here's why that's wrong: AI will make capital (factories, data centers, money) MORE powerful while making human labor LESS valuable.
Jan 3 11 tweets 4 min read
Want to know how OpenAI's o1 and o3 models work?

Chinese researchers figured it out!

They dropped a research paper explaining EVERYTHING.

Here's what you need to know: 🧵 Image 1/ The paper analyzes four key components needed to achieve o1/o3-level performance:

Policy initialization: Train model with human-like reasoning behaviors

Reward design: Design feedback signals to guide model improvement

Search capabilities: Explore multiple solutions through tree/sequential strategies

Learning methods: Update model using search-generated data and rewardsImage
Dec 28, 2024 12 tweets 4 min read
Anthropic just dropped an incredible guide on "How To Build Effective Agents"

2025 will be the year of AGENTS 🤖

Here's everything you need to know: 🧵 Image Simple > Complex

When building LLM agents, the most successful implementations use basic composable patterns.

My take: agentic frameworks are great for not needing to reinvent the wheel while building agent patterns. Image
Dec 27, 2024 12 tweets 4 min read
1/ 🚨 Big news: OpenAI makes it clear they are evolving into a for-profit

They are moving to a more closed and for-profit model while doubling down on AGI safety and scalability.

Is this the right balance of ethics and ambition, or is it a departure from their ideals?

Let’s unpack. 🧵Image 2/ 🚀 Mission: AGI for All

OpenAI’s mission remains the same: to ensure AGI benefits all of humanity.

But with AGI development accelerating, they claim changes are critical to stay competitive and address complex challenges. Image
Dec 24, 2024 14 tweets 4 min read
o3 was announced less than a week ago and the AI industry was stunned.

I've collected some of the reactions from the biggest names in AI: 🧵 Image Balaji on how incredible the 25% score on Frontier Math really is.

Dec 20, 2024 8 tweets 3 min read
.@OpenAI just dropped o3 and o3-mini!

This is AGI (not clickbait)

o3 is the best AI ever created, and its performance is WILD.

Here's everything you need to know: 🧵 Image O3 is their most advanced model yet.

Excelling at coding, mathematics, and even PhD-level science tasks.

O3 Mini offers incredible cost-performance optimization, perfect for diverse use cases. Image
Dec 20, 2024 8 tweets 3 min read
#1 Trending Github Project: Genesis 🌟

A groundbreaking framework for creating, training, & deploying embodied agents in simulated environments!

And it's open-source!

Here's why you should care: 🧵 Image 1/ What is Genesis?

Genesis is an open-source platform designed to enable the creation of embodied agents—AI models that interact with simulated worlds just like humans or animals do in the real world. 🤖🌍 Image
Dec 20, 2024 7 tweets 3 min read
.@AnthropicAI just published a WILD new AI jailbreaking technique

Not only does it crack EVERY frontier model, but it's also super easy to do.

ThIS iZ aLL iT TakE$ 🔥

Here's everything you need to know: 🧵 Image Introducing Best-of-N (BoN) Jailbreaking: a black-box algorithm that bypasses AI system safeguards across various modalities.

BoN operates by generating multiple prompt variations through augmentations like random shuffling or capitalization, continuing until a harmful response is produced.Image
Dec 18, 2024 7 tweets 3 min read
NVIDIA just dropped a game-changing tiny supercomputer:

The Jetson Orin Nano.

Here's why this is massive for edge AI... 🧵 Image At just $249, this pocket-sized powerhouse can run large language models LOCALLY - no cloud needed.

It's delivering nearly 70 TRILLION operations per second at just 25 watts!

Why this matters: We're entering the era of edge AI.

Soon, powerful AI will run independently on devices everywhere - from robots to IoT devices to cars. No cloud connection required.Image
Dec 18, 2024 12 tweets 4 min read
Anthropic just dropped an insane new paper.

AI models can "fake alignment" - pretending to follow training rules during training but reverting to their original behaviors when deployed!

Here's everything you need to know: 🧵 Image Study by @Anthropic shows Claude 3 Opus strategically complies with harmful requests in training to preserve its harmless behavior

Key finding: When told it's being trained on free-tier user data (but not paid), Claude complies with harmful requests 14% of time for free users vs almost never for paid users.

The model explicitly reasons about preserving its preferred harmless behavior.Image
Dec 14, 2024 9 tweets 3 min read
Google just announced Android XR!

A new operating system for extended reality devices like headsets and glasses, built with AI (Gemini 2.0) from the ground up.

Here's the wild vision of what the future of AI looks like: 🧵 Image Check out this video explaining why Android XR is so important:
Dec 13, 2024 8 tweets 3 min read
Cohere just dropped Command R7B

The smallest, fastest, state-of-the-art enterprise-grade LLM.

The best part? It’s Open Weights!

Here’s everything you need to know 🧵

(Cohere Partner) Image The Final Model

Command R7B is the final model in this series of models.

It’s built for real-world tasks for developers and businesses.

It excels at multi-lingual support, citation-verified RAG, reasoning, tool use, and agentic behavior. Image
Image
Dec 11, 2024 9 tweets 3 min read
Google just dropped Gemini 2.0!

Tons of awesome things just released.

Here's everything you need to know: 🧵 Image Gemini 2.0 Flash is launching today

It's faster than 1.5 Pro and comes with new features like native image generation and text-to-speech in multiple languages. Image
Dec 3, 2024 9 tweets 3 min read
AWS (@awscloud) just dropped Automated Reasoning!

Automated reasoning was previously only available to the largest companies with massive resources.

AI changes that.

Let me show you why this is such a big deal 🧵

(AWS Partner) Automated reasoning means using math to prove statements are valid.

Think "if it rains, ground gets wet, so tires have less grip."

Thus, "if it rains, tires have less grip."

Simple, right? But it gets complex fast. Image
Nov 26, 2024 8 tweets 2 min read
#1 Trending Github Repo: Screenshot-to-Code

A simple tool to convert screenshots, mockups and Figma designs into clean, functional code using AI

Check out the WILD demos below 👇 Instagram ==> COPIED
Nov 22, 2024 14 tweets 4 min read
This was a WILD week in AI News. 📈

Musk predicts AGI, updates from Figure Robotics, Flux, Qwen, Gemini, and ChatGPT.

Here's everything you missed 👇 1/ Elon Musk predicts AGI by 2026! Bold timeline from the Tesla CEO. What do you think - too optimistic or right on target?