Founder of the world’s most read daily AI newsletter @therundownai. Sharing the latest developments in the world of artificial intelligence.
108 subscribers
Dec 16 • 8 tweets • 4 min read
Google just released Veo 2, a new state-of-the-art AI video model.
In testing, Veo beat OpenAI Sora in BOTH quality and prompt adherence.
The video compilation below is 100% created by AI (more details in thread):
Veo can generate 8 second videos in up to 4K resolution (720p at launch).
The model also features:
— Better understanding of physics for more natural movement, lighting, etc.
— Enhanced clarity and sharpness of outputs
— Reduced hallucinated objects and details
Dec 11 • 7 tweets • 3 min read
I've been an early tester + had in-person demos for most of Google’s AI projects announced today.
I found several practical use cases that will benefit everyday people.
12 use cases of Project Astra/Deep Research Agents/Project Mariner (beyond the hype):
Project Astra: Google's AI agent that can 'see the world' using your phone camera
Use cases that stood out to me:
> Summarizing a book page in seconds and chatting with it for follow-ups on complex topics (professor-in-your-pocket)
> Identifying a rash: just seasonal hives or something more serious?
> Real-time translation of languages, sign language, and books (worked great for Japanese writing → English summary).
> Locating landscapes in photos and estimating their distance using the Google Maps integration.
> Remembering cookbook recipes and recommending wine pairings based on the recipe and budget.
> Summarizing thousands of Amazon/Airbnb reviews in seconds using mobile screen sharing, with highlights of any negative feedback.
Dec 11 • 11 tweets • 4 min read
AI NEWS: OpenAI just rolled out 'ChatGPT Canvas' to all users.
Plus new AI coding agents, Sora, Google WIllow, xAI's Grok image generator, Amazon's AGI lab, Lindy agent phone calls, and Meta COCONUT.
Here's what you need to know:
OpenAI just made Canvas, the collaborative split-screen writing and coding interface available to all users.
The feature also now has:
-Native integration with gpt-4o
-Python integration for direct code execution
-Integration within custom GPTs
Dec 6 • 10 tweets • 4 min read
AI NEWS: Meta just unexpectedly dropped Llama 3.3—a 70B model that's ~25x cheaper than GPT-4o.
Plus, Google released a new Gemini model, OpenAI reinforcement finetuning, xAI's Grok is available for free, Copilot Vision, ElevenLabs GenFM, and more.
Here's what you need to know:
Meta just dropped Llama 3.3 — a 70B open model that offers similar performance to Llama 3.1 405B, but significantly faster and cheaper.
It's also ~25x cheaper than GPT-4o.
Text only for now, and available to download at llama .com/llama-downloads
Nov 26 • 6 tweets • 2 min read
OpenAI’s Sora video model appears to have leaked.
And it's even more advanced than the February demo.
A group from the early beta testers just dropped access to what looks to be Sora's ‘turbo’ variant on Hugging Face, citing concerns about OpenAI's early access program and artist compensation.
The leaked version generates 1080p 10-second clips and seems to be processing WAY faster than the previously reported 10-minute render times.
Nov 20 • 5 tweets • 2 min read
NEW: ChatGPT’s ‘Live Camera’ video features were just found in the code of the latest beta.
Six months after their initial demo, OpenAI's visual AI could be ready for testing — and Advanced Voice Mode could soon be getting ‘eyes’
The code discovered in v1.2024.317 reveals:
—Live camera functionality
—Real-time processing
—Voice mode integration
—Visual recognition capabilities
The tech was initially showcased in an OpenAI demo in May, with Advanced Voice Mode interacting with a dog in real time:
@AndroidAuth first spotted the code in the latest beta:
Beta Tap the camera icon to let ChatGPT view and chat about your surroundings. Live camera Don't use for live navigation or decisions that may impact your health or safety.
Sep 25 • 8 tweets • 4 min read
Meta just announced a ton of new AI announcements across Meta AI, Llama, Ray-Bans, and more.
Here’s everything important announced live from here @ Meta Connect:
1. Meta AI is getting its own voice mode! 2. Meta AI can now ‘see’ images!
Similar to ChatGPT, you can now share photos and have Meta AI reply to any photo in chat.
But where Meta is going a step further is by allowing users to actually edit photos, like removing an object, adding a hat, or changing backgrounds, etc., all within the chat.
Rolling out in the US only for now (unlike voice mode, which is rolling out to US, Canada, Australia, and New Zealand over the next month)
Sep 23 • 10 tweets • 4 min read
Sam Altman and Jony Ive just confirmed they are working on a secret, new AI-powered hardware device.
Plus, more developments from Microsoft, Electronic Arts, Disney, Pudu Robotics, and Alibaba.
Here's everything that happened in AI over the weekend:
Jony Ive is back with his first major tech project since leaving Apple, teaming up with Sam Altman for an AI-powered hardware device.
The project already raised private funding and plans to raise up to $1 billion by the end of the year.
iPhone meets o1?
Sep 20 • 14 tweets • 5 min read
AI NEWS: Sam Altman just confirmed o1 has reached level 2 of OpenAI's 5-tier path to AGI.
Plus, more developments from Apple, Amazon, YouTube, Meta, Google, Alibaba, and Nvidia.
Here's everything going on in AI right now:
In a recent interview with T-Mobile, Sam Altman compared o1’s current state to the ‘GPT-2 stage’ of reasoning models
He also revealed that the development of o1 unlocks a much quicker path to fully capable AI agents
Hear it from the man himself:
Sep 18 • 12 tweets • 5 min read
AI NEWS: Snap just announced new AR/AI glasses and an AI video generator.
Plus, more developments from 1X, Google, Microsoft, Sakana, OpenAI, and Perplexity.
Here's everything going on in AI right now:
Snap's Spectacles feature a suite of cameras and sensors for multimodal AI and contextual understanding.
They're powered by AI and can create shared immersive experiences with people nearby.
BUT the battery only lasts for 45 minutes, and they cost $99/m.
Sep 17 • 11 tweets • 4 min read
AI NEWS: A smarter, faster version of Microsoft's AI assistant, Copilot, is coming to Excel and Word.
Plus, more developments from Slack, Groq, Google, Luma Labs, Runway, OpenAI, and Star Wars.
Here's everything going on in AI right now:
Microsoft just unveiled its next wave of Copilot, a smarter, faster AI assistant.
Copilot is coming to Excel, PowerPoint, Teams, Outlook, Word, and OneDrive soon.
It's 2x faster and 3x better thanks to its GPT-4 integration, according to early users
Sep 16 • 10 tweets • 4 min read
AI NEWS: The "Godmother of AI" just launched World Labs, teaching AI to create and understand 3D worlds.
Plus, more developments from OpenAI, Tencent, Runway, Google, and Meta.
Here's everything that happened in AI over the weekend:
Fei-Fei Li announced a new startup, World Labs, to develop AI models capable of understanding and crafting 3D environments.
Instead of focusing on LLMs, World Labs will build LWMs (Large World Models).
AI NEWS: CEO of OpenAI Japan just said an AI called 'GPT-Next' is coming, and it's 100x better than GPT-4.
Plus, more developments from Altera, Anthropic, Nvidia, and Google DeepMind.
Here's everything going on in AI right now:
OpenAI Japan CEO revealed that GPT-Next will be 100x more powerful than GPT-4 at KDDI Summit 2024
The model will reportedly use a smaller version of OpenAI's mysterious Project Strawberry
The boost in performance comes from better architecture, not a ton of computing resources
Aug 29 • 9 tweets • 4 min read
AI NEWS: Google just developed an AI system that recreated the popular DOOM game at 20 fps.
Plus, more developments from OpenAI, Bland AI, Gemini, Midjourney, and XPENG
Here's everything going on in AI right now:
Google researchers just developed GameNGen, an AI that can simulate DOOM at over 20 fps
It works by predicting each frame in real time with a diffusion model
At scale this could mean AI will be able to create games on the fly, personalized to each player
Aug 28 • 10 tweets • 4 min read
AI NEWS: OpenAI's Project Strawberry is reportedly coming this fall.
Plus, more developments from Anthropic, Google DeepMind, Apple, Cerebras, and xAI.
Here's everything going on in AI right now:
In a new report via The Information, OpenAI researchers are finally preparing to launch a new AI model, code-named Strawberry (previously Q*), this fall
If it lives up to leaks, it could potentially advance OpenAI towards Stage 2 of its five-level roadmap to AGI
Aug 23 • 12 tweets • 4 min read
AI NEWS: Salesforce just introduced two autonomous AI agents for sales.
Plus, more developments from Amazon, Boston Dynamics, Synthflow, AI21 Labs, NVIDIA, Mistral, D-ID, and Krea AI.
Here's everything going on in AI right now:
Salesforce just introduced two fully autonomous, AI-powered sales agents:
> Einstein SDR Agent can engage with inbound leads 24/7
> Einstein Sales Coach Agent that can offers real-time sales suggestions during calls
We're headed into a new era of sales.
Aug 22 • 9 tweets • 3 min read
NEWS: Neuralink's second patient can now play video games with no controller.
Plus, more AI developments from Ideogram, Midjourney, CodeSignal, Disney, ETH Zurich, and xAI.
Here's everything going on in AI right now:
Neuralink released its progress report on its Second human patient, Alex.
TLDR:
- Discharged the next day
- Broke the prev. world record for BCI cursor control on day one
- Can play Counter-strike 2 using just the Neuralink + his brain (no controller) 🤯