Designed for deep reasoning, both can use all ChatGPT tools agentically:
- Intelligent use of tools (Python, search, image analysis)
- Chaining tools for complex solutions
- Quick, well-formatted results, usually in under a minute
Key information you need to know:1. o3 is a deep reasoning model excelling in:
- Coding and debugging
- Advanced math
- Visual problem-solving (charts, sketches, blurry photos)
- Scientific and business analysis
It achieves new state-of-the-art scores on major benchmarks like Codeforces, SWE-Bench, and MMMU.
Mar 31 • 6 tweets • 2 min read
Social media is filled with Ghibli-style AI images this week.
And it's definitely impressive.
But it raises more difficult questions:
Are we honoring artistic legacies or reducing them to mere prompts?
Where is the line between admiration and appropriation?
A deep reverence for nature, humanity, and emotion.
Turning that into an instant filter misses the point.
1/5
Feb 6 • 5 tweets • 2 min read
Hugging Face has launched AI Search within @huggingface Spaces, simplifying the discovery of AI-powered apps.
Finding AI tools used to be difficult, but with Spaces Search, you can instantly discover AI solutions.
With 400K+ tools, you can explore solutions for:
- Image editing
- Speech transcription
- Virtual try-ons
- Text analysis & more
A quick breakdown 🧵👇
What are Spaces?
Hugging Face Spaces lets you explore and use AI apps without coding.
It's like an app store for AI tools where you can:
- Instantly try apps like image editing and text analysis
- Discover AI innovations globally
- Use AI with no installation needed
Search, click, and start using AI!
Dec 28, 2024 • 12 tweets • 2 min read
10 Apple Intelligence Features and How To Use Them
🧵 1. Enable Apple Intelligence
Update your device to the latest iOS, iPadOS, or macOS. Then, go to Settings > Apple Intelligence & Siri and follow the prompts to get started.
Autopilot is an AI tool designed to make your work easier by learning how you work.
How it works:
- Watches your task handling: emails, file organization, problem-solving
- Learns your style to fit your workflow
- Uses familiar tools like Google Drive, Slack, SharePoint
- Manages tasks like document creation and emails, saving you time
It solves key challenges in scaling image generation!
FLUID innovates with continuous and random tokens 👇🏼 1. FLUID vs. Other Models
- Autoregressive Models: Generate images in a fixed sequence, limiting global adjustments
- Diffusion Models: Great quality but require many steps. FLUID delivers both quality and efficiency, refining images faster than diffusion models
Jul 15, 2024 • 7 tweets • 3 min read
We have gotten a lot of questions about role prompting.
To clarify, we ran ~12 role prompts against 2K MMLU questions on gpt-4-turbo. This is *not yet* in The Prompt Report.
In particular, I created a "Genius" prompt, something like: "You are a Harvard educated scientist..." and... 🧵
an "Idiot" prompt, something like: "You are a dumb person..."
Full prompts here:
We found no significant differences in scores from them. In fact, the Idiot prompt outperformed the Genius prompt.gist.github.com/trigaten/17183…
Jul 14, 2024 • 8 tweets • 2 min read
🚨 Role Prompting doesn't work...
Our team at @learnprompting led a year-long study with co-authors from @OpenAI & @Microsoft, analyzing over 1,500 prompting papers. We narrowed it down to 58 different prompting techniques and we analyzed every one.
Here's what we found...
@OpenAI @Microsoft 🚫 Role Prompting was shockingly ineffective. Here's why:
For older models, it seems they could access improved responses/reasoning by being moved by the prompt into a better parameter space. However, newer models are likely already in that improved parameter space.*
Jul 5, 2024 • 4 tweets • 2 min read
Human Prompt Engineer VS AI Prompt Engineer? Who wins?
@sanderschulhoff, Founder of @learnprompting, was defeated💀 🪦...
DPSy performed 40% better on a novel classification benchmark.
Here's how 🧵
@SanderSchulhoff 1)
🌟 We gave @sanderschulhoff a task description and some training data.
🕒 He spent 20 hours testing out different techniques over 47 steps of development.
🐞 He dealt with issues like LLM refusal to respond and performance regressions
Jun 12, 2024 • 9 tweets • 4 min read
🚨Announcing The Prompt Report🚨
A 76-page survey of 1,500+ prompting papers, analyzing EVERY prompting technique, Agents, & GenAI
Led by @learnprompting, and folks from @OpenAI, @Microsoft, & @UofMaryland
Here’s what we found & the 58 prompting techniques you should know👇🧵
🤝Our team of 31 people + GenAI searched the internet for the latest prompting papers and broke them down into 7 Categories:
✏️ Text-based Prompting
🌐 Multilingual Techniques
🎨 Multimodal Techniques
🤖 Agents
📊 Evaluation
🔒 Security
⚖️ Alignment
ChatGPT is great, but still struggling to find the right prompt?
Just launched to solve this 🧵👇
(1/4)learnprompting.org
What is learnprompting?
A course that teaches you prompting techniques. It also has real world examples that take you through the end-to-end prompt engineering process⚡