Updates for developers building with the OpenAI Platform and API • Service status: https://t.co/kZwnwdYqOS • Support: https://t.co/qCi6M5ESZU
6 subscribers
Mar 11 • 7 tweets • 4 min read
We're launching new tools to help developers build reliable and powerful AI agents. 🤖🔧
Timestamps:
01:54 Web search
02:41 File search
03:22 Computer use
04:07 Responses API
10:17 Agents SDK
Our new API primitive: the Responses API. Combining the simplicity of Chat Completions with the tool-use of Assistants, this new foundation provides more flexibility in building agents. Web search, file search, or computer use are a couple lines of code!
We’ve put together some best practices on using our o-series models:
🤔 Differences between our reasoning and non-reasoning models
💡 When to use reasoning models
💬 How to approach prompting reasoning models effectively
GPT models focus on fast, cost‑efficient, straightforward task execution—“the workhorses.” 🐎
o‑series models excel at handling ambiguous, multi‑step tasks in domains such as math, engineering, legal, and finance—“the planners.” 🧠
Jan 17 • 4 tweets • 1 min read
We’ve put together a reference implementation for building and orchestrating agentic patterns using the Realtime API. You can use this repo to prototype a voice app using multi-agent flows in less than 20 minutes!
github.com/openai/openai-…
Building with the Realtime API can be complex because of the low-latency, synchronous nature of voice interactions. This repo includes best practices we’ve learned for managing this complexity, like:
- Orchestrating agent handoffs (inspired by Swarm)
- Background escalation to o1 for advanced decision making
- Improving model instruction following by defining a state machine in the prompt
- Demos of applying these patterns to customer service and front desk use cases
Oct 11, 2024 • 12 tweets • 3 min read
We announced the Realtime API last week at DevDay SF. It's been amazing to see its adoption—here are some of the coolest examples we’ve seen so far. Let us know if we missed any in this thread!
Today at DevDay SF, we’re launching a bunch of new capabilities to the OpenAI platform:
🗣️ Introducing the Realtime API—build speech-to-speech experiences into your applications. Like ChatGPT’s Advanced Voice, but for your own app. Rolling out in beta for developers on paid tiers. openai.com/index/introduc…
Apr 17, 2024 • 4 tweets • 2 min read
Introducing a series of updates to the Assistants API 🧵
With the new file search tool, you can quickly integrate knowledge retrieval, now allowing up to 10,000 files per assistant. It works with our new vector store objects for automated file parsing, chunking, and embedding.
New token controls allow you to set maximum input and output tokens per run to manage costs. You can also choose how many recent messages to use for context truncation.
GPT-4 Turbo with Vision is now generally available in the API. Vision requests can now also use JSON mode and function calling.
Below are some great ways developers are building with vision. Drop yours in a reply 🧵platform.openai.com/docs/models/gp…
Devin, built by @cognition_labs, is an AI software engineering assistant powered by GPT-4 Turbo that uses vision for a variety of coding tasks.