OpenRouter Profile picture
Jun 13 11 tweets 4 min read Read on X
Introducing the Fusion API, the smartest compound model in the market.

Fusion achieves Fable-level intelligence at half the price.

How it works 👇 Image
We benchmarked Fusion on 100 hard research tasks and found:

1. Panels of models consistently outperform individual models
2. Beyond-frontier performance can be achieved with frontier panels
3. Panels of budget models can surpass frontier models at a much lower cost
By testing different combinations of models, we found that roughly three quarters of the lift that Fusion provides comes from synthesis, and one quarter from diversity. Image
Notably, the budget panel was comparable with Claude Fable 5 in performance.

A panel of Gemini 3 Flash, Kimi K2.6, and DeepSeek V4 Pro, fused together, beat solo GPT-5.5 and solo Opus 4.8 outright.

And it landed within 1% of Fable 5 while costing roughly half the price.
How does it work?

When you send a prompt to Fusion, we fan it out to a panel of models in parallel, each with web search and bash tools enabled.

A judge model reads every response and extracts the structure: consensus points, contradictions, partial coverage, unique insights, blind spots.

Chatroom: openrouter.ai/fusion
Then a synthesizer writes the final answer grounded in that analysis

Fusion runs server-side, so developers can call it exactly like a single model slug: "openrouter/fusion"

Or let the model decide when to reach for it by adding {"type": "openrouter:fusion"} to your tools array.
We ran it on the DRACO deep research benchmark by Perplexity: 100 deep research tasks across 10 domains, from law and medicine to finance and product comparison.

Each task is graded against ~39 weighted criteria, and wrong answers carry negative weight. (You can't bluff your way to a high score by being verbose.)

arxiv.org/abs/2602.11685
One detail we want to call out: when we first gave the panel web search, models started surfacing the DRACO rubric online.

We excluded those domains across every model with a one-line config change to the OpenRouter web search tool config, then re-ran everything. All published numbers come from the clean setup.
Want to customize the panel? Pass your own participant models and synthesizer: Image
Fusion is neurodiversity, but for models. Try it now!

💬 Chatroom: openrouter.ai/fusion (pick a preset or build a custom panel)

⚙️ API: docs at openrouter.ai/docs/guides/fe…

ℹ️ More info on the blog post: openrouter.ai/blog/announcem…
Note: we have only evaluated one deep research benchmark so far, which did not include long-horizon tasks.

Fable's long-horizon abilities were extremely impressive, and it calls for future work to benchmark both on long-horizon tasks.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with OpenRouter

OpenRouter Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @OpenRouter

May 2
Introducing Response Caching: save tons of money and time on tests and agent retries.

Blog post:

Available for free. Learn more 👇 openrouter.ai/announcements/…Image
Add one header (`X-OpenRouter-Cache: true`) and identical requests come back in milliseconds with zero tokens billed.

The first call hits the provider as normal, but every identical call after that is free.
A typical uncached request to Gemini 2.5 Flash takes about 1.3 seconds, Kimi K2.6 about 4.6 seconds, GPT-5.5 around 9.1 seconds.

Cache hits return in 80 to 300ms. The lookup itself averages 4ms.
Read 8 tweets
Apr 24
Introducing "create-agent-tui"

A skill for building your own agent harness + terminal UI (TUI). The skill walks you through 4 different ways of customizing the look, and supports dozens of optional features 👇
Supports custom banners, plus many tool display styles: Image
The input field can be customized as well, to match Codex's style or Claude Code / Pi's style.

Or you can describe your own look! Image
Read 8 tweets
Dec 19, 2025
You can now use Claude Code with OpenRouter 🎊

Code with over 320 LLMs, including 39 free ones! Image
Here's a guide to getting set up: openrouter.ai/docs/guides/gu…
You can also now use any model on OpenRouter with Anthropic's API shapes and types.

Your code should look the way you want it to!
Read 4 tweets
Dec 4, 2025
We collaborated with @a16z to publish the **State of AI** - an empirical report on how LLMs have been used on OpenRouter.

After analyzing more than 100 trillion tokens across hundreds of models and 3+ million users (excluding 3rd party) from the last year, we have a lot of insights to share.Image
@AnjneyMidha @MaikaThoughts @xanderatallah @cclark One finding: we observe a Cinderella "Glass Slipper" effect for new models.

Early users a new LLM either churn quickly or become part of a foundational cohort, with much higher retention than others. They are early adopters who can "lead" the rest of the market (more details 👇)
Our dataset: anonymized request-level metadata from OpenRouter, including classifications.

We used this to study behavior at scale without reading any prompts or completions directly.
Read 21 tweets
Nov 27, 2025
Live now on OpenRouter!

INTELLECT-3 pushes the frontier forward by opening up how high quality models are trained. Weights, code, environments, and a detailed writeup are available to all.
Try it now on OpenRouter: openrouter.ai/prime-intellec…
For API users, note that INTELLECT-3’s reasoning should be preserved between turns. Learn more in our docs:

openrouter.ai/docs/guides/be…
Read 4 tweets
Apr 28, 2025
🟣 New model family: Qwen3, by @Alibaba_Qwen

- available today from 8B to 235B, with many free variants!

- unique prompt-based toggle for chain-of-thought reasoning Image
@Alibaba_Qwen 🚀The headliners:

• Qwen3-235B-A22B - MoE with rich reasoning and generation power: openrouter.ai/qwen/qwen3-235…

• Qwen3-30B-A3B - speed & cost-efficiency: openrouter.ai/qwen/qwen3-30b…

• Qwen3-32B - dense model: openrouter.ai/qwen/qwen3-32b
@Alibaba_Qwen 🎁 Try for free:

• Qwen3-30B-A3B Free: openrouter.ai/qwen/qwen3-30b…

• Qwen3-235B-A22B Free: openrouter.ai/qwen/qwen3-235…
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(