Thread by @NVIDIAAI on Thread Reader App

Today we're shipping Nemotron 3 Ultra.

A 550B MoE frontier-intelligence open model built for long-running agents.

It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models.

Ultra excels at complex tasks like coding and deep research.

Long-running agents spend their time planning, using tools, recovering from failures, and deciding what to do next.

The model’s hybrid Mamba-Transformer MoE architecture enables more reasoning cycles within the same time budget, allowing the agent to accomplish more in less time.

Nemotron 3 Ultra delivers leading accuracy for agentic tasks, including agent productivity, coding, and long horizon planning.

Beyond benchmark performance, Ultra can work through large codebases, reason across long chains of tool calls, and synthesize information gathered from hundreds of sources.

We post-trained Ultra for popular agent harnesses like @openclaw, @NousResearch Hermes Agent, and @Langchain.

The result is an open frontier model developers can customize for specialized agents across domains. Read more: nvda.ws/4adkn6J

@openclaw @NousResearch @LangChain As always, Nemotron 3 Ultra is fully open.

This includes model weights, synthetic data, and post-training recipes. Available now on @huggingface → nvda.ws/4v1iBhi

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Share this page!

Enter URL or ID to Unroll