Nous Research Profile picture
Aug 26 5 tweets 3 min read Read on X
Nous Research presents Hermes 4, our latest line of hybrid reasoning models.

hermes4.nousresearch.com

Hermes 4 builds on our legacy of user-aligned models with expanded test-time compute capabilities.

Special attention was given to making the models creative and interesting to interact with, unencumbered by censorship, and neutrally aligned while maintaining state of the art level math, coding, and reasoning performance for open weight models.Image
You can try Hermes 4 in the new, revamped Nous Chat UI.

chat.nousresearch.com

Nous Chat has been reworked to include parallel interactions, completions mode, and a memory system, which is slowly being rolled out. We now provide a suite of open and closed models for this experience, from Hermes 4 to GPT-5.

For the first week, all Hermes 4 inference in Nous Chat is free of charge.Image
Alongside these models, Nous Research releases a technical report that details the entirety of its creation process.

arxiv.org/abs/2508.18255

The technical report includes a thorough set of evaluations of Hermes 4 and other popular LLMs, complete with the actual text-results of each test. We believe this report sets a new standard for transparency in benchmarking.Image
In pursuit of producing models that are open, steerable and capable of producing the full range of human expression, we created a new benchmark, RefusalBench, that tests a model’s willingness to be helpful in a variety of scenarios commonly disallowed by both closed and open models.

Hermes 4 achieves SOTA against all popular closed and open models in conforming to your values, without censorship.Image
Special thanks to our launch day partners - @chutes_ai, @nebiusai, and @luminal_ai - for serving these models and powering our new chat experience.

Check out their platforms for additional options for API inference.

And if you’re looking for support or a great AI community to join, check out our discord at discord.gg/NousResearch

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Nous Research

Nous Research Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @NousResearch

Mar 13
Announcing the latest DeepHermes Preview models, DeepHermes 24B and 3B!

huggingface.co/collections/No…

These new models are Hybrid Reasoners - meaning you can toggle ON and OFF the long chain of thought reasoning whenever you want a short, intuitive answer, or a long, well reasoned higher accuracy answer, now available on our API and to download on HuggingFace.Image
DeepHermes 24B Preview performs extremely well on reasoning tasks with reasoning mode ON, jumping over 4x in accuracy on hard math problems, and 43% on GPQA, a STEM based QA benchmark.

Built on @MistralAI's excellent Mistral-Small-24B open model, its a perfect size for quantization on consumer GPUs.

With reasoning mode off, it performs comparably to Mistral's own instruct variant.Image
Image
The DeepHermes models scale quite well with size, with 3B->24B progressively and rapidly improving as you scale - and, its not just great at objective tasks, it's also great for any question that demands deep thought - and is completely transparent with its thinking process. Image
Image
Read 6 tweets
Feb 13
Introducing DeepHermes-3 Preview, a new LLM that unifies reasoning and intuitive language model capabilities.



DeepHermes 3 is built from the Hermes 3 datamix, with new reasoning data, creating a model that can toggle on and off long chains of thought for improved accuracy at the cost of more test time compute!huggingface.co/NousResearch/D…Image
This is our first work on reasoning models, and hope our unique approach to user controlled, toggleable reasoning mode furthers our mission of giving those who use DeepHermes more steerability for whatever need they have.

These early benchmarks show extreme improvement in Mathematical reasoning capabilities when enabled, as well as a modest improvement in GPQA (Google Proof Question Answering) benchmarksImage
Here are some example outputs in reasoning mode, where it thinks longer for harder problems and shows the full, raw chain of thought to arrive at the answer, allowing insight, transparency, observability, and access.Image
Image
Image
Read 5 tweets
Jan 27
Recent AI breakthroughs challenge the status quo narrative that only closed, mega labs have the ability to push the frontier of superintelligence.

Today we announce Nous Psyche built on @Solana - a cooperative training network for generative AI. Psyche coordinates heterogeneous hardware to join a run and train open-source models.

We retell the myth of Psyche — a mortal’s quest for retribution against divine odds:
Read more in our blog post:

nousresearch.com/nous-psyche/
You can now experiment with Psyche’s DisTrO-enabled training code on our GitHub, and the larger open-sourced distributed training stack will be released alongside testnet.

github.com/PsycheFoundati…
Read 4 tweets
Dec 2, 2024
Nous Research announces the pre-training of a 15B parameter language model over the internet, using Nous DisTrO and heterogeneous hardware contributed by our partners at @Oracle, @LambdaAPI, @NorthernDataGrp, @CrusoeCloud, and the Andromeda Cluster.

This run presents a loss curve and convergence rate that meets or exceeds centralized training.

Our paper and code on DeMo, the foundational research that led to Nous DisTrO, is now available (linked below).Image
You can watch the run LIVE here: distro.nousresearch.com

We harness both Nous DisTrO, our novel networking stack that reduces inter-GPU communication by up to 10,000x during pretraining, and the testnet code for Psyche, a decentralized network that builds on Nous DisTrO to autonomously coordinate compute for model training and more.

Psyche details coming soon.
DeMo was created in March 2024 by Bowen Peng (@bloc97_ ) and Jeffrey Quesnelle (@theemozilla) and has been published on arXiv in collaboration with Diederik P. Kingma (@dpkingma), co-founder of OpenAI and inventor of the Adam optimizer and VAEs.

Paper Link: arxiv.org/abs/2411.19870
Code: bloc97/DeMo: github.com/bloc97/DeMoImage
Read 4 tweets
Nov 12, 2024
Today we are launching the Forge Reasoning API Beta, an advancement in inference time scaling that can be applied to any model or a set of models, for a select group of people in our community.

nousresearch.com/introducing-th…

The Forge Reasoning engine is capable of dramatically improving Hermes 70B to reach parity in some categories with OpenAI's o1 (full), at the cost of more inference compute.Image
The API is built upon three architectures developed at Nous:

1. Monte Carlo Tree Search (MCTS)
2. Chain of Code (CoC)
3. Mixture of Agents (MoA)

Together, these three techniques create a powerful reasoning system that outputs complex, flexible, and nuanced responses from LLMs. Elevating open-source ai to the level of frontier models has been a core principle of Nous since its inception.Image
We’re inviting a small group of beta users to try out the Forge Reasoning API over the next month. This inference technology requires battle testing and user feedback in order to determine what areas it uniquely excels at in the real world.

Sign up for research updates here: nousresearch.typeform.com/FORGEAPI?typef…

Try Nous Chat today here for free: hermes.nousresearch.com
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(