Post

More from @PrimeIntellect

Prime Intellect

@PrimeIntellect

Apr 30

Over the past months, Cohort I of our RL Residency has been shipping.

Highlights
- continual learning
- automating AI research (from GPU programming to RL itself)
- embodied environments
- multi-agent systems
- materials science discovery

CARLA-Env – @myainotez

An open-source embodied RL environment based on the CARLA simulator. It provides high-fidelity physics, sensors, and configurable urban scenarios for training and evaluating decision-making agents.

Blog: blog.sinatras.dev/Carla-EnvEnvir…: app.primeintellect.ai/dashboard/envi…
x.com/myainotez/stat…

PMPP-Eval – @myainotez

A dataset and RL environment based on the book “Programming Massively Parallel Processors,” focused on CUDA and GPU programming skills. Includes verifiable coding exercises and a frontier eval based on it.

Blog: blog.sinatras.dev/PMPP-Eval+Jour…

Environment: app.primeintellect.ai/dashboard/envi…

x.com/myainotez/stat…

Read 11 tweets

Prime Intellect

@PrimeIntellect

Feb 11

Introducing Lab: A full-stack platform for training your own agentic models

Build, evaluate and train on your own environments at scale without managing the underlying infrastructure.

Giving everyone their own frontier AI lab.

We are not inspired by a future where a few labs control the intelligence layer

So we built a platform to give everyone access to the tools of the frontier lab

If you are an AI company, you can now be your own AI lab

If you are an AI engineer, you can now be an AI researcher

Lab unifies everything you need for post-training research into one platform

+ Environments Hub
+ Hosted Evaluations
+ Hosted Training
+ Deployments & Inference

Without needing to worry about the costs of massive GPU clusters or the headaches of low-level algorithm details

Read 13 tweets

Prime Intellect

@PrimeIntellect

Jan 27

https://twitter.com/arcee_ai/status/2016278017572495505

We're excited to introduce @arcee_ai's Trinity Large model.

An open 400B parameter Mixture of Experts model, delivering frontier-level performance with only 13B active parameters.

Trained in collaboration between Arcee, Datology and Prime Intellect.

https://twitter.com/arcee_ai/status/2016278017572495505

Trinity Architecture

Key design choices:
- Interleaved local + global attention (3:1 pattern)
- Grouped-query + gated attention
- New load-balancing method (SMEBU)
- Depth scaled sandwich norm and QK norm

With extreme sparsity, built for long context and fast inference.

Infrastructure

- Large-scale synthetic data generation on ~2k H100s
- Training Trinity Large on 2k B300 GPUs

Training stack:
- Modified torchtitan
- Muon optimizer
- HSDP with FSDP group size 128
- Expert parallelism
- Context parallelism for context extension
- Improvements to recover quickly from hardware failures

Read 7 tweets

Prime Intellect

@PrimeIntellect

Jan 1

We believe the next breakthrough in long-horizon agents is training models to manage their own context.

Introducing our new research direction on Recursive Language Models.

We are sharing our initial experiments showing the promise of RLMs.

primeintellect.ai/blog/rlm

First introduced by @a1zhang in Oct 2025, the RLM has access to its inputs through a variable in a persistent Python REPL.

The model can inspect & transform that variable with code, and pipe parts of it into sub-LLMs with tools without ever loading the potentially huge input data into its context.

RLMs are a simple, flexible form of context folding that doesn't depend on lossy summarization.

Instead, the model proactively delegates context to:

- Python scripts (search, filter, transform)
- Sub-LLMs (fresh instances) for parallel work
- Iterative answer edits until it's actually correct

Read 8 tweets

Prime Intellect

@PrimeIntellect

Nov 27, 2025

Introducing INTELLECT-3: Scaling RL to a 100B+ MoE model on our end-to-end stack

Achieving state-of-the-art performance for its size across math, code and reasoning

Built using the same tools we put in your hands, from environments & evals, RL frameworks, sandboxes & more

INTELLECT-3 is a 106B parameter Mixture-of-Experts model trained with both SFT and RL on top of the GLM 4.5 Air Base model.

Both stages, including multiple ablations, were carried out on a 512-GPU H200 cluster over the course of two months.

Our Training Stack

+ PRIME-RL: Our scalable, asynchronous RL trainer
+ Verifiers: Our unified library used for hundreds of envs and evals on the Environments Hub
+ Sandboxes: Custom container infra optimized for agentic RL
+ Compute: Orchestration & observability for 512 H200s

Read 13 tweets

Prime Intellect

@PrimeIntellect

Oct 27, 2025

We're scaling our Open-Source Environments Program

As part of this, we're committing hundreds of thousands of $ in bounties and looking for partners who want to join our mission to accelerate open superintelligence

Join us in building the global hub for environments and evals

Over the past 2 months, we've crowdsourced 400+ environments and 80+ verified implementations through our bounties and RL residency across:

+ Autonomous AI Research
+ Browser Automation
+ Theorem Proving
+ Subject-Specific QA
+ Legal/Finance Tasks
+ Many more...

Thank you to everyone whose claimed a bounty or joined the residency!

@alexinexxx @xlr8harder @LatentLich @myainotez @ChaseBrowe32432 @varunneal @vyomdundigalla @amit05prakash @minjunesh @sidbing @unrelated333 @ljt019 @lakshyaag @sid_899 @srthkdev @semiozz @ibnAmjid and more!

Read 6 tweets

Share this page!

Enter URL or ID to Unroll

Prime Intellect

Try unrolling a thread yourself!

More from @PrimeIntellect

Prime Intellect

Prime Intellect

Prime Intellect

Prime Intellect

Prime Intellect

Prime Intellect

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!