Post

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @omarsar0

elvis

@omarsar0

Aug 7

BREAKING: OpenAI introduces GPT-5

Here's everything you need to know:

Altman claims that with GPT-5, it is now like talking to an expert.

It can write entire programs from scratch. Software-on-demand is a defining characteristic.

PhD-level experts in your pockets.

GPT-5 brings a higher level of reasoning.

It thinks just the perfect amount to generate the perfect answer.

Good for math, physics, law, and many other domains.

They claim that GPT-5 is the best coding model today.

Read 30 tweets

elvis

@omarsar0

Aug 3

The Agentic Web is upon us!

If you want to learn about the Agentic Web, look no further.

This new report is a banger!

It presents a detailed framework to understand and build the agentic web.

Here is everything you need to know:

Agentic Web

This paper introduces the concept of the Agentic Web, a transformative vision of the internet where autonomous AI agents, powered by LLMs, act on behalf of users to plan, coordinate, and execute tasks.

It proposes a structured framework for understanding this shift, situating it as a successor to the PC and Mobile Web eras.

It's defined by a triplet of core dimensions (intelligence, interaction, and economics) and involves fundamental architectural and commercial transitions.

Read 15 tweets

elvis

@omarsar0

Aug 2

Hierarchical Reasoning Model

This is one of the most interesting ideas on reasoning I've read in the past couple of months.

It uses a recurrent architecture for impressive hierarchical reasoning.

Here are my notes:

The paper proposes a novel, brain-inspired architecture that replaces CoT prompting with a recurrent model designed for deep, latent computation.

It moves away from token-level reasoning by using two coupled modules: a slow, high-level planner and a fast, low-level executor.

The two recurrent networks operate at different timescales to collaboratively solve tasks

Leads to greater reasoning depth and efficiency with only 27M parameters and no pretraining!

Read 9 tweets

elvis

@omarsar0

Jul 30

Graph-R1

New RAG framework just dropped!

Combines agents, GraphRAG, and RL.

Here are my notes:

Introduces a novel RAG framework that moves beyond traditional one-shot or chunk-based retrieval by integrating graph-structured knowledge, agentic multi-turn interaction, and RL.

Graph-R1 is an agent that reasons over a knowledge hypergraph environment by iteratively issuing queries and retrieving subgraphs using a multi-step “think-retrieve-rethink-generate” loop.

Unlike prior GraphRAG systems that perform fixed retrieval, Graph-R1 dynamically explores the graph based on evolving agent state.

Read 7 tweets

elvis

@omarsar0

Jul 28

GLM-4.5 looks like a big deal!

> MoE Architecture
> Hybrid reasoning models
> 355B total (32B active)
> GQA + partial RoPE
> Multi-Token Prediction
> Muon Optimizer + QK-Norm
> 22T-token training corpus
> Slime RL Infrastructure
> Native tool use

Here's all you need to know:

Model Architecture & Pre-Training

GLM-4.5 is 355B total parameters (32B active); deeper model with narrower width; optimized for reasoning via more layers and 96 attention heads.

GLM-4.5-Air is 106B (12B active).

22T-token training corpus that combines 15T general data with 7T code/reasoning-focused data.

Grouped-Query Attention + partial RoPE to enhance long-context efficiency and accuracy in reasoning tasks.

Mid-training looks like a key part of this model

"Unlike the earlier pre-training stage on large-scale universal documents, these stages leverage medium-sized domain-specific datasets, including instruction data."

Read 14 tweets

elvis

@omarsar0

Jul 27

Claude Code is more than a coding agent.

It's more like a super smart orchestrator agent.

Watch this evaluator loop agent I just built using sub agents and / commands.

This is one of the fastest ways to build custom agentic workflows.

Claude Code is no joke!

I'm impressed to see how easy it is to control how the sub agents communicate with each other (i.e., chain, loop, hierarchical, critic, etc.).

Claude Code is good out of the box, but customization gives you a clear advantage.

Custom sub agents + / commands solve that.

It's worth spending the time optimizing instructions, tool use, agent definitions, and more.

Claude Code, on its own, somehow likes to use a lot of tokens and perform unnecessary tasks/tool calls.

You can max out credits or hit rate limits really fast if you are not careful.

Read 6 tweets

Share this page!

Enter URL or ID to Unroll

elvis

Try unrolling a thread yourself!

More from @omarsar0

elvis

elvis

elvis

elvis

elvis

elvis

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!