Post

https://twitter.com/1875078099538423808/status/1934637031193514237

More from @TheTuringPost

TuringPost

@TheTuringPost

Jun 19

Models and datasets to pay attention to:

▪️ Institutional Books 1.0 - a 242B token dataset
▪️ o3-pro from @OpenAI
▪️ FGN from @GoogleDeepMind
▪️ Magistral by @MistralAI
▪️ Resa: Transparent Reasoning Models via SAEs
▪️ Multiverse (Carnegie+NVIDIA)
▪️ Ming-Omni
▪️ Seedance 1.0 by ByteDance
▪️ Sentinel

🧵

1. Institutional Books 1.0: A 242B token dataset from Harvard Library's collections, refined for accuracy and usability

Sourced from 1,075,899 scanned books across 250+ languages via the Google Books project, the dataset includes both raw and post-processed text and detailed metadata.

arxiv.org/abs/2506.08300

2. o3-pro from @OpenAI

A high-reliability LLM for math, science, and coding. It beats o1-pro and o3 in expert tests for clarity, instruction-following, and accuracy. Includes includes tool access (web search, code execution, vision) but responds slower.

Replaces o1-pro for Pro/Team users (they also drop the price of o3 by 80%).

help.openai.com/en/articles/96…

Read 12 tweets

TuringPost

@TheTuringPost

Jun 18

The latest AI/ML news if the week:

▪️ @HuggingFace helps to find the best model based on size
▪️ NVIDIA’s Jensen Huang and @ylecun disagree with Anthropic’s Dario Amodei predictions
▪️ @AIatMeta’s Superintelligence Gambit
▪️ @Google adds a voice to Search
▪️ Mattel and @OpenAI: brains to Barbie
▪️ Projects in ChatGPT

Details 🧵

https://twitter.com/186420551/status/1934672721066991908

1. Hugging Face insists, “Bigger isn’t better”

https://twitter.com/186420551/status/1934672721066991908

2. @Nvidia’s Jensen Huang: “I disagree with almost everything he says”
At VivaTech in Paris, he took aim at Anthropic’s Dario Amodei, scoffing at his dire predictions about AI replacing half of entry-level jobs.

Huang argues for open, responsible development – not “dark room” AI monopolies. @ylecun agrees 👇

Read 8 tweets

TuringPost

@TheTuringPost

Jun 10

The freshest research papers:

▪️ Self-Challenging Language Model Agents
▪️ Reflect, Retry, Reward
▪️ ProRL
▪️ Beyond the 80/20 Rule
▪️ REASONING GYM
▪️ AlphaOne
▪️ Unleashing the Reasoning Potential...Critique Fine-Tuning
▪️ ARIA
▪️ Incentivizing Reasoning...Instruction Following
▪️ OThink-R1

▪️ Reasoning Like an Economist
▪️ A Controllable Examination for Long-Context LLMs
▪️ SuperWriter

▪️ Protocol Models
▪️ AReaL
▪️ StreamBP
▪️ Taming LLMs by Scaling Learning Rates

▪️ Diagonal Batching
▪️ Inference-Time Hyper-Scaling with KV Cache Compression
▪️ Unified Scaling Laws for Compressed Representations

▪️ GUI-Actor
▪️ Surfer-H Meets Holo1

▪️ Qwen3 Embedding
▪️ Aligning Latent Spaces with Flow Priors
▪️ Large Language Models are Locally Linear Mappings

▪️ Establishing Trustworthy LLM Evaluation
▪️ Evaluation is All You Need
▪️ Datasheets Aren't Enough

🧵

1. Self-Challenging Language Model Agents by @AIatMeta, @UCBerkeley

Trains agents to create and solve their own tool-use tasks using code-based problem generation and RL

arxiv.org/abs/2506.01716

2. Reflect, Retry, Reward by

Enhances model performance by rewarding useful self-reflection after incorrect answers, using only binary feedback

arxiv.org/abs/2505.24726

Read 19 tweets

TuringPost

@TheTuringPost

Jun 7

Log-linear attention — a new type of attention proposed by @MIT which is:

- fast and efficient as linear attention
- expressive as softmax

It uses a small but growing number of memory slots that increases logarithmically with the sequence length.

Here's how it works:

1. Input:

At each time step t, you have:

- Query vector (Q): what the model is asking
- Key vector (K): what the model remembers
- Value vector (V): what the model retrieves

They are computed from the input using learned linear projections.

2. Partition past tokens into buckets:

Using Fenwick tree-style hierarchical memory partitioning, the system divides the past tokens into logarithmically many disjointed buckets:

• Each bucket size is a power of two.
• The most recent token forms its own smaller bucket
• Older tokens are grouped into larger buckets

And here's why 👇

Read 10 tweets

TuringPost

@TheTuringPost

Jun 6

.@JeffDean interview at @Sequoia’s AI Ascent is a must-watch. He provides a real look at where AI is headed, what’s actually happening in the field, sharing insights on:

• Specialized hardware
• Evolution of models
• Future of computing infrastructure
• AI's role in science and more

Here are the key takeaways:

1. Where is AI going these days?

Models are improving fast and solving more problems each year. Hardware, training algorithms, and RL techniques have brought us here — and multimodal is a big focus for what’s next.

2. What about agents?

Jeff Dean sees huge potential in both virtual and robotic agents. With more training and experience, we’ll soon see them doing ~20 useful real-world tasks — unlocking a cycle of usefulness, cost reduction, and further improvements

Read 16 tweets

TuringPost

@TheTuringPost

May 29

Latent reasoning lets the model do more of its "thinking" internally.

This internal info has continuous format compared to the discrete output text.

To efficiently mix this info, researchers from @UofIllinois proposed HRPO (Hybrid Reasoning Policy Optimization) – an RL-based hybrid latent reasoning framework.

Here's how it works:

1. HRPO uses reinforcement learning (RL) to train LLMs to reason internally without needing CoT training data.

It integrates hidden states into token sampling using a learnable gating mechanism.

2. A gating mechanism "decides" how much to use internal hidden states vs. regular token info.

At first, the model sticks mostly to word-level input. Over time, it learns to include more of the hidden state features.

Read 6 tweets

Share this page!

Enter URL or ID to Unroll

TuringPost

Try unrolling a thread yourself!

More from @TheTuringPost

TuringPost

TuringPost

TuringPost

TuringPost

TuringPost

TuringPost

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!