Post

@anderssandberg

More from @tobyordoxford

Toby Ord

@tobyordoxford

Oct 20

New post on RL scaling:
Careful analysis of OpenAI’s public benchmarks reveals RL scales far worse than inference: to match each 10x scale-up of inference compute, you need 100x the RL-training compute. The only reason it has been cost-effective is starting from a tiny base.
🧵

But now RL has grown to nearly the size of pretraining and scale-ups beyond this reveal its inefficiency. I estimate it would take a 1,000,000x scale-up from its current level to add the equivalent to a 1,000x scale-up of inference or a 100x scale-up in pretraining.

This is a big deal. Pretraining scaling has already stalled and RL-scaling was the new hope for scaling up training compute. By going beyond imitation learning it also offered the best hope for blasting past the human-range of abilities — but it just scales too poorly.

Read 5 tweets

Toby Ord

@tobyordoxford

Oct 3

Evidence Recent AI Gains are Mostly from Inference-Scaling
🧵
Here's a thread about my latest post on AI scaling...
1/14

Scaling up AI using next-token prediction was the most important trend in modern AI. It stalled out over the last couple of years and has been replaced by RL scaling.
This has two parts:
1. Scaling RL training
2. Scaling inference compute at deployment
2/

Many people focus on (1). This is the bull case for RL scaling — it started off small compared to internet-scale pre-training, so can be scaled 10x or 100x before doubling overall training compute.
3/

Read 14 tweets

Toby Ord

@tobyordoxford

Sep 25

Evaluating the Infinite
🧵
My latest paper tries to solve a longstanding problem afflicting fields such as decision theory, economics, and ethics — the problem of infinities.
Let me explain a bit about what causes the problem and how my solution avoids it.
1/20

Decision theory, economics and ethics all involve comparing different options. Sometimes the relevant sums or integrals that we'd hope would provide the value of an option instead diverge to +∞, making comparison between such options impossible.
2/

This problem arises because the standard approach to evaluating infinite sums and integrals uses a very coarse-grained system of infinite numbers, where there is only one positive infinite number (+∞). To assign values to such options, we need a more fine-grained system.
3/

Read 20 tweets

Toby Ord

@tobyordoxford

Sep 19

The Extreme Inefficiency of RL for Frontier Models
🧵
The switch from training frontier models by next-token-prediction to reinforcement learning (RL) requires 1,000s to 1,000,000s of times as much compute per bit of information the model gets to learn from.
1/11

Next-token prediction (aka pre-training) gives the model access to one token of ground-truth information after each token the model produces.
RL requires the model to produce an entire chain of thought (often >10,000 tokens) before finding out a single bit of information.
2/

So the shift from scaling up compute used for pre-training to scaling up compute used for RL comes with a major reduction in the information-efficiency of the training method.
3/

Read 11 tweets

Toby Ord

@tobyordoxford

Jul 16

https://twitter.com/balesni/status/1945151391674057154

The fact that frontier AI agents subvocalise their plans in English is an absolute gift for AI safety — a quirk of the technology development which may have done more to protect us from misaligned AGI than any technique we've deliberately developed.
Don't squander this gift.

https://twitter.com/balesni/status/1945151391674057154

While @balesni's thread asks developers to:
"Consider architectural choices that preserve transparency"
I don't think that goes nearly far enough.

@balesni If someone works out how to trade away this transparency in exchange for more efficiency and ushers in a new era of opaque thoughts, they may have done more than any other individual to lower the chance humanity survives this century.

Read 4 tweets

Toby Ord

@tobyordoxford

May 7

Is there a half-life for the success rates of AI agents?
I show that the success rates of AI agents on longer-duration tasks can be explained by an extremely simple mathematical model — a constant rate of failing during each minute a human would take to do the task.
🧵
1/

METR recently released an intriguing report showing that on a suite of tasks related to doing AI research, the length of tasks that frontier AI agents can complete has been doubling every 7 months.
2/

They measure task-length by how long, on average, it takes a human to complete it. And they measure the length of task an AI agent can complete by the longest task at which it still has ≥50% success rate.
3/

Read 22 tweets

Share this page!

Enter URL or ID to Unroll

Toby Ord

Try unrolling a thread yourself!

More from @tobyordoxford

Toby Ord

Toby Ord

Toby Ord

Toby Ord

Toby Ord

Toby Ord

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!