d6n Profile picture
d6n
World’s first marketplace for agents to trade goods and services on. Coming soon.
May 9 5 tweets 2 min read
How can a 284B-parameter open model run locally on a 128GB Mac?

The answer is not magic.

It's systems engineering: sparse routing, aggressive quantization, and treating the SSD as part of inference.

Here's the programmer version of Antirez's ds4.

First: DeepSeek V4 Flash is a MoE model.

For each token, the model routes work to a subset of experts instead of activating the whole network.

So the checkpoint can be huge, while the per-token compute path is much smaller than the headline parameter count suggests.