What happens when you freeze a process *perfectly*? RAM, VRAM, network, everything.
Imagine:
- Live-migrations of LLM training jobs
- time-travel debugging
- Surgical repairs of a crash moments before segfault
It’s called CRIU, and it’s already here:
It starts with a mad Russian.
At least, that’s what a lead linux kernel dev called it:
“a project by various mad Russians to perform c/r mainly from userspace, with various oddball helper code...I'm less confident than the developers that it will all eventually work”
Despite the criticism, Pavel Emelyanov, head of the OpenVZ kernel team, pushed on.
By 2012, Linus Torvalds merged the first wave of patches into Linux.
Previous attempts “failed miserably”, mostly out of insane complexity.