New preprint! When reasoning LLMs deliberate over possible futures, are they actually planning?
arxiv.org/abs/2605.06840
We extract search trees from chain-of-thought reasoning traces in the four-in-a-row board game, and find that LLMs generate the surface structure of tree search, but their decisions are driven by something much shallower.