How to get URL link on X (Twitter) App


I had to run Kimi-K2.5 twice, because last time I tested LisanBench with Qwen3 I had to add a /nothink tag, that I then forgot.https://twitter.com/scaling01/status/2008387917899546671again, not saying it's not smart

Decomposing helps the model to focus more on reasoning as it keeps the problem size smaller but it will basically get lost in the algorithm and repeat steps.



Full Leaderboard: 


https://x.com/scaling01/status/1866268414517387299



Paper Link: ai.meta.com/research/publi…