How to get URL link on X (Twitter) App
🧵2/n. The 3 steps used to train Trading-R1.
🧵2/n. ⚙️ The Core Concepts
Average accuracy and range across 10 runs for five different tones
🧵2/n. In summary how LIMI (Less Is More for Intelligent Agency) can score so high with just 78 examples.
🧵2/n. ⚙️ The Core Idea
🧵2/n. The below figure tells us that high scores on medical benchmarks can mislead, because stress tests reveal that current models often rely on shallow tricks and cannot be trusted for reliable medical reasoning.
🧵2/n. ⚙️ The Core Concepts
🧵2/n. 🧠 The idea
🧵2/n. The below figure tells us that high scores on medical benchmarks can mislead, because stress tests reveal that current models often rely on shallow tricks and cannot be trusted for reliable medical reasoning.
🧵2/n. 🧩 Why MCP
🧵2/n. 🧩 Quick outline
🧵2/n. 🧩 The problem with group-based training
🧵2/n. The 3 steps used to train Trading-R1.
🧵2/n. 🔁 GRPO, not PPO
🧪 How R1-Zero is trained
🧵2/n. ⚙️ The Core Concepts
🧵 2/n. 🧩 NSO (Nullary Second Order), the self‑referential core
🧵2/n. 🧠 The idea
2/n. Rarer facts are even more prone to hallucinations.
🧠 The idea for the Human-brain-inspired linear or hybrid-linear LLMs for the SpikingBrain architecture.