Latest Twitter Threads by @llm360 on Thread Reader App

Dec 5, 2025 • 6 tweets • 4 min read

To mark the 2nd anniversary of LLM360, we are proud to release K2-V2: a 70B reasoning-centric foundation model that delivers frontier capabilities.

As a push for "360-open" transparency, we are releasing not only weights, but the full recipe: data composition, training code, logs, and intermediate checkpoints.

About K2-V2:

🧠 70B params, reasoning-optimized
🧊 512K context window
🔓 "360-Open" (Data, Logs, Checkpoints)
📈 SOTA on olympiad math and complex logic puzzles

We evaluated K2 across general knowledge, STEM, coding, and agentic tool use.

The goal? To show open models need not be smaller, weaker versions of closed ones.

K2 outperforms models of similar sizes, and performs close to models that are larger.

🔗 Check out the model here:

Generously sponsored by @mbzuai.huggingface.co/LLM360/K2-V2

Apr 11, 2025 • 8 tweets • 4 min read

Proudly present MegaMath, the largest open-source math reasoning pretraining corpus—371B tokens of high-quality mathematical web, code, and synthetic data, designed to build the data foundation for next-generation math-proficient LLMs like o1 and R1. 🧵👇 #LLM #OpenSource #MegaMath #Math #Data4LLMs #Pretraining

Trending now on Hugging Face: huggingface.co/datasets/LLM36…

🔍 Why is this important?

Mathematical reasoning is a key feature of advanced LLMs. Training math-proficient models like O1 and DeepSeek-R1 requires large-scale, high-quality, diverse math data. Proprietary corpora, such as Qwen-2.5-Math (1T) and DeepSeekMath (120B), show strong mathematical abilities but are closed source. Existing open corpora lack comparable size and quality. MegaMath aims to bridge this gap.

Share this page!

Enter URL or ID to Unroll