Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

LLM360

@llm360

Dec 5 • 6 tweets • 4 min read • Read on X

Scrolly

To mark the 2nd anniversary of LLM360, we are proud to release K2-V2: a 70B reasoning-centric foundation model that delivers frontier capabilities.

As a push for "360-open" transparency, we are releasing not only weights, but the full recipe: data composition, training code, logs, and intermediate checkpoints.

About K2-V2:

🧠 70B params, reasoning-optimized
🧊 512K context window
🔓 "360-Open" (Data, Logs, Checkpoints)
📈 SOTA on olympiad math and complex logic puzzles

We evaluated K2 across general knowledge, STEM, coding, and agentic tool use.

The goal? To show open models need not be smaller, weaker versions of closed ones.

K2 outperforms models of similar sizes, and performs close to models that are larger.

🔗 Check out the model here:

Generously sponsored by @mbzuai.huggingface.co/LLM360/K2-V2

K2 introduces three "reasoning effort" modes—Low, Medium, and High—allowing you to balance cost vs. capability.

K2-High excels on hard math tasks like AIME 2025 (80.2%) and HMMT, while K2-Medium is the sweet spot for efficiency.

True openness means shipping the artifacts others hide. We release K2 components in three suites 📕

1. Full checkpoints (including mid-training), logs, and code.

2. Exact data recipes and curation details.

3. The "TxT360-3efforts" SFT data to teach models to think.

The secret sauce is our "Mid-Training" phase.

We didn't only fine-tune; we infused reasoning early by feeding K2 billions of reasoning tokens and extending context to 512K tokens.

This ensures reasoning is a native behavior. See how K2-High achieves state-of-the-art results by leveraging more "thinking tokens."

Ready to dive in?

📄 Technical Report: llm360.ai/reports/K2_V2_…

🤗 Model & Data: huggingface.co/collections/LL…

📊 Analysis: wandb.ai/llm360/K2-V2

💻 Training Code: github.com/llm360/k2v2_tr…

💻 Evaluation Code: github.com/llm360/eval360

A huge thank you to the OSS ecosystem! @huggingface @wandb @github @lmsysorg @AiEleuther @allen_ai @BigCodeProject @PyTorch @nvidia @cerebras @mbzuai and many more.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @llm360

LLM360

@llm360

Apr 11

Proudly present MegaMath, the largest open-source math reasoning pretraining corpus—371B tokens of high-quality mathematical web, code, and synthetic data, designed to build the data foundation for next-generation math-proficient LLMs like o1 and R1. 🧵👇 #LLM #OpenSource #MegaMath #Math #Data4LLMs #Pretraining

Trending now on Hugging Face: huggingface.co/datasets/LLM36…

🔍 Why is this important?

Mathematical reasoning is a key feature of advanced LLMs. Training math-proficient models like O1 and DeepSeek-R1 requires large-scale, high-quality, diverse math data. Proprietary corpora, such as Qwen-2.5-Math (1T) and DeepSeekMath (120B), show strong mathematical abilities but are closed source. Existing open corpora lack comparable size and quality. MegaMath aims to bridge this gap.

💡 What’s in MegaMath?
MegaMath is a comprehensive 371B-token collection delivering with top data quality. It is composed of:

📚279B tokens of math-rich web data
🧑‍💻 28B tokens of math-relevant code
🧠64B tokens of high-quality synthetic data (QA pairs, translated code, text+code blocks)

Read 8 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

LLM360

Try unrolling a thread yourself!

More from @llm360

LLM360

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!