Kimi.ai Profile picture
Oct 30 1 tweets 1 min read Read on X
Kimi Linear Tech Report is dropped! 🚀
huggingface.co/moonshotai/Kim…

Kimi Linear: A novel architecture that outperforms full attention with faster speeds and better performance—ready to serve as a drop-in replacement for full attention, featuring our open-sourced KDA kernels! Kimi Linear offers up to a 75% reduction in KV cache usage and up to 6x decoding throughput at a 1M context length.

Key highlights:
🔹 Kimi Delta Attention: A hardware-efficient linear attention mechanism that refines the gated delta rule.
🔹 Kimi Linear Architecture: The first hybrid linear architecture to surpass pure full attention quality across the board.
🔹 Empirical Validation: Scaled, fair comparisons + open-sourced KDA kernels, vLLM integration, and checkpoints.

The future of agentic-oriented attention is here! 💡

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Kimi.ai

Kimi.ai Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Kimi_Moonshot

Jul 11
🚀 Hello, Kimi K2! Open-Source Agentic Model!
🔹 1T total / 32B active MoE model
🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models
🔹Strong in coding and agentic tasks
🐤 Multimodal & thought-mode not supported for now

With Kimi K2, advanced agentic intelligence is more open and accessible than ever. We can't wait to see what you build!

🔌 API is here: platform.moonshot.ai
- $0.15 / million input tokens (cache hit)
- $0.60 / million input tokens (cache miss)
- $2.50 / million output tokens

🔗 Tech blog: moonshotai.github.io/Kimi-K2/
🔗 Weights & code: huggingface.co/moonshotai
🔗 Github: github.com/MoonshotAI/Kim…
Try it now at Kimi.ai or via API!Image
Here are some vibe tests we ran:

1. Interactive 3D Mountain Scene
2. A ball bouncing in hexagon
Read 6 tweets
Jun 20
Meet Kimi-Researcher - an autonomous agent that excels at multi-turn search and reasoning. Powered by k 1.5 and trained with end-to-end agentic RL.

Achieved 26.9% pass@1 on Humanity's Last Exam, 69% pass@1 on xbench.

🔗 Tech blog: moonshotai.github.io/Kimi-Researche…Image
Benchmarks aside, It thinks:
→ 23 reasoning steps per task (avg.)
→ 200+ URLs explored
→ Multi-turn tool use of search, browser, and code
→ Inline citations

Beta access is rolling out at kimi.com — get on the waitlist 👉 [docs.google.com/forms/d/e/1FAI…]
Join the discussion & share feedback in our Discord.👉


To facilitate more research efforts in the field, we are planning on open-sourcing the base pretrained model as well as the reinforcement-learned model underlying Kimi-Researcher in the following months.discord.gg/uGqNmXhNhM
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(