β¨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context.
π Now live on App, Web, and API.
π° API prices cut by 50%+!
1/n
β‘οΈ Efficiency Gains
π€ DSA achieves fine-grained sparse attention with minimal impact on output quality β boosting long-context performance & reducing compute cost.
π Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.
2/n
Aug 21 β’ 5 tweets β’ 3 min read
Introducing DeepSeek-V3.1: our first step toward the agent era! π
π§ Hybrid inference: Think & Non-Think β one model, two modes
β‘οΈ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528
π οΈ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks
Try it now β toggle Think/Non-Think via the "DeepThink" button: chat.deepseek.com
1/5
API Update βοΈ
πΉ deepseek-chat β non-thinking mode
πΉ deepseek-reasoner β thinking mode
π§΅ 128K context for both
π Anthropic API format supported: api-docs.deepseek.com/guides/anthropβ¦
β Strict Function Calling supported in Beta API: api-docs.deepseek.com/guides/functioβ¦
π More API resources, smoother API experience
2/5
Jan 20 β’ 5 tweets β’ 3 min read
π DeepSeek-R1 is here!
β‘ Performance on par with OpenAI-o1
π Fully open-source model & technical report
π MIT licensed: Distill & commercialize freely!
π Website & API are live now! Try DeepThink at today!
π¬ Distilled from DeepSeek-R1, 6 small models fully open-sourced
π 32B & 70B models on par with OpenAI-o1-mini
π€ Empowering the open-source community
π Exciting news! Weβve officially launched DeepSeek-V2.5 β a powerful combination of DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724! Now, with enhanced writing, instruction-following, and human preference alignment, itβs available on Web and API. Enjoy seamless Function Calling, FIM, and Json Output all-in-one!
Note: Due to significant updates in this version, if performance drops in certain cases, we recommend adjusting the system prompt and temperature settings for the best results!
DeepSeek-V2.5 outperforms both DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724 on most benchmarks.