Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
8 subscribers
Aug 21 β’ 5 tweets β’ 3 min read
Introducing DeepSeek-V3.1: our first step toward the agent era! π
π§ Hybrid inference: Think & Non-Think β one model, two modes
β‘οΈ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528
π οΈ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks
Try it now β toggle Think/Non-Think via the "DeepThink" button: chat.deepseek.com
1/5
API Update βοΈ
πΉ deepseek-chat β non-thinking mode
πΉ deepseek-reasoner β thinking mode
π§΅ 128K context for both
π Anthropic API format supported: api-docs.deepseek.com/guides/anthropβ¦
β Strict Function Calling supported in Beta API: api-docs.deepseek.com/guides/functioβ¦
π More API resources, smoother API experience
2/5
Jan 20 β’ 5 tweets β’ 3 min read
π DeepSeek-R1 is here!
β‘ Performance on par with OpenAI-o1
π Fully open-source model & technical report
π MIT licensed: Distill & commercialize freely!
π Website & API are live now! Try DeepThink at today!
π¬ Distilled from DeepSeek-R1, 6 small models fully open-sourced
π 32B & 70B models on par with OpenAI-o1-mini
π€ Empowering the open-source community
π Exciting news! Weβve officially launched DeepSeek-V2.5 β a powerful combination of DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724! Now, with enhanced writing, instruction-following, and human preference alignment, itβs available on Web and API. Enjoy seamless Function Calling, FIM, and Json Output all-in-one!
Note: Due to significant updates in this version, if performance drops in certain cases, we recommend adjusting the system prompt and temperature settings for the best results!
DeepSeek-V2.5 outperforms both DeepSeek-V2-0628 and DeepSeek-Coder-V2-0724 on most benchmarks.