You can download it and run the model locally via Ollama/LM Studio, or host it on platforms like Fireworks AI, Groq, etc. that support Qwen3 models. 2/
We used Dr. GRPO on Qwen3-4B given existing function calling abilities, w/ VeRL + SGLang (i.e. training for multi-turn tool-calling). As seen above, the results are pretty good! 3/
If you end up using the model, let us know what you think!
And if you’re interested in a continuously improving custom version (for this model or any open source model), please reach out 🙂 4/