Post

Kasey Zhang

@_WEEXIAO

May 8 • 5 tweets • 2 min read • Read on X

We used RL to train a model for MCP!

Connect any MCP client to any MCP server - you can run MCP workflows fully with local models (+ tune it further).

It works with Ollama / any MCP client that supports Qwen3 models - download it below 👇1/

HF link: huggingface.co/osmosis-ai/osm…

Quickstart example link: github.com/Gulp-AI/Osmosi…

You can download it and run the model locally via Ollama/LM Studio, or host it on platforms like Fireworks AI, Groq, etc. that support Qwen3 models. 2/

We used Dr. GRPO on Qwen3-4B given existing function calling abilities, w/ VeRL + SGLang (i.e. training for multi-turn tool-calling). As seen above, the results are pretty good! 3/

If you end up using the model, let us know what you think!

And if you’re interested in a continuously improving custom version (for this model or any open source model), please reach out 🙂 4/

https://x.com/willccbb/status/1917286908192870841

ty @willccbb for the idea :)

https://x.com/willccbb/status/1917286908192870841

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

Share this page!

Enter URL or ID to Unroll

Kasey Zhang

Try unrolling a thread yourself!

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!