Cua Profile picture
Cua
Open-source infrastructure for Computer-Use Agents // YC X25
Dec 16 6 tweets 3 min read
Today we're announcing cua-bench: a framework for benchmarking, training data, and RL environments for computer-use AI agents. Why? Current agents show 10x variance across minor UI changes. Here's how we're fixing it. 1/ The problem: Claude Computer-Use, OpenAI CUA, and Gemini 2.5 Computer-Use show 10x performance difference across minor UI changes.

Completes tasks on a clean desktop, fails when windows overlap. Works on default themes, breaks on high contrast mode.

The root cause: training data lacks visual diversity.Image
Jun 1 6 tweets 2 min read
Introducing App-Use - create virtual desktops for AI agents to focus on specific apps.

1/6 App-Use lets you scope agents to just the apps they need. Instead of full desktop access, say "only work with Safari and Notes" or "just control iPhone Mirroring" - visual isolation without new processes for perfectly focused automation.

2/6
May 29 6 tweets 2 min read
Launching public preview of C/ua Cloud Containers today - Docker for Computer-Use Agents. Zero local setup. Same Computer and Agent interfaces. Scale 1-100 agents instantly.

1/6 Image First cloud platform built for Computer-Use Agents. Open-source backbone. Linux/Windows/macOS desktops in your browser. Works with OpenAI, Anthropic, or any LLM. Pay only for compute time.

2/6
May 2 5 tweets 1 min read
Want to help make AI better at using computers? We just released a guide on creating human trajectory datasets with C/ua Image Our new @Gradio UI makes it easy to record, organize and share high-quality trajectories of how humans navigate interfaces, recover from errors, and complete real tasks.
Apr 10 7 tweets 1 min read
Cua loves MCP. Launching official MCP Server support today - enabling our Computer-Use Agent to run through Claude Desktop, Cursor, and other MCP clients. An example use case? How about using Claude as a tutor to learn how to use Tableau?