Windsurf Profile picture
Jul 16 2 tweets 1 min read Read on X
Claude Sonnet 4 is back via first party support from @AnthropicAI!

Available at 2x credits (limited time discount) per request for Pro and Teams users. That’s 250 requests a month! Image
Try it now: windsurf.com

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Windsurf

Windsurf Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @windsurf_ai

May 16
To train SWE-1, we had to create a data model and training recipe that took all of the complex states, tasks, and surfaces into consideration.

We then ran evals and experiments to evaluate performance against open and foundation models.

Here's what we did ↴
First, we evaluated how well the model could handle a user query mid-session.

Seamless collaboration with users on partially completed tasks is a crucial benchmark for model usefulness.

SWE-1 achieves near-parity with frontier models in helpfulness, accuracy, and edit quality. Image
We then measured the ability of the model to independently solve a problem end to end.

From a new conversation, how well does Cascade address input intent by passing a set of tests?

SWE-1 competes with frontier models, and surpasses mid-sized and open-weight alternatives. Image
Read 6 tweets
Apr 30
We asked our devs at Windsurf to share their thoughts on their favorite models and what they actually use them for.

Read their answers in the thread ↓
3.7. Sonnet:

It’s proactive and confident but can do too much at times. Regardless, it is generally seen as the most capable.

“3.7 is just super agentic and eager to use tools and do things. I prefer stopping an over-eager model vs. coaxing an under-eager one.”
Gemini 2.5 Pro:

Preferred for tasks that require clean, structured responses.

It’s less proactive than Claude 3.7, but more consistent and less likely to introduce unrelated or duplicate code.

“Its code quality is similar to Sonnet 3.7, but it’s more consistent.”
Read 6 tweets
Apr 8
Here our some of our favorite tips and tricks from the @windsurf_ai community!

Bookmark this and thank yourself later ↓
Slow Vibe Coding: Think, Plan, Prompt, Review, Validate and Start Again
Keep your prompts clear and focused. Start a fresh chat as you start a new task.
Read 9 tweets
Feb 23
Let's discuss how Large Language Models (LLMs) handle codebase structure and parsing, and what makes Windsurf particularly cracked in this area.

While most AI code tools treat code as unstructured text, Windsurf leverages Abstract Syntax Trees (ASTs) to comprehend code at the syntactic level.

Here's why this results in faster, more accurate suggestions: 🧵👇
Unlike other tools that rely on embedding indexes—a one-size-fits-all retrieval method that doesn't scale well for large repos—Windsurf's agent employs strategies akin to human developers to locate necessary context:

- Grep and file search
- File relation traversal (e.g., AST parsing)
- Web search and online documentation
- Parallel LLM-based searches

This approach ensures efficient and scalable context retrieval.
What is an AST?

During compilation, code is parsed into an Abstract Syntax Tree—a hierarchical representation of the code's syntax.

This structure allows extraction of scopes, variable bindings, and function definitions—elements that text-based models might overlook.
Read 7 tweets
Nov 17, 2024
Copilots + Agents = Flows

The reason why Cascade feels like magic is because it combines the collaborative nature of copilots with the independently powerful capacity of agents.

Both Copilots and Agents are valuable, but not as much as Flows.

Let's break this down 🧵
Before the year 2022, humans and keyboards worked in unison, and code development was done completely manually. Every single line of code was a direct result of human input.
In 2022, LLM’s took the world by storm and Copilots were introduced. If you started typing out a line, it would suggest a completion. Or if you asked a question, you would receive an answer.

But they worked on scoped tasks because of single LLM calls.
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(