It's a multi-agent AI system built with Gemini 2.0 to help accelerate scientific breakthroughs.
2025 is truly the year of multi-agents!
Let's break it down:
What's the goal of this AI co-scientist?
It can serve as a "virtual scientific collaborator to help scientists generate novel hypotheses and research proposals, and to accelerate the clock speed of scientific and biomedical discoveries."
How is it built?
It uses a coalition of specialized agents inspired by the scientific method.
Many devs ask me which LLMs work best for AI agents.
The new Agent Leaderboard (by @rungalileo) was built to provide insights and evaluate LLMs on real-world tool-calling tasks—crucial for building AI agents.
Let's go over the results:
1️⃣ Leader
After evaluating 17 leading LLMs across 14 diverse datasets, here are the key findings:
Google's 𝗚𝗲𝗺𝗶𝗻𝗶-𝟮.𝟬-𝗳𝗹𝗮𝘀𝗵 leads with a 0.94 score at a remarkably low cost.
2️⃣ Pricing
The top 3 models span a 10x price difference with only 4% performance gap. Many of you might be overpaying.