AI Agents, RAG, and LLMs just had their biggest week.
And the releases are mind-blowing.
Here's everything you need to know:
1. DeepSeek V3 Base model shows impressive performance
β 671B parameter Mixture-of-Experts model
β 37B parameters activated per token
β Outperforms Claude 3.5 Sonnet, GPT-4, and Llama 3
2. OASIS launches social media simulator
β Model up to 1 million AI agent interactions
β Study behavior across Twitter and Reddit
β Analyze information spread and group dynamics
3. Cheshire Cat AI is an opensource framework for production AI agents
β Built with Docker at its core
β Process documents and connect to external APIs
β Supports both commercial and open-source LLMs
4. Humanloop releases comprehensive RAG evaluation suite
β Automated evaluations through code or UI
β Human feedback collection capabilities
β Real-time performance monitoring
5. Hercules emerges as first opensource testing agent
β Converts Gherkin steps to end-to-end tests
β No coding skills required
β Handles complex testing automation