Alvaro Cintas Profile picture
Educating about AI, Cybersecurity and Technology | Professor | PhD in Computer Science & Engineering | 👨‍🏫@therundownai
16 subscribers
Jun 1 7 tweets 3 min read
Genspark AI just released the world’s first full Agentic Download Agent.

You can now ask to download anything and the agent automatically searches, downloads, and organizes files instantly.

5 powerful use cases + how to try👇:

1. Download all papers from a LinkedIn post 2. “Download this Instagram video: [URLs]”
May 30 11 tweets 4 min read
What a crazy week in AI 🤯

- Mistral AI Agents
- Claude Voice mode
- Google SignGemma
- Factory AI SWE Agents
- Perplexity launches Labs
- Flux.1 Image Editing Kontext
- SpAItial AI Foundation Models
- Opera’s first AI Agentic Browser

Here’s EVERYTHING you need to know: 1. Mistral releases Agents API with built-in code execution, web search, and image generation.

Web search boosted accuracy from 23% to 75% on benchmarks.
May 26 6 tweets 3 min read
🚨 Freepik just dropped its new AI Assistant.

You can now generate images, edit using GPT-4o, upscale, and convert them into video all in one place.

It also lets you pick color palettes, styles, and ready-made workflows.

Here’s how: 1. Freepik Assistant has so many different tools in one place. 

From AI image generation to editing to AI videos.

To access it, head over to: freepik.com/pikaso
May 25 11 tweets 3 min read
Google Veo 3 is a major shift.

People are sharing tons of new crazy AI videos.

Here are some of the best ones🧵:

1. AI characters realize they can talk 2. 3D animation done all by AI
May 20 11 tweets 4 min read
Tons of crazy releases from Google I/O 🤯

- Agent Mode
- Google Veo 3
- Jules Code Assistant
- AI Filmmaking Tool Flow
- Project Astra live abilities
- AI Image Google Imagen 4
- Think mode & native audio
- Gemini 2.5 new capabilities

Here’s EVERYTHING you need to know: 1. Agent Mode in the Gemini App

It lets you delegate complex planning and tasks to Gemini to get stuff done.
May 12 7 tweets 3 min read
The first AI design agent just dropped.

It combines major models for image, video, and 3D asset generation all in one place.

You can also edit the images, generate music, and voiceovers!

Step-by-step tutorial 👇 The Design Agent is called @lovart_ai and you can access it here:

Once there, you’ll notice a nice UI with a centered chat box. Describe your design there to enter into the talk, tab, and tune workflow. lovart.ai
May 11 9 tweets 4 min read
These might be the best guides on:

- Prompt Engineering
- Building Agents
- AI integration strategies
- Working with AI

So much free value by OpenAI, Anthropic, and Google.

All the links below. Image 1. Prompting Guide by Google

🔗 services.google.com/fh/files/misc/…Image
May 8 6 tweets 3 min read
The new HeyGen Avatar IV model is crazy.

You can now produce unlimited content with just one photo, a script, and a cloned voice. It does:

• Head tilts
• Pauses
• Expressions

I already saw accounts with +1M views using this method 🤯

Here’s how (explained by my AI avatar): ✍️ Written Steps

Head over to:

Select “Photo to Video with Avatar IV” HeyGen.comImage
May 5 13 tweets 4 min read
🚨BREAKING: New top AI agent just launched.

It’s called Fellou, and it combines deep research, browser use, and workflow automation, all in one place.

10 powerful use-cases (plus link to beta test): Image 1. Product Hunt to Notion

Add names and intros of the top 8 Product Hunt products to the open Notion page
Apr 25 10 tweets 4 min read
What a crazy week in AI 🤯

- Grok’s AI Vision
- Genspark AI Slides
- Perplexity Assistant
- OpenAI gpt-image-1
- Tavus SoTA lipsync model
- Dia SoTA speech AI model
- Dreamina AI's Top AI Image
- ChatGPT Deep Research Mini

Here's EVERYTHING you need to know: 1. Grok Vision launches with multimodal capabilities, letting users point their phone cameras at objects or environments for real-time analysis.

This comes alongside multilingual audio support and real-time search capabilities.
Apr 24 4 tweets 2 min read
This is such a powerful AI coding workflow.

You can now use Cursor or Windsurf for coding, and let CodeRabbit’s AI agent refactor messy code, help you debug, and find security vulnerabilities.

Here’s how: 1. Visit , click "Sign in with GitHub”, and authorize CodeRabbit coderabbit.ai
Apr 23 12 tweets 4 min read
Genspark might be the best AI agent I’ve tried yet.

It can agentically conduct research, create web pages, generate videos, and even have the AI call and make reservations for you.

10 powerful use cases: 2. AI Slides (@genspark_ai just launched this)

This Super Agent conducts research and creates polished slides and icons, and users can also edit them. 

To try it out, you can go here: genspark.ai
Apr 23 8 tweets 3 min read
🚨BREAKING: Top AI image model just launched.

It’s called Seedream 3.0 by Dreamina AI and it ranks #1 at creating photorealistic images up to 2k resolution.

Dreamina AI can also upscale, inpaint, expand and even generate videos.

It’s now fully available. Link to try free below Image This is the link to access @dreamina_ai:

First, select where it says “Image Generator” dreamina.capcut.com/ai-tool/home/?…
Apr 21 11 tweets 4 min read
AI videos are getting scary good.

So I tested some of the hardest prompts for all the major leading models:

• Kling 2.0
• Sora
• Runway Gen-4
• Google Veo 2

10 side-by-side examples: 2. A lion driving an open jeep in the tanzania safari
Apr 17 12 tweets 4 min read
What a crazy week in AI 🤯

- Kling 2.0 AI video
- Canva Visual Suite 2.0
- Microsoft Copilot Vision
- Grok Studio and Memories
- ChatGPT 4.1, o3, & o4-mini
- OpenAI’s new coding agent
- ByteDance Seaweed AI video
- Claude Autonomous Research

Here’s EVERYTHING you need to know: 1. Kling AI released its new model Kling 2.0.

This launch features improved prompt understanding, enhanced character motion dynamics for more natural fluid movements, and a Multi-Elements Editor for easier video editing.
Apr 14 6 tweets 3 min read
🚨 BREAKING: OpenAI just launched the GPT-4.1 family of models.

New benchmarks, bigger context windows, and the first-ever nano model.

Here’s everything you need to know: Image OpenAI is rolling out 3 new models via API:

- GPT‑4.1
- GPT‑4.1 mini
- GPT‑4.1 nano

Each one beats GPT-4o and GPT-4o mini across the board, especially on coding, instruction following, and long-context tasks. Image
Apr 11 10 tweets 3 min read
What a wild week in AI 🤯

- Google AI Agents
- Meta Llama 4 models
- AI 2027 forecast report
- Amazon AI Voice model
- Gemini 2.5 Deep Research
- ChatGPT memory upgrade
- Firebase Studio rivals Cursor
- Nvidia/Stanford 1-min AI cartoons

Here’s everything you need to know: 1. Google introduces Agent2Agent (A2A) protocol for AI interoperability.

It enables AI agents from different vendors to communicate and collaborate seamlessly.
Apr 9 5 tweets 2 min read
🚨 BREAKING: Google just announced Agent2Agent.

This protocol enables AI agents to communicate across platforms regardless of framework or vendor.

Here’s how it works: A2A facilitates communication between "client" and "remote" agents through four key capabilities:

Secure Collaboration, Task Management, User Experience Negotiation, and Capability Discovery

All built popular standards like HTTP, JSON-RPC standards with enterprise auth. Image
Apr 4 10 tweets 3 min read
What a wild week in AI 🤯

- Midjourney v7
- Runway Gen-4
- Apple AI Health Coach
- Lindy AI Agent Swamps
- Amazon Browser Agent
- LLMs passing the Turing Test
- Meta MoCha AI talking characters
- AI brain signal to speech breakthrough

Here’s everything you need to know: Midjourney just released v7, the latest upgrade to the AI art generator loved by creatives.

Its new “Draft Mode” slashes costs by 50% and speeds up generation 10x, perfect for quick sketches.
Mar 27 14 tweets 5 min read
What a wild week in AI 🤯

- Reve Image
- Ideogram 3.0
- Qwen new models
- ARC-AGI-2 launch
- Alibaba LHM model
- Microsoft Researcher
- Google Gemini 2.5 Pro
- Perplexity Answer Tabs
- DeepSeek’s V3 AI model
- OpenAI’s Image Generator

Here’s everything you need to know: 1. Reve has launched Reve Image 1.0.

Fresh out of stealth, it has claimed the top spot in global image model rankings and outperforming big names like Midjourney and Google’s Imagen.

It provides stunning photorealism, best-in-class prompt accuracy, and wild text rendering.
Mar 26 11 tweets 3 min read
OpenAI’s native Al image generation is insane.

The image and text quality are so good that it has unlocked unlimited possibilities.

10 crazy use cases:

1. Thumbnail maker Image 2. Create product marketing images Image