Gemini’s Stream Realtime has unlocked so many incredible use cases.
You now have an AI assistant that can see your screen and chat with you in real-time to learn, work, and research faster.
7 powerful ideas:
1. Research Assistant
- Highlight a dense paragraph from a white paper and ask for a non-technical summary.
- Hover over a tricky term, formula, diagram and ask for a simple explanation.
- Open multiple tabs on a research topic and ask to synthesize the key points side by side.
2. Learning New Software
- Help you navigate an unfamiliar menu.
- Ask how to locate or enable a hidden setting, e.g., "Draw" in Word.
- Hover over a complex toolbar icon and see how Gemini interprets its function in real time.
3. Interactive Troubleshooting and Instant Feedback Loop
- Run a piece of code, show the error, and ask for a likely root cause.
- Attempt a quick fix and ask if it sees any remaining issues.
- Share your code editor and ask for help, e.g., "how to create a nextjs project".
4. Live Document Editing
- Write a paragraph and ask for improvements.
- Ask for alternative headlines or titles for a section you're drafting.
- Go through your document and ask for synonyms or sentences to paraphrase.
5. On-the-Fly Translation
- Open a web page in a foreign language and ask for a real-time translation.
- Present an idiomatic expression and request a culturally accurate interpretation.
- Share a post and ask for clarification.
6. Collaborative Brainstorming
- Give me ideas to improve my website
- This is my PowerPoint presentation on Al, should l add a note to this slide regarding...?
7. Content Creation
- Share a video/post and ask for improvements.
- Show it a bunch of videos and ask video ideas based on your content.
Bonus: Use Grounding for up-to-date information.
Make sure to say “Search the Internet to find X”
Follow me @dr_cintas for more practical AI tutorials.
These type of posts take a bit of time to do, so if you have enjoyed it, consider leaving a like/repost of the post below :)
- Midjourney V1 Video
- ChatGPT Record Mode
- Higgsfield new AI Canvas
- Claude Code MCP Servers
- Google Search Live AI Mode
- MIT Study ChatGPT’s Impact
- MiniMax M1 model & AI Agent
- Tencent open-source 3D model
Here’s EVERYTHING you need to know:
1. Midjourney enters the video game with their first AI video model V1 that turns any image into 5-second clips.
Users can also extend videos up to 20s and it's available on their $10/month plan starting now.
2. OpenAI launches Record Mode, a new meeting assistant that records, transcribes, and summarizes your conversations.
Available on ChatGPT Pro, Team, Enterprise and Edu for Mac users.
Other updates include: OpenAI podcast, Image Generation on WhatsApp, and Canvas Downloads.