Google and OpenAI are at war. Just days apart, both unveiled 2 insanely powerful multimodal AIs— Gemini 1.5 and GPT-4o.
Who has the best LLM? We threw an emergency hackathon to find out.
Here are the finalists from the @solarissociety ⚔️ GPT-4o vs. Gemini 1.5 Hackathon (🧵):
1/ Elevator - 🏆 Best use of Gemini 1.5
Steve Jobs AI powered by Gemini 1.5-multimodal audio (no speech to text at all) that gives feedback on elevator pitches.
Extremely detailed breakdown—Even understands tone, passion, and conviction
@__shubhankar
2/ Generative UX
Interactive travel agent that connects user preferences with trip guides, bookings, photos, cost planners, and more
@_bprimal_
3/ OpenRabbit - 🏆 Best use of Computer Vision @roboflow
Open sourced Rabbit R1 with local hardware. It can:
- Create music with Suno
- Send emails
- Curate playlists
And more
A better R1 but built in 24 hours
@notionsmith @sachscode @rama41296
4/ Synthify
Use GPT-4o and Gemini 1.5 to generate synthetic datasets for smaller machine learning tasks
Cheap dataset curation
@saurishhh
5/ George - 🏆 Best use of GPT-4o
AI agent to negotiate with food vendors and suppliers on WhatsApp marketplaces
Saving restaurants hundreds of dollars per month sourcing cheaper, higher quality ingredients
@_AbdullahNauman @apostoliev @arjundabra @jadroy2 @taylorzhuAI
6/ RoboDebate Battle 2024
GPT-4o and Gemini 1.5 debate in front of a live audience— but the audience doesn’t know who is who.
Audience votes on the winner
7/ Engineer-4o - 🏆 Best in Show
VS code extension agent to automatically track issues and solve codebase issues
Built with @pyautogen
@ishaandey_ @ElijahKurien
8/ Glowby Debates
Live debate between GPT-4o and Gemini 1.5
@jacobilin
9/ HotAgents
Desktop agent that takes screenshots and figures out what tools to call automatically. It can summarize text, write code, and more
All with one hot key
Powered by @wordware_ai
@crsamra @kevinydzhu @avery_chiu @ariaxhan
10/ Turbot
AI agent that combines user requests with GPS location to plan uniquely personalized tour experiences
@kellyhongsn @ten_lukasz
11/ Thought Partner
AI assistant that listens to your notes, creates a live editable transcript, and sends you an email summary
12/ Halo AI
Better Siri voice assistant for managing todos and calendar events
13/ Robodog
Computer vision powered robot trained to identify objects
@sebheyneman
14/ Twitter Times
Turn your Twitter feed into a newspaper
Web scraper opens a browser, parses your feed, and turns it into a 1 pager.
@itzelleyy
15/ AI Yapper
AI voice clone that synchronizes with your personal information and becomes a synchronized clone of you
@markokraemer
Huge thanks @thomasschulzz @jacobshamberger @solarissociety + @AgentOpsAI @wordware_ai @logicrite @AnonPlatform @browserbasehq
+ judges @mickeyxfriedman @rajko_rad @rememberlenny @josephofiowa @pk_iv @kwindla @AustinPeirson @luishectorcha @jtvhk
More AI? Follow @AlexReibman
@pyautogen @ishaandey_ @ElijahKurien here’s the video
7/ Engineer-4o - 🏆 Best in Show
(Reposting because upload didn't work)
VS code extension agent to automatically track issues and solve codebase issues. Built with @pyautogen
@ishaandey_
@ElijahKurien
Share this Scrolly Tale with your friends.
A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.
