Alex Reibman 🖇️ Profile picture
Hyper Engineering @agentopsai Vibes @cerebral_valley Hack reporter

May 20, 2024, 19 tweets

Google and OpenAI are at war. Just days apart, both unveiled 2 insanely powerful multimodal AIs— Gemini 1.5 and GPT-4o.

Who has the best LLM? We threw an emergency hackathon to find out.

Here are the finalists from the @solarissociety ⚔️ GPT-4o vs. Gemini 1.5 Hackathon (🧵):

1/ Elevator - 🏆 Best use of Gemini 1.5

Steve Jobs AI powered by Gemini 1.5-multimodal audio (no speech to text at all) that gives feedback on elevator pitches.

Extremely detailed breakdown—Even understands tone, passion, and conviction
@__shubhankar

2/ Generative UX

Interactive travel agent that connects user preferences with trip guides, bookings, photos, cost planners, and more

@_bprimal_

3/ OpenRabbit - 🏆 Best use of Computer Vision @roboflow

Open sourced Rabbit R1 with local hardware. It can:
- Create music with Suno
- Send emails
- Curate playlists
And more

A better R1 but built in 24 hours

@notionsmith @sachscode @rama41296

4/ Synthify

Use GPT-4o and Gemini 1.5 to generate synthetic datasets for smaller machine learning tasks

Cheap dataset curation

@saurishhh

5/ George - 🏆 Best use of GPT-4o

AI agent to negotiate with food vendors and suppliers on WhatsApp marketplaces

Saving restaurants hundreds of dollars per month sourcing cheaper, higher quality ingredients

@_AbdullahNauman @apostoliev @arjundabra @jadroy2 @taylorzhuAI

6/ RoboDebate Battle 2024

GPT-4o and Gemini 1.5 debate in front of a live audience— but the audience doesn’t know who is who.

Audience votes on the winner

7/ Engineer-4o - 🏆 Best in Show

VS code extension agent to automatically track issues and solve codebase issues

Built with @pyautogen

@ishaandey_ @ElijahKurien

8/ Glowby Debates

Live debate between GPT-4o and Gemini 1.5

@jacobilin

9/ HotAgents

Desktop agent that takes screenshots and figures out what tools to call automatically. It can summarize text, write code, and more

All with one hot key

Powered by @wordware_ai

@crsamra @kevinydzhu @avery_chiu @ariaxhan

10/ Turbot

AI agent that combines user requests with GPS location to plan uniquely personalized tour experiences

@kellyhongsn @ten_lukasz

11/ Thought Partner

AI assistant that listens to your notes, creates a live editable transcript, and sends you an email summary

12/ Halo AI

Better Siri voice assistant for managing todos and calendar events

13/ Robodog

Computer vision powered robot trained to identify objects

@sebheyneman

14/ Twitter Times

Turn your Twitter feed into a newspaper

Web scraper opens a browser, parses your feed, and turns it into a 1 pager.

@itzelleyy

15/ AI Yapper

AI voice clone that synchronizes with your personal information and becomes a synchronized clone of you

@markokraemer

Huge thanks @thomasschulzz @jacobshamberger @solarissociety + @AgentOpsAI @wordware_ai @logicrite @AnonPlatform @browserbasehq

+ judges @mickeyxfriedman @rajko_rad @rememberlenny @josephofiowa @pk_iv @kwindla @AustinPeirson @luishectorcha @jtvhk

More AI? Follow @AlexReibman

@pyautogen @ishaandey_ @ElijahKurien here’s the video

7/ Engineer-4o - 🏆 Best in Show
(Reposting because upload didn't work)

VS code extension agent to automatically track issues and solve codebase issues. Built with @pyautogen

@ishaandey_
@ElijahKurien

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling