Thread Reader
Share this page!
×
Post
Share
Email
Enter URL or ID to Unroll
×
Unroll Thread
You can paste full URL like: https://x.com/threadreaderapp/status/1644127596119195649
or just the ID like: 1644127596119195649
How to get URL link on X (Twitter) App
On the Twitter thread, click on
or
icon on the bottom
Click again on
or
Share Via icon
Click on
Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at
Twitter Help
Shubham Sharma
@HappyyPablo
likes to reason with humans | lives at @babayagalabs | IIT Bombay '23
Subscribe
Save as PDF
May 19
•
6 tweets
•
2 min read
open sourcing Marlin-2B 🐟
a tiny VLM to extract structured information from videos
Marlin is finetuned for two questions devs want to ask in their videos: what is happening, and when?
Best open model in its weight class, competitive with Gemini-2.5-flash at only 2B params 🧵
Marlin was trained on two modes:
1.
marlin.caption() returns a structured Scene + Events JSON with second-precise timestamps.
You can use it to caption ig reels, index a video library or give your agent context of what happened and when in a video feed