Alexander Martin Profile picture
Apr 2 7 tweets 3 min read Read on X
Wish you could get a Wikipedia style article for unfolding events?

Introducing WikiVideo: a new multimodal task and benchmark for Wikipedia-style article generation from multiple videos! Image
WikiVideo was annotated by experts in a multistep annotation process to provide multimodal grounding of articles in our video corpus!

Paper: arxiv.org/abs/2504.00939
Dataset: huggingface.co/datasets/hltco…
Repo: github.com/alexmartin1722… Image
WikiVideo is a challenging task that VideoLLMs can’t do!

It requires inference across multiple videos (avg 8 per topic) and requires models recognize low-level semantic features, like entities, and draw higher-level inferences about the unfolding event.
To tackle this challenge, we present a collaborative, test-time scalable method: Collaborative Article Generation (CAG). CAG involves the collaboration between a VideoLLM and reasoning model to iterate through video content and synthesize it into an article Image
We find that CAG performs better than existing methods across all metrics, but still has a long way to go! There is plenty of future work in efficient and multi-video inference, high-level understanding, and improving video retrieval performance! Image
If you’re interested in article generation from videos and other tasks that require understanding events in videos, checkout our ACL Workshop MAGMAR and our related work!

MAGMAR: nlp.jhu.edu/magmar/

MultiVENT: nlp.jhu.edu/multivent/
This work was done in collaboration w/ colleagues at Johns Hopkins University: Reno Kriz, William Walden, @kesnet50, Hannah Recknor, @EYangTW, Francis Ferraro, and @ben_vandurme

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alexander Martin

Alexander Martin Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(