Happy to release GOAL ⚽️, a multimodal dataset based on football highlights that includes 1) videos, 2) human transcriptions, and 3) Wikidata-based KB with statistics about players and teams for every match.

arxiv.org/abs/2211.04534

Interested? Read the thread below! #NLProc
[1/7] Previous video benchmarks consider movies or TV series that typically involve scripted interaction between characters instead of visually grounded language. On the other hand, in GOAL we focus on football commentaries because they involve visually grounded language Image
[2/7]: GOAL pushes the boundaries of current multimodal models because it requires the encoding of 1) videos; 2) commentary; 3) KB information. All these elements are essential when generating a sound and coherent commentary for a football video. Image
[3/7]: GOAL contains highlights derived from several football leagues (e.g., Serie A, Premier League, etc.), professional human transcriptions, as well as information about the match derived from Wikidata. Just like real commentators, models should use all these modalities. Image
[4/7]: We set up a multi-faceted evaluation based on several tasks: 1) commentary retrieval, 2) frame reordering, 3) moment retrieval and 4) commentary generation. We use the HERO model for our evaluation and demonstrate that the commentary generation task is really challenging. Image
[5/7]: We analysed both HERO and BART predictions when considering just the language or the multimodal context as a reference. This highlighted the poor visual grounding ability of this model. We also show that it’s important to incorporate the KB information to boost performance Image
[6/7]: GOAL can also serve as resource a resource for visual context-aware speech recognition; multi-modal fact-checking, and multi-modal activity recognition. Additionally, GOAL represents an interesting benchmark for models that directly solve the task using audio information.
[7/7]: This work was the last chapter of my PhD thesis and is a result of a great collaboration with @zedavid @andrea_vanzo @emabastiano @verena_rieser @sinantie @MNikandrou Lu Yu and @shubhamag1992

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alessandro Suglia

Alessandro Suglia Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(