Ravi Kiran S Profile picture
Feb 3, 2021 β€’ 7 tweets β€’ 4 min read β€’ Read on X
πŸ“’ Introducing SynSE, a language-guided approach for generalized zero shot learning of pose-based action representations! Great effort by @bublaasaur and @divyanshu1709 #actionrecognition

Paper: arxiv.org/abs/2101.11530…
Code: github.com/skelemoa/synse…

πŸ§΅πŸ‘‡ Image
For enabling compositional generalization to novel action-object combinations, the action description is transformed into individual Part-of-Speech based embeddings. Image
The PoS-based embeddings are aligned with action sequence embedding via a VAE-based generative space. This alignment is optimized using within and cross modality constraints. Image
The default ZSL paradigm is biased towards seen classes. We use the elegant gating approach by Atzmon&co. for Generalized ZSL. Essentially, we learn a binary classifier which distinguishes between seen and unseen class samples. Image
SynSE obtains state of the art ZSL and GZSL performance on the large-scale NTU-RGBD skeleton action dataset. Image
πŸ“£ JPoSE (mwray.github.io/FGAR/), CADA-VAE (github.com/edgarschnfld/C…) which inspired our work. JPoSE: alignment of per-PoS language embedding with visual counterpart but in non-generative setting. CADA-VAE: visuo-lingual alignment in VAE-based setting, but no PoS-awareness.
For details, read our paper arxiv.org/abs/2101.11530… and browse our code, pre-trained models at github.com/skelemoa/synse… 🌟

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Ravi Kiran S

Ravi Kiran S Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @vikataravi

Sep 8, 2021
Presenting BoundaryNet - a resizing-free approach for high-precision weakly supervised document layout parsing. BoundaryNet will be an ORAL presentation (Oral Session 3) today at @icdar2021 . Project page: ihdia.iiit.ac.in/BoundaryNet/ . Details πŸ‘‡ Image
Precise boundary annotations can be crucial for downstream applications which rely on region-class semantics. Some document collections contain irregular and overlapping region instances. Fully automatic approaches require resizing and often produce suboptimal parsing results. Image
Our semi-automatic approach takes region bounding box as input and predicts boundary polygon as output. Importantly, BoundaryNet can handle variable sized images without any need for resizing. Image
Read 23 tweets
Aug 20, 2021
πŸ“’ In our #ACMMM21 paper, we highlight issues with training and evaluation of π—°π—Ώπ—Όπ˜„π—± π—°π—Όπ˜‚π—»π˜π—Άπ—»π—΄ deep networks. πŸ§΅πŸ‘‡
For far too long, π—°π—Ώπ—Όπ˜„π—± π—°π—Όπ˜‚π—»π˜π—Άπ—»π—΄ works in #CVPR, #AAAI, #ICCV, #NeurIPS have reported only MAE, but not standard deviation.
Looking at MAE and standard deviation from MAE, a very grim picture emerges. E.g. Imagine a SOTA net with MAE 71.7 but deviation is a whopping 376.4 !
Read 17 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(