David W. Romero Profile picture
Jul 5 โ€ข 10 tweets โ€ข 6 min read
VGG, U-Net, TCN, ... CNNs are powerful but must be tailored to specific problems, data-types, -lenghts & -resolutions.

Can we design a single CNN that works well on all these settings?๐Ÿค”Yes! Meet the ๐‚๐‚๐๐, a single CNN that achieves SOTA on several datasets, e.g., LRA!๐Ÿ”ฅ
๐Œ๐š๐ข๐ง ๐ˆ๐๐ž๐š: Architecture changes are needed to model long range dependencies for signals of different length, res & dims, e.g., pooling, depth, kernel sizes.

To solve all tasks with a single CNN, it must model long range deps. at every layer: use Continuous Conv Kernels!
๐‚๐จ๐ง๐ญ๐ข๐ง๐ฎ๐จ๐ฎ๐ฌ ๐‚๐จ๐ง๐ฏ. ๐Š๐ž๐ซ๐ง๐ž๐ฅ๐ฌ allow you to create conv. kernels of arbitrary length and dimensionlity that generalize across resolutions by parameterizing them with a neural net.
๐’๐Ÿ’ ๐ฏ๐ฌ ๐‚๐‚๐๐. Inspired by the powerful S4 model (arxiv.org/abs/2111.00396), we use a variation of their residual blocks, which we call S4 Blocks.

However, in contrast to S4, which only works with 1D signals, CCNNs easily model long range dependencies in ND.
๐‘๐ž๐ฌ๐ฎ๐ฅ๐ญ๐ฌ ๐Ÿ๐ƒ. CCNNs obtain SOTA on several sequential benchmarks, e.g., Long Range Arena, Speech Recognition, 1D img classification, all with a single architecture.

CCNNs are often smaller and simpler than other methods.
๐‘๐ž๐ฌ๐ฎ๐ฅ๐ญ๐ฌ ๐Ÿ๐ƒ. With a single architecture, the CCNN matches & surpasses much deeper CNNs!

๐‹๐จ๐ง๐  ๐‘๐š๐ง๐ ๐ž ๐€๐ซ๐ž๐ง๐š ๐ข๐ง ๐Ÿ๐ƒ. Some LRA tasks are defined on 2D data. Using this info -not possible for other methods, eg S4- CCNNs easily get much getter results faster!
๐‚๐จ๐ง๐œ๐ฅ๐ฎ๐ฌ๐ข๐จ๐ง. The CCNN is a single CNN architecture that works well on several tasks on data with different lenghts, resolutions & dimensionality.

๐…๐ฎ๐ญ๐ฎ๐ซ๐ž ๐–๐จ๐ซ๐ค. We plan to extend our results to 3D data & other tasks, e.g., segmentation, generative modelling.
๐‚๐จ๐ฅ๐ฅ๐š๐›๐จ๐ซ๐š๐ญ๐ข๐จ๐ง๐ฌ. Would you like to collaborate with us on this project? Ping me!๐Ÿ˜We want to have an extensive list of experiments in this project & we are sure we can use your expertise and use case!
๐€๐œ๐ค๐ง๐จ๐ฐ๐ฅ๐ž๐๐ ๐ž๐ฆ๐ž๐ง๐ญ๐ฌ. This is part of my Qualcomm Innovetion Fellowship series. I'd like to thank @QCOMResearch & my mentor @danielewworrall for their support!

Details w.r.t. the fellowship:
qualcomm.com/research/univeโ€ฆ

โ€ข โ€ข โ€ข

Missing some Tweet in this thread? You can try to force a refresh
ใ€€

Keep Current with David W. Romero

David W. Romero Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @davidwromero

Oct 19, 2021
CNNs ask you to predefine the size of conv. kernels. But, isn't it better to let the network decide?

Our new work w/ @rjbruin let CNNs learn full-res. kernels during training, leading to SOTA on several datasets!๐Ÿ˜€

Code & slides here!
w/ @jmtomczak @erikjbekkers & @jan_gemert
๐…๐ฅ๐ž๐ฑ๐‚๐จ๐ง๐ฏ - ๐ฆ๐š๐ข๐ง ๐ข๐๐ž๐š: We model conv. kernels as continuous functions of compact support. This is done by using an MLP to model the kernel and a mask to determine its size.

By making the mask params. learnable, the network can decide how big each kernel should be.
Read 11 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(