Tweet

Papers with Code

10 Nov, 7 tweets, 4 min read

Deep learning models for tabular data continue to improve. What are the latest methods and recent progress?

Let’s have a look ↓

1) Wide&Deep jointly trains wide linear models and deep neural networks to combine the benefits of memorization and generalization for real-world recommender systems. The model was productionized and evaluated on Google Play.

paperswithcode.com/method/wide-de…

2) TaBERT is a pretrained LM that jointly learns representations for natural language sentences and (semi-)structured tables. TaBERT works well for semantic parsing and is trained on a large corpus of 26 million tables and their English contexts.

paperswithcode.com/method/tabert

3) TabTransformer is a deep tabular data modeling architecture for supervised and semi-supervised learning. It is built upon self-attention based Transformers. The model learns robust contextual embeddings to achieve higher prediction accuracy.

paperswithcode.com/method/tabtran…

4) SAINT is a recent hybrid deep learning approach for tabular data. It performs attention over both rows and columns, and it includes an enhanced embedding method. It outperforms gradient boosting methods like CatBoost on a variety of benchmark tasks.

paperswithcode.com/method/saint

5) FT-Transformer is a Transformer-based architecture for the tabular domain. The model transforms all features (categorical and numerical) to tokens and runs a stack of Transformer layers over the tokens. It outperforms other DL models on several tasks.

paperswithcode.com/method/ft-tran…

To track the latest deep learning models applied to tabular data here is an extended list of methods, including associated papers, open source codes, benchmark datasets, and trends.

paperswithcode.com/methods/catego…

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @paperswithcode

Papers with Code

@paperswithcode

27 Oct

Graph neural networks are driving lots of progress in machine learning by extending deep learning approaches to complex graph data and applications.

Let’s take a look at a few methods ↓

1) A Graph Convolutional Network, or GCN, is an approach for semi-supervised learning on graph-structured data. It’s based on an efficient variant of CNNs which operates directly on graphs and is useful for semi-supervised node classification.

paperswithcode.com/method/gcn

2) Diffusion-convolutional neural networks (DCNN) introduce a diffusion-convolution operation to extend CNNs to graph data. This enables learning of diffusion-based representations. It's used as an effective basis for node classification.

paperswithcode.com/method/dcnn

Read 7 tweets

Papers with Code

@paperswithcode

12 Oct

StyleGAN3 is out and results are 🤯!

It proposes architectural changes that suppress aliasing and forces the model to implement more natural hierarchical refinement which improves its ability to generate video and animation.

paperswithcode.com/paper/alias-fr…

1/8

In the cinemagraph below, we can see that in StyleGAN2 the texture (e.g., wrinkles and hairs) appears to stick to the screen coordinates. In comparison, StyleGAN3 (right) transforms details coherently:

2/8

The following example shows the same issue with StyleGAN2: textural details appear fixed. As for alias-free StyleGAN3, smooth transformations with the rest of the screen can be seen.

3/8

Read 8 tweets

Papers with Code

@paperswithcode

4 Oct

@wightmanr

In a new paper from @wightmanr et al. a traditional ResNet-50 is re-trained using a modern training protocol. It achieves a very competitive 80.4% top-1 accuracy on ImageNet without using extra data or distillation.

[mini-thread]

paperswithcode.com/paper/resnet-s…

The paper catalogues the exact training settings to provide a robust baseline for future experiments:

It also records training costs and inference times on ImageNet classification between other architectures trained with the proposed ResNet-50 optimized training procedure:

Read 6 tweets

Papers with Code

@paperswithcode

20 Jan

🚨 Newsletter Issue #3. Featuring a new state-of-the-art on ImageNet, a trillion-parameter language model, 10 applications of transformers you didn’t know about, and much more! Read on below:

paperswithcode.com/newsletter/3

@hieupham789

👩‍🔬 Research: featuring work by @hieupham789 et al., @LiamFedus et al., @Pengcheng2020 et al., Stergiou et al., Ding et al., @quocleix, among others.

@laidacviet

👩‍💻 Libraries: featuring work by @laidacviet, @yzhao062, @nabla_theta, among others.

Read 4 tweets

Papers with Code

@paperswithcode

30 Dec 20

⏪ Papers with Code: Year in Review. We’re ending the year by taking a look back at the top trending papers, libraries and benchmarks for 2020. Read on below!

medium.com/paperswithcode…

@tanmingxing

📄 Trending Papers for 2020, featuring:

- EfficientDet @tanmingxing @quocleix
- ResNeSt @zhanghang0704
- Big Transfer @__kolesnikov__ @giffmana
- FixRes @HugoTouvron
- as well as work by @QizheXie @colinraffel @ctnzr, and others

@huggingface

👩‍💻 Trending Libraries for 2020, featuring:

- Transformers @huggingface
- PyTorch Image Models @wightmanr
- Detectron2 @ppwwyyxx
- MMDetection @OpenMMLab
- as well as work by @eriklindernoren, @myleott, @ZloiAlexei, and others

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Thank you for your support!

Share this page!

Papers with Code

Try unrolling a thread yourself!

More from @paperswithcode

Papers with Code

Papers with Code

Papers with Code

Papers with Code

Papers with Code

Did Thread Reader help you today?

Like this author's thread?