Tweet

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @tunguz

Bojan Tunguz

@tunguz

Oct 8

This past week I came across another paper that purports to get the SOTA for NNs for tabular data. Due to the extreme penchant for exaggeration in this community, I have given up on checking most of these claims, but decided to take a look at this particular work.

1/6

I decided to check how does XGBoost *really* perform on the datasets used in the paper, and the results were not pretty.

2/6

The main takeaway: for all three datasets used in the paper, the reported performance of XGBoost was widely inaccurate and the real performance was much better than their best results.

3/6

Read 6 tweets

Bojan Tunguz

@tunguz

Oct 1

@NVIDIA

This week @NVIDIA open sourced the 3D object generation AI model, GET3D. GET3D is a generative model of high quality 3D textured shapes learned from images.

1/4

Trained using only 2D images, GET3D generates 3D shapes with high-fidelity textures and complex geometric details.

2/4

These 3D objects are created in the same format used by popular graphics software applications, allowing users to immediately import their shapes into 3D renderers and game engines for further editing.

3/4

Read 4 tweets

Bojan Tunguz

@tunguz

Sep 29

I have just done something really cool - I've managed to *train* XGBoost in browser completely within an HTML file! This has been possible thanks to the PyScript project that allows running Python inside of HTML, similar to how JavaScript works.

trainxgb.com

1/5

The example below is very simple - the script loads the small Iris dataset from sklearn. With a slider you are able to adjust the number of XGBoost trees, and the script will train different XGBoost models accordingly and print out accuracy.

2/5

PyScript is still in very early stages of development. Getting all the relevant components to work together is still tricky, and there are not many detailed tutorials. Hence, this example is *very* rudimentary. I'll try to make it more powerful and snazzy does the road.

3/5

Read 5 tweets

Bojan Tunguz

@tunguz

Sep 20

https://twitter.com/javisantana/status/1572296064211861507

All right, here is one trick for using XGBoost for *data analysis*.

1/5

https://twitter.com/javisantana/status/1572296064211861507

First, you create a simple model with XGBoost. It doesn't have to be fancy, or even too accurate, it's just for reference purposes. Use that model to calculate the Shapley values for your training set. Here is an example:

kaggle.com/code/tunguz/tp…

2/5

Next, use those Shapley values for some simple clustering, dimensionality reduction and visualization:

kaggle.com/code/tunguz/tp…

3/5

Read 5 tweets

Bojan Tunguz

@tunguz

Sep 19

NVIDIA GTC starts today! There are tons of exciting topics and webinars covered. This year again the whole conference is online and free, so go and register if you have not done so already.

Here are a few special highlight sessions:

1/4

GTC 2022 Keynote - September: lnkd.in/gYNqxsnr

How CUDA Programming Works: lnkd.in/gKmdjZub

Building the Future of Work with AI-powered Digital Humans: lnkd.in/gXJWk6vz

Building Future-Ready Intelligence for Cars: lnkd.in/gJ9BJMGM

2/4

A Deep Dive into RAPIDS for Accelerated Data Science and Data Engineering: lnkd.in/gM7mquwc

A Deep Dive into the Latest HPC Software: lnkd.in/ghXxGmar

Cross-Framework Model Evaluation and Accelerated Training with NVIDIA Merlin: lnkd.in/gXUEdajH

3/4

Read 4 tweets

Bojan Tunguz

@tunguz

Aug 5

@DeepMind

A very good paper I came across this morning by the @DeepMind researchers. For the past five years Transformers have been one of the most dominant approaches to Deep Learning problems, especially in the #NLP domain.

1/5

However, despite many interesting papers on the topic, and lots of good open code, there has been a noticeable lack of *formal* definition of what transformed are, especially on the level of pseudocode.

2/5

This paper aims to rectify that. It provides pseudocode for almost all major Transformer architectures, including training algorithms.

3/5

Read 5 tweets

Share this page!

Bojan Tunguz

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @tunguz

Bojan Tunguz

Bojan Tunguz

Bojan Tunguz

Bojan Tunguz

Bojan Tunguz

Bojan Tunguz

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!