Tweet

@huggingface

@numpy

@_ScottCondron

@huggingface

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @TheZachMueller

Zach Mueller

@TheZachMueller

Jul 6

New article on #python decorators is out! Specifically this shows you how decorators are written, what they do, and the power you can do with them. I even show an example of when you'd use the strange "nonlocal" 1/3
muellerzr.github.io/fastblog/pytho…

Context manager sequel should be out in the next few days. This one will take a bit longer because in some cases decorators are context managers, and they also have a few more rules so it'll take some time for me to get that how I want it :) 2/3

The other aim with these two is to give you easy-to-view boilerplate examples of decorators and context managers to play with, and explain how they work.

Why? Because I've been wanting those for many months now, and could really use them myself for reference 3/3

Read 4 tweets

Zach Mueller

@TheZachMueller

Jul 5

@huggingface

Listened to everyone's response with the new `no_sync` wrapper in @huggingface's Accelerate and I took it to heart.

Here's our new gradient accumulation context manager available in Accelerate dev now! A thread on design choices and the struggles 1/4🧵

@huggingface

@huggingface The goal with Accelerate is abstract as very little as we possibly can for you to perform what you want on any training device (CPU, multi-gpu, etc). As a result, it came to a decision of "how can we simplify gradient accumulation, without hiding anything?" 2/4

@huggingface

@huggingface A compromise was found, where instead we focus on deleting your duplicated code that would come from performing gradient accumulation and also help with the loss as well. It doesn't reduce the clarity of the code, and lets it be consistent across platforms 3/4

Read 4 tweets

Zach Mueller

@TheZachMueller

May 19

@Docker

A few tips and tricks I learned about @Docker today and keeping image sizes small 🧵

Use a multi-stage approach to keep the resulting image lightweight by pre-compiling all of the installs and then just bringing in those installed files to the end image. I could save 500mbs + in some cases by doing this

The second trick I learned (which should be an obvious one!) is to install the direct torch wheel based on what you're using. For example, if you're using CPU but don't specify the CPU wheel, your docker image can be 2gb when in reality it only needs to be 800mb's or so!

Read 4 tweets

Zach Mueller

@TheZachMueller

Feb 6

@fastdotai

Tonight we're talking about @fastdotai's `tabular_learner`, and more specifically the TabularModel 🧵

The role of the `tabular_learner` is to mostly build a `TabularModel` for your data. This tabular model is a series of embedding matrices and some batch normalization, before going through a few rounds of LinBnDrop, as shown below 2/

@fastdotai

What makes this model different from all other models that @fastdotai has is that it splits our inputs into **two** separate groups, the categorical and continuous, meaning the model expects a tuple:

3/

Read 5 tweets

Zach Mueller

@TheZachMueller

Feb 4

@fastdotai

What is @fastdotai's `cnn_learner`, and what magic does it do? 🧵

@fastdotai

The `cnn_learner` builds a fastai Learner designed for specifically vision transfer learning, using some of the best practical practices.

We start with a baseline `arch`, such as a resnet34, cut off the last layer, and introduce a @fastdotai head (such as below) for our task 2/

Along with this, we freeze the backbone of the architecture (which means set the params to not trainable) and only train the head (that Custom Head) of the model. 3/

Read 5 tweets

Zach Mueller

@TheZachMueller

Nov 13, 2021

https://twitter.com/TheZachMueller/status/1459315548160933889

Gave it a second read through (I had the opportunity to read the first draft a while ago), below you can find a thread of my review, and some bits I enjoyed from it:

https://twitter.com/TheZachMueller/status/1459315548160933889

@fastdotai

This book is an excellent companion to something like the @fastdotai book, course, or Walk with fastai. It explores some areas differently than what is presented in the course, which can perhaps help folks get a better grasp of some concepts. 1/

This is a small detail, but I really liked the fact that each dataset referenced in the book HAD an actual reference. It was small, I'm not sure how commonplace that is normally, but it was something that surprised me (in a good way) 2/

Read 7 tweets

Share this page!

Zach Mueller

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @TheZachMueller

Zach Mueller

Zach Mueller

Zach Mueller

Zach Mueller

Zach Mueller

Zach Mueller

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?