Tweet

@heartexlabs

More from @full_stack_dl

Full Stack Deep Learning

@full_stack_dl

20 Jan

https://twitter.com/josh_tobin_/status/1351662321240707073

🛠️ tooling tuesday 🛠️

In honor of our first lecture at Berkeley this evening, here's our remote teaching stack:

https://twitter.com/josh_tobin_/status/1351662321240707073

@zoom_us

1/ @zoom_us. Duh.

One thing that makes it better is to have a good audio/video setup. Here's a good guide:

https://twitter.com/garrytan/status/1240649924212961280?s=20

Fujifilm cameras work too and avoid the need for the Camlink.

@SlackHQ

2/ @SlackHQ for question management.

Zoom chat is unthreaded and hard to react to. Asking questions live is chaos. Instead, students post questions in a slack channel.

Instructors can answer them directly in slack, or summarize and answer aloud at a break in the lecture.

Read 7 tweets

Full Stack Deep Learning

@full_stack_dl

13 Jan

@GoogleColab

🛠️Tooling Tuesday🛠️

Today, we share a @GoogleColab notebook implementing a Transformer with @PyTorch, trained using @PyTorchLightnin.

We show both encoder and decoder, train with teacher forcing, and implement greedy decoding for inference.

colab.research.google.com/drive/1swXWW5s…

👇1/N

2/N Transformers are a game changer.

This architecture has superseded RNNs for NLP tasks, and is likely to do the same to CNNs for vision tasks.

PyTorch provides Transformer modules since 1.2, but the docs are lacking:

- No explanation of inference
- Tutorial is encoder-only

3/N Our notebook shows both. Let's get started with simple data.

Our output will be number sequences like [2, 5, 3].

Our input will be the same as output, but with each element repeated twice, e.g. [2, 2, 5, 5, 3, 3]

We start each sequence with 0 and end each sequence with 1.

Read 10 tweets

Full Stack Deep Learning

@full_stack_dl

5 Jan

🛠️Tooling Tuesday🛠️

Let's talk about setting up our Python/CUDA environment!

Our goals:

- Easily specify exact Python and CUDA versions
- Humans should not be responsible for finding mutually-compatible package versions
- Production and dev requirements should be separate

1/N

Here's a good way to achieve these goals:

- Use `conda` to install Python/CUDA as specified in `environment.yml`

- Use `pip-tools` to lock in mutually compatbile versions from `requirements/prod.in` and `requirements/dev.in`

- Simply run `make` to update everything!

2/N

Here's our `environment.yml` file.

It specifies Python 3.8, CUDA 10.2, CUDNN 7.6.

To create an environment from this, install Miniconda (docs.conda.io/en/latest/mini…) and run `conda env create`.

Activate the environment with `conda activate conda-piptools-sample-project`

3/N

Read 7 tweets

Full Stack Deep Learning

@full_stack_dl

29 Dec 20

🛠️Tooling Tuesdays: Thread of Threads🛠️

Every week, we share a useful tool for full stack machine learning. Follow along, and please share your suggestions!

1/N

https://twitter.com/full_stack_dl/status/1333839016702013440

2/N: DeepNote

https://twitter.com/full_stack_dl/status/1333839016702013440

https://twitter.com/full_stack_dl/status/1336510862560006146

3/N: DVC

https://twitter.com/full_stack_dl/status/1336510862560006146

Read 5 tweets

Full Stack Deep Learning

@full_stack_dl

23 Dec 20

@dagsterio

🛠️ Tooling Tuesday 🛠️

This week: @dagsterio (dagster.io)

dagster describes themselves as a "data orchestrator for machine learning, analytics, and ETL"

Let's break that down 👇

2/ When you work with real-world data, your pipelines can get complex.

E.g., to train a language model on twitter, you might:
- Download data
- Strip out offensive tweets
- Preprocess the data
- Fit models
- Summarize training performance
- Deploy the best model to production

3/ In production settings, pipelines can be even more complicated.

All well and good, but doing those steps manually every time you update your model is painful, resource intensive, and hard to scale.

And what happens if you have hundreds of these pipelines you need to manage?

Read 13 tweets

Full Stack Deep Learning

@full_stack_dl

11 Dec 20

@lishali88

1/ @lishali88 and @spring_stream joined us to talk about building Rosebud.ai.

Rosebud.ai's @tokkingheads turns portraits into animated avatars that read text you provide. It's fun to play around with!

Here are some challenges they faced building it:

2/ A scalable model training platform was key to experimenting quickly enough to build talkingheads.rosebud.ai.

They built theirs on Kubernetes and take advantage of spot instances to keep costs down.

More on their training infra here: blog.rosebud.ai/cost-efficient…

3/ Model quality is key to their product, so Rosebud prioritizes that over performance.

They're looking into model compression techniques to make big models faster (and more cost effective).

Read 9 tweets

Share this page!

Full Stack Deep Learning

Try unrolling a thread yourself!

More from @full_stack_dl

Full Stack Deep Learning

Full Stack Deep Learning

Full Stack Deep Learning

Full Stack Deep Learning

Full Stack Deep Learning

Full Stack Deep Learning

Did Thread Reader help you today?

Like this author's thread?