Tweet

Lightning AI ⚡️

@LightningAI

May 23 • 3 tweets • 2 min read Twitter logo

Last week, we discussed techniques to speed up the training speed of large language models🔥💨

How about saving memory during inference? 🧠💾 Check out int8 & int4 quantization, which is supported in Lit-LLama 👉github.com/Lightning-AI/l…

🧵1/3

#LLMs #ML #DeepLearning

How does int8 quantization work? 🤔

It's a 2-part procedure with
1) using 8bit quantization
2) 16-bit matmuls for outlier feature dimensions

Check out the LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale paper for details arxiv.org/abs/2208.07339

🧵2/3

And how about int4? 🤔

It's a one-shot weight quantization method based on approximate second-order information⚙️📉

For more details, see GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers: 📚🔍 arxiv.org/abs/2210.17323

🧵3/3

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @LightningAI

Lightning AI ⚡️

@LightningAI

Jan 19

Train a 20-billion parameter GPT model for text prediction on 3 GPU nodes with Lightning. 🤯

The entire training process is contained in a simple script that you can scan, read, and understand in just a few seconds.✅

🧵(1/4)

You can customize any part of this process: the dataset, model, hyperparameters, and training strategy.

Also, you can easily swap out which 🛠️hardware you’re using. With a single flag, you can choose to run on 3 nodes (as in this example), or higher to fit your model.

🧵(2/4)

The Lightning Platform also enables you to perform multi-node training from scratch without the hassle of setting up infrastructure or worrying about managing multi-node communication. 🤩

🧵(3/4)

Read 4 tweets

Share this page!

Enter Twitter Thread URL to Unroll

Lightning AI ⚡️

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @LightningAI

Lightning AI ⚡️

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!