Benjamin Bossan Profile picture
Apr 14 6 tweets 3 min read Read on X
Today, we released PEFT v0.19.0 and it's a big one. Not only did we add 9 new PEFT methods, the release also contains a bunch of improvements to make PEFT more useful. Check the thread for details: Image
The release contains new functions to convert non-LoRA weights into LoRA weights. This allows them to be used in packages like Diffusers and vLLM that only support LoRA. Find more details here: huggingface.co/docs/peft/main…
LoRA fine-tuning can introduce so called "intruder dimensions" which contribute to forgetting (). We now have a utility function to remove intruder dimension, `reduce_intruder_dimension`. Call this on a fine-tuned LoRA model to reduce forgetting.huggingface.co/papers/2410.21…
A selection of improvements to LoRA:

- support for Tensor Parallelism
- Tensor Engine quantization
- better handling of tied weights
- support fp8 dtypes
- LoRA-GA initialization by @sambhavdixitpro

Moreover, for prefix tuning, we provide better initialization options.
We also added 9 new PEFT methods. There is not enough room here to describe them in detail, but here is a list:

- GraLoRA by github.com/yeonjoon-jung01
- BD-LoRA by github.com/Conzel
- Cartridges and TinyLoRA by @krasul
- PVeRA by @leofillioux
- PSOFT by github.com/fei407
- Lily and PEANut by @tmux_1
- AdaMSS by github.com/LonglongaaaGo

Find a short description of these PEFT methods and the full release notes here: github.com/huggingface/pe…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Benjamin Bossan

Benjamin Bossan Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @BenjaminBossan

Jun 27, 2023
With ZeroShotClassifier and FewShotClassifier from skorch, you can perform classification with LLMs.

Use any LLM from @huggingface 🤗 with a scikit-learn API – fit, predict, and predict_proba just work 🔥

The snippet below shows how to grid search the model and prompt:

🧵1/5
Because we build on Hugging Face transformers, you have the choice among thousands of models. It also means that your Large Language Model runs locally and skorch can perform some optimizations under the hood to speed things up.

🧵2/5
You should check this feature out if you

• have few labeled data
• require a drop-in replacement for sklearn classifiers
• want to force the LLM to only predict the given labels
• need the LLM to return probabilities
• want everything to run locally

More to come.

🧵3/5
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(