Wasi Ahmad Profile picture
May 4 4 tweets 2 min read Twitter logo Read on Twitter
#ACL2023NLP
Do causal language models (CLMs) yield good representations with good isotropy and discrimination?

The answer is not always! To address the issue, our ACL2023 paper (arxiv.org/pdf/2210.01185…) proposes ContraCLM.

Joint work with @DejiaoZhang @nihalj_
We show that CodeGen (350M to 16B) pretrained on source code, and text-based CLMs (smaller than GPT2-Large) generated representations suffer from anisotropy and poor discrimination. Image
We show that ContraCLM enhances both isotropy and discrimination, regardless of whether the original CLMs suffer from the degenerated representations. Image
ContraCLM attains 44% relative improvement on the Semantic Textual Similarity tasks and 34% on Code-to-Code Search tasks. Furthermore, ContraCLM boosts source code generation capability with a 9% relative improvement in execution accuracy on the HumanEval benchmark.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Wasi Ahmad

Wasi Ahmad Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(