Andy Chen Profile picture
Founder @ https://t.co/menIudcKI0 Boston native, @BrownUniversity CS, former ML tech lead @Meta @RealityLabs
Jan 7, 2023 4 tweets 2 min read
1/ GLM-130B outperforms OpenAI's GPT-3 175B and Google's PALM 540B on critical benchmarks.

AND it's open sourced, which means — you can run this model on your own machine, for free. 2/ GLM-130B is instruction-finetuned, leverages Chinchilla scaling laws, and has bells and whistles like 4-bit quantization and bidirectional attention.

With 4-bit quantization, the model can run on 1 x 80 GB A100 or a consumer GPU rig.