building backend ai engineer → https://t.co/02SMuhZNpF, previously @tiktok_us
@devv_ai
87 subscribers
Mar 8, 2024 • 11 tweets • 3 min read
I just learned about an exciting development of (by @jeremyphoward), they just released an open source system that can train a 70b large language model on a regular desktop computer with two or more standard gaming GPUs (RTX 3090 or 4090).
This could be a game-changer!
1/nAnswer.AI
The system combines FSDP (Fully Sharded Data Parallel) and QLoRA (Quantized Low-Rank Adaptation) techniques, allowing users to train large models on consumer-grade hardware.
It's the result of a collaboration between , Tim Dettmers (U Washington), and Hugging Face.
- a thread -
1/ 大一 C 语言课 final project 大家都在写 xx 管理系统的时候写了一个解释器,期间读了著名的 SICP(《计算机程序的构造和解释》)、王垠的解释器教程、不完全刷了 CS 61A,其他编译原理相关的书也随便翻了一下,还一并学会了 Emacs,最后写出了一个非常简单的 Lisp 解释器(C 写起来太累了)。