Alex Ratner Profile picture
@SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.
Apr 2, 2023 10 tweets 4 min read
1/ Prediction: Everyone will soon be using foundation models (FMs) like GPT-4.

However, they'll be using FMs trained on their own data & workloads:

"GPT-You", not GPT-X

Tl/dr:
- Closed APIs aren't defensible
- The durable moat is data
- The last mile generates the real value 2/ *Closed APIs aren't defensible*

- Recent examples like @StanfordCRFM Alpaca tinyurl.com/yc78bnct shows that cloning closed API-based FMs like ChatGPT can be done for a few $100 on top of small OSS base FMs (e.g. here, fine-tuning LLaMa 7B via exs from the ChatGPT API).
Dec 18, 2022 8 tweets 2 min read
1/ 2023 AI prediction: the gap between generative and predictive AI will widen.

Despite product & business model innovation in generative AI, real-world ROI will remain concentrated around predictive AI- leading to frustrated expectations.

This gap will all come down to data... 2/ First, basic definitions:

- Generative (ie. LLMs / foundation models): Goal is to output a data point (e.g. an image)

- Predictive (or "discriminative"): Goal is to label a data point (e.g. predict whether an image contains offensive content).