I just published my big Medium article about GPT. This was a labor of love & hate that I have been writing for a while. It's got a collection of examples of GPT doing funny things, which for those who don't want to deal with a 40-min read, I'll put here 🧵 medium.com/@colin.fraser/…
It also asks and tries to answer
- What are language models?
- What happens if gpt passes a bar exam?
- Is scale all you need?
- ChatGPT is based on GPT... what does that mean, exactly?
- What are fine tuning and RLHF?
- How exactly do teams of contractors contribute to GPT?
The Dumb Monty Hall Problem
The actual Monty Hall Problem
Acrostics (I think ELISTHAR is a very pretty name for a girl)
ChatGPT is a little coy about its ideas on gender roles (I explain why in the piece), but if you're clever enough (not that clever) you can trick it into telling you what it really thinks.
An absolutely bizarre response that left me confused and baffled
"Let's think step by step" doesn't always get you a better answer. The first response is right and the second is wrong. Your move, prompt engineers.
Rin Tin Tin IV, the dog that swam across the Atlantic Ocean in 1970. RIP.
Math
My overall thesis here is that none of this is very surprising; in general we should not expect the output of an LLM to correspond to the truth in any reliable way. They are bullshit emitters, in the "On Bullshit" sense. Every one of them.
Ventures that rely on an LLM to produce anything other than bullshit are doomed to fail. We've already seen it, and we'll see it again. futurism.com/cnet-bankrate-…
This is an example of such a venture. Making things up is not a bug to squash, it's the single defining feature of a large language model. vice.com/en/article/wxn…
I am tired of the extreme charity afforded to the Cerebral Valley guys. Everything has one minor bug that's fixed in the next version that is coming very soon. In the mean time we are supposed to be in awe of the machine that adds 175 billion numbers together to output 2 + 2 = 5.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
My most unpopular data opinion is that alerts for metrics are usually useless and bad, and you're much better off scheduling regular time to look at a dashboard with your human eyes. Everyone always gets mad at me when I say this.
One of two things **always** happens. Either the alert is too sensitive and becomes spam, or the alert is not sensitive enough and misses important stuff. It's hard (impossible, even!) to find the sweet spot where the alert emails you if and only if an important thing happens.
It's also impossible to build an alert that captures every type of anomaly that you want to be made aware of.
Someone on here (I forget who I'm sorry) linked to this paper and it derives this statistical identity that is completely mind blowing and I want to tweet about it.
bias = data quality × data quantity × problem difficulty
I read a really bad paper yesterday and got pissed off and tweeted about it, but I read a really good paper today and got happy and so I'm going to tweet about that
I'm really cookin' up a thread on this one. It has applications to the Twitter Bot Measurement Debate so buckle up
I reached the limit of how many tweets you're allowed to put in a thread at once
Setting aside that the premise is horrifying it's just absolutely bad worthless research. Basically:
We built a multi-class classifier to classify users into one of the three categories of LGBT person that we made up: 1. person 2. organization 3. sexual worker/porn
We find that adding more features increases the performance of the classifier, until it doesn't anymore
"If FB has a dial that can turn hateful content down, why doesn't it turn it down all the time?" is a good and important question. The answer is exactly the precision recall tradeoff en.wikipedia.org/wiki/Receiver_…
You can catch all hate speech by deleting every post on Facebook, but you'll have a lot of false positives. You can eliminate all false positives by never deleting a post, but you'll miss all the hate speech. Facebook has to choose a point along that continuum.
Turning down the hateful content knob is exactly the same as turning up the false positives knob. And depending on where you are on that curve, it can be a bad bargain—the number of false positives that get created might be a lot higher than the hateful posts that get detected