Joe Hellerstein Profile picture
Berkeley CS Prof, focused on data and computation.

Dec 9, 2022, 11 tweets

OK people, I did the thing. #ChatGTP can hallucinate relational databases. With full credit to Jonas Degrave's creative prompt for hallucinating a Linux prompt.

I had to push it harder to generate some long-tail real-world data, but it got there more or less. In this case, I was looking for high-end handmade trumpets. Took a few tries.

These are real trumpet manufacturers!

And finally we're getting the kind of stuff I was looking for. David Monette is the most famous independent trumpet craftsman. Warburton is another. Kanstul and Edwards also arguably qualify in some fashion. The rest are bigger brands. This is some long-tail data!

The gendered prompt was intentional, actually -- brass instrument manufacturing is a male-dominated field and I figured I'd play to the empirical biases to get the data I was looking for.

It's not so good at composing complicated SQL queries on this database though.

If I write a (buggy) version of my query it produces something that looks sensible (tho the prices are about 4x too low). The query has both logic and syntax errors, but ChatGPT figured out more or less what I meant.

Prices are ~40x too low. (Man, I am so unreliable, I don't think I'll ever achieve GI.)

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling