Tweet

jonstokes.com

24 Mar, 7 tweets, 2 min read

https://twitter.com/jonst0kes/status/1374466629392687112

People are confused (& mislead) re: this tweet, so in service of procrastination I'll break it down on here.

Models need to be really really large (right now) to capture enough of language that their output sounds authentic. Hence large language models (LLMs).

https://twitter.com/jonst0kes/status/1374466629392687112

"Large" means 2 things: a large number of parameters & a large dataset. To get the latter, a large dataset, you have to do a giant crawl of some massive corpus of text, probably on the web.

So you're going to soak up a TON of text, & because the net is so wide that dataset...

...will naturally reflect the status quo use of language. Almost definitionally "status quo" as the size approaches infinity. Well, the status quo is "problematic", right? So people want the ability to then sanitize that output so that it is not problematic. They want to steer it

So these LLMs are expensive to build & operate, & the output is one-size-fits-all & reflects the status quo (or what Stochastic Parrots calls "the hegemonic worldview"). If you want to "fix" the output, your only option right now is to just edit what you get to clean it up.

But activists don't want to sanitize the output of LLMs just for themselves or their communities — they want to sanitize it for you, too. For everyone. Sanitize all the things. Right now tho, we don't have this ability really. But when we do, who gets to use it? B/c that's power.

What I'm saying in this tweet is that when we can steer the output of an LLM with reference to say, a style guide in order to sanitize it, we should not do that. You don't want the Southern Baptist Convention to have that power, & I don't want the Human Rights Campaign to have it

I don't want any set of activists with a niche, minority viewpoint, to "edit" the handful of large, expensive LLMs that we'll all be using. I'd rather the output just continue to reflect the linguistic status quo, & communities do their own edits on it.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @jonst0kes

jonstokes.com

@jonst0kes

25 Mar

https://twitter.com/jonst0kes/status/1374856583008620545

After a walk, I have rethought my reaction to this paper.
Reading it was what prompted my "maybe I'll give up & become a troll" tweet. But I'm now encouraged by it.

https://twitter.com/jonst0kes/status/1374856583008620545

First, the depressing aspect: a whole section of this paper traffics in ancient techbro stereotypes, plus one new one they've invented themselves. These snidely offered stereotypes are basically offered up unsupported, as if we ALL KNOW AMIRITE? And this is an ACM paper!

So I'm like, this sneering collection of techbro tropes was published by the ACM, which means the citadel has fallen. It's all over. Stick a fork in American tech leadership. But then I thought about their "Ethics Unicorn" archetype, & it hit me: they're eating their own.

Read 7 tweets

jonstokes.com

@jonst0kes

24 Mar

https://twitter.com/math_rachel/status/1374553383856574469

So I read this paper, & there's an entire subsection dedicated to a new supposed new (problematic) type of figure — the Ethics Unicorn. But no example of such a person, or even such thinking, is given. I have never encountered this. It seems like folklore.

https://twitter.com/math_rachel/status/1374553383856574469

I mean, maybe I am not in trendy enough tech circles? Maybe some of the engineers at the NYT who have a lot of opinions about what the edit staff should & shouldn't be publishing would fall into this category?

At any rate, the lone citation there is to an explainer on the "full-stack unicorn developer." This Ethics Unicorn character is left to the imagination, I guess.

Read 9 tweets

jonstokes.com

@jonst0kes

24 Mar

https://twitter.com/jonst0kes/status/1374827973677834246

I think nobody really realizes that this particular fight is coming, not even VCs. At some point, we will all fight on here over which party gets to be the editor whose values & linguistic quirks are reflected in the language the machines use to talk at us.

https://twitter.com/jonst0kes/status/1374827973677834246

Right now, the machines are just parroting whatever giant, unruly dataset they've been fed. But soon, they will be side-loaded with a small sample of additional context (e.g. a style/usage reference), so that they can tweak their output with reference to that context.

When that day comes, you will never, ever hear another word from the AI ethics folks about the supposed dangers of large language models (LLMs). They will pivot immediately to the fight to write the style guide that's side-loaded into the LLMs to steer the output.

Read 4 tweets

jonstokes.com

@jonst0kes

24 Mar

I'm listening to a guy explain an AI paper & I just learned a new German phrase: "they want the the egg-laying wool milk pig," which means roughly the same as "they want every child to have a pony" or some such,.

(Obviously "egg-laying wool milk pig" is one word in the original German.)

I wish we had this in English, but in a shorter version, like "the Omni-pig."

"Yeah, these guys want the Omni-pig. It does milk, wool, eggs, pork, all for free."

Read 4 tweets

jonstokes.com

@jonst0kes

24 Mar

https://twitter.com/matthewstoller/status/1374761119525572611

Reading this now, but the thing that jumps out at me is the aesthetics of CEOs. No matter how awkward & nerdy u looked as a lower-ranking geek, when u ascend to CEO they dip you in a vat & then hoist you out & sandblast you & air dry you. It's wild.

https://twitter.com/matthewstoller/status/1374761119525572611

The main exception here is Zuck, who still looks mostly like an awkward, greasy undergrad. He has somehow avoided the CEO vat of rejuvenation and chiseling.

90's-era Pat vs. Intel CEO Pat.

Read 5 tweets

jonstokes.com

@jonst0kes

24 Mar

A thing that puzzles me: people who spend a lot of time obsessing over power, but who seem unwilling to acknowledge that there are different kinds of it.

Me: X has power

You: LIES! X DOES NOT CONTROL THE ALABAMA STATE LEGISLATURE!

One of the great things about the old populist tradition was they had language around a "financial power" or a "money trust", which was different than just a political power. There's also cultural power that rests in centers of academia & media.

So I find myself in these conversations re: power w/ people who seem to really, truly believe the "marginalization" language literally, as in there is only 1 single page, & the center of that 1 page == "power" while the margins == "not power".

But there's a whole book of pages!

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

jonstokes.com

Try unrolling a thread yourself!

More from @jonst0kes

jonstokes.com

jonstokes.com

jonstokes.com

jonstokes.com

jonstokes.com

jonstokes.com

Did Thread Reader help you today?

Like this author's thread?