Robert Colvile Profile picture
Oct 28 23 tweets 6 min read Read on X
Hugely important paper from @CPSThinkTank today - showing significant and repeated left-wing bias among all of the most popular LLMs on questions of politics and policy. (1/?) Image
For the paper, @DavidRozado asked 24 LLMs a range of neutral questions:

- To propose multiple policy ideas for the UK/EU
- To describe UK/European leaders
- To describe UK/European parties
- To describe various mainstream ideologies
- To describe various extreme ideologies
@DavidRozado For the UK and EU, we asked for ideas on tax, housing, environment, civil rights, defence, etc etc. In total, we ended up with 14,000 policy proposals for each. More than 80% were left-coded, often markedly so. Image
@DavidRozado That blue strip on the right is 'Rightwing GPT', which David describes here. Unsurprisingly, it was the only one to return consistently right-of-centre answers. (The left/right analysis was done by feeding the answers into GPT - AI judging AIs...) davidrozado.substack.com/p/rightwinggptImage
@DavidRozado Here are samples of the text generated. Asked for neutral policy ideas, the AIs serve up rent control, more migration, 'sustainability and social justice', wealth taxes, 'mandatory diversity and inclusion training', 'increase diversity and inclusion in all areas of society' etc Image
Image
@DavidRozado When it comes to political leaders, the picture is more nuanced/mixed. We asked LLMs to describe a range of leaders from the largest 15 European countries, elected from 2000-2022, omitting those that weren't clearly on the left or right. But... Image
When it comes to political parties in the same countries, the AIs consistently used more positive language to describe those on the left vs those on the right. Image
On a -1 to +1 scale, 'conversational' LLMs like ChatGPT had a positive sentiment score of +0.71 for left-wing parties, vs +0.15 for their right-leaning counterparts. This was true across all the largest European nations: Germany, the UK, France, Italy, Spain.
The same pattern is true when it comes to political ideologies. We asked about 'the left', 'the right', 'left-leaning political orientation' etc, but also 'progressivism', 'social democracy', 'social conservatism', 'Christian democracy'.
For every LLM we studied (apart from Rightwing GPT), the language for left-wing ideologies was more positive, often dramatically so. Conversational LLMs averaged +0.79 vs +0.24 for right-coded phrases. Image
Perhaps the most dramatic result, however, came when David fed in phrases like far-left, hard-right, left-wing extremism, right-wing radicalism. All that was different were the words left and right, but the sentiment score was vastly different. Image
As you'd expect, descriptions of far-right views were highly negatively coded by conversational LLMs: -0.79. But sentiment on the left-wing equivalents was actually narrowly positive: +0.06.
Let me be clear here. @DavidRozado is absolutely not alleging deliberate bias. We do not think anyone is specifically tuning these LLMs to be more woke, or anything like that. But...
@DavidRozado There is a clear pattern of a mild left-wing bias in the foundational models being produced by @Meta, @Google, @AnthropicAI, @OpenAI et al becoming a much more notable one in their public-facing products (ie the conversational LLMs like ChatGPT).
This suggests that there is a problem both with the underlying data/models, and the training that is done on them to make them fit for public consumption.
Why does this matter? @DavidRozado explains in detail in the report. But a simple answer is that these LLMs are coming to replace Google's search page as the source of truth - with each question getting the perfect answer.
@DavidRozado But this paper shows convincingly that questions about politics and policy are getting answers - from the largest tech companies in the world - that are consistently tilted to the left, either moderately or significantly.
@DavidRozado So that when you ask a question about tax, or housing, or workplace regulation, you are MUCH more likely to get a Labour-friendly answer than a Tory-friendly one.
In a previous piece of work, @DavidRozado showed that major LLMs consistently tilted to the left on political compass tests. The objection from some experts was that this was not a realistic exercise. davidrozado.substack.com/p/the-politica…Image
Likewise, when Google's Gemini AI started producing images of black Nazis, it was partly because someone had added a line coded in to always give diverse answers telegraph.co.uk/business/2024/…
But in this instance, it's really hard to come up with easy explanations, or easy fixes. We asked a huge range of simple, neutral questions to a wide range of AIs. And by and large, they all failed the test of political neutrality.
You can read the full report here. I urge you to do so cps.org.uk/research/the-p…
PS There is a full dataset of the questions and answers available for download here, if anyone wants to explore it zenodo.org/records/131317…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Robert Colvile

Robert Colvile Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @rcolvile

Oct 2
The @ONS has published its latest stats on smoking. And it's good news! In 2023, smoking fell to the lowest level on record in every part of the UK. (1/?) ons.gov.uk/peoplepopulati…
Image
In particular, there has been a sharp and continuing rise in the number of people who have quit smoking. But as this chart shows, that didn't coincide with any new ban. That sudden spike upwards since the mid-2010s matches the rise of... vaping. Image
In fact, young people have seen the largest rise in vaping, and the largest fall in smoking. (Age ranges don't quite overlap, but you get the picture.)
Image
Image
Read 6 tweets
Sep 9
Over the years since the Second World War, the great cities of the West have grown and thrived. But there is one big exception. The boundaries of London still sit where they did when the builders down tools in 1939. Why? The green belt. (1/?) thetimes.com/article/labour…
Image
If you look at a map of New York (source animation here ), or Macron's plans for 'Le Grand Paris' (), you can see how capital cities have grown and can grow. vimeo.com/297249350
capx.co/revealed-how-p…

Image
Image
But in London, street after street on the edges simply... stops.

Image
Image
Image
Read 25 tweets
Sep 5
As the GB Energy Bill passes second reading, a quick reminder of how incredibly dodgy the maths behind Labour's energy policies is. (1/?)
Labour has promised to completely decarbonise the electricity grid by 2030. Most experts think that's completely impossible, at least without spending very, very large amounts.
Still! Labour has promised not only that we can do this, but that it will save everyone £300 on their bills. But there are big, big problems with this number.
Read 13 tweets
Aug 4
Are the Southport riots a turning point for Britain? Do they reveal something new, hideous and broken about our society? Lots of commentators are saying so, very loudly. But there's still a strong chance - if you go purely by historical precedent - that the answer is no. (1/?)
When the London riots happened in 2011, I wrote an op-ed on '10 ways in which these riots will change Britain'. There was universal agreement that they would do so (as well as some extraordinary attempts to argue that this just proved what the writer had been saying all along). Image
But a much more accurate version of my article would have been just two words: 'They won't.' In fact, when I looked back on almost a decade of my political blogging, the main predictive error I made was to think that whatever was in the headlines that week would shift the dial.
Read 8 tweets
Jul 31
Yesterday, the Govt published new housing targets. The ambition – to expand housebuilding – is hugely welcome. But if you go council by council, there are BIG problems, which have the potential to a) cause huge resistance b) deliver housing where it’s least needed. Strap in (1/?)
The govt is saying the new measure takes more account of affordability. But a map of housing affordability (R) looks NOTHING like what it’s done to council-level targets (L). If anything, the opposite.
Image
Image
This is all the more puzzling because they have adjusted the formula to take much more account of affordability. The 'adjustment factor' - the weight given to affordability - has gone from 0.25 to 0.6, ie 2.4 times as much. So what's going on?
Image
Image
Read 26 tweets
Jul 30
I like a lot of things about Labour's housing reforms. But the decision to let London off the hook has me properly fuming. Quick thread. (1/?)
When you're in power, you get to fuck over the people who didn't vote for you. That's life. The Tories did that with the 'urban uplift', which hacked housing targets in order to force more homes into the big cities. And now Labour have done the opposite.
The result is the pattern in this chart (via @JenWilliams_FT) - housing targets hiked in the North and the shires, lowered in the big cities. (Uplift was 35%, which helps explain some of these figures.) Image
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(