Here's a network graph for a popular hashtag. Since the graph has no labels, you can't tell what hashtag it is, or what anything in the graph actually means, but it's colorful and pretty and weird and therefore incredibly tempting to retweet, right?
The hashtag in question is #CatsOfTwitter, and here's a more boring-looking version of the same graph with more context. The interaction being graphed is retweets, with the more frequently-retweeted accounts shown larger on the graph, and the date range is also included.
One can alter the apparent meaning of a graph via manual editing. Here, three of the accounts have been dragged off to the top left, suggesting a relationship between them that isn't supported by the underlying data. It's technically still "correct", but it's misleading.
Graphs can also be misleading if one highlights a particular aspect without exploring it. Here, the retweets from bots (automated accounts) have been colored pink, revealing what appears to be a cluster of automated activity. Is this some kind of nefarious #CatsOfTwitter botnet?
Nope. Changing settings so that the label size is proportional to the number of times the account retweeted a #CatsOfTwitter tweet (rather than the number of times the account was retweeted) reveals that almost all of the automated activity is from just two accounts.
Overall point: while data visualizations are very useful and effective (and sometimes pretty) ways of summarizing lots of data, context is vital to interpreting them and should always be included.
Footnote: for this portion of the thread, automation was determined based on the source app used to post the tweet/retweet as provided by the Twitter API.
Some thoughts on perennial pitfalls in news coverage of social media manipulation that frequently result in reporting on fake accounts/bots/etc being far less accurate and informative than it ought to be...
The most common problem with news articles about fake accounts: failure to include any examples of fake accounts or evidence of their inauthenticity. Any or all of these headlines might be accurate, but you can't tell from the articles, due to absence of evidence.
A related issue: articles like the "Nearly Half of Biden/Trump's Followers Are Fake" and "Nearly Half Of Accounts Tweeting About Coronavirus Are Bots" pieces base their numbers on closed-source third party tools, which may or may not actually be detecting anything useful.
Does thanking, praising, or insulting an LLM-based chatbot affect the speed or accuracy of its responses to questions involving basic arithmetic? Let's find out!
For this experiment, Meta’s Llama 3.1 model was asked to add and multiply random numbers between 10 and 100, with six different wordings: polite, rude, obsequious, urgent, and short and long neutral forms. Each combination of math operation and wording was tested 1000 times.
Results: asking the questions neutrally yielded a faster response than asking politely, rudely, obsequiously, or urgently, even if the neutral prompt was longer. Overall, obsequious math questions took the longest to process, followed by urgent, rude, and polite questions.
Just for fun, I decided to search Amazon for books about cryptocurrency a couple days ago. The first result that popped up was a sponsored listing for a book series by an "author" with a GAN-generated face, "Scott Jenkins".
cc: @ZellaQuixote
Alleged author "Scott Jenkins" is allegedly published by publishing company Tigress Publishing, which also publishes two other authors with GAN-generated faces, "Morgan Reid" and "Susan Jeffries". (A fourth author uses a photo of unknown origin.)
As is the case with all unmodified StyleGAN-generated faces, the facial feature positioning is extremely consistent between the three alleged author images. This becomes obvious when the images are blended together.
The people in these Facebook posts have been carving intricate wooden sculptures and baking massive loaves of bread shaped like bunnies, but nobody appreciates their work. That's not surprising, since both the "people" and their "work" are AI-generated images.
cc: @ZellaQuixote
In the last several days, Facebook's algorithm has served me posts of this sort from 18 different accounts that recycle many of the same AI-generated images. Six of these accounts have been renamed at least once.
The AI-generated images posted by these accounts include the aforementioned sculptures, sad birthdays, soldiers holding up cardboard signs with spelling errors, and farm scenes.
The common element: some sort of emotional appeal to real humans viewing the content.
As Bluesky approaches 30 million users, people who run spam-for-hire operations are taking note. Here's a look at a network of fake Bluesky accounts associated with a spam operation that provides fake followers for multiple platforms.
cc: @ZellaQuixote
This fake follower network consists of 8070 Bluesky accounts created between Nov 30 and Dec 30, 2024. None has posted, although some have reposted here and there. Almost all of their biographies are in Portuguese, with the exception of a few whose biographies only contain emoji.
The accounts in this fake follower network use a variety of repeated or otherwise formulaic biographies, some of which are repeated dozens or hundred of times. Some of the biographies begin with unnecessary leading commas, and a few consist entirely of punctuation.
It's presently unclear why, but over the past year someone has created a network of fake Facebook accounts pretending to be employees of the Los Angeles Dodgers. Many of the accounts in this network have GAN-generated faces.
cc: @ZellaQuixote
This network consists of (at least) 80 Facebook accounts, 48 of which use StyleGAN-generated faces as profile images. The remaining 32 all use the same image, a real photograph of a random person sitting in an office.
As is the case with all unmodified StyleGAN-generated faces, the main facial features (especially the eyes) are in the same position on all 48 AI-generated faces used by the network. This anomaly becomes obvious when the faces are blended together.