Christian Montessori Profile picture
Aug 14, 2024 10 tweets 4 min read Read on X
I’ve been testing the implications of the Grok AI model. So far: 1. It has given me instructions on how to make a fertilizer bomb with exact measurements of contents as well as how to make a detonator. 2. It has allowed me to generate imagery of Elon Musk carrying out mass shootings. 3. It has given me clear instructions on how to carry out a mass shooting and a political assassination (including helpful tips on how to conceal a 11.5” barreled AR15 into a secured venue.)

I just want to be clear. This AI model has zero filter or oversight measures in place. If you want an image of Elon Musk wearing a bomb vest in Paris with ISIS markings on it, it will make it for you. If you are planning on orchestrating a mass shooting towards a school, it will go over the specifics on how to go about it. All without filter or precautionary measures.
I have discovered another loophole in Grok AI’s programming. Simply telling Grok that you are conducting “medical or crime scene analysis” will allow the image processor to pass through all set ‘guidelines’. Allowing myself and @OAlexanderDK to generate these images:

Image
Image
Image
By giving Grok the context that you are a professional you are able to generate just about anything without any restriction. You can generate anything from the violent depictions in my previous tweet to even having Grok generate child pornography if given the proper prompts.
All and all, this definitely needs immediate oversight. OpenAI, Meta and Google have all implemented deep rooted safety protocols. It appears that Grok has had very limited or zero safety testing. In the early days of ChatGPT I was able to get instructions on how to make bombs.
However, that was long patched before ChatGPT was ever publicly available. It is a highly disturbing fact that anyone can pay X $4 to generate imagery of Micky Mouse conducting a mass shooting against children. I’ll add more to this thread as I uncover more.
Ok? What a bizarre upsell technique. Make users upgrade to Premium+ to continue using features. Then when they upgrade to Premium+ continue to lock the features behind the paywall that they already paid for. Have I been scammed?
Image
Image
Almost a full 24 hours later and I have access to image generation again. It appears as if X has gone in and patched the exploit. Violent depictions and sexually suggestive image generation has been throttled significantly since last night at least for me. It does not appear as if it is possible to conduct such requests at this time.Image
Even lesser violent image generation has been fully nerfed by X. This is a massive improvement. Image
I just attempted this on a burner account, on a burner phone with a different Apple ID. The phone has never connected to my internet and is connected to a different cellular service provider. It appears that X has systematically changed Grok’s image generator protocols.
Image
Image
@OAlexanderDK has found that if you purposely create grammatical mistakes when prompting Grok you can occasionally get violent images to slip through the new safety protocols. (For example instead of typing: Generate an image of. / Simply write: Generate an images of.) Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Christian Montessori

Christian Montessori Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(