🧵 Some thoughts about the recent release of Galactica by @MetaAI (everything here is my personal opinion) 👀

Let's start with the positive / What went well
[1] The model was released and Open Source*

Contrary to the trend of very interesting research being closed or just accessible through paid APIs, by open-sourcing the models and building on top of existing OS tools, evaluation can be reliably done in a transparent and open way
[2] There was a demo with the release**

Demos allow for a much wider audience to understand how models work. By having a demo with the release, a much more diverse audience can explore the model, identify points of failure, new biases, and more.
[3] Technically impressive!

Big Kudos where it's deserved. The model is technically impressive, with strong performance in different benchmarks, 50% citation accuracy, generation of latex and SMILES formulas, and more.
/ What went wrong

[1] Hype in announcements, mixing end-product with research. The announcement and page talk about "solving" information overload in science and that this can be used to write scientific code.

This communication style is very misleading and will cause misuse
[2] Safety Filter in demo erasing communities

Although I imagine this was well-intentioned, the (non-transparent) safety filter removed content about queer theory and AIDS

OpenAI has been doing the same with Dalle 2 and received backlash as well
The safety filter
- Censors content about minorities, further marginalizing people
- Contradicts the idea of storing and reasoning about scientific knowledge

See more at
[3] Use cases were unclear, undocumented, or misleading

The limitations stated in the site and paper are quite limited and somewhat unclear. The paper says, "we would not advise using it for tasks that require this type of knowledge as this is not the intended use-case."
There is also a somewhat hidden model card in github.com/paperswithcode…

But I find again that the documentation around limitations, biases, and use cases is too limited, given how powerful the model is
[4] Demo

Although having a demo was nice, it could have done a better job in
- Adding clearer disclaimers
- Changing the UI to make it less like real-papers
- Having a mechanism to identify such generated content
- Adding a way to flag toxic and erroneous content
[5] Related to the previous point, there was a lack of opportunity for the community to discuss and report issues, just by Twitter.

At @huggingface we learned that creating a space for public, open and transparent discussions on models is essential

As such, users have mechanisms to report outputs generated by the demo, explore the code used to create it, and discuss with the community about the work openly and transparently.
So TL;DR, what could be done better

- More explicit use cases and limitations
- Better documentation of the model
- Consider OpenRAIL licenses, which dive into use cases much more than classical software licenses
- Add disclaimers if there are any future demos

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Omar Sanseviero

Omar Sanseviero Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @osanseviero

Oct 31
This week @huggingface Spaces of the week are 🔥Check this thread for their respective announcements, or go directly and play with the demos!

🧵Check them out! huggingface.co/spaces Image
[1/7] Stable Diffusion Multiplayer by @radamar is an amazing collaborative experiment in which hundreds of people interact to paint a canvas using Stable Diffusion

[2/7] Finetuned diffusion by @hahahahohohe allows you to experiment with 10+ fine-tuned diffusion models, from pokemon to cyberpunk anime. This is amazing! Many models by @Nitrosocke 🔥

Read 8 tweets
Oct 26
CLIP Interrogator + Stable Diffusion

I used CLIP Interrogator to generate the prompt of an image, then I fed the prompt to SD, and used the image in CLIP Interrogator...and again!

🧵Follow along in this llamastic journey! Image
before / after ImageImage
b/a ImageImage
Read 14 tweets
Oct 24
I just published a new edition of the free weekly ML News 🔥

Check it out! 👀
thehackerllama.substack.com/p/machine-lear…
Three interesting papers about editing images with text (Imagic, DiffEdit, and Prompt to Prompt)

Exciting updates from @runwayml with a new Stable Diffusion version, upgraded Inpainting, and more!

Read 6 tweets
Oct 11
Six open-source ML demos from the last 6 days 🔥

1. Stable Diffusion Infinity 🎨

Outpaint Stable Diffusion on an infinite canvas

Demo huggingface.co/spaces/lnyan/s…
GitHub github.com/lkwq007/stable… by @lkwq007
2. Positive Reframing 🤗

Start inputting negative texts to see how you can see the same event from a positive angle.

Choose among different strategies and reframe them to behave more positively! 😀

huggingface.co/spaces/Ella232…
3. Huggy, the RL dog 🐶

huggingface.co/spaces/ThomasS… by @ThomasSimonini

Play with Huggy, @huggingface best friend, and see how Huggy catches the stick thanks to the power of deep Reinforcement Learning!
Read 6 tweets
Sep 26
🧵[1/8] Did you know you can find cool free top demos at hf.co/spaces (Spaces of the week)?

Let's explore them!

🔊@OpenAI Whisper Multilingual Speech Recognition model demo allows you to speak and get a transcript in seconds!
[2/8] Stable Diffusion Conceptualizer 🧑‍🎨

Navigate 500+ community-taught objects and styles and create images using them!

huggingface.co/spaces/sd-conc…
[3/8] @mozilla Foundation YouTube video similarity 🦊

huggingface.co/spaces/mozilla…

Compute the semantic similarity between two videos. This project is multilingual as well!
Read 8 tweets
Sep 5
Did you know you can check out the trending ML demos at hf.co (at the right in trending)

Check out some of them🧵👇
1. Stable Diffusion huggingface.co/spaces/stabili…
2. Diffuse the Rest - img2img app huggingface.co/spaces/hugging…
3. ERNIE-ViLG - sota Chinese text to image huggingface.co/spaces/PaddleP…
4. Dalle Mini - huggingface.co/spaces/dalle-m…
5. Musika - Some ML-generated music huggingface.co/spaces/marcop/…
6. DocQuery - Document Visual Understanding huggingface.co/spaces/impira/…
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(