Luca Soldaini 🎀 Profile picture
they/them 🦝 data-centric NLP, LLMs, open science, IR @allen_ai 🧑‍💻Building Dolma to feed OLMo 🍇 @QueerInAI organizer 🏳️‍🌈 On Bluesky 🦋 DMs open!
May 10, 2023 9 tweets 3 min read
PaLM v2 is out! Join me as I read the technical report (ai.google/static/documen…) for pretraining data insights 👇🧵 First, PaLM v2 is trained on mixture of web/books/code/conversational data; it uses English and non-English text The PaLM 2 pre-training cor...Distribution of Languages i...
Jul 8, 2020 4 tweets 2 min read
Look, I appreciate the spirit of this work, but non-binary erasure shouldn't have any place at #acl2020nlp

This work makes my blood boil.

aclweb.org/anthology/2020… NB folx are **not** a variable that you can just throw away for the sake of simplifying your analysis.

And don't get me started of gender labeling individual based on their names.

#acl2020nlp