@DamianRomero@sigmoid.social
💬 He/him/they
💼 PhD Candidate @UArizona
🤓 Corpus & computational linguistics
💥 Intern @explosion_ai
⚠️ Opinions are my own
Jun 20, 2022 • 9 tweets • 4 min read
Annotator bias 💫
There are many reasons why Machine Learning models sometimes do not generalize well in production.
One of them may be hidden in your annotation process 🧵
Annotator bias is a type of data bias that comes from a lack of diversity in the annotator population 👥
The bias can result in low-quality datasets if the data does not fit the intended use case
Models trained on biased datasets do not perform well in production
May 2, 2022 • 8 tweets • 3 min read
Annotation guidelines for ML 101 (basic concepts) 🧵
✨ The results of supervised learning approaches are only as good as the annotations they are based on.
✨ Annotations are only as good as the guidelines that annotators rely upon to direct their efforts.
⬇️
What is data annotation? 🪄
✨It is the task of associating entries in your data with additional information
✨It is also known as "coding" or "labeling"
✨It is crucial for both qualitative and quantitative analyses