Want to follow along as we post lectures publicly?👇
2/ Sign up to receive updates on our lectures as they're released (and to optionally participate in a synchronous learning community): forms.gle/zqE2rjkfqex2AQ…
3/ We cover the full stack, from project management to MLOps:
- Formulating the problem and estimating cost
- Managing, labeling, and processing data
- Making the right HW and SW choices
- Troubleshooting and reproducing training
- Deploying the model at scale
4/ Deep Learning doesn’t stand still and neither do we. Some of what's new for 2021:
- Module on AI ethics and explainability
- Much more on data infra: spark, airflow, feature stores
- Deep dive on ML testing and monitoring
- Updates to infra and tooling (🤗, new GPUs, etc)
5/ Let us know if you have any other requests for topics to cover, and hope to see you online!
• • •
Missing some Tweet in this thread? You can try to
force a refresh
1/ Last week's Production ML meetup featured Peter Gao and Princeton Kwong, former Engineering Managers at Cruise and Aquabyte. Below, their insights on data quality and its downstream effects for computer vision use cases:
2/ What is your experience with data quality?
- Cruise: (1) poorly-labeled data confuses the model; (2) models may perform poorly on edge case objects
3/ Data quality cont'd
- Aquabyte: No public datasets to work with. Our engineers went onsite to collect ground-truth data, built a huge labeling pipeline to get the data to human labelers, and designed our own labeling interface that enabled labelers to properly label fishes.