- ”example spark config” stackoverflow post
- sklearn documentation
- hatred for Airflow DAGs
- awareness of k8s and containers but no idea how to actually use them
- “the illustrated transformer” blog post
- cursing US-West-2 for not having any instances available
- reviewing data scientists’ code & wishing it was cleaner
- reviewing software engineers’ code & wishing your code could be half as good as theirs
- weekly emails from ML tooling startups trying to sell their products
- spending 10x time cleaning data as training models on the data
- model.save(“./checkpoints/final_model_v14)
- thinks “i need another intern” multiple times a day
- can’t reproduce the data scientist’s results but productionizes the model anyways
- googles “how does t-sne work” every few months