How to get URL link on X (Twitter) App
https://twitter.com/gantry_ml/status/1600508983814537222Split it into train/val/test, iterated on the feature set a bit, and eventually got a good test accuracy. Then I “productionized” it, i.e., put it in a dataswarm pipeline (precursor to Airflow afaik). Then I went back to school before the pipeline ran more than once.
https://twitter.com/eugeneyan/status/1563007015295008769(1) "quality" is not easily defined by a human---are you gonna comb through every feature column and create bounds?---and (2) the distribution matters; it's hard to look at one record alone and know whether it's "broken"
https://twitter.com/mhajabri/status/14600604334826659901. Convince yourself that operationalizing ML, even as a 1-person team, is a hard problem. What are some differences between a kaggle project and a production ML service? Do some tutorials -- Here's a more-than-hello-world toy ML pipeline I've built: github.com/shreyashankar/…
https://twitter.com/AndrewYNg/status/1396922136808202241In the thread @AndrewYNg (accurately) argues that iterating on the data is more fruitful in developing ML applications. This begs the question, what really is the biggest difference between academic ML and industry ML tasks? (2/7)