16 questions that I really like to ask during interviews for machine learning candidates.
These will help you practice, but more importantly, they will help you think and find ways to improve!
Let's do this! ☕️👇
1. How do you handle an imbalanced dataset and avoid the majority class completely overtaking the other?
2. How do you deal with out-of-distribution samples in a classification problem?
3. How would you design a system that minimizes Type II errors?
4. Sometimes, the validation loss of your model is consistently lower than your training loss. Why could this be happening, and how can you fix it?
5. Explain what you would expect to see if we use a learning rate that's too large to train a neural network.
6. What are the advantages and disadvantages of using a single sample (batch = 1) on every iteration when training a neural network?
7. What's the process you follow to determine how many layers and nodes per layer you need for the neural network you are designing.
8. What do you think about directly using the model's accuracy as the loss function to train a neural network?
9. Batch normalization is a popular technique when training deep neural networks. Why do you think this is the case? What benefits does it provide?
10. Why do you think combining a few weak learners into an ensemble would give us better results than any of the individual models alone?
11. Early stopping is a popular regularization technique. How does it work, and what are some of the triggers that you think could be useful?
12. Explain how one-hot-encoding the features of your dataset works. What are some disadvantages of using it?
13. Explain how Dropout works as a regularizer of your deep learning model.
14. How would you go about reducing the dimensionality of a dataset?
15. Can you achieve translation invariance when processing images using a fully-connected neural network? Why is this the case?
16. Why are deep neural networks usually more powerful than shallower but wider networks?
I'll use these questions to guide some of the threads that I'll be posting in the coming days.
Stay tuned for that!
And don't forget to follow me so we can keep working on this machine learning stuff together!
🦕
I usually try to work really hard in avoiding trivia questions. This is why I prefer to ask open-ended questions that allow candidates to show off their ability to reason.
I feel these are in that category. Do you feel differently about some of these?
Let's talk about how you can build your first machine learning solution.
(And let's make sure we piss off half the industry in the process.)
Grab that ☕️, and let's go! 🧵
Contrary to popular belief, your first attempt at deploying machine learning should not use TensorFlow, PyTorch, Scikit-Learn, or any other fancy machine learning framework or library.
Your first solution should be a bunch of if-then-else conditions.
Regular, ol' conditions make for a great MVP solution to a machine learning wannabe system.
Pair those conditions with a human, and you have your first system in production!
Conditions handle what they can. Humans handle the rest.