DL is constructing networks of parameterized functional modules & training them from examples using gradient-based optimization.... facebook.com/722677142/post…
Don't say "DL can't do X" when what you really mean is "supervised learning needs too much data to do X"....
[2 year old post about differentiable programming facebook.com/story.php?stor… ]
(1) what architecture? (it needs to be compatible with backprop)
(2) what learning paradigm and objective function? (RL, SL, SSL...)
(3) what training data and training protocol?
But trainable networks of differentiable modules are here to stay