On the one hand it presents important technical results.
On the other, so many people interpret it as "yo lets replace all RNNs with FF nets". This is wrong. This is NOT the result.
- ha cool we can do language models with feed-forward nets instead of RNNs!
- if we do LM well we will model all of language and achieve AGI!
It doesn't work this way. These are conflicting.