(* in the spirit of @iamtrask and @FelixHill84, exclusively from other groups :)).
[1] Tomáš Mikolov et al: Interspeech 2010, fit.vutbr.cz/research/group…
@RichardSocher @chrmanning @AndrewYNg
nlp.stanford.edu/pubs/2010Soche…
[3] Alex Graves, Generating Sequences With Recurrent Neural Networks
arxiv.org/abs/1308.0850
@DeepMindAI
[4] T Mikolov, I Sutskever, K Chen, GS Corrado, J Dean. Distributed representations of words and phrases and their compositionality arxiv.org/abs/1310.4546
@JeffDean
[5] Cho, Kyunghyun et al.: Learning Phrase Representations ... arxiv.org/abs/1406.1078
@kchonyc
[6] Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio: Neural Machine Translation by Jointly Learning to Align and Translate arxiv.org/abs/1409.0473
@kchonyc @MILAMontreal
[7] Awni Hannun et al. : Deep Speech: Scaling up end-to-end speech recognition arxiv.org/abs/1412.5567
[8] Bowman, Potts & Manning: arxiv.org/abs/1406.1827
@sleepinyourhat @ChrisGPotts
@elikiper @yoavgo
[10] R Sennrich, B Haddow, A Birch, Edinburgh Neural Machine Translation Systems for WMT 16
aclweb.org/anthology/W16-…
@alexandrabirch1 @RicoSennrich
[11] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin arxiv.org/abs/1706.03762
[12] Peters, Neumann, Iyyer, Gardner, Clark, Lee, & Zettlemoyer : Deep contextualized word representations: arxiv.org/abs/1802.05365
@nlpmattg
@toutanova