DeepSpeed is a new open-source framework focused on optimizing the training of massively large deep learning models.
Includes the first implementation of ZeRO as well as other optimization methods. 6/
TheSequence Edge covers:
+An ML concept you should learn
+A review of an impactful research paper
+New ML framework or platform and how you can use it thesequence.substack.com/subscribe
7/
• • •
Missing some Tweet in this thread? You can try to
force a refresh
❓AllenNLP:
+includes key building blocks for NLU
+offers state of the art NLU methods
+facilitates the work of researchers thesequence.substack.com/p/-edge22-mach…
2/
AllenNLP is built on top of @PyTorch and designed with experimentation in mind
Key contribution = maintains implementations of new models:
+text generation,
+question answering,
+sentiment analysis
+& many others
3/