TensorFlow Serving = the first mainstream model serving architecture in ML frameworks
+It serves ML models inside Google
+It is available in the cloud and via open-source
How it was created and how it works?
Thread⬇️
Deep dive into "TensorFlow-Serving: Flexible, High-Performance ML Serving" by @JeremiahHarmsen, @FangweiLi, @sukritiramesh, Christopher Olston, Noah Fiedel, Kiril Gorovoy, Li Lao, Vinu Rajashekhar, Jordan Soyke
2/⬇️
Paper outlined the architecture of a serving pipeline for @TensorFlow models
Capabilities of TensorFlow serving:
+model lifecycle management;
+experiments with multiple algorithms;
+efficient use of GPU resources
3/⬇️
Fundamental building blocks of TensorFlow Serving:
+A C++ library consisting of APIs and modules from which to construct an ML server
+An assemblage of the library modules into a canonical server binary
+A hosted service
4/⬇️
The TensorFlow Serving paper was published together with the framework’s first implementation as part of the TensorFlow project.
TheSequence Edge covers:
+ML concept you should learn
+Review of an impactful research paper
+New ML framework or platform and how you can use it thesequence.substack.com/p/-so-what-is-…
7/7
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Model serving goes a bit beyond deployment, given the unique nature of the lifecycle of ML programs.
ML models operate in a circular lifecycle, where phases such as training and optimization are continuously repeated.
2/⬇️
Some of the most important aspects of any model serving pipeline:
+API interface
+real-time vs. batch execution
+versioning
+A/B testing
+scalability
3/⬇️
It presents a Private Aggregation of Teacher Ensembles (PATE) method to ensure privacy in training datasets
Thread👇🏼 🔎
Imagine that two different models, trained on two different datasets produce similar outputs
Then, their decision does not reveal information about any single training example
And this is another way to say it ensures the privacy of the training data
2/⬇️
PATE uses a perturbation technique that structures the learning process using an ensemble of teacher models communicating their knowledge to a student model
3/⬇️
❓AllenNLP:
+includes key building blocks for NLU
+offers state of the art NLU methods
+facilitates the work of researchers thesequence.substack.com/p/-edge22-mach…
2/
AllenNLP is built on top of @PyTorch and designed with experimentation in mind
Key contribution = maintains implementations of new models:
+text generation,
+question answering,
+sentiment analysis
+& many others
3/