Which text types are processed here? Medical literature, patient notes, electronic health records, clinical reports etc.
But how to start?
First you need to identify the different entities such as compounds, diseases, adverse drug effects and receptor bindings.
This is achieved through Natural Language Processing (NLP) and there are suitable pre-trained models for processing biomedical, scientific or clinical text like scispaCy
The next challenge is to extract the different relations! Diseases are related to genes which are related to receptors and compounds can bind to these receptors.
Sounds simple at first but there are several problems that need to be solved
Problems to solve
1. Difficult to ingest and integrate complex networks of text mined outputs 2. Difficult to contextualize knowledge extracted from text with existing knowledge 3. Difficult to investigate insights in a scalable and efficient way
In fall 2020, Twitter users raised concerns that the automated image cropping system on Twitter favored light-skinned over dark-skinned individuals, as well as concerns that the system favored cropping woman's bodies instead of their heads
Did you think bringing your machine learning model to production was the hard part?
What about model drift?
Now MLOps comes into play but how does it work and what are good tools?
What is:
- Continuous integration (CI)
- Continuous deployment (CD)
- Continuous training (CT)
The full MLOps life cycle
- Data Engineering: Get and clean the data recurring if necessary
- Model Engineering: Model training, evaluation, testing, and packaging
- Model Deployment: integrating the trained model. Model serving, performance monitoring
Why is MLOps important?
Just because your model is hitting now doesn't mean it will be doing so 6 months from now
Where the big ones like OneNote, Google Keep and Evernote fail is that the brain does not work like an index, thoughts are linked and associatively this is where the next generation of note taking apps show their strength.
Your open source project is ready for deployment? Documentation is still missing?
Good documentation and its presentation is an art!
A case study with 4 examples on awesome documentation
What makes good documentation?
- No prosaic texts! Choose a practical approach with code snippets
- Good structure and overview with a quick entry then in depth
- Good search is everything
- Good code examples
Why?
-Extremely good search
-These diagrams eye candy everywhere!
-Interactivity
-Live code examples that can be customized and run in a Binder container