The past decade has witnessed a groundbreaking rise of machine learning for human language analysis, with current methods capable of automatically accurately recovering various aspects of syntax and semantics - including sentence
structure and grounded word meaning - from large data collections.
Recent research showed the promise of such tools for analyzing acoustic communication in nonhuman species.
They posit that machine learning will be the cornerstone of future collection, processing, and analysis of multimodal streams of data in animal communication studies, including bioacoustic, behavioral, biological, and environmental data.
nbdev is a library that allows you to develop a python library in Jupyter Notebooks, putting all your code, tests and documentation in one place. That is: you now have a true literate programming environment, as envisioned by Donald Knuth back in 1983!
Does BERT Pretrained on Clinical Notes Reveal Sensitive Data? • Large Transformers pretrained over clinical notes from Electronic Health Records (EHR) have afforded substantial gains in performance on predictive clinical tasks.
The cost of training such models and the necessity of data access to do so is coupled with their utility motivates parameter sharing, i.e., the release of pretrained models such as ClinicalBERT.
↓ 2/4
While most efforts have used deidentified EHR, many researchers have access to large sets of sensitive, non-deidentified EHR with which they might train a BERT model (or similar).
Would it be safe to release the weights of such a model if they did?
At its core it consists of a population of Kenyon cells, which receive inputs from multiple sensory modalities.
These cells are inhibited by the anterior paired lateral neuron, thus creating a sparse high dimensional representation of the inputs.
In this work they study a mathematical formalization of this network motif and apply it to learning the correlational structure between words and their context in a corpus of unstructured text, a common natural language processing (NLP) task.
Hey NLP is not only machine learning it also has to do with language.
But you are not alone!
Here are some of my classics that deal with linguistics
Linguistic Fundamentals for Natural Language Processing
by @emilymbender
May all-time favorite & silver bullet! A must read.
The beauty of this book is that it explains the most important linguistic concepts in short chunks and illustrates this with easy to understand examples