Matthew Honnibal Profile picture
Computational linguist from Sydney and Berlin. 💫 Author of the @spacy_io NLP tools. 💥 Founder @explosion_ai
Oct 15, 2020 7 tweets 2 min read
This release was SO much work! Glad to finally have it out 🍻
The big impact for users will definitely be the transformer models and config system, but I want to talk about a feature I wasn't expecting to build until a few months ago: the new workflow system, spaCy Projects 🧵 spaCy Projects was inspired by @DVCorg , and has an easy integration for DVC users. But it's also standalone: you can write a single YML file and spaCy will get your data, trigger your processing steps, and use remote cache. It even generates readmes: github.com/explosion/proj…
Nov 28, 2018 10 tweets 2 min read
If you try out the new spacy-nightly (v2.1.0a3), you might be surprised to see it's single-threaded. This actually took a tonne of work! I've spent many hours getting the Blis linear algebra routines into a stand-alone, wheel-installable package. Why? A thread on threads 🧵1/10 In spaCy 2 we switched over to neural network models, so the bottleneck in spaCy comes down to matrix multiplication. Most Python libraries delegate CPU matrix multiplication to numpy, which then delegates it to a low-level library. Which library? Well, that depends. 2/10
Jun 12, 2018 8 tweets 2 min read
Interesting discussion of developer compensation on HN, especially in the Bay Area: news.ycombinator.com/item?id=172869…
tl;dr: Total compensation for the largest companies is often >300k. If this surprises you, you might find @patio11 's take useful.
Some thoughts: 1/8 First, there's much discussion on HN about how startups can compete with this level of compensation. Probably the best answer is they shouldn't. And I don't mean start-ups should pay less and make up the difference with stories, promises and dreams. 2/8