The latest version of umap-learn is now out. Version 0.5 includes some major new features, including ParametricUMAP, DensMAP, AlignedUMAP, model composition, and model updating. Thank you to everyone who contributed! 1/14
ParametricUMAP uses a neural network to learn a UMAP embedding. This allows for a number of significant advantages. 2/14
ParametricUMAP provides extremely fast new data embedding (comparable to PCA if you use a GPU), UMAP based autoencoders, and powerful semi-supervised learning, particularly in low label regimes. 3/14
Special thanks got to @tim_sainburg who contributed the ParametricUMAP implementation. See the documentation (umap-learn.readthedocs.io/en/latest/para…), or the paper (arxiv.org/abs/2009.12981), for more details. 4/14
DensMAP is a modification of UMAP that provides better preservation of relative local density. See the documentation (umap-learn.readthedocs.io/en/latest/dens…) or the paper (biorxiv.org/content/10.110…) for further details. 5/14
Special thanks to Hyunghoon Cho and team for their contribution of DensMAP based on their paper. 6/14
AlignedUMAP allows sequences of different UMAP embeddings to be aligned with each other according to relations among the datasets. This can be particularly useful for situations such as time evolving data. 7/ 14
See the documentation (umap-learn.readthedocs.io/en/latest/alig… and umap-learn.readthedocs.io/en/latest/alig…) for more details and examples of AlignedUMAP. 8/14
UMAP also now supports some degree of model composition, allowing users to combine different UMAP models by

model1 * model2
model1 + model2
model1 - model2

See the documentation for more details (umap-learn.readthedocs.io/en/latest/comp…). 9/14
UMAP also now allows use of an “update” method to generate a new model updated with new additional data. 10/14
The approximate nearest neighbour search from UMAP is now fully moved to an ANN library PyNNDescent (github.com/lmcinnes/pynnd…). In turn PyNNDescent has seen significant development and is faster, multithreaded, and supports new metrics such as Wasserstein distance. 11/14
Finally a large number of bug-fixes, plotting improvements, and performance improvements were contributed as well. 12/14
Thank you to *everyone* who contributed, including those who helped in improving the documentation. 13/14
To upgrade to the new umap-learn you can use:

pip install umap-learn —upgrade

or

conda update umap-learn

to freshly install the new umap-learn:

pip install umap-learn

or

conda install umap-learn

or install from the source on GitHub (github.com/lmcinnes/umap). 14/14

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Leland McInnes

Leland McInnes Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @leland_mcinnes

5 Jan 20
Pynndescent, an approximate nearest neighbor search library, got a major update recently. Index construction is now multicore by default. Querying is now much faster -- competitive with some of the fastest ANN libraries around.
(1/4)
Performance in particularly strong for higher accuracy (>90%) queries.
(2/4)
The library also comes equipped with a Transformer class fitting in with the new KNeighborsTransformer in scikit-learn (scikit-learn.org/stable/modules…) to allow you to speed up various sklearn models and pipelines.
(3/4)
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!