Discover and read the best of Twitter Threads about #fingercrossed

Most recents (1)

So excited to share our latest preprint!
TLDR:
We present NIQKI, a Mash-like tool able to index large sequence databases.
Its main feature is to be fast!
We were able to index all bacterial genomes of GenBank (>1million) and query all pairwise distances in a couple of days(!)
Behind the scenes, we rely on a novel index structure to query partition-based fingerprints that we call NIQKI (Next Index to Query Kmer Intersection) which use an inverted index for each partition
Concretely, we associate to each <partition/fingerprint_value>pair the list of genomes that got this fingerprint value in this partition.
This means that a query will only generate one costly "Random Access" per partition.
Read 11 tweets

Related hashtags

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!