Tweet

Martin Larralde

Follow @althonos

Dec 4 • 8 tweets • 4 min read

After a few quiet months, the next release of #PyHMMER is out today with some new major features!

@felixmlanger

You can now use a target database for search methods without having to load it entirely in memory, just pass a SequenceFile object directly (s/o @felixmlanger):

Also, there's now an `hmmscan` implementation! Target profiles can be either loaded from a pressed file (like the original), or pre-fetched into memory, which gives massive performance improvements if you can afford the extra RAM. A benchmark of 4,489 proteins against Pfam v33.1:

@satriaphd

The API for filtering hits and domains was also improved, with a shortcut for iterating only on hits and domains under inclusion or report thresholds (s/o @satriaphd):

I added a new guide to the documentation (pyhmmer.readthedocs.io/en/stable/exam…) showing how to manage HMM files inside a Python package (for instance for using custom HMMs inside a CLI tool), and some additional performance tips for achieving top performance.

This release also has several bugfixes (small memory leaks in `nhmmer`, potential segfaults in `hmmsearch`, etc.) and some changes in the lower-level interface to make the API more Pythonic overall. Full release notes on GitHub: github.com/althonos/pyhmm…

Finally, I am working on the manuscript for submitting an application note. If my main projects don't take too much time, i hope to submit in the next couple of months!

... (forgot the mandatory #bioinformatics and #Python tags, how are the auto-retweet bots gonna find out) ...

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @althonos

Martin Larralde

@althonos

Aug 5

@sdeorowicz

Just released another #Python package for #bioinformatics: #Cython bindings to FAMSA, a very fast algorithm for #MultipleSequenceAlignment by @sdeorowicz, Agnieszka Debudaj-Grabysz and @AdamGudys! Get it here: github.com/althonos/pyfam…

Just create an Aligner object (with additional configuration if you want), then give it some sequences to align:

It's missing some small things that I'll add over time, but for now the key features are there, and you could use it in your Python or #snakemake workflows in place of the FAMSA binary.

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Martin Larralde

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @althonos

Martin Larralde

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!