Martin Larralde Profile picture
Dec 4 8 tweets 4 min read
After a few quiet months, the next release of #PyHMMER is out today with some new major features!
You can now use a target database for search methods without having to load it entirely in memory, just pass a SequenceFile object directly (s/o @felixmlanger):
Also, there's now an `hmmscan` implementation! Target profiles can be either loaded from a pressed file (like the original), or pre-fetched into memory, which gives massive performance improvements if you can afford the extra RAM. A benchmark of 4,489 proteins against Pfam v33.1:
The API for filtering hits and domains was also improved, with a shortcut for iterating only on hits and domains under inclusion or report thresholds (s/o @satriaphd):
I added a new guide to the documentation (pyhmmer.readthedocs.io/en/stable/exam…) showing how to manage HMM files inside a Python package (for instance for using custom HMMs inside a CLI tool), and some additional performance tips for achieving top performance.
This release also has several bugfixes (small memory leaks in `nhmmer`, potential segfaults in `hmmsearch`, etc.) and some changes in the lower-level interface to make the API more Pythonic overall. Full release notes on GitHub: github.com/althonos/pyhmm…
Finally, I am working on the manuscript for submitting an application note. If my main projects don't take too much time, i hope to submit in the next couple of months!
... (forgot the mandatory #bioinformatics and #Python tags, how are the auto-retweet bots gonna find out) ...

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Martin Larralde

Martin Larralde Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @althonos

Aug 5
Just released another #Python package for #bioinformatics: #Cython bindings to FAMSA, a very fast algorithm for #MultipleSequenceAlignment by @sdeorowicz, Agnieszka Debudaj-Grabysz and @AdamGudys! Get it here: github.com/althonos/pyfam…
Just create an Aligner object (with additional configuration if you want), then give it some sequences to align:
It's missing some small things that I'll add over time, but for now the key features are there, and you could use it in your Python or #snakemake workflows in place of the FAMSA binary.
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(