Serratus is now published in @Nature :) nature.com/articles/s4158…

We searched 5.7M seq libraries (10.2 petabases) for all 15,000 known RNA viruses. In 11 days, we uncovered 130,000+ new RNA viruses (incl 9 new CoV, with a twist). That’s near an order of magnitude bump.
[1/N] 🧵👇
[2] For the Scientific Conclusions, @rayanchikhi has a great thread from the preprint:
👉 👈
[3] As the pandemic hit, like many scientists we wanted to help. The idea was simple: analyze all public sequencing data to ensure every possible Coronavirus sequence ever sampled is identified and freely available. And do it fast.

(aka Eye of SRAn)
[4] By luck, @NIHDataScience STRIDES had just finished mirroring the massive Sequence Read Archive (SRA) to cloud platforms. An opportunity!

See their recent update paper! pubmed.ncbi.nlm.nih.gov/34850094/
[5] The world’s DNA/RNA sequencing was at our fingertips as an Open Dataset on @awscloud. Accessing 20 million gigabytes of sequencing data was no longer a bottleneck, we eventually did this in under 11 days.

Take a look under the hood:
[6] 🌍Computationally efficient access to planetary-scale sequencing data will forever change genetics🌎
#Bioinformatics #OpenScience #BigData #GannaNeedABiggerPipe
[7] The coolest part of open-source projects is teaming up with awesome devs who improve their tools too; We got a tailored v. of SPAdes: coronaSPAdes (protip you can use it for any RNA virus); and a sig. boost in small-query alignment for DIAMOND v2. Stay tuned for MUSCLE v5!
[9] Serratus is a volunteer project. We started out at the #hacksqRNA hackathon (ty: @RNASociety’ / @UBC MedGen) and continue to have an open-door collaboration policy (cough*you should join*)

#hackathon #openscience
[10] We took part in COVID19 #bioHackathon, @EUvsVirus, @hackzurich, @redhat Team19, sent out tweets, emailed bioinformaticians and virologists. Eventually we got an amazing and passionate crew together. <3 <3
[11] Huge thanks to the long list of people who took the time to discuss, share insights or just popped in for a few days to help. And to the team at @UBC #CIC and AWS who helped make this possible.

And of course, what matters most is the friends we made along the way…
[13] All Serratus data is free and public (cc0) immediately. Our goal is to catalyze research into Earth’s virome as intuitively as possible. Reach out if any help is needed :)

Data Explorer: serratus.io | Experimental RdRP interface: serratus.io/palmid Frank and Ginger, Serratus mascots

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with hackseq / Artem Babaian

hackseq / Artem Babaian Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(