What if we could universally recombine, insert, delete, or invert any two pieces of DNA?
In back-to-back @Nature papers, we report the discovery of bridge RNAs and 3 atomic structures of the first natural RNA-guided recombinase - a new mechanism for programmable genome design
See our previous thread on preprint here:
Bridge RNA recombinases encode a mechanistically new class of guide RNAs
They are bispecific, unlike the CRISPR (2012) or RNA interference (1998) guide RNAs that specify only 1 target for a simple scissors cut
In new collaborative work with @hnisimasu, we reveal the IS110 bridge mechanism in 3 distinct stages
The bispecific bridge RNA recruits two dimers of the IS110 recombinase, forming an elegant tetramer to exchange and religate 4 distinct DNA strands in a single step!
CRISPR creates DNA nicks/breaks and requires complex DNA repair pathways in the cell to make a genome edit (like an insertion or deletion) beyond a cut
Bridge recombination works in vitro, without relying on cellular DNA repair mechanisms. This could lead to safer genome edits
Cre-Lox recombination has been a foundational tool for molecular biology and mouse genetics since its discovery in 1981. However, it is not programmable.
Scientists have spent decades trying to engineer and reprogram Cre to recognize DNA sites other than the LoxP sequence
Unlike Cre, bridge recombination is fully programmable simply by changing the target binding loop (left) or donor binding loop (right)
These two loops can be modularly and separably reprogrammed
Inspired by Cre engineering, we show that reprogrammed bridge RNAs can direct insertion, excision, or inversion of DNA
This is a universal mechanism that makes the desired DNA rearrangement end-to-end in a single step
How does this enable large-scale genome design beyond editing individual bases or genes?
The experimental ability to universally rearrange/recombine DNA must be paired with the computational ability to design sequences at genome-scale
That's Evo, our AI foundation model with @BrianHie trained on genomes:
Together, I think these studies open up entirely new directions and we are excited to explore them through many collaborations
The bridge discovery was co-led by the amazing duo of @mgdurrant and @ntperry13 with important contributions from @SKonermann, @jjwp1011, @Adi_R_Jangid, @hnisimasu, Masa, @AprilPawluk, @JSAthukoralage, @johnmcspeedy
The structural mechanism studies were led by @hnisimasu and Masahiro Hiraizumi. Hiroshi and I first started working together over 10 years ago, on the first atomic structure of Cas9 bound to guide RNA and target DNA ()
One of the great pleasures of a life in science is long-term collaboration with wonderful friends and colleagues, and I'm grateful to continue teaming up with Hiroshi and also the brilliant @SKonermann (whom I've worked with for nearly 15 years now across dozens of papers and, of course, starting @arcinstitute)cell.com/fulltext/S0092…
This was a 2+ year detective story uniquely enabled by the @arcinstitute model, blending Arc's technical staff, PhD students from our partner universities, and hybrid computational & experimental science.
Matt, Nick, and the team have moved mountains to bring you this tour-de-force work, which combines ideas, scholarship, and methods across the fields of computational biology, genetics, biochemistry, molecular biology, microbiology, structural biology, and bioengineering - all in a single project.
Moving forward, I'm particularly interested in programming biology across multiple lengthscales, including tissues and physiology.
We're hiring across ML and experimental positions for folks interested in combining biology and AI. Come join us at @arcinstitute!
In new work, we report Evo, a genomic foundation model that learns across the fundamental languages of biology: DNA, RNA, and proteins. Evo is capable of both prediction tasks and generative design, from molecular to whole genome scale.
This was a wonderful collaboration with @BrianHie led by @exnx, @MichaelPoli6, and @mgdurrant
Evo is a 7B model that uses a context length of 131k tokens 👀 and is based on StripedHyena, a deep signal processing architecture designed to improve efficiency and quality over Transformers. It's trained at a single-nucleotide (byte) resolution on 2.7M whole prokaryotic genomes
Just shared at @KeystoneSymp a new @ArcInstitute discovery of the bridge RNA recombinase mechanism: a new class of natural RNA-guided systems that retains the key property of programmability from RNAi and CRISPR while enabling large-scale genome design beyond RNA and DNA cuts
This was a remarkable detective story 🕵️that starts with transposons. These mobile genetic elements (MGEs) jump around genomes 🧬 and play a central and ancient role in evolution and speciation. IS110 is a poorly understood family of MGEs with unusually long noncoding sequences!
We discovered that these noncoding flanks hide a noncoding RNA that directs the IS110 DNA element to integrate into its target site.
A striking computational analysis uncovered that this RNA acts as a "bridge" between target and donor DNA to enable programmable recombination!
yongzi ji for amazingly authentic HK-style wonton noodle soup. less than 10 things on the menu so you know it’s good. a hole in the wall run by an impossibly sweet older couple
pro-tip: add ~3:1 vinegar:soy sauce to taste
saap ver (“damn good” in Thai) is the best Thai place in the city. the ambiance is boisterous while the food comes correct: an “asian spicy” order puts you straight into a fragrant hot ones episode
don’t sleep on the green beans and tofu or crispy basil duck (or pork belly)
Food is my core competency so I will periodically add to this thread over time. I love ernest, san ho wan, flour and water, benu etc. too but I want to highlight the places you’re less likely to find via yelp or google maps. (think Q* search vs only next token prediction 😂)
Delighted to announce the Arc Institute @arcinstitute, a new, independent scientific institution dedicated to the study of complex human disease. Working with our partners @UCBerkeley@Stanford@UCSF, our mission is to accelerate progress in biomedical research and therapeutics.
Learn more about Arc's headquarters, Core Investigators, Technology Centers, Translation Program, and more in this blog post written with my co-founders @SKonermann and @PatrickC: arcinstitute.org/blog/introduci…
@arcinstitute's donors will contribute more than $650M to fund Arc investigators and their labs with complete freedom to pursue curiosity-driven research agendas. Our PIs may hold faculty positions @Stanford@UCBerkeley@UCSF, whose PhD students can also train in Arc labs.
The best coronavirus/SARS-CoV-2 biology lecture I have seen so far, by Britt Glaunsinger @UCBerkeley@berkeleyMCB@HHMINEWS@igisci. Covers where it comes from, how it gets into the cell, replicates in the host, and exploits the immune system against you.
An exonuclease (ExoN) is encoded in coronaviruses to enable their super large (30 kb) genomes. ExoN protects CoVs from inhibition by nucleoside analogs (e.g. Remdesivir @GileadSciences), suggesting a combo treatment w/ Remdesivir and an ExoN inhibitor could be effective
CoVs form these insane looking replication-transcription complexes in the host cell that are protected by double membrane vesicles. This shields the replicating virus from antiviral that could damage it and concentrates its transcriptional machinery so it can grow faster
German study on viral load kinetics and seroconversion timescale, out in @nature. Highlights huge time sensitivity of swab testing, viral loads in upper respiratory tract tank after ~5 days of symptoms (swabs are yellow line, orange is sputum).
Seroconversion is also slow! Only 50% after 7 days. This is important for applications of commercially available rapid serology tests, e.g. for back to work programs.
Lots of interesting data here. Viral sequencing indicates distinct genotypes in throat and sputum, indicating active viral replication in throat. Importantly, this study does not observe severe cases. This is a strength IMO, most papers have studied blazing hot patients