Today in @ScienceMagazine, we report a new DNA editing technology to seamlessly write massive changes into the right place in the human genome.
The reason gene editing hasn't transformed human health is that current gene editing technologies like CRISPR are very limited.
The problem with CRISPR is that it cuts up your DNA, and then hopes that unreliable cellular DNA repair will make the wanted edit. @geochurch famously called it genome vandalism. More precise versions of CRISPR only edit less than 100 bases - often only a single base. Therefore, it's not suited to make large changes safely.
However, most diseases are not the result of mutations in one location. Instead, their causes are spread all across the 3 billion base pairs in the genome.
We found bridge RNAs in bacterial “jumping genes” that allow us to make safe and arbitrary changes (insert, cut out, or flip) to every nucleotide within (up to) a 1 million bp sequence in your DNA.
In the paper, we show that we can correct the disease-causing DNA repeats that cause Friedreich's ataxia (which is a rare neurological disease). The same approach could be applied to Huntington’s and other repeat expansion disorders.
At @arcinstitute, we're working towards a full Turing machine for biology. Evo, our DNA foundation model, helps us design the optimal healthy DNA sequences. And Bridge recombination gives us the ability to seamlessly write these changes into the right place in the genome.
This work was a wonderful collaboration with my @arcinstitute cofounder @SKonermann and led by the indefatigable @ntperry13, alongside our amazing bridge editing team: @BartieLiam @dhruvakatrekar @Gabogonzalez515 @mgdurrant @james_jw_pai @AlisonFanton Juliana Martins Masa Hiraizumi @chiaroscurale @hnisimasu
Here's the paper itself. If you're interested in joining the team to turn this into a medicine, please reach out science.org/doi/10.1126/sc…
• • •
Missing some Tweet in this thread? You can try to
force a refresh
AI provides a universal framework that leverages data and compute at scale to uncover higher-order patterns
Today, @arcinstitute in collaboration with @nvidia releases Evo 2—a fully open source biological foundation model trained on genomes spanning the entire tree of life 🧵
Evo 2 is trained on 9.3T tokens of DNA with single-base resolution at 1M token context length 🥳
We release two models with 7B and 40B parameters along with weights, training and inference code, and pretraining data—making this one of the largest fully open AI models available
This enables it to reason about and understand biological interactions across diverse length scales, from individual molecules to entire bacterial genomes or eukaryotic chromosomes
🧵Epigenetics require the coordinated interplay of diverse proteins and mechanisms. To systematically explore this, we invented the COMBINE platform to quantify the effect of >50,000 pairs🧦of long epigenetic effectors on endogenous human transcription biorxiv.org/content/10.110…
COMBINE revealed diverse synergistic 🤝 and antagonistic 🤬 interactions between epigenetic protein domains, which we were able to study by calculating "synergy scores" for each individual pair and broader domain class (HDACs shown below)
Among many new and known interactions, we identified a potent KRAB-L3MBTL3 fusion that enhanced gene silencing up to 34-fold in dose-limited conditions (which are most in vivo therapeutic applications ‼️)...
What if we could universally recombine, insert, delete, or invert any two pieces of DNA?
In back-to-back @Nature papers, we report the discovery of bridge RNAs and 3 atomic structures of the first natural RNA-guided recombinase - a new mechanism for programmable genome design
See our previous thread on preprint here:
Bridge RNA recombinases encode a mechanistically new class of guide RNAs
They are bispecific, unlike the CRISPR (2012) or RNA interference (1998) guide RNAs that specify only 1 target for a simple scissors cut
In new collaborative work with @hnisimasu, we reveal the IS110 bridge mechanism in 3 distinct stages
The bispecific bridge RNA recruits two dimers of the IS110 recombinase, forming an elegant tetramer to exchange and religate 4 distinct DNA strands in a single step!
In new work, we report Evo, a genomic foundation model that learns across the fundamental languages of biology: DNA, RNA, and proteins. Evo is capable of both prediction tasks and generative design, from molecular to whole genome scale.
This was a wonderful collaboration with @BrianHie led by @exnx, @MichaelPoli6, and @mgdurrant
Evo is a 7B model that uses a context length of 131k tokens 👀 and is based on StripedHyena, a deep signal processing architecture designed to improve efficiency and quality over Transformers. It's trained at a single-nucleotide (byte) resolution on 2.7M whole prokaryotic genomes
Just shared at @KeystoneSymp a new @ArcInstitute discovery of the bridge RNA recombinase mechanism: a new class of natural RNA-guided systems that retains the key property of programmability from RNAi and CRISPR while enabling large-scale genome design beyond RNA and DNA cuts
This was a remarkable detective story 🕵️that starts with transposons. These mobile genetic elements (MGEs) jump around genomes 🧬 and play a central and ancient role in evolution and speciation. IS110 is a poorly understood family of MGEs with unusually long noncoding sequences!
We discovered that these noncoding flanks hide a noncoding RNA that directs the IS110 DNA element to integrate into its target site.
A striking computational analysis uncovered that this RNA acts as a "bridge" between target and donor DNA to enable programmable recombination!
yongzi ji for amazingly authentic HK-style wonton noodle soup. less than 10 things on the menu so you know it’s good. a hole in the wall run by an impossibly sweet older couple
pro-tip: add ~3:1 vinegar:soy sauce to taste
saap ver (“damn good” in Thai) is the best Thai place in the city. the ambiance is boisterous while the food comes correct: an “asian spicy” order puts you straight into a fragrant hot ones episode
don’t sleep on the green beans and tofu or crispy basil duck (or pork belly)
Food is my core competency so I will periodically add to this thread over time. I love ernest, san ho wan, flour and water, benu etc. too but I want to highlight the places you’re less likely to find via yelp or google maps. (think Q* search vs only next token prediction 😂)