New programmable gene editing proteins found outside of CRISPR systems

Within the last decade, scientists have adapted CRISPR systems from microbes into gene editing technology, a precise and programmable system for modifying DNA. Now, scientists at MIT’s McGovern Institute and the Broad Institute of MIT and Harvard have discovered a new class of programmable DNA modifying systems called OMEGAs (Obligate Mobile Element Guided Activity), which may naturally be involved in shuffling small bits of DNA throughout bacterial genomes.

These ancient DNA-cutting enzymes are guided to their targets by small pieces of RNA. While they originated in bacteria, they have now  been engineered to work in human cells, suggesting they could be useful in the development of gene editing therapies, particularly as they are small (~30% the size of Cas9), making them easier to deliver to cells than bulkier enzymes. The discovery, reported September 9, 2021, in the journal Science, provides evidence that natural RNA-guided enzymes are among the most abundant proteins on earth, pointing toward a vast new area of biology that is poised to drive the next revolution in genome editing technology.

The research was led by McGovern Investigator Feng Zhang, who is the James and Patricia Poitras Professor of Neuroscience at MIT, a Howard Hughes Medical Institute investigator, and a Core Institute Member of the Broad Institute. Zhang’s team has been exploring natural diversity in search of new molecular systems that can be rationally programmed.

“We are super excited about the discovery of these widespread programmable enzymes, which have been hiding under our noses all along,” says Zhang. “These results suggest the tantalizing possibility that there are many more programmable systems that await discovery and development as useful technologies.”

Natural adaptation

Programmable enzymes, particularly those that use an RNA guide, can be rapidly adapted for different uses. For example, CRISPR enzymes naturally use an RNA guide to target viral invaders, but biologists can direct Cas9 to any target by generating their own RNA guide. “It’s so easy to just change a guide sequence and set a new target,” says graduate student and co-first author of the paper, Soumya Kannan. “So one of the broad questions that we’re interested in is trying to see if other natural systems use that same kind of mechanism.”

Zhang lab graduate student Han Altae-Tran, co-author of the Science paper with Soumya Kannan. Photo: Zhang lab

The first hints that OMEGA proteins might be directed by RNA came from the genes for proteins called IscBs. The IscBs are not involved in CRISPR immunity and were not known to associate with RNA, but they looked like small, DNA-cutting enzymes. The team discovered that each IscB had a small RNA encoded nearby and it directed IscB enzymes to cut specific DNA sequences. They named these RNAs “ωRNAs.”

The team’s experiments showed that two other classes of small proteins known as IsrBs and TnpBs, one of the most abundant genes in bacteria, also use ωRNAs that act as guides to direct the cleavage of DNA.

IscB, IsrB, and TnpB are found in mobile genetic elements called transposons. Graduate student Han Altae-Tran, co-first author on the paper, explains that each time these transposons move, they create a new guide RNA, allowing the enzyme they encode to cut somewhere else.

It’s not clear how bacteria benefit from this genomic shuffling—or whether they do at all.  Transposons are often thought of as selfish bits of DNA, concerned only with their own mobility and preservation, Kannan says. But if hosts can “co-opt” these systems and repurpose them, hosts may gain new abilities, as with CRISPR systems which confer adaptive immunity.

“A lot of the things that we have been thinking about may already exist naturally in some capacity,” says Altae-Tran.

IscBs and TnpBs appear to be predecessors of Cas9 and Cas12 CRISPR systems. The team suspects they, along with IsrB, likely gave rise to other RNA-guided enzymes, too—and they are eager to find them. They are curious about the range of functions that might be carried out in nature by RNA-guided enzymes, Kannan says, and suspect evolution likely already took advantage of OMEGA enzymes like IscBs and TnpBs to solve problems that biologists are keen to tackle.

Comparison of Ω (OMEGA) systems with other known RNA-guided systems. In contrast to CRISPR systems, which capture spacer sequences and store them in the locus within the CRISPR array, Ω systems may transpose their loci (or trans-acting loci) into target sequences, converting targets into ωRNA guides. Image courtesy of the researchers.

“A lot of the things that we have been thinking about may already exist naturally in some capacity,” says Altae-Tran. “Natural versions of these types of systems might be a good starting point to adapt for that particular task.”

The team is also interested in tracing the evolution of RNA-guided systems further into the past. “Finding all these new systems sheds light on how RNA-guided systems have evolved, but we don’t know where RNA-guided activity itself comes from,” Altae-Tran says. Understanding those origins, he says, could pave the way to developing even more classes of programmable tools.

This work was made possible with support from the Simons Center for the Social Brain at MIT; National Institutes of Health Intramural Research Program; National Institutes of Health grants 1R01-HG009761 and 1DP1-HL141201; Howard Hughes Medical Institute; Open Philanthropy; G. Harold and Leila Y. Mathers Charitable Foundation; Edward Mallinckrodt, Jr. Foundation; Poitras Center for Psychiatric Disorders Research at MIT; Hock E. Tan and K. Lisa Yang Center for Autism Research at MIT; Yang-Tan Center for Molecular Therapeutics at MIT; Lisa Yang; Phillips family; R. Metcalfe; and J. and P. Poitras.

RNA-targeting enzyme expands the CRISPR toolkit

Researchers at MIT’s McGovern Institute have discovered a bacterial enzyme that they say could expand scientists’ CRISPR toolkit, making it easy to cut and edit RNA with the kind of precision that, until now, has only been available for DNA editing. The enzyme, called Cas7-11, modifies RNA targets without harming cells, suggesting that in addition to being a valuable research tool, it provides a fertile platform for therapeutic applications.

“This new enzyme is like the Cas9 of RNA,” says McGovern Fellow Omar Abudayyeh, referring to the DNA-cutting CRISPR enzyme that has revolutionized modern biology by making DNA editing fast, inexpensive, and exact. “It creates two precise cuts and doesn’t destroy the cell in the process like other enzymes,” he adds.

Up until now, only one other family of RNA-targeting enzymes, Cas13, has extensively been developed for RNA targeting applications. However, when Cas13 recognizes its target, it shreds any RNAs in the cell, destroying the cell along the way. Like Cas9, Cas7-11 is part of a programmable system; it can be directed at specific RNA targets using a CRISPR guide. Abudayyeh, McGovern fellow Jonathan Gootenberg, and their colleagues discovered Cas7-11 through a deep exploration of the CRISPR systems found in the microbial world. Their findings are reported today in the journal Nature.

Exploring natural diversity

DNA tools in the CRISPR toolkit (red) are approaching capacity, but researchers are now beginning to find new tools to edit RNA (blue). Image: Steven Dixon

Like other CRISPR proteins, Cas7-11 is used by bacteria as a defense mechanism against viruses. After encountering a new virus, bacteria that employ the CRISPR system keep a record of the infection in the form of a small snippet of the pathogen’s genetic material. Should that virus reappear, the CRISPR system is activated, guided by a small piece of RNA to destroy the viral genome and eliminate the infection.

These ancient immune systems are widespread and diverse, with different bacteria deploying different proteins to counter their viral invaders.

“Some target DNA, some target RNA. Some are very efficient in cleaving the target but have some toxicity, and others do not. They introduce different types of cuts, they can differ in specificity—and so on,” says Eugene Koonin, an evolutionary biologist at the National Center for Biotechnology Information.

Abudayyeh, Gootenberg, and Koonin have been scouring genome sequences to learn about the natural diversity of CRISPR systems—and to mine them for potential tools. The idea, Abudayyeh says, is to take advantage of the work that evolution has already done in engineering protein machines.

“We don’t know what we’ll find,” Abudayyeh says, “but let’s just explore and see what’s out there.”

As the team was poring through public databases to examine the components of different bacterial defense systems, a protein from a bacterium that had been isolated from Tokyo Bay caught their attention. Its amino acid sequence indicated that it belonged to a class of CRISPR systems that use large, multiprotein machines to find and cleave their targets. But this protein appeared to have everything it needed to carry out the job on its own. Other known single-protein Cas enzymes, including the Cas9 protein that has been widely adopted for DNA editing, belong to a separate class of CRISPR systems—but Cas7-11 blurs the boundaries of the CRISPR classification system, Koonin says.

The enzyme, which the team eventually named Cas7-11, was attractive from an engineering perspective, because single proteins are easier to deliver to cells and make better tools than their complex counterparts. But its composition also signaled an unexpected evolutionary history. The team found evidence that through evolution, the components of a more complex Cas machine had fused together to make the Cas7-11 protein. Gootenberg equates this to discovering a bat when you had previously assumed that birds are the only animals that fly, thereby recognizing that there are multiple evolutionary paths to flight. “It totally changes the landscape of how these systems are thought about, both functionally and evolutionarily,” he says.

Precision editing

McGovern Fellows Jonathan Gootenberg and Omar Abudayyeh in their lab. Photo: Caitlin Cunningham

When Gootenberg and Abudayyeh produced the Cas7-11 protein in their lab and began experimenting with it, they realized this unusual enzyme offered a powerful means to manipulate and study RNA. When they introduced it into cells along with an RNA guide, it made remarkably precise cuts, snipping its targets while leaving other RNA undisturbed. This meant they could use Cas7-11 to change specific letters in the RNA code, correcting errors introduced by genetic mutations. They were also able to program Cas7-11 to either stabilize or destroy particular RNA molecules inside cells, which gave them the ability to adjust the levels of the proteins encoded by those RNAs.

Abudayyeh and Gootenberg also found that Cas7-11’s ability to cut RNA could be dampened by a protein that appeared likely to also be involved in triggering programmed cell death, suggesting a possible link between CRISPR defense and a more extreme response to infection.

The team showed that a gene therapy vector can deliver the complete Cas7-11 editing system to cells and that Cas7-11 does not compromise cells’ health. They hope that with further development, the enzyme might one day be used to edit disease-causing sequences out of a patient’s RNA so their cells can produce healthy proteins, or to dial down the level of a protein that is doing harm due to genetic disease.

“We think that the unique way that Cas7-11 cuts enables many interesting and diverse applications,” Gootenberg says, noting that no other CRISPR tool cuts RNA so precisely. “It’s yet another great example of how these basic-biology driven explorations can yield new tools for therapeutics and diagnostics,” he adds. “And we’re certainly still just scratching the surface of what’s out there in natural diversity.”

Scientists harness human protein to deliver molecular medicines to cells

Researchers from MIT, the McGovern Institute for Brain Research at MIT, the Howard Hughes Medical Institute, and the Broad Institute of MIT and Harvard have developed a new way to deliver molecular therapies to cells. The system, called SEND, can be programmed to encapsulate and deliver different RNA cargoes. SEND harnesses natural proteins in the body that form virus-like particles and bind RNA, and it may provoke less of an immune response than other delivery approaches.

The new delivery platform works efficiently in cell models, and, with further development, could open up a new class of delivery methods for a wide range of molecular medicines — including those for gene editing and gene replacement. Existing delivery vehicles for these therapeutics can be inefficient and randomly integrate into the genome of cells, and some can stimulate unwanted immune reactions. SEND has the promise to overcome these limitations, which could open up new opportunities to deploy molecular medicine.

“The biomedical community has been developing powerful molecular therapeutics, but delivering them to cells in a precise and efficient way is challenging,” said CRISPR pioneer Feng Zhang, senior author on the study, core institute member at the Broad Institute, investigator at the McGovern Institute, and the James and Patricia Poitras Professor of Neuroscience at MIT. “SEND has the potential to overcome these challenges.” Zhang is also an investigator at the Howard Hughes Medical Institute and a professor in MIT’s Departments of Brain and Cognitive Sciences and Biological Engineering.

SEND packages are introduced to diseased cells to deliver therapeutic mRNA and restore health. Image: McGovern Institute

Reporting in Science, the team describes how SEND (Selective Endogenous eNcapsidation for cellular Delivery) takes advantage of molecules made by human cells. At the center of SEND is a protein called PEG10, which normally binds to its own mRNA and forms a spherical protective capsule around it. In their study, the team engineered PEG10 to selectively package and deliver other RNA. The scientists used SEND to deliver the CRISPR-Cas9 gene editing system to mouse and human cells to edit targeted genes.

First author Michael Segel, a postdoctoral researcher in Zhang’s lab, and Blake Lash, second author and a graduate student in the lab, said PEG10 is not unique in its ability to transfer RNA. “That’s what’s so exciting,” said Segel. “This study shows that there are probably other RNA transfer systems in the human body that can also be harnessed for therapeutic purposes. It also raises some really fascinating questions about what the natural roles of these proteins might be.”

Inspiration from within

The PEG10 protein exists naturally in humans and is derived from a “retrotransposon” — a virus-like genetic element — that integrated itself into the genome of human ancestors millions of years ago. Over time, PEG10 has been co-opted by the body to become part of the repertoire of proteins important for life.

Four years ago, researchers showed that another retrotransposon-derived protein, ARC, forms virus-like structures and is involved in transferring RNA between cells. Although these studies suggested that it might be possible to engineer retrotransposon proteins as a delivery platform, scientists had not successfully harnessed these proteins to package and deliver specific RNA cargoes in mammalian cells.

Knowing that some retrotransposon-derived proteins are able to bind and package molecular cargo, Zhang’s team turned to these proteins as possible delivery vehicles. They systematically searched through these proteins in the human genome for ones that could form protective capsules. In their initial analysis, the team found 48 human genes encoding proteins that might have that ability. Of these, 19 candidate proteins were present in both mice and humans. In the cell line the team studied, PEG10 stood out as an efficient shuttle; the cells released significantly more PEG10 particles than any other protein tested. The PEG10 particles also mostly contained their own mRNA, suggesting that PEG10 might be able to package specific RNA molecules.

Developing a modular system

To develop the SEND technology, the team identified the molecular sequences, or “signals,” in PEG10’s mRNA that PEG10 recognizes and uses to package its mRNA. The researchers then used these signals to engineer both PEG10 and other RNA cargo so that PEG10 could selectively package those RNAs. Next, the team decorated the PEG10 capsules with additional proteins, called “fusogens,” that are found on the surface of cells and help them fuse together.

By engineering the fusogens on the PEG10 capsules, researchers should be able to target the capsule to a particular kind of cell, tissue, or organ. As a first step towards this goal, the team used two different fusogens, including one found in the human body, to enable delivery of SEND cargo.

“By mixing and matching different components in the SEND system, we believe that it will provide a modular platform for developing therapeutics for different diseases,” said Zhang.

Advancing gene therapy

SEND is composed of proteins that are produced naturally in the body, which means it may not trigger an immune response. If this is demonstrated in further studies, the researchers say SEND could open up opportunities to deliver gene therapies repeatedly with minimal side effects. “The SEND technology will complement viral delivery vectors and lipid nanoparticles to further expand the toolbox of ways to deliver gene and editing therapies to cells,” said Lash.

Next, the team will test SEND in animals and further engineer the system to deliver cargo to a variety of tissues and cells. They will also continue to probe the natural diversity of these systems in the human body to identify other components that can be added to the SEND platform.

“We’re excited to keep pushing this approach forward,” said Zhang. “The realization that we can use PEG10, and most likely other proteins, to engineer a delivery pathway in the human body to package and deliver new RNA and other potential therapies is a really powerful concept.”

This work was made possible with support from the Simons Center for the Social Brain at MIT; National Institutes of Health Intramural Research Program; National Institutes of Health grants 1R01-HG009761 and 1DP1-HL141201; Howard Hughes Medical Institute; Open Philanthropy; G. Harold and Leila Y. Mathers Charitable Foundation; Edward Mallinckrodt, Jr. Foundation; Poitras Center for Psychiatric Disorders Research at MIT; Hock E. Tan and K. Lisa Yang Center for Autism Research at MIT; Yang-Tan Center for Molecular Therapeutics at MIT; Lisa Yang; Phillips family; R. Metcalfe; and J. and P. Poitras.

Exploring the unknown

View the interactive version of this story in our Summer 2021 issue of BrainScan.

 

McGovern Investigator Ed Boyden.

McGovern Investigator Ed Boyden says his lab’s vision is clear.

“We want to understand how our brains take our sensory inputs, generate emotions and memories and decisions, and ultimately result in motor outputs. We want to be able to see the building blocks of life, and how they go into disarray in brain diseases. We want to be able to control the signals of the brain, so we can repair it,” Boyden says.

To get there, he and his team are exploring the brain’s complexity at every scale, from the function and architecture of its neural networks to the molecules that work together to process information.

And when they don’t have the tools to take them where they want to go, they create them, opening new frontiers for neuroscientists everywhere.

Open to discovery

Boyden’s team is highly interdisciplinary and collaborative. Its specialty, Boyden says, is problem solving. Creativity, adaptability, and deep curiosity are essential, because while many of neuroscience’s challenges are clear, the best way to address them is not. In its search for answers, Boyden’s lab is betting that an important path to discovery begins with finding new ways to explore.

They’ve made that possible with an innovative imaging approach called expansion microscopy (ExM). ExM physically enlarges biological samples so that minute details become visible under a standard laboratory microscope, enabling researchers everywhere to peer into spaces that once went unseen (see video below).

To use the technique, researchers permeate a biological sample with an absorbent gel, then add water, causing the components of the gel to spread apart and the tissue to expand.

This year, postdoctoral researcher Ruixuan Gao and graduate student Chih-Chieh (Jay) Yu made the method more precise, with a new material that anchors a sample’s molecules within a crystal-like lattice, better preserving structure during expansion than the irregular mesh-like composition of the original gel. The advance is an important step toward being able to image expanded samples with single-molecule precision, Gao says.

A revealing look

By opening space within the brain, ExM has let Boyden’s team venture into those spaces in new ways.

Areas of research and brain disorders page
Graduate student Oz Wassie examines expanded brain tissue. Photo: Justin Knight

In work led by Deblina Sarkar (who is now an assistant professor at MIT’s Media Lab), Jinyoung Kang, and Asmamaw (Oz) Wassie, they showed that they can pull apart proteins in densely packed regions like synapses so that it is easier to introduce fluorescent labels, illuminating proteins that were once too crowded to see. The process, called expansion revealing, has made it possible to visualize in intact brain tissue important structures such as ion channels that help transmit signals and fine-scale amyloid clusters in Alzheimer’s model mice.

Another reaction the lab has adapted to the expanded-brain context is RNA sequencing—an important tool for understanding cellular diversity. “Typically, the first thing you do in a sequencing project is you grind up the tissue, and you lose the spatial dimension,” explains Daniel Goodwin, a graduate student in Boyden’s lab. But when sequencing reactions are performed inside cells instead, new information is revealed.

Confocal image showing targeted ExSeq of a 34-panel gene set across a slice of mouse hippocampus. Green indicates YFP, magenta indicates reads identified with ExSeq, and white indicates reads localized within YFP-expressing cells. Image courtesy of the researchers.

Goodwin and fellow Boyden lab members Shahar Alon, Anubhav Sinha, Oz Wassie, and Fei Chen developed expansion sequencing (ExSeq), which copies RNA molecules, nucleotide by nucleotide, directly inside expanded tissue, using fluorescent labels that spell out the molecules’ codes just as they would in a sequencer.

The approach shows researchers which genes are turned on in which cells, as well as where those RNA molecules are—revealing, for example, which genes are active in the neuronal projections that carry out the brain’s communications. A next step, Sinha says, is to integrate expansion sequencing with other technologies to obtain even deeper insights.

That might include combining information revealed with ExSeq with a topographical map of the same cells’ genomes, using a method Boyden’s lab and collaborators Chen (who is now a core member of the Broad Institute) and Jason Buenrostro at Harvard have developed for DNA sequencing. That information is important because the shape of the genome varies across cells and circumstances, and that has consequences for how the genetic code is used.

Using similar techniques to those that make ExSeq possible, graduate students Andrew Payne, Zachary Chiang, and Paul Reginato figured out how to recreate the steps of commercial DNA sequencing within the genome’s natural environment.

By pinpointing the location of specific DNA sequences inside cells, the new method, called in situ genome sequencing (IGS) allows researchers to watch a genome reorganize itself in a developing embryo.

They haven’t yet performed this analysis inside expanded tissue, but Payne says integrating in situ genome sequencing (IGS) with ExM should open up new opportunities to study genomes’ structure.

Signaling clusters

Alongside these efforts, Boyden’s team is working to give researchers better tools to explore how molecules move, change, and interact, including a modular system that lets users assemble sets of sensors into clusters to simultaneously monitor multiple cellular activities.

Molecular sensors use fluorescence to report on certain changes inside cells, such as the calcium that surges into a neuron after it fires. But they come in a limited palette, so in most experiments only one or two things can be seen at once.

Graduate student Shannon Johnson and postdoctoral fellow Changyang Linghu solved this problem by putting different sensors at different points throughout a cell so they can report on different signals. Their technique, called spatial multiplexing, links sensors to molecular scaffolds designed to cling to their own kind. Sensors built on the same scaffold form islands inside cells, so when they light up their signals are distinct from those produced by other sensor islands.

Eventually, as new sensors and scaffolds become available, Johnson says the technique might be used to simultaneously follow dozens of molecular signals in living cells. The more precise information they can help people uncover, the better, Boyden says.

“The brain is so full of surprises, we don’t know where the next big discovery will come from,” he says. With new support from the recently established K. Lisa Yang and Hock E. Tan Center for Molecular Therapeutics in Neuroscience, the Boyden lab is positioned to make these big discoveries.

“My dream would be to image the signaling dynamics of the brain, and then perturb the dynamics, and then use expansion methods to make a map of the brain. If we can get those three data sets—the dynamics, the causality, and the molecular organization—I think stitching those together could potentially yield deep insights into how the brain works, and how we can repair it in disease states.”

New technique corrects disease-causing mutations

Gene editing, or purposefully changing a gene’s DNA sequence, is a powerful tool for studying how mutations cause disease, and for making changes in an individual’s DNA for therapeutic purposes. A novel method of gene editing that can be used for both purposes has now been developed by a team led by Guoping Feng, the James W. (1963) and Patricia T. Poitras Professor in Brain and Cognitive Sciences at MIT.

“This technical advance can accelerate the production of disease models in animals and, critically, opens up a brand-new methodology for correcting disease-causing mutations,” says Feng, who is also a member of the Broad Institute of Harvard and MIT and the associate director of the McGovern Institute for Brain Research at MIT. The new findings publish online May 26 and in print June 10 in the journal Cell.

Genetic models of disease

A major goal of the Feng lab is to precisely define what goes wrong in neurodevelopmental and neuropsychiatric disorders by engineering animal models that carry the gene mutations that cause these disorders in humans. New models can be generated by injecting embryos with gene editing tools, along with a piece of DNA carrying the desired mutation.

In one such method, the gene editing tool CRISPR is programmed to cut a targeted gene, thereby activating natural DNA mechanisms that “repair” the broken gene with the injected template DNA. The engineered cells are then used to generate offspring capable of passing the genetic change on to further generations, creating a stable genetic line in which the disease, and therapies, are tested.

Although CRISPR has accelerated the process of generating such disease models, the process can still take months or years. Reasons for the inefficiency are that many treated cells do not undergo the desired DNA sequence change at all, and the change only occurs on one of the two gene copies (for most genes, each cell contains two versions, one from the father and one from the mother).

In an effort to increase the efficiency of the gene editing process, the Feng lab team initially hypothesized that adding a DNA repair protein called RAD51 to a standard mixture of CRISPR gene editing tools would increase the chances that a cell (in this case a fertilized mouse egg, or one-cell embryo) would undergo the desired genetic change.

As a test case, they measured the rate at which they were able to insert (“knock-in”) a mutation in the gene Chd2 that is associated with autism.  The overall proportion of embryos that were correctly edited remained unchanged, but to their surprise, a significantly higher percentage carried the desired gene edit on both chromosomes. Tests with a different gene yielded the same unexpected outcome.

“Editing of both chromosomes simultaneously is normally very uncommon,” explains postdoctoral fellow Jonathan Wilde.  “The high rate of editing seen with RAD51 was really striking and what started as a simple attempt to make mutant Chd2 mice quickly turned into a much bigger project focused on RAD51 and its applications in genome editing,” said Wilde, who co-authored the Cell paper with research scientist Tomomi Aida.

A molecular copy machine

The Feng lab team next set out to understand the mechanism by which RAD51 enhances gene editing. They hypothesized that RAD51 engages a process called interhomolog repair (IHR), whereby a DNA break on one chromosome is repaired using the second copy of the chromosome (from the other parent) as the template.

To test this, they injected mouse embryos with RAD51 and CRISPR but left out the template DNA. They programmed CRISPR to cut only the gene sequence on one of the chromosomes, and then tested whether it was repaired to match the sequence on the uncut chromosome. For this experiment, they had to use mice in which the sequences on the maternal and paternal chromosomes were different.

They found that control embryos injected with CRISPR alone rarely showed IHR repair. However, addition of RAD51 significantly increased the number of embryos in which the CRISPR-targeted gene was edited to match the uncut chromosome.

“Previous studies of IHR found that it is incredibly inefficient in most cells,” says Wilde. “Our finding that it occurs much more readily in embryonic cells and can be enhanced by RAD51 suggest that a deeper understanding of what makes the embryo permissive to this type of DNA repair could help us design safer and more efficient gene therapies.”

A new way to correct disease-causing mutations          

Standard gene therapy strategies that rely on injecting a corrective piece of DNA to serve as a template for repairing the mutation engage a process called homology-directed repair (HDR).

“HDR-based strategies still suffer from low efficiency and carry the risk of unwanted integration of donor DNA throughout the genome,” explains Feng. “IHR has the potential to overcome these problems because it relies upon natural cellular pathways and the patient’s own normal chromosome for correction of the deleterious mutation.”

Feng’s team went on to identify additional DNA repair-associated proteins that can stimulate IHR, including several that not only promote high levels of IHR, but also repress errors in the DNA repair process. Additional experiments that allowed the team to examine the genomic features of IHR events gave deeper insight into the mechanism of IHR and suggested ways that the technique can be used to make gene therapies safer.

“While there is still a great deal to learn about this new application of IHR, our findings are the foundation for a new gene therapy approach that could help solve some of the big problems with current approaches,” says Aida.

This study was supported by the Hock E. Tan and K. Lisa Yang Center for Autism Research at MIT, the Poitras Center for Psychiatric Disorders Research at MIT, NIH/NIMH Conte Center Grant (P50 MH094271) and NIH Office of the Director (U24 OD026638).

A high-resolution glimpse of gene expression in cells

Using a novel technique for expanding tissue, MIT and Harvard Medical School researchers have devised a way to label individual molecules of messenger RNA within a tissue sample and then sequence the RNA.

This approach offers a unique snapshot of which genes are being expressed in different parts of a cell, and could allow scientists to learn much more about how gene expression is influenced by a cell’s location or its interactions with nearby cells. The technique could also be useful for mapping cells in the brain or other tissues and classifying them according to their function.

“Gene expression is one of the most fundamental processes in all of biology, and it plays roles in all biological processes, both healthy and disease-related. However, you need to know more than just whether a gene is on or off,” says Ed Boyden, the Y. Eva Tan Professor in Neurotechnology and a professor of biological engineering, media arts and sciences, and brain and cognitive sciences at MIT.

“You want to know where the gene products are located. You care what cell types they’re in, which individual cells they play roles in, and even which parts of cells they work in,” says Boyden.

In a study appearing today in Science, the researchers showed that they could use this technique to locate and then sequence thousands of different messenger RNA molecules within the mouse brain and in human tumor samples.

The senior authors of the study are Boyden, an investigator at the MIT McGovern Institute and the Howard Hughes Medical Institute; George Church, a professor of genetics at Harvard Medical School; and Adam Marblestone, a former MIT research scientist. The paper’s lead authors are Shahar Alon, a former MIT postdoc who is now a senior lecturer at Bar-Ilan University; Daniel Goodwin, an MIT graduate student; Anubhav Sinha ’14 MNG ’15, an MIT graduate student; Asmamaw Wassie ’12, PhD ’19; and Fei Chen PhD ’17, who is an assistant professor of stem cell and regenerative biology at Harvard University and a member of the Broad Institute of MIT and Harvard.

Tissue expansion

The new sequencing technique builds on a method that Boyden’s group devised in 2015 for expanding tissue samples and then imaging them. By embedding water-absorbent polymers into a tissue sample, researchers can swell the tissue sample while keeping its overall organization intact. Using this approach, tissues can be expanded by a factor of 100 or more, allowing scientists to obtain very high-resolution images of the brain or other tissues using a regular light microscope.

In 2014, Church’s lab developed an RNA sequencing technique known as FISSEQ (fluorescent in situ sequencing), which allows thousands of mRNA molecules to be located and sequenced within cells grown in a lab dish. The Boyden and Church labs decided to join forces to combine tissue expansion and in situ RNA sequencing, creating a new technique they call expansion sequencing (ExSeq).

Expanding the tissue before performing RNA sequencing has two main benefits: It offers a higher-resolution look at the RNA in cells, and it makes it easier to sequence those RNA molecules. “When you separate these molecules in the expanding sample, and move them away from each other, that gives you more room to actually perform the chemical reactions of in situ sequencing,” Marblestone says.

Once the tissue is expanded, the researchers can label and sequence thousands of RNA molecules in a sample, at a resolution that allows them to pinpoint the molecules’ locations not only within cells but within specific compartments such as dendrites — the tiny extensions of neurons that receive communications from other neurons.

“We know that the location of RNA in these small regions is important for learning and memory, but until now, we didn’t have any way to measure these locations because they are very small, on the order of nanometers,” Alon says.

Using an “untargeted” version of this technique, meaning that they are not looking for specific RNA sequences, the researchers can turn up thousands of different sequences. They estimate that in a given sample, they can sequence between 20 and 50 percent of all of the genes present.

In the mouse hippocampus, this technique yielded some surprising results. For one, the researchers found mRNA containing introns, which are sections of RNA that are normally edited out of mRNA in the nucleus, in dendrites. They also discovered mRNA molecules encoding transcription factors in the dendrites, which may help with novel forms of dendrite-to-nucleus communication.

“These are just examples of things that we never would have gone looking for intentionally, but now that we can sequence RNA exactly where it is in the neuron, we’re able to explore a lot more biology,” Goodwin says.

Cellular interactions

The researchers also showed that they could explore gene expression in a more targeted way, looking for a specific set of RNA sequences that correspond to genes of interest. In the visual cortex of the mouse, the researchers used this approach to classify neurons into different types based on an analysis of 42 different genes that they express.

In situ sequencing of physically expanded specimens enables multiplexed mapping of RNAs at nanoscale, subcellular resolution throughout intact tissues. Top: schematics of physical expansion and in situ sequencing (left), and image analysis (right). Bottom: characterization of nanoscale transcriptomic compartmentalization in mouse hippocampal neuron dendrites and spines (left, middle), and maps of cell types and states in a metastatic human breast cancer biopsy (right). Image courtesy of the researchers.

This technology could also be useful to analyze many other kinds of tissues, such as tumor biopsies. In this paper, the researchers studied breast cancer metastases, which contain many different cell types, including cancer cells and immune cells. The study revealed that these cell types can behave differently depending on their location within a tumor. For example, the researchers found that B cells that were near tumor cells expressed certain inflammatory genes at a higher level than B cells that were farther from tumor cells.

“The tumor microenvironment has been studied in many different contexts for a long time, but it’s been difficult to study it with any depth,” Sinha says. “A cancer biologist can give you a list of 20 or 30 marker genes that will identify most of the cell types in the tissue. Here, since we interrogated 297 different RNA transcripts in the sample, we can ask and answer more detailed questions about gene expression.”

The researchers now plan to further study the interactions between cancer cells and immune cells, as well as gene expression in the brain in healthy and disease states. They also plan to extend their techniques to allow them to map additional types of biomolecules, such as proteins, alongside RNA.

The research was funded, in part, by the National Institutes of Health and the National Science Foundation, as well as by Lisa Yang, John Doerr, the Open Philanthropy Project, Cancer Research UK, the Chan Zuckerberg Initiative Human Cell Atlas pilot program, and HHMI.

Four MIT scientists honored with 2021 National Academy of Sciences awards

Four MIT scientists are among the 20 recipients of the 2021 Academy Honors for major contributions to science, the National Academy of Sciences (NAS) announced at its annual meeting. The individuals are recognized for their “extraordinary scientific achievements in a wide range of fields spanning the physical, biological, social, and medical sciences.”

The awards recognize: Pablo Jarillo-Herrero, for contributions to the fields of nanoscience and nanotechnology through his discovery of correlated insulator behavior and unconventional superconductivity in magic-angle graphene superlattices; Aviv Regev, for using interdisciplinary information or techniques to solve a contemporary challenge; Susan Solomon, for contributions to understanding and communicating the causes of ozone depletion and climate change; and Feng Zhang, for pioneering achievements developing CRISPR tools with the potential to diagnose and treat disease.

Pablo Jarillo-Herrero: Award for Scientific Discovery

Pablo Jarillo-Herrero, a Cecil and Ida Green Professor of Physics, is the recipient of the NAS Award for Scientific Discovery for his pioneering developments in nanoscience and nanotechnology, which is presented to scientists in the fields of astronomy, materials science, or physics. His findings expand nanoscience by demonstrating for the first time that orientation can be used to dramatically control nanomaterial properties and to design new nanomaterials. This work lays the groundwork for developing a whole new family of 2D materials and has had a transformative impact on the field and on condensed-matter physics.

The biannual award recognizes “an accomplishment or discovery in basic research, achieved within the previous five years, that is expected to have a significant impact on one or more of the following fields: astronomy, biochemistry, biophysics, chemistry, materials science, or physics.”

In 2018, his research group discovered that by rotating two layers of graphene relative to each other by a magic angle, the bilayer material can be turned from a metal into an electrical insulator or even a superconductor. This discovery has fostered new theoretical and experimental research, inspiring the interest of technologists in nanoelectronics. The result is a new field in condensed-matter physics that has the potential to result in materials that conduct electricity without resistance at room temperature.

Aviv Regev: James Prize in Science and Technology Integration

Aviv Regev, who is a professor of biology, a core member of the Broad Institute of Harvard and MIT, a member of the Koch Institute, and a Howard Hughes Medical Institute investigator has been selected for the inaugural James Prize in Science and Technology Integration, along with Harvard Medical School Professor Allon Kelin, for “their concurrent development of now widely adopted massively parallel single-cell genomics to interrogate the gene expression profiles that define, at the level of individual cells, the distinct cell types in metazoan tissues, their developmental trajectories, and disease states, which integrated tools from molecular biology, engineering, statistics, and computer science.”

The prize recognizes individuals “who are able to adopt or adapt information or techniques from outside their fields” to “solve a major contemporary challenge not addressable from a single disciplinary perspective.”

Regev is credited with forging new ways to unite the disciplines of biology, computational science, and engineering as a pioneer in the field of single-cell biology, including developing some of its core experimental and analysis tools, and their application to discover cell types, states, programs, environmental responses, development, tissue locations, and regulatory circuits, and deploying these to assemble cellular atlases of the human body that illuminate mechanisms of disease with remarkable fidelity.

Susan Solomon: Award for Chemistry in Service to Society

Susan Solomon, the Lee and Geraldine Martin Professor of Environmental Studies in the Department of Earth, Atmospheric and Planetary Sciences who holds a secondary appointment in the Department of Chemistry, is the recipient of the Award for Chemistry in Service to Society for “influential and incisive application of atmospheric chemistry to understand our most critical environmental issues — ozone layer depletion and climate change — and for her effective communication of environmental science to leaders to facilitate policy changes.”

The award is given biannually for “contributions to chemistry, either in fundamental science or its application, that clearly satisfy a societal need.”

Solomon is globally recognized as a leader in atmospheric science, notably for her insights in explaining the cause of the Antarctic ozone “hole.” She and her colleagues have made important contributions to understanding chemistry-climate coupling, including pioneering research on the irreversibility of global warming linked to anthropogenic carbon dioxide emissions, and on the influence of the ozone hole on the climate of the southern hemisphere.

Her work has had an enormous effect on policy and society, including the transition away from ozone-depleting substances and to environmentally benign chemicals. The work set the stage for the Paris Agreement on climate, and she continues to educate policymakers, the public, and the next generation of scientists.

Feng Zhang: Richard Lounsbery Award

Feng Zhang, who is the James and Patricia Poitras Professor of Neuroscience at MIT, an investigator at the McGovern Institute for Brain Research and the Howard Hughes Medical Institute, a professor of brain and cognitive sciences and biological engineering at MIT, and a core member of the Broad Institute of MIT and Harvard, is the recipient of the Richard Lounsbery Award for pioneering CRISPR-mediated genome editing.

The award recognizes “extraordinary scientific achievement in biology and medicine” as well as stimulating research and encouraging reciprocal scientific exchanges between the United States and France.

Zhang continues to lead the field through the discovery of novel CRISPR systems and their development as molecular tools with the potential to diagnose and treat disease, such as disorders affecting the nervous system. His contributions in genome engineering, as well as his earlier work developing optogenetics, are enabling a deeper understanding of behavioral neural circuits and advances in gene therapy for treating disease.

In addition, Zhang has championed the open sharing of the technologies he has developed through extensive resource sharing. The tools from his lab are being used by thousands of scientists around the world to accelerate research in nearly every field of the life sciences. Even as biomedical researchers around the world adopt Zhang’s discoveries and his tools enter the clinic to treat genetic diseases, he continues to innovate and develop new technologies to advance science.

The National Academy of Sciences is a private, nonprofit society of distinguished scholars, established in 1863 by the U.S. Congress. The NAS is charged with providing independent, objective advice to the nation on matters related to science and technology as well as encouraging education and research, recognize outstanding contributions to knowledge, and increasing public understanding in matters of science, engineering, and medicine. Winners received their awards, which include a monetary prize, during a virtual ceremony at the 158th NAS Annual Meeting.

This story is a modified compilation from several National Academy of Sciences press releases.

Sequencing inside cells

By bringing DNA sequencing out of the sequencer and directly to cells, MIT scientists have revealed an entirely new view of the genome. With a new method for in situ genome sequencing reported December 31, 2020, in the journal Science, researchers can, for the first time, see exactly how DNA sequences are organized and packed inside cells.

The approach, whose development was led by Ed Boyden, the Y. Eva Tan Professor in Neurotechnology at MIT, and Harvard University Stem Cell and Regenerative Biology faculty members Jason Buenrostro and Fei Chen, integrates DNA sequencing technology with microscopy to pinpoint exactly where specific DNA sequences are located inside intact cells.

While alternative methods allow scientists to reconstruct structural information about the genome, this is the first sequencing technology to give scientists a direct look.

The technology creates new opportunities to investigate a broad range of biology, from fundamental questions about how DNA’s three-dimensional organization affects its function to the structural changes and chromosomal rearrangements associated with aging, cancer, brain disorders, and other diseases.

Seeing is believing

“How structure yields function is one of the core themes of biology,” says Boyden, who is also an investigator at the McGovern Institute and the Howard Hughes Medical Institute.“And the history of biology tells us that when you can actually see something, you can make lots of advances.” Seeing how an organism’s genome is packed inside its cells could help explain how different cell types in the brain interpret the genetic code, or reveal structural patterns that mean the difference between health and disease, he says. Additionally, the researchers note, the technique also makes it possible to directly see how proteins and other factors interact with specific parts of the genome.

The new method builds on work underway in Boyden and Chen’s laboratories focused on sequencing RNA inside cells. Buenrostro collaborated with Boyden and Chen, who is also a core member of the Broad Institute, to adapt the technique for use with DNA. “It was clear the technology they had developed would be an extraordinary opportunity to have a new perspective on cells’ genomes,” Boyden says.

Their approach begins by fixing cells onto a glass surface to preserve their structure. Then, after inserting small DNA adapters into the genome, thousands of short segments of DNA—about 20 letters of code apiece—are amplified and sequenced in their original locations inside the cells. Finally, the samples are ground up and put into a sequencer, which sequences all of the cells’ DNA about 300 letters at a time. By finding the location-identified short sequences within those longer segments, the method pinpoints each one’s position within the three-dimensional structure of the cell.

Sequencing inside the cells is done more or less the same way DNA is sequenced inside a standard next-generation sequencer, Boyden explains, by watching under a microscope as a DNA strand is copied using fluorescently labeled building blocks. As in a traditional sequencer, each of DNA’s four building blocks, or nucleotides, is tagged with a different color so that they can be visually identified as they are added to a growing DNA strand.

A collaborative effort

Boyden, Buenrostro, and Chen, who began their collaboration several years ago, say the new technology represents a heroic effort on the part of MIT and Harvard graduate students Andrew Payne, Zachary Chiang, and Paul Reginato, who took the lead in developing and integrating its many technical steps and computational analyses. That involved both recapitulating the methods used in commercial sequencers and introducing several key innovations. “Some advances on the technology side have taken this from impossible to do to being possible,” Chen says.

The team has already used the method to visualize a genome as it reorganizes itself during the earliest moments of life. Brightly colored representations of DNA that they sequenced inside a mouse embryo show how genetic information inherited from each parent remains distinct and compartmentalized immediately after fertilization, then gradually intertwines as development progresses. Their sequencing also reveals how patterns of genome organization, which very early in life vary from cell to cell, are passed on as cells divide, generating a memory of each cell’s developmental origins. Being able to watch these processes unfold across entire cells instead of piecing them together through less direct means offered a dramatic new view of development, the researchers say.

While the team continues to improve the spatial resolution of the technique and adapt it to a broader range of cell types, they have made their method and associated software freely available to other labs. The researchers hope this new approach to DNA sequencing will change the way people think about studying the structure of the genome and will help illuminate patterns and consequences of genome organization across a variety of contexts.

A large-scale tool to investigate the function of autism spectrum disorder genes

Scientists at Harvard University, the Broad Institute of MIT and Harvard, and MIT have developed a technology to investigate the function of many different genes in many different cell types at once, in a living organism. They applied the large-scale method to study dozens of genes that are associated with autism spectrum disorder, identifying how specific cell types in the developing mouse brain are impacted by mutations.

The “Perturb-Seq” method, published in the journal Science, is an efficient way to identify potential biological mechanisms underlying autism spectrum disorder, which is an important first step toward developing treatments for the complex disease. The method is also broadly applicable to other organs, enabling scientists to better understand a wide range of disease and normal processes.

“For many years, genetic studies have identified a multitude of risk genes that are associated with the development of autism spectrum disorder. The challenge in the field has been to make the connection between knowing what the genes are, to understanding how the genes actually affect cells and ultimately behavior,” said co-senior author Paola Arlotta, the Golub Family Professor of Stem Cell and Regenerative Biology at Harvard. “We applied the Perturb-Seq technology to an intact developing organism for the first time, showing the potential of measuring gene function at scale to better understand a complex disorder.”

The study was also led by co-senior authors Aviv Regev, who was a core member of the Broad Institute during the study and is currently Executive Vice President of Genentech Research and Early Development, and Feng Zhang, a core member of the Broad Institute and an investigator at MIT’s McGovern Institute.

To investigate gene function at a large scale, the researchers combined two powerful genomic technologies. They used CRISPR-Cas9 genome editing to make precise changes, or perturbations, in 35 different genes linked to autism spectrum disorder risk. Then, they analyzed changes in the developing mouse brain using single-cell RNA sequencing, which allowed them to see how gene expression changed in over 40,000 individual cells.

By looking at the level of individual cells, the researchers could compare how the risk genes affected different cell types in the cortex — the part of the brain responsible for complex functions including cognition and sensation. They analyzed networks of risk genes together to find common effects.

“We found that both neurons and glia — the non-neuronal cells in the brain — are directly affected by different sets of these risk genes,” said Xin Jin, lead author of the study and a Junior Fellow of the Harvard Society of Fellows. “Genes and molecules don’t generate cognition per se — they need to impact specific cell types in the brain to do so. We are interested in understanding how these different cell types can contribute to the disorder.”

To get a sense of the model’s potential relevance to the disorder in humans, the researchers compared their results to data from post-mortem human brains. In general, they found that in the post-mortem human brains with autism spectrum disorder, some of the key genes with altered expression were also affected in the Perturb-seq data.

“We now have a really rich dataset that allows us to draw insights, and we’re still learning a lot about it every day,” Jin said. “As we move forward with studying disease mechanisms in more depth, we can focus on the cell types that may be really important.”

“The field has been limited by the sheer time and effort that it takes to make one model at a time to test the function of single genes. Now, we have shown the potential of studying gene function in a developing organism in a scalable way, which is an exciting first step to understanding the mechanisms that lead to autism spectrum disorder and other complex psychiatric conditions, and to eventually develop treatments for these devastating conditions,” said Arlotta, who is also an institute member of the Broad Institute and part of the Broad’s Stanley Center for Psychiatric Research. “Our work also paves the way for Perturb-Seq to be applied to organs beyond the brain, to enable scientists to better understand the development or function of different tissue types, as well as pathological conditions.”

“Through genome sequencing efforts, a very large number of genes have been identified that, when mutated, are associated with human diseases. Traditionally, understanding the role of these genes would involve in-depth studies of each gene individually. By developing Perturb-seq for in vivo applications, we can start to screen all of these genes in animal models in a much more efficient manner, enabling us to understand mechanistically how mutations in these genes can lead to disease,” said Zhang, who is also the James and Patricia Poitras Professor of Neuroscience at MIT and a professor of brain and cognitive sciences and biological engineering at MIT.

This study was funded by the Stanley Center for Psychiatric Research at the Broad Institute, the National Institutes of Health, the Brain and Behavior Research Foundation’s NARSAD Young Investigator Grant, Harvard University’s William F. Milton Fund, the Klarman Cell Observatory, the Howard Hughes Medical Institute, a Center for Cell Circuits grant from the National Human Genome Research Institute’s Centers of Excellence in Genomic Science, the New York Stem Cell Foundation, the Mathers Foundation, the Poitras Center for Psychiatric Disorders Research at MIT, the Hock E. Tan and K. Lisa Yang Center for Autism Research at MIT, and J. and P. Poitras.

RNA “ticker tape” records gene activity over time

As cells grow, divide, and respond to their environment,  their gene expression changes; one gene may be transcribed into more RNA at one time point and less at another time when it’s no longer needed. Now, researchers at the McGovern Institute, Harvard, and the Broad Institute of MIT and Harvard have developed a way to determine when specific RNA molecules are produced in cells.  The method, described today in Nature Biotechnology, allows scientists to more easily study how a cell’s gene expression fluctuates over time.

“Biology is very dynamic but most of the tools we use in biology are static; you get a fixed snapshot of what’s happening in a cell at a given moment,” said Fei Chen, a core institute member at the Broad, an assistant professor at Harvard University, and a co-senior author of the new work. “This will now allow us to record what’s happening over hours or days.”

To find out the level of RNA a cell is transcribing, researchers typically extract genetic material from the cell—destroying the cell in the process—and use RNA sequencing technology to determine which genes are being transcribed into RNA, and how much. Although researchers can sample cells at various times, they can’t easily measure gene expression at multiple time points.

To create a more precise timestamp, the team added strings of repetitive DNA bases to genes of interest in cultured human cells. These strings caused the cell to add repetitive regions of adenosine molecules—one of four building blocks of RNA — to the ends of RNA when the RNA was transcribed from these genes. The researchers also introduced an engineered version of an enzyme called adenosine deaminase acting on RNA (ADAR2cd), which slowly changed the adenosine molecules to a related molecule, inosine, at a predictable rate in the RNA. By measuring the ratio of inosines to adenosines in the timestamped section of any given RNA molecule, the researchers could elucidate when it was first produced, while keeping cells intact.

“It was pretty surprising to see how well this worked as a timestamp,” said Sam Rodriques, a co-first author of the new paper and former MIT graduate student who is now founding the Applied Biotechnology Laboratory at the Crick Institute in London. “And the more molecules you look at, the better your temporal resolution.”

Using their method, the researchers could estimate the age of a single timestamped RNA molecule to within 2.7 hours. But when they looked simultaneously at four RNA molecules, they could estimate the age of the molecules to within 1.5 hours. Looking at 200 molecules at once allowed the scientists to correctly sort RNA molecules into groups based on their age, or order them along a timeline with 86 percent accuracy.

“Extremely interesting biology, such as immune responses and development, occurs over a timescale of hours,” said co-first author of the paper Linlin Chen of the Broad. “Now we have the opportunity to better probe what’s happening on this timescale.”

The researchers found that the approach, with some small tweaks, worked well on various cell types — neurons, fibroblasts and embryonic kidney cells. They’re planning to now use the method to study how levels of gene activity related to learning and memory change in the hours after a neuron fires.

The current system allows researchers to record changes in gene expression over half a day. The team is now expanding the time range over which they can record gene activity, making the method more precise, and adding the ability to track several different genes at a time.

“Gene expression is constantly changing in response to the environment,” said co-senior author Edward Boyden of MIT, the McGovern Institute for Brain Research, and the Howard Hughes Medical Institute. “Tools like this will help us eavesdrop on how cells evolve over time, and help us pinpoint new targets for treating diseases.”

Support for the research was provided by the National Institutes of Health, the Schmidt Fellows Program at Broad Institute, the Burroughs Wellcome Fund, John Doerr, the Open Philanthropy Project, the HHMI-Simons Faculty Scholars Program, the U. S. Army Research Laboratory and the U. S. Army Research Office, the MIT Media Lab, Lisa Yang, the Hertz Graduate Fellowship and the National Science Foundation Graduate Research Fellowship Program.