The Society for Neuroscience will present its highest honor, the Ralph W. Gerard Prize in Neuroscience, to McGovern Institute Director Robert Desimone at its annual meeting today.
The Gerard Prize is named for neuroscientist Ralph W. Gerard who helped establish the Society for Neuroscience, and honors “outstanding scientists who have made significant contributions to neuroscience throughout their careers.” Desimone will share the $30,000 prize with Vanderbilt University neuroscientist Jon Kaas.
Desimone is being recognized for his career contributions to understanding cortical function in the visual system. His seminal work on attention spans decades, including the discovery of a neural basis for covert attention in the temporal cortex and the creation of the biased competition model, suggesting that attention is biased towards material relevant to the task. More recent work revealed how synchronized brain rhythms help enhance visual processing. Desimone also helped discover both face cells and neural populations that identify objects even when the size or location of the object changes. His long list of contributions includes mapping the extrastriate visual cortex, publishing the first report of columns for motion processing outside the primary visual cortex, and discovering how the temporal cortex retains memories. Desimone’s work has moved the field from broad strokes of input and output to a more nuanced understanding of cortical function that allows the brain to make sense of the environment.
At its annual meeting, beginning today, the Society will honor Desimone and other leading researchers who have made significant contributions to neuroscience — including the understanding of cognitive processes, drug addiction, neuropharmacology, and theoretical models — with this year’s Outstanding Achievement Awards.
“The Society is honored to recognize this year’s awardees, whose groundbreaking research has revolutionized our understanding of the brain, from the level of the synapse to the structure and function of the cortex, shedding light on how vision, memory, perception of touch and pain, and drug
addiction are organized in the brain,” SfN President Barry Everitt, said. “This exceptional group of neuroscientists has made fundamental discoveries, paved the way for new therapeutic approaches, and introduced new tools that will lay the foundation for decades of research to come.”
Robots can deliver food on a college campus and hit a hole-in-one on the golf course, but even the most sophisticated robot can’t perform basic social interactions that are critical to everyday human life.
MIT researchers have now incorporated certain social interactions into a framework for robotics, enabling machines to understand what it means to help or hinder one another, and to learn to perform these social behaviors on their own. In a simulated environment, a robot watches its companion, guesses what task it wants to accomplish, and then helps or hinders this other robot based on its own goals.
The researchers also showed that their model creates realistic and predictable social interactions. When they showed videos of these simulated robots interacting with one another to humans, the human viewers mostly agreed with the model about what type of social behavior was occurring.
Enabling robots to exhibit social skills could lead to smoother and more positive human-robot interactions. For instance, a robot in an assisted living facility could use these capabilities to help create a more caring environment for elderly individuals. The new model may also enable scientists to measure social interactions quantitatively, which could help psychologists study autism or analyze the effects of antidepressants.
“Robots will live in our world soon enough, and they really need to learn how to communicate with us on human terms. They need to understand when it is time for them to help and when it is time for them to see what they can do to prevent something from happening. This is very early work and we are barely scratching the surface, but I feel like this is the first very serious attempt for understanding what it means for humans and machines to interact socially,” says Boris Katz, principal research scientist and head of the InfoLab Group in MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) and a member of the Center for Brains, Minds, and Machines (CBMM).
Joining Katz on the paper are co-lead author Ravi Tejwani, a research assistant at CSAIL; co-lead author Yen-Ling Kuo, a CSAIL PhD student; Tianmin Shu, a postdoc in the Department of Brain and Cognitive Sciences; and senior author Andrei Barbu, a research scientist at CSAIL and CBMM. The research will be presented at the Conference on Robot Learning in November.
A social simulation
To study social interactions, the researchers created a simulated environment where robots pursue physical and social goals as they move around a two-dimensional grid.
A physical goal relates to the environment. For example, a robot’s physical goal might be to navigate to a tree at a certain point on the grid. A social goal involves guessing what another robot is trying to do and then acting based on that estimation, like helping another robot water the tree.
The researchers use their model to specify what a robot’s physical goals are, what its social goals are, and how much emphasis it should place on one over the other. The robot is rewarded for actions it takes that get it closer to accomplishing its goals. If a robot is trying to help its companion, it adjusts its reward to match that of the other robot; if it is trying to hinder, it adjusts its reward to be the opposite. The planner, an algorithm that decides which actions the robot should take, uses this continually updating reward to guide the robot to carry out a blend of physical and social goals.
“We have opened a new mathematical framework for how you model social interaction between two agents. If you are a robot, and you want to go to location X, and I am another robot and I see that you are trying to go to location X, I can cooperate by helping you get to location X faster. That might mean moving X closer to you, finding another better X, or taking whatever action you had to take at X. Our formulation allows the plan to discover the ‘how’; we specify the ‘what’ in terms of what social interactions mean mathematically,” says Tejwani.
Blending a robot’s physical and social goals is important to create realistic interactions, since humans who help one another have limits to how far they will go. For instance, a rational person likely wouldn’t just hand a stranger their wallet, Barbu says.
The researchers used this mathematical framework to define three types of robots. A level 0 robot has only physical goals and cannot reason socially. A level 1 robot has physical and social goals but assumes all other robots only have physical goals. Level 1 robots can take actions based on the physical goals of other robots, like helping and hindering. A level 2 robot assumes other robots have social and physical goals; these robots can take more sophisticated actions like joining in to help together.
Evaluating the model
To see how their model compared to human perspectives about social interactions, they created 98 different scenarios with robots at levels 0, 1, and 2. Twelve humans watched 196 video clips of the robots interacting, and then were asked to estimate the physical and social goals of those robots.
In most instances, their model agreed with what the humans thought about the social interactions that were occurring in each frame.
“We have this long-term interest, both to build computational models for robots, but also to dig deeper into the human aspects of this. We want to find out what features from these videos humans are using to understand social interactions. Can we make an objective test for your ability to recognize social interactions? Maybe there is a way to teach people to recognize these social interactions and improve their abilities. We are a long way from this, but even just being able to measure social interactions effectively is a big step forward,” Barbu says.
Toward greater sophistication
The researchers are working on developing a system with 3D agents in an environment that allows many more types of interactions, such as the manipulation of household objects. They are also planning to modify their model to include environments where actions can fail.
The researchers also want to incorporate a neural network-based robot planner into the model, which learns from experience and performs faster. Finally, they hope to run an experiment to collect data about the features humans use to determine if two robots are engaging in a social interaction.
“Hopefully, we will have a benchmark that allows all researchers to work on these social interactions and inspire the kinds of science and engineering advances we’ve seen in other areas such as object and action recognition,” Barbu says.
“I think this is a lovely application of structured reasoning to a complex yet urgent challenge,” says Tomer Ullman, assistant professor in the Department of Psychology at Harvard University and head of the Computation, Cognition, and Development Lab, who was not involved with this research. “Even young infants seem to understand social interactions like helping and hindering, but we don’t yet have machines that can perform this reasoning at anything like human-level flexibility. I believe models like the ones proposed in this work, that have agents thinking about the rewards of others and socially planning how best to thwart or support them, are a good step in the right direction.”
This research was supported by the Center for Brains, Minds, and Machines; the National Science Foundation; the MIT CSAIL Systems that Learn Initiative; the MIT-IBM Watson AI Lab; the DARPA Artificial Social Intelligence for Successful Teams program; the U.S. Air Force Research Laboratory; the U.S. Air Force Artificial Intelligence Accelerator; and the Office of Naval Research.
The lateral prefrontal cortex is a particularly well-connected part of the brain. Neurons there communicate with processing centers throughout the rest of the brain, gathering information and sending commands to implement executive control over behavior. Now, scientists at MIT’s McGovern Institute have mapped these connections and revealed an unexpected order within them: The lateral prefrontal cortex, they’ve found, contains maps of other major parts of the brain’s cortex.
The researchers, led by postdoctoral researcher Rui Xu and McGovern Institute Director Robert Desimone, report that the lateral prefrontal cortex contains a set of maps that represent the major processing centers in the other parts of the cortex, including the temporal and parietal lobes. Their organization likely supports the lateral prefrontal cortex’s roles managing complex functions such as attention and working memory, which require integrating information from multiple sources and coordinating activity elsewhere in the brain. The findings are published November 4, 2021, in the journal Neuron.
Topographic maps
The layout of the maps, which allows certain regions of the lateral prefrontal cortex to directly interact with multiple areas across the brain, indicates that this part of the brain is particularly well positioned for its role. “This function of integrating and then sending back control signals to appropriate levels in the processing hierarchies of the brain is clearly one of the reasons that prefrontal cortex is so important for cognition and executive control,” says Desimone.
In many parts of the brain, neurons’ physical organization has been found to reflect the information represented there. For example, individual neurons’ positions within the visual cortex mirror the layout of the cells in the retina from which they receive input, such that the spatial pattern of neuronal activity in this part of the brain provides an approximate view of the image seen by the eyes. For example, if you fixate on the first letter of a word, the next letters in the word will map to sequential locations in the visual cortex. Likewise, the arm and hand are mapped to adjacent locations in the somatic cortex, where the brain receives sensory information from the skin.
Topographic maps such as these, which have been found primarily in brain regions involved in sensory and motor processing, offer clues about how information is stored and processed in the brain. Neuroscientists have hoped that topographic maps within the lateral prefrontal cortex will provide insight into the complex cognitive processes that are carried out there—but such maps have been elusive.
Previous anatomical studies had given little indication how different parts of the brain communicate preferentially to specific locations within the prefrontal cortex to give rise to regional specialization of cognitive functions. Recently, however, the Desimone lab identified two areas within the lateral prefrontal cortex of monkeys with specific roles in focusing an animal’s visual attention. Knowing that some spots within the lateral prefrontal cortex were wired for specific functions, they wondered if others were, too. They decided they needed a detailed map of the connections emanating from this part of the brain, and devised a plan to plot connectivity from hundreds of points within the lateral prefrontal cortex.
Cortical connectome
To generate a wiring diagram, or connectome, Xu used functional MRI to monitor activity throughout a monkey’s brain as he stimulated specific points within its lateral prefrontal cortex. He moved systematically through the brain region, stimulating points spaced as close as one millimeter apart, and noting which parts of the brain lit up in response. Ultimately, the team collected data from about 100 sites for each of two monkeys.
As the data accumulated, clear patterns emerged. Different regions within the lateral prefrontal cortex formed orderly connections with each of five processing centers throughout the brain. Points within each of these maps connected to sites with the same relative positions in the distant processing centers. Because some parts of the lateral prefrontal cortex are wired to interact with more than one processing centers, these maps overlap, positioning the prefrontal cortex to integrate information from different sources.
The team found significant overlap, for example, between the maps of the temporal cortex, a part of the brain that uses visual information to recognize objects, and the parietal cortex, which computes the spatial relationships between objects. “It is mapping objects and space together in a way that would integrate the two systems,” explains Desimone. “And then on top of that, it has other maps of other brain systems that are partially overlapping with that—so they’re all sort of coming together.”
Desimone and Xu say the new connectome will help guide further investigations of how the prefrontal cortex orchestrates complex cognitive processes. “I think this really gives us a direction for the future, because we now need to understand the cognitive concepts that are mapped there,” Desimone says.
Already, they say, the connectome offers encouragement that a deeper understanding of complex cognition is within reach. “This topographic connectivity gives the lateral prefrontal some specific advantage to serve its function,” says Xu. “This suggests that lateral prefrontal cortex has a fine organization, just like the more studied parts of the brain, so the approaches that have been used to study these other regions may also benefit the studies of high-level cognition.”
In the past few years, artificial intelligence models of language have become very good at certain tasks. Most notably, they excel at predicting the next word in a string of text; this technology helps search engines and texting apps predict the next word you are going to type.
The most recent generation of predictive language models also appears to learn something about the underlying meaning of language. These models can not only predict the word that comes next, but also perform tasks that seem to require some degree of genuine understanding, such as question answering, document summarization, and story completion.
Such models were designed to optimize performance for the specific function of predicting text, without attempting to mimic anything about how the human brain performs this task or understands language. But a new study from MIT neuroscientists suggests the underlying function of these models resembles the function of language-processing centers in the human brain.
Computer models that perform well on other types of language tasks do not show this similarity to the human brain, offering evidence that the human brain may use next-word prediction to drive language processing.
“The better the model is at predicting the next word, the more closely it fits the human brain,” says Nancy Kanwisher, the Walter A. Rosenblith Professor of Cognitive Neuroscience, a member of MIT’s McGovern Institute for Brain Research and Center for Brains, Minds, and Machines (CBMM), and an author of the new study. “It’s amazing that the models fit so well, and it very indirectly suggests that maybe what the human language system is doing is predicting what’s going to happen next.”
Joshua Tenenbaum, a professor of computational cognitive science at MIT and a member of CBMM and MIT’s Artificial Intelligence Laboratory (CSAIL); and Evelina Fedorenko, the Frederick A. and Carole J. Middleton Career Development Associate Professor of Neuroscience and a member of the McGovern Institute, are the senior authors of the study, which appears this week in the Proceedings of the National Academy of Sciences.
Martin Schrimpf, an MIT graduate student who works in CBMM, is the first author of the paper.
Making predictions
The new, high-performing next-word prediction models belong to a class of models called deep neural networks. These networks contain computational “nodes” that form connections of varying strength, and layers that pass information between each other in prescribed ways.
Over the past decade, scientists have used deep neural networks to create models of vision that can recognize objects as well as the primate brain does. Research at MIT has also shown that the underlying function of visual object recognition models matches the organization of the primate visual cortex, even though those computer models were not specifically designed to mimic the brain.
In the new study, the MIT team used a similar approach to compare language-processing centers in the human brain with language-processing models. The researchers analyzed 43 different language models, including several that are optimized for next-word prediction. These include a model called GPT-3 (Generative Pre-trained Transformer 3), which, given a prompt, can generate text similar to what a human would produce. Other models were designed to perform different language tasks, such as filling in a blank in a sentence.
As each model was presented with a string of words, the researchers measured the activity of the nodes that make up the network. They then compared these patterns to activity in the human brain, measured in subjects performing three language tasks: listening to stories, reading sentences one at a time, and reading sentences in which one word is revealed at a time. These human datasets included functional magnetic resonance (fMRI) data and intracranial electrocorticographic measurements taken in people undergoing brain surgery for epilepsy.
They found that the best-performing next-word prediction models had activity patterns that very closely resembled those seen in the human brain. Activity in those same models was also highly correlated with measures of human behavioral measures such as how fast people were able to read the text.
“We found that the models that predict the neural responses well also tend to best predict human behavior responses, in the form of reading times. And then both of these are explained by the model performance on next-word prediction. This triangle really connects everything together,” Schrimpf says.
“A key takeaway from this work is that language processing is a highly constrained problem: The best solutions to it that AI engineers have created end up being similar, as this paper shows, to the solutions found by the evolutionary process that created the human brain. Since the AI network didn’t seek to mimic the brain directly — but does end up looking brain-like — this suggests that, in a sense, a kind of convergent evolution has occurred between AI and nature,” says Daniel Yamins, an assistant professor of psychology and computer science at Stanford University, who was not involved in the study.
Game changer
One of the key computational features of predictive models such as GPT-3 is an element known as a forward one-way predictive transformer. This kind of transformer is able to make predictions of what is going to come next, based on previous sequences. A significant feature of this transformer is that it can make predictions based on a very long prior context (hundreds of words), not just the last few words.
Scientists have not found any brain circuits or learning mechanisms that correspond to this type of processing, Tenenbaum says. However, the new findings are consistent with hypotheses that have been previously proposed that prediction is one of the key functions in language processing, he says.
“One of the challenges of language processing is the real-time aspect of it,” he says. “Language comes in, and you have to keep up with it and be able to make sense of it in real time.”
The researchers now plan to build variants of these language processing models to see how small changes in their architecture affect their performance and their ability to fit human neural data.
“For me, this result has been a game changer,” Fedorenko says. “It’s totally transforming my research program, because I would not have predicted that in my lifetime we would get to these computationally explicit models that capture enough about the brain so that we can actually leverage them in understanding how the brain works.”
The researchers also plan to try to combine these high-performing language models with some computer models Tenenbaum’s lab has previously developed that can perform other kinds of tasks such as constructing perceptual representations of the physical world.
“If we’re able to understand what these language models do and how they can connect to models which do things that are more like perceiving and thinking, then that can give us more integrative models of how things work in the brain,” Tenenbaum says. “This could take us toward better artificial intelligence models, as well as giving us better models of how more of the brain works and how general intelligence emerges, than we’ve had in the past.”
The research was funded by a Takeda Fellowship; the MIT Shoemaker Fellowship; the Semiconductor Research Corporation; the MIT Media Lab Consortia; the MIT Singleton Fellowship; the MIT Presidential Graduate Fellowship; the Friends of the McGovern Institute Fellowship; the MIT Center for Brains, Minds, and Machines, through the National Science Foundation; the National Institutes of Health; MIT’s Department of Brain and Cognitive Sciences; and the McGovern Institute.
Other authors of the paper are Idan Blank PhD ’16 and graduate students Greta Tuckute, Carina Kauf, and Eghbal Hosseini.
The National Academy of Medicine (NAM) has announced the election of 100 new members for 2021, including two MIT faculty members and three additional Institute affiliates.
Faculty honorees include Linda G. Griffith, a professor in the MIT departments of Biological Engineering and Mechanical Engineering; and Feng Zhang, a professor in the MIT departments of Brain and Cognitive Sciences and Biological Engineering. Guillermo Antonio Ameer SCD ’99, a professor of biomedical engineering and surgery at Northwestern University; Darrell Gaskin SM ’87, a professor of health policy and management at Johns Hopkins University; and Vamsi Mootha, an institute member of the Broad Institute of MIT and Harvard and former student in the Harvard-MIT Program in Health Sciences and Technology, were also honored.
The new inductees were elected through a process that recognizes individuals who have made major contributions to the advancement of the medical sciences, health care, and public health. Election to the academy is considered one of the highest honors in the fields of health and medicine and recognizes individuals who have demonstrated outstanding professional achievement and commitment to service.
Griffith, the School of Engineering Professor of Teaching Innovation and director of the Center for Gynepathology Research at MIT, is credited for her longstanding leadership in research, education, and medical translation. Specifically, the NAM recognizes her pioneering work in tissue engineering, biomaterials, and systems biology, including the development of the first “liver chip” technology. Griffith is also recognized for inventing 3D biomaterials printing and organotypic models for systems gynopathology, and for the establishment of the biological engineering department at MIT.
The academy recognizes Zhang, the Patricia and James Poitras ’63 Professor in Neuroscience at MIT, for revolutionizing molecular biology and powering transformative leaps forward in our ability to study and treat human diseases. Zhang, who also is an investigator at the Howard Hughes Medical Institute and the McGovern Institute for Brain Research, and a core member of the Broad Institute of MIT and Harvard, is specifically credited for the discovery of novel microbial enzymes and their development as molecular technologies, including optogenetics and CRISPR-mediated genome editing. The academy also commends Zhang for his outstanding mentoring and professional services.
Ameer, the Daniel Hale Williams Professor of Biomedical Engineering and Surgery at the Northwestern University Feinberg School of Medicine, earned his Doctor of Science degree from the MIT Department of Chemical Engineering in 1999. A professor of biomedical engineering and of surgery who is also the director of the Center for Advanced Regenerative Engineering, he is cited by the NAM “For pioneering contributions to regenerative engineering and medicine through the development, dissemination, and translation of citrate-based biomaterials, a new class of biodegradable polymers that enabled the commercialization of innovative medical devices approved by the U.S. Food and Drug Administration for use in a variety of surgical procedures.”
Gaskin, the William C. and Nancy F. Richardson Professor in Health Policy and Management, Bloomberg School of Public Health at Johns Hopkins University, earned his Master of Science degree from the MIT Department of Economics in 1987. A health economist who advances community, neighborhood, and market-level policies and programs that reduce health disparities, he is cited by the NAM “For his work as a leading health economist and health services researcher who has advanced fundamental understanding of the role of place as a driver in racial and ethnic health disparities.”
Mootha, the founding co-director of the Broad Institute’s Metabolism Program, is a professor of systems biology and medicine at Harvard Medical School and a professor in the Department of Molecular Biology at Massachusetts General Hospital. An alumnus of the Harvard-MIT Program in Health Sciences and Technology and former postdoc with the Whitehead Institute for Biomedical Research, Mootha is an expert in the mitochondrion, the “powerhouse of the cell,” and its role in human disease. The NAM cites Mootha “For transforming the field of mitochondrial biology by creatively combining modern genomics with classical bioenergetics.”
Established in 1970 by the National Academy of Sciences, the NAM addresses critical issues in health, science, medicine, and related policy and inspires positive actions across sectors. NAM works alongside the National Academy of Sciences and National Academy of Engineering to provide independent, objective analysis and advice to the nation and conduct other activities to solve complex problems and inform public policy decisions. The National Academies of Sciences, Engineering, and Medicine also encourage education and research, recognize outstanding contributions to knowledge, and increase public understanding of STEMM. With their election, NAM members make a commitment to volunteer their service in National Academies activities.
With the tools of modern neuroscience, data accumulates quickly. Recording devices listen in on the electrical conversations between neurons, picking up the voices of hundreds of cells at a time. Microscopes zoom in to illuminate the brain’s circuitry, capturing thousands of images of cells’ elaborately branched paths. Functional MRIs detect changes in blood flow to map activity within a person’s brain, generating a complete picture by compiling hundreds of scans.
“When I entered neuroscience about 20 years ago, data were extremely precious, and ideas, as the expression went, were cheap. That’s no longer true,” says McGovern Associate Investigator Ila Fiete. “We have an embarrassment of wealth in the data but lack sufficient conceptual and mathematical scaffolds to understand it.”
Fiete will lead the McGovern Institute’s new K. Lisa Yang Integrative Computational Neuroscience (ICoN) Center, whose scientists will create mathematical models and other computational tools to confront the current deluge of data and advance our understanding of the brain and mental health. The center, funded by a $24 million donation from philanthropist Lisa Yang, will take a uniquely collaborative approach to computational neuroscience, integrating data from MIT labs to explain brain function at every level, from the molecular to the behavioral.
“Driven by technologies that generate massive amounts of data, we are entering a new era of translational neuroscience research,” says Yang, whose philanthropic investment in MIT research now exceeds $130 million. “I am confident that the multidisciplinary expertise convened by this center will revolutionize how we synthesize this data and ultimately understand the brain in health and disease.”
Data integration
Fiete says computation is particularly crucial to neuroscience because the brain is so staggeringly complex. Its billions of neurons, which are themselves complicated and diverse, interact with one other through trillions of connections.
“Conceptually, it’s clear that all these interactions are going to lead to pretty complex things. And these are not going to be things that we can explain in stories that we tell,” Fiete says. “We really will need mathematical models. They will allow us to ask about what changes when we perturb one or several components — greatly accelerating the rate of discovery relative to doing those experiments in real brains.”
By representing the interactions between the components of a neural circuit, a model gives researchers the power to explore those interactions, manipulate them, and predict the circuit’s behavior under different conditions.
“You can observe these neurons in the same way that you would observe real neurons. But you can do even more, because you have access to all the neurons and you have access to all the connections and everything in the network,” explains computational neuroscientist and McGovern Associate Investigator Guangyu Robert Yang (no relation to Lisa Yang), who joined MIT as a junior faculty member in July 2021.
Many neuroscience models represent specific functions or parts of the brain. But with advances in computation and machine learning, along with the widespread availability of experimental data with which to test and refine models, “there’s no reason that we should be limited to that,” he says.
Robert Yang’s team at the McGovern Institute is working to develop models that integrate multiple brain areas and functions. “The brain is not just about vision, just about cognition, just about motor control,” he says. “It’s about all of these things. And all these areas, they talk to one another.” Likewise, he notes, it’s impossible to separate the molecules in the brain from their effects on behavior – although those aspects of neuroscience have traditionally been studied independently, by researchers with vastly different expertise.
The ICoN Center will eliminate the divides, bringing together neuroscientists and software engineers to deal with all types of data about the brain. To foster interdisciplinary collaboration, every postdoctoral fellow and engineer at the center will work with multiple faculty mentors. Working in three closely interacting scientific cores, fellows will develop computational technologies for analyzing molecular data, neural circuits, and behavior, such as tools to identify pat-terns in neural recordings or automate the analysis of human behavior to aid psychiatric diagnoses. These technologies will also help researchers model neural circuits, ultimately transforming data into knowledge and understanding.
“Lisa is focused on helping the scientific community realize its goals in translational research,” says Nergis Mavalvala, dean of the School of Science and the Curtis and Kathleen Marble Professor of Astrophysics. “With her generous support, we can accelerate the pace of research by connecting the data to the delivery of tangible results.”
Computational modeling
In its first five years, the ICoN Center will prioritize four areas of investigation: episodic memory and exploration, including functions like navigation and spatial memory; complex or stereotypical behavior, such as the perseverative behaviors associated with autism and obsessive-compulsive disorder; cognition and attention; and sleep. The goal, Fiete says, is to model the neuronal interactions that underlie these functions so that researchers can predict what will happen when something changes — when certain neurons become more active or when a genetic mutation is introduced, for example. When paired with experimental data from MIT labs, the center’s models will help explain not just how these circuits work, but also how they are altered by genes, the environment, aging, and disease.
These focus areas encompass circuits and behaviors often affected by psychiatric disorders and neurodegeneration, and models will give researchers new opportunities to explore their origins and potential treatment strategies. “I really think that the future of treating disorders of the mind is going to run through computational modeling,” says McGovern Associate Investigator Josh McDermott.
In McDermott’s lab, researchers are modeling the brain’s auditory circuits. “If we had a perfect model of the auditory system, we would be able to understand why when somebody loses their hearing, auditory abilities degrade in the very particular ways in which they degrade,” he says. Then, he says, that model could be used to optimize hearing aids by predicting how the brain would interpret sound altered in various ways by the device.
Similar opportunities will arise as researchers model other brain systems, McDermott says, noting that computational models help researchers grapple with a dauntingly vast realm of possibilities. “There’s lots of different ways the brain can be set up, and lots of different potential treatments, but there is a limit to the number of neuroscience or behavioral experiments you can run,” he says. “Doing experiments on a computational system is cheap, so you can explore the dynamics of the system in a very thorough way.”
The ICoN Center will speed the development of the computational tools that neuroscientists need, both for basic understanding of the brain and clinical advances. But Fiete hopes for a culture shift within neuroscience, as well. “There are a lot of brilliant students and postdocs who have skills that are mathematics and computational and modeling based,” she says. “I think once they know that there are these possibilities to collaborate to solve problems related to psychiatric disorders and how we think, they will see that this is an exciting place to apply their skills, and we can bring them in.”
With the tools of modern neuroscience, researchers can peer into the brain with unprecedented accuracy. Recording devices listen in on the electrical conversations between neurons, picking up the voices of hundreds of cells at a time. Genetic tools allow us to focus on specific types of neurons based on their molecular signatures. Microscopes zoom in to illuminate the brain’s circuitry, capturing thousands of images of elaborately branched dendrites. Functional MRIs detect changes in blood flow to map activity within a person’s brain, generating a complete picture by compiling hundreds of scans.
This deluge of data provides insights into brain function and dynamics at different levels – molecules, cells, circuits, and behavior — but the insights often remain compartmentalized in separate research silos. An innovative new center at MIT’s McGovern Institute aims to leverage them into powerful revelations of the brain’s inner workings.
The center, funded by a $24 million donation from philanthropist Lisa Yang and led by McGovern Institute Associate Investigator Ila Fiete, will take a collaborative approach to computational neuroscience, integrating cutting-edge modeling techniques and data from MIT labs to explain brain function at every level, from the molecular to the behavioral.
“Our goal is that sophisticated, truly integrated computational models of the brain will make it possible to identify how ‘control knobs’ such as genes, proteins, chemicals, and environment drive thoughts and behavior, and to make inroads toward urgent unmet needs in understanding and treating brain disorders,” says Fiete, who is also a brain and cognitive sciences professor at MIT.
“Driven by technologies that generate massive amounts of data, we are entering a new era of translational neuroscience research,” says Yang, whose philanthropic investment in MIT research now exceeds $130 million. “I am confident that the multidisciplinary expertise convened by the ICoN center will revolutionize how we synthesize this data and ultimately understand the brain in health and disease.”
Connecting the data
It is impossible to separate the molecules in the brain from their effects on behavior – although those aspects of neuroscience have traditionally been studied independently, by researchers with vastly different expertise. The ICoN Center will eliminate the divides, bringing together neuroscientists and software engineers to deal with all types of data about the brain.
“The center’s highly collaborative structure, which is essential for unifying multiple levels of understanding, will enable us to recruit talented young scientists eager to revolutionize the field of computational neuroscience,” says Robert Desimone, director of the McGovern Institute. “It is our hope that the ICoN Center’s unique research environment will truly demonstrate a new academic research structure that catalyzes bold, creative research.”
To foster interdisciplinary collaboration, every postdoctoral fellow and engineer at the center will work with multiple faculty mentors. In order to attract young scientists and engineers to the field of computational neuroscience, the center will also provide four graduate fellowships to MIT students each year in perpetuity. Interacting closely with three scientific cores, engineers and fellows will develop computational models and technologies for analyzing molecular data, neural circuits, and behavior, such as tools to identify patterns in neural recordings or automate the analysis of human behavior to aid psychiatric diagnoses. These technologies and models will be instrumental in synthesizing data into knowledge and understanding.
Center priorities
In its first five years, the ICoN Center will prioritize four areas of investigation: episodic memory and exploration, including functions like navigation and spatial memory; complex or stereotypical behavior, such as the perseverative behaviors associated with autism and obsessive-compulsive disorder; cognition and attention; and sleep. Models of complex behavior will be created in collaboration with clinicians and researchers at Children’s Hospital of Philadelphia.
The goal, Fiete says, is to model the neuronal interactions that underlie these functions so that researchers can predict what will happen when something changes — when certain neurons become more active or when a genetic mutation is introduced, for example. When paired with experimental data from MIT labs, the center’s models will help explain not just how these circuits work, but also how they are altered by genes, the environment, aging, and disease. These focus areas encompass circuits and behaviors often affected by psychiatric disorders and neurodegeneration, and models will give researchers new opportunities to explore their origins and potential treatment strategies.
“Lisa Yang is focused on helping the scientific community realize its goals in translational research,” says Nergis Mavalvala, dean of the School of Science and the Curtis and Kathleen Marble Professor of Astrophysics. “With her generous support, we can accelerate the pace of research by connecting the data to the delivery of tangible results.”
On Oct. 5, the National Institutes of Health announced the names of 106 scientists who have been awarded grants through the High-Risk, High-Reward Research program to advance highly innovative biomedical and behavioral research. Seven of the recipients are MIT faculty members.
The High-Risk, High-Reward Research program catalyzes scientific discovery by supporting research proposals that, due to their inherent risk, may struggle in the traditional peer-review process despite their transformative potential. Program applicants are encouraged to pursue trailblazing ideas in any area of research relevant to the NIH’s mission to advance knowledge and enhance health.
“The science put forward by this cohort is exceptionally novel and creative and is sure to push at the boundaries of what is known,” says NIH Director Francis S. Collins. “These visionary investigators come from a wide breadth of career stages and show that groundbreaking science can happen at any career level given the right opportunity.”
New innovators
Four MIT researchers received New Innovator Awards, which recognize “unusually innovative research from early career investigators.” They are:
Pulin Li is a member at the Whitehead Institute for Biomedical Research and an assistant professor in the Department of Biology. Li combines approaches from synthetic biology, developmental biology, biophysics and systems biology to quantitatively understand the genetic circuits underlying cell-cell communication that creates multicellular behaviors.
Seychelle Vos, the Robert A. Swanson (1969) Career Development Professor of Life Sciences in the Department of Biology, studies the interplay of gene expression and genome organization. Her work focuses on understanding how large molecular machineries involved in genome organization and gene transcription regulate each others’ function to ultimately determine cell fate and identity.
Xiao Wang, the Thomas D. and Virginia Cabot Assistant Professor of Chemistry and a member of the Broad Institute of MIT and Harvard, aims to develop high-resolution and highly-multiplexed molecular imaging methods across multiple scales toward understanding the physical and chemical basis of brain wiring and function.
Alison Wendlandt is a Cecil and Ida Green Career Development Assistant Professor of Chemistry. Wendlandt focuses on the development of selective, catalytic reactions using the tools of organic and organometallic synthesis and physical organic chemistry. Mechanistic study plays a central role in the development of these new transformations.
Transformative researchers
Two MIT researchers have received Transformative Research Awards, which “promote cross-cutting, interdisciplinary approaches that could potentially create or challenge existing paradigms.” The recipients are:
Manolis Kellis is a professor of computer science at MIT in the area of computational biology, an associate member of the Broad Institute, and a principal investigator with MIT’s Computer Science and Artificial Intelligence Laboratory. He aims to further our understanding of the human genome by computational integration of large-scale functional and comparative genomics datasets.
Myriam Heiman is the Latham Family Career Development Associate Professor of Neuroscience in the Department of Brain and Cognitive Sciences and an investigator in the Picower Institute for Learning and Memory. Heiman studies the selective vulnerability and pathophysiology seen in two neurodegenerative diseases of the basal ganglia, Huntington’s disease, and Parkinson’s disease.
Together, Heiman, Kellis and colleagues will launch a five-year investigation to pinpoint what may be going wrong in specific brain cells and to help identify new treatment approaches for amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration with motor neuron disease (FTLD/MND). The project will bring together four labs, including Heiman and Kellis’ labs at MIT, to apply innovative techniques ranging from computational, genomic, and epigenomic analyses of cells from a rich sample of central nervous system tissue, to precision genetic engineering of stem cells and animal models.
Pioneering researchers
Polina Anikeeva received a Pioneer Award, which “challenges investigators at all career levels to pursue new research directions and develop groundbreaking, high-impact approaches to a broad area of biomedical, behavioral, or social science.” Anikeeva is an MIT professor of materials science and engineering, a professor of brain and cognitive sciences, and a McGovern Institute for Brain Research associate investigator. She has established a research program that uniquely combines materials synthesis, device fabrication, neurophysiology, and animal models of behavior. Her group carries out projects that understand, invent, and design materials from the level of atoms to functional devices with applications in fundamental neuroscience.
This year, NIH issued 10 Pioneer awards, 64 New Innovator awards, 19 Transformative Research awards (10 general, four ALS-related, and five Covid-19-related), and 13 Early Independence awards for 2021. Funding for the awards comes from the NIH Common Fund, the National Institute of General Medical Sciences, the National Institute of Mental Health, and the National Institute of Neurological Disorders and Stroke.
Using machine learning, a computer model can teach itself to smell in just a few minutes. When it does, researchers have found, it builds a neural network that closely mimics the olfactory circuits that animal brains use to process odors.
Animals from fruit flies to humans all use essentially the same strategy to process olfactory information in the brain. But neuroscientists who trained an artificial neural network to take on a simple odor classification task were surprised to see it replicate biology’s strategy so faithfully.
“The algorithm we use has no resemblance to the actual process of evolution,” says Guangyu Robert Yang, an associate investigator at MIT’s McGovern Institute, who led the work as a postdoctoral fellow at Columbia University. The similarities between the artificial and biological systems suggest that the brain’s olfactory network is optimally suited to its task.
Yang and his collaborators, who reported their findings October 6, 2021, in the journal Neuron, say their artificial network will help researchers learn more about the brain’s olfactory circuits. The work also helps demonstrate artificial neural networks’ relevance to neuroscience. “By showing that we can match the architecture [of the biological system] very precisely, I think that gives more confidence that these neural networks can continue to be useful tools for modeling the brain,” says Yang, who is also an assistant professor in MIT’s Departments of Brain and Cognitive Sciences and Electrical Engineering and Computer Science and a member of the Center for Brains, Minds and Machines.
Mapping natural olfactory circuits
For fruit flies, the organism in which the brain’s olfactory circuitry has been best mapped, smell begins in the antennae. Sensory neurons there, each equipped with odor receptors specialized to detect specific scents, transform the binding of odor molecules into electrical activity. When an odor is detected, these neurons, which make up the first layer of the olfactory network, signal to the second-layer: a set of neurons that reside in a part of the brain called the antennal lobe. In the antennal lobe, sensory neurons that share the same receptor converge onto the same second-layer neuron. “They’re very choosy,” Yang says. “They don’t receive any input from neurons expressing other receptors.” Because it has fewer neurons than the first layer, this part of the network is considered a compression layer. These second-layer neurons, in turn, signal to a larger set of neurons in the third layer. Puzzlingly, those connections appear to be random.
For Yang, a computational neuroscientist, and Columbia University graduate student Peter Yiliu Wang, this knowledge of the fly’s olfactory system represented a unique opportunity. Few parts of the brain have been mapped as comprehensively, and that has made it difficult to evaluate how well certain computational models represent the true architecture of neural circuits, they say.
Building an artificial smell network
Neural networks, in which artificial neurons rewire themselves to perform specific tasks, are computational tools inspired by the brain. They can be trained to pick out patterns within complex datasets, making them valuable for speech and image recognition and other forms of artificial intelligence. There are hints that the neural networks that do this best replicate the activity of the nervous system. But, says Wang, who is now a postdoctoral researcher at Stanford University, differently structured networks could generate similar results, and neuroscientists still need to know whether artificial neural networks reflect the actual structure of biological circuits. With comprehensive anatomical data about fruit fly olfactory circuits, he says: “We’re able to ask this question: Can artificial neural networks truly be used to study the brain?”
Collaborating closely with Columbia neuroscientists Richard Axel and Larry Abbott, Yang and Wang constructed a network of artificial neurons comprising an input layer, a compression layer, and an expansion layer—just like the fruit fly olfactory system. They gave it the same number of neurons as the fruit fly system, but no inherent structure: connections between neurons would be rewired as the model learned to classify odors.
The scientists asked the network to assign data representing different odors to categories, and to correctly categorize not just single odors, but also mixtures of odors. This is something that the brain’s olfactory system is uniquely good at, Yang says. If you combine the scents of two different apples, he explains, the brain still smells apple. In contrast, if two photographs of cats are blended pixel by pixel, the brain no longer sees a cat. This ability is just one feature of the brain’s odor-processing circuits, but captures the essence of the system, Yang says.
It took the artificial network only minutes to organize itself. The structure that emerged was stunningly similar to that found in the fruit fly brain. Each neuron in the compression layer received inputs from a particular type of input neuron and connected, seemingly randomly, to multiple neurons in the expansion layer. What’s more, each neuron in the expansion layer receives connections, on average, from six compression-layer neurons—exactly as occurs in the fruit fly brain.
“It could have been one, it could have been 50. It could have been anywhere in between,” Yang says. “Biology finds six, and our network finds about six as well.” Evolution found this organization through random mutation and natural selection; the artificial network found it through standard machine learning algorithms.
The surprising convergence provides strong support that the brain circuits that interpret olfactory information are optimally organized for their task, he says. Now, researchers can use the model to further explore that structure, exploring how the network evolves under different conditions and manipulating the circuitry in ways that cannot be done experimentally.
As we interact with the world, we are constantly presented with information that is unreliable or incomplete – from jumbled voices in a crowded room to solicitous strangers with unknown motivations. Fortunately, our brains are well equipped to evaluate the quality of the evidence we use to make decisions, usually allowing us to act deliberately, without jumping to conclusions.
Now, neuroscientists at MIT’s McGovern Institute have homed in on key brain circuits that help guide decision-making under conditions of uncertainty. By studying how mice interpret ambiguous sensory cues, they’ve found neurons that stop the brain from using unreliable information.
“One area cares about the content of the message—that’s the prefrontal cortex—and the thalamus seems to care about how certain the input is.” – Michael Halassa
The findings, published October 6, 2021, in the journal Nature, could help researchers develop treatments for schizophrenia and related conditions, whose symptoms may be at least partly due to affected individuals’ inability to effectively gauge uncertainty.
Decoding ambiguity
“A lot of cognition is really about handling different types of uncertainty,” says McGovern Associate Investigator Michael Halassa, explaining that we all must use ambiguous information to make inferences about what’s happening in the world. Part of dealing with this ambiguity involves recognizing how confident we can be in our conclusions. And when this process fails, it can dramatically skew our interpretation of the world around us.
“In my mind, schizophrenia spectrum disorders are really disorders of appropriately inferring the causes of events in the world and what other people think,” says Halassa, who is a practicing psychiatrist. Patients with these disorders often develop strong beliefs based on events or signals most people would dismiss as meaningless or irrelevant, he says. They may assume hidden messages are embedded in a garbled audio recording, or worry that laughing strangers are plotting against them. Such things are not impossible—but delusions arise when patients fail to recognize that they are highly unlikely.
Halassa and postdoctoral researcher Arghya Mukherjee wanted to know how healthy brains handle uncertainty, and recent research from other labs provided some clues. Functional brain imaging had shown that when people are asked to study a scene but they aren’t sure what to pay attention to, a part of the brain called the mediodorsal thalamus becomes active. The less guidance people are given for this task, the harder the mediodorsal thalamus works.
The thalamus is a sort of crossroads within the brain, made up of cells that connect distant brain regions to one another. Its mediodorsal region sends signals to the prefrontal cortex, where sensory information is integrated with our goals, desires, and knowledge to guide behavior. Previous work in the Halassa lab showed that the mediodorsal thalamus helps the prefrontal cortex tune in to the right signals during decision-making, adjusting signaling as needed when circumstances change. Intriguingly, this brain region has been found to be less active in people with schizophrenia than it is in others.
Study authors (from left to right) Michael Halassa, Arghya Mukherjee, Norman Lam and Ralf Wimmer.
Working with postdoctoral researcher Norman Lam and research scientist Ralf Wimmer, Halassa and Mukherjee designed a set of animal experiments to examine the mediodorsal thalamus’s role in handling uncertainty. Mice were trained to respond to sensory signals according to audio cues that alerted them whether to focus on either light or sound. When the animals were given conflicting cues, it was up to them animal to figure out which one was represented most prominently and act accordingly. The experimenters varied the uncertainty of this task by manipulating the numbers and ratio of the cues.
Division of labor
By manipulating and recording activity in the animals’ brains, the researchers found that the prefrontal cortex got involved every time mice completed this task, but the mediodorsal thalamus was only needed when the animals were given signals that left them uncertain how to behave. There was a simple division of labor within the brain, Halassa says. “One area cares about the content of the message—that’s the prefrontal cortex—and the thalamus seems to care about how certain the input is.”
Within the mediodorsal thalamus, Halassa and Mukherjee found a subset of cells that were especially active when the animals were presented with conflicting sound cues. These neurons, which connect directly to the prefrontal cortex, are inhibitory neurons, capable of dampening downstream signaling. So when they fire, Halassa says, they effectively stop the brain from acting on unreliable information. Cells of a different type were focused on the uncertainty that arises when signaling is sparse. “There’s a dedicated circuitry to integrate evidence across time to extract meaning out of this kind of assessment,” Mukherjee explains.
As Halassa and Mukherjee investigate these circuits more deeply, a priority will be determining whether they are disrupted in people with schizophrenia. To that end, they are now exploring the circuitry in animal models of the disorder. The hope, Mukherjee says, is to eventually target dysfunctional circuits in patients, using noninvasive, focused drug delivery methods currently under development. “We have the genetic identity of these circuits. We know they express specific types of receptors, so we can find drugs that target these receptors,” he says. “Then you can specifically release these drugs in the mediodorsal thalamus to modulate the circuits as a potential therapeutic strategy.”
This work was funded by grants from the National Institute of Mental Health (R01MH107680-05 and R01MH120118-02).