James DiCarlo

Rapid Recognition

DiCarlo’s research goal is to reverse engineer the brain mechanisms that underlie human visual intelligence. He and his collaborators have revealed how population image transformations carried out by a deep stack of interconnected neocortical brain areas — called the primate ventral visual stream — are effortlessly able to extract object identity from visual images. His team uses a combination of large-scale neurophysiology, brain imaging, direct neural perturbation methods, and machine learning methods to build and test neurally-mechanistic computational models of the ventral visual stream and its support of cognition and behavior. Such an engineering-based understanding is likely to lead to new artificial vision and artificial intelligence approaches, new brain-machine interfaces to restore or augment lost senses, and a new foundation to ameliorate disorders of the mind.

More Research

We take for granted our ability to recognize vast numbers of objects rapidly and effortlessly, but this ability is based on a complex network of brain regions. DiCarlo is interested in how this remarkable system works. Our visual system enables us to tell within a fraction of a second whether, for example, a visual scene contains a dog, despite the fact that no two dogs are exactly alike and that the dog’s image on the retina is constantly changing depending on its location, size, pose, and illumination. Somehow, our brains create a representation of “dog-ness” that allows us to recognize an unfamiliar dog based on prior experiences with other dogs. We learn thousands of such categories in early childhood, and we continue to acquire them throughout life.

Using electrophysiological recordings from animals and neuroimaging techniques with animal and human subjects, DiCarlo is studying the patterns of brain activity that underlie our ability to recognize visual objects. In collaboration with McGovern colleague Nancy Kanwisher, DiCarlo has shown that the highest stage of this ventral stream – the inferior temporal (IT) cortex – contains clusters of neurons that respond to similar types of objects. DiCarlo has shown that the brain’s ability to recognize objects under different conditions is altered by experience. As we gain experience with visual objects, the activity of IT neurons and our perception of objects change – pointing to how the ventral stream might “learn” to represent objects in the first place. DiCarlo believes that this ventral stream transforms pixel-based images of the world into patterns of nerve activity that emphasize object identity and discount potentially confusing variables like the object’s position and size.

Biography

Jim DiCarlo joined the McGovern Institute in 2002, and is currently the Peter de Florez Professor of Brain and Cognitive Sciences as well as Director of the MIT Quest for Intelligence. For nearly nine years, DiCarlo was also the head of MIT’s Brain and Cognitive Sciences Department. He received his MD and PhD in Biomedical Engineering from Johns Hopkins University in 1998 and did his postdoctoral work at Baylor College of Medicine from 1998 to 2002. He is a past recipient of a Sloan fellowship, a Pew Scholar Award, and a McKnight Scholar Award.

Honors and Awards

Recent Publications

Cao, R, Wang, J, Lin, C, De Falco, E, Peter, A, Rey, HG et al.. Feature-based encoding of face identity by single neurons in the human amygdala and hippocampus. Nat Hum Behav. 2025; :. doi: 10.1038/s41562-025-02218-1. PubMed PMID:40481217 PubMed Central PMC12240612.

Kar, K, DiCarlo, JJ. The Quest for an Integrated Set of Neural Mechanisms Underlying Object Recognition in Primates. Annu Rev Vis Sci. 2024;10 (1):91-121. doi: 10.1146/annurev-vision-112823-030616. PubMed PMID:38950431 .

Margalit, E, Lee, H, Finzi, D, DiCarlo, JJ, Grill-Spector, K, Yamins, DLK et al.. A unifying framework for functional organization in early and higher ventral visual cortex. Neuron. 2024;112 (14):2435-2451.e7. doi: 10.1016/j.neuron.2024.04.018. PubMed PMID:38733985 PubMed Central PMC11257790.

ALL PUBLICATIONS

Faculty

Principal Research Scientists

Focus Areas

Disorder Areas

Rapid Recognition

More Research

Biography

Honors and Awards

A visual pathway in the brain may do more than recognize objects

When computer vision works more like a brain, it sees more like people do

Key Publications

Recent Publications