Evidence for efficient inverse graphics in the human brain using large-scale ECoG data

Poster Presentation 63.418: Wednesday, May 22, 2024, 8:30 am – 12:30 pm, Pavilion
Session: Face and Body Perception: Models

Daniel Calbick1 (), Hakan Yilmaz1, Greg McCarthy1, Ilker Yildirim1; 1Yale University

The efficient inverse graphics (EIG) hypothesis states that the initial few hundred milliseconds of visual processing unfolds a cascade of computations reversing generative models of how scenes form and project to images, thereby mapping retinal inputs to three-dimensional scene percepts. Here we ask: Is EIG a shared computational motif of face processing between non-human primates and the human brain? In two key domains of vision, perception of faces and bodies, EIG models have been shown to closely recapitulate processing hierarchies in macaque inferotemporal cortex. These studies used representational similarity analysis (RSA) and high spatio-temporal resolution recordings of single neurons, enabling effective model evaluation. To test EIG in the human brain, we turned to a large-scale electrocorticography (ECoG) dataset in epilepsy patients (N=60). All patients passively viewed a common screening task with faces and other stimulus categories. In addition, each patient viewed a stimulus set of only faces, with different subsets of patients seeing one of a number of distinct stimulus sets. Analyzing this data is challenging due to the idiosyncratic distribution of electrodes across patients and differences across stimulus sets. We address these challenges with a data-driven pipeline for functional and anatomical identification and registration of electrodes in ECoG: We exploit the shared screening task to identify face-informative electrodes in each patient and register electrodes to regions of interest (RoI; e.g., fusiform face area). Using RSA, we find that the similarity structure of EIG on a given stimulus set significantly correlates with the empirical similarity matrices, based on the stimulus-evoked time-frequency patterns of electrodes in individual patients. We also confirm this correlation using random shuffling of data. Finally, EIG explains the time-frequency patterns especially highly between 200-400 ms after stimulus onset. These results suggest EIG as a shared motif of visual computation across primates and humans.

Acknowledgements: Yale Center for Research Computing