Dynamic face-related eye movement representations in the human ventral pathway

doi:10.21203/rs.3.rs-5835383/v1

Dynamic face-related eye movement representations in the human ventral pathway

2025 · doi:10.21203/rs.3.rs-5835383/v1

preprint OA: closed

Full text JSON View at publisher

Full text 140,049 characters · extracted from preprint-html · click to expand

Dynamic face-related eye movement representations in the human ventral pathway | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Dynamic face-related eye movement representations in the human ventral pathway Lihui Wang, Zhongbin Su, Xiaolin Zhou, Stefan Pollmann This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-5835383/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 24 Nov, 2025 Read the published version in Communications Biology → Version 1 posted You are reading this latest preprint version Abstract Multiple brain areas along the ventral pathway have been known to represent face images. Here, in a magnetoencephalography (MEG) experiment, we show dynamic representations of face-related eye movements in the ventral pathway in the absence of image perception. Participants followed a dot presented on a uniform background, the movement of which represented gaze tracks acquired previously during their free-viewing of face and house pictures. We found a dominant role of the ventral stream in representing face-related gaze tracks, starting from the orbitofrontal cortex (OFC) and anterior temporal lobe (ATL), and extending to the medial temporal and ventral occipitotemporal cortex. Our findings show that the ventral pathway represents the gaze tracks used to explore faces, by which top-down prediction of face category in OFC and ATL may guide, via the medial temporal cortex or directly, face perception in the ventral occipitotemporal cortex. Biological sciences/Neuroscience/Cognitive neuroscience Biological sciences/Neuroscience/Visual system/Object vision eye movements human ventral pathway object representation face magnetoencephalography (MEG) Figures Figure 1 Figure 2 Figure 3 Figure 4 Introduction Ventral occipitotemporal cortex is well known for its role in high-level object perception 1–6 . In particular, the fusiform face area (FFA) is activated by face images 7 and the parahippocampal place area (PPA) by house images 8 . In a recent study, however, it was shown that FFA and PPA exhibited distinct neural activation patterns to face- and house-related gaze tracks, elicited in the absence of face or house image perception 9 . In this study, participants followed a sequence of dots on a uniform background with eye movements, where the dot sequence replayed gaze tracks previously recorded during face or house viewing. The face- and house-related gaze tracks could be decoded by the activation patterns in the FFA and PPA, thus indicating category-specific representations of gaze tracks in areas that were known to be activated by the respective image categories. Furthermore, the category-selective activation patterns were more sensitive to self-generated gaze tracks than gaze tracks generated by other observers 9 , in line with known individual differences in looking at faces 10 . Here, we asked what the function of these gaze-track representations in a high-level perceptual area such as the FFA might be. Multiple areas along the ventral pathway are dedicated to face processing. In addition to the already mentioned FFA, face-responsive patches along the rostrocaudal extent of the temporal lobes as well as in the orbitofrontal cortex have been found in both human and non-human primates 11–13 . Notably, all of these studies investigated the processing of face images, without recourse to eye movements. Recently, however, modulations of neural activity by eye movements have been reported in the orbitofrontal cortex, particularly involving face viewing 14,15 . Moreover, neuronal activity in the medial temporal lobes can be modulated by eye movements, and lesions in this area may lead to changes in eye movement patterns during active sampling of the environment 16,17 . Medial temporal structures also interact with PPA during scene exploration 18 . Taken together with the evidence of gaze-sequence representation in the FFA and PPA 9 , these findings may lead to the hypothesis that the ventral stream, from orbitofrontal cortex via the uncinate fascicle to anterior temporal cortex and, via medial temporal cortex or directly, further to ventral occipitotemporal cortex, might be involved in representing face-specific gaze tracks. However, how might face-specific gaze track representations be processed along the ventral stream? A top-down hypothesis is that eye movement sequences, e.g., during face viewing, initially activate prefrontal cortex, creating a categorical prediction (in this example of the 'face' category) which is fed back via the ventral stream to guide recognition in posterior perceptual areas 19–21 . To test this hypothesis, we investigated the spatiotemporal gradient of the MEG signal changes during face-related (vs. house-related) gaze track following. We expected dynamic signal changes specific to face-related gaze tracks to occur along the ventral stream, from orbitofrontal via anterior temporal and medial temporal cortex to ventral occipitotemporal cortex. Considering the known interindividual differences in the eye movement patterns of faces 10 , we also expected that following self-generated gaze tracks would optimally stimulate face-specific neural representations 9 , leading to earlier and stronger MEG signal changes than other-generated gaze tracks. Top-down processing in the prefrontal cortex has been particularly observed when there is a lack of rich and unambiguous visual information 20,22 , as is the case in our dot-following task. Specifically, the categorical prediction in the prefrontal cortex was activated by ambiguous visual information in early visual areas 20 . To test this constraint, we presented actual images of faces and houses in the MEG scanner as a control condition. In contrast to the dot-following task, here, we expected a dominance of feedforward processing along the ventral pathway. Results The study consisted of a behavioral experiment and an MEG experiment, with a one-week interval between the two experiments. In the behavioral experiment (Fig. 1a), gaze tracks of all participants were recorded while they were looking at images of faces and houses (see Table S1 in Supplementary Information for gaze parameters). In the MEG experiment, participants followed dot sequences of their own (self-face, SF; self-house, SH) or another participant’s gaze tracks (other-face, OF; other-house, OH) with eye movements (Gaze Session, Fig. 1b). After the Gaze Session, participants took part in an Image Session where they viewed images of faces and houses while maintaining the eyes on a central fixation point (Fig. 1c). The behavioral performance is presented in Behavioral results of Supplementary Information. Distinct patterns between face- and house-related gaze tracks We first analyzed if the face- and house-related fixation sequences obtained in the behavioral experiment showed distinct patterns. A machine-learning classification analysis over the spatiotemporal parameters of the fixations ( x , y coordinates, and fixation duration) showed a high prediction accuracy in discriminating the two categories of gaze sequences, 70.5 ± 11.4% (M ± SD), with above-chance significance ( p < 10 5 ; permutation-based significance testing). The same analysis was also performed on the eye movements collected in the MEG-Gaze Session, yielding a high prediction accuracy in discriminating between SF and SH, 66.3 ± 10.2%, p < 10 5 , and between OF and OH, 66.5 ± 9.7%, p < 10 5 . We also performed cross-experiment classifications where the classifier was trained with the gaze patterns from the behavioral experiment and was used to predict the gaze patterns in the Gaze Session of the MEG experiment. The above-chance cross-experiment prediction accuracies confirmed that the distinct patterns of the online eye movement in the Gaze Session were related to the face vs. house categories in the behavioral experiment. Moreover, the cross-experiment prediction accuracies were higher for self-generated gaze tracks than for other-generated gaze tracks (Fig. 1e), indicating that participants followed their own gaze tracks better (see Cross-experiment classification of gaze patterns in Supplementary Information for the statistics). Neural face-related gaze pattern representations To reveal the face-related gaze representations, we compared the MEG signals of face-related gaze tracks with the MEG signals of house-related gaze tracks. Here the house-related gaze tracks were taken as a control for face-related gaze tracks because there were eye movements and visual stimulation but a lack of a structural pattern 23 (for statistical evidence, see The structural pattern of face-related gaze trac ks in Supplementary Information). For each condition (SF, SH, OF, OH) in the Gaze Session, the ERF signal from 0-1500 ms relative to the onset of the gaze track was calculated. The difference in the estimated cortical current maps was calculated between the following conditions: ‘SF – SH’, ‘OF – OH’, and ‘(SF – SH) – (OF – OH)’. These results would reveal the temporal development of the brain networks involved in the face-related gaze tracks, respectively areas that were sensitive to self-generated face-gaze tracks. The face-related gaze tracks elicited stronger ERF signals than the house-related gaze tracks in the orbitofrontal cortex (OFC) and the ventral anterior temporal lobe (ATL) extending to the medial temporal lobe (‘SF – SH’ and ‘OF – OH’ in Fig. 2a, Bonferroni-corrected for time and spatial clustering; see also Extended Data Video 1-4). Although small, there are also signal differences reaching back to the occipital cortex (at 600ms and 1400ms, depending on the contrast). Importantly, the activities in the OFC and ATL emerged earlier than the activities in the medial temporal lobe and occipital cortex, suggesting a top-down prediction of the face category guided by the face-related gaze tracks. Moreover, this network was more active during the following of self-generated gaze tracks than another observer’s gaze tracks, as revealed by the interaction contrast ‘(SF – SH) – (OF – OH)’ (Fig. 2a and Extended Data Video 5). The source reconstruction was dominantly localized in the ventral stream, with the notable exception of dorsal stream areas in frontal cortex, including the frontal eye field (FEF) and supplementary eye field (SEF), in the interaction contrast (Fig. 2b and Extended Data Video 6). The whole-brain source reconstruction was also performed based on the signal difference between Face and House in the Image Session. In contrast to the Gaze Session, here the strongest signal difference was localized in the posterior occipitotemporal areas, beginning already 200 ms post-stimulus onset (Fig. 3 and Extended Data Video 7-8). Dynamic spatiotemporal patterns of the gaze-related representations in the brain In order to provide statistical evidence for the temporal order of signal development observed in the event-related magnetic field analysis, we tested if there was an information flow from the anterior areas (e.g., OFC and ATL) to the posterior areas (e.g., FFA) during gaze following. We performed spatial gradient analysis, which quantified how the MEG signals gradually changed along a specific dimension (e.g., anterior-posterior) in the brain space 24 . Here, the top-down hypothesis of the gaze-track representations predicted information flow from the anterior areas to the posterior areas along the ventral pathway. This can be probed with the gradually decreased activity from the anterior areas to the posterior areas, in particular, how the gradient became face-selective during the gaze following. As an area or neural network with stronger signals would be faster to exceed the neural threshold of maintaining sensory selectivity or perceptual preference 25 , the stronger signal changes in the anterior areas indicated that the face-selective representation of the gaze tracks emerged earlier than the posterior areas. To test our hypothesis, here the spatial gradient was analyzed based on the ERF signal differences (e.g., ‘SF – SH’, ‘OF – OH’) to show how the signal changed at the anterior-posterior dimension. The analysis was performed at each time point during gaze following to show how the gradient pattern became face-selective over time. To provide a complete gradient pattern at the whole-brain level, the analysis was also performed for the dorsal-ventral and left-right dimensions. For each of the three dimensions ( x , y , z , hence left-right, anterior-posterior, dorsal-ventral dimension), we modeled the coordinates with the ERF difference at each time point 24 . The R 2 of the model was calculated to assess the accounted variance. The first-order derivatives of the estimated model were calculated to test if the spatial gradient increased or decreased monotonically along a specific dimension. As shown in Fig. 4a (left), the ERF signal difference between SF and SH showed a significant gradient pattern along the y and z dimensions, cluster-based permutation correction at p < 0.05 (see Fig. 4a for the significant time ranges), whereas the x dimension did not reach significance (no time ranges reached significance). The significant gradient pattern that emerged over time during the gaze following (i.e., significantly higher R 2 than baseline) indicated that the gradient pattern was not due to the general intrinsic brain dynamics but rather to the face-specific gaze following. Along both the y and z dimensions, the estimated model showed a monotonic characteristic, with 97.5% of the derivative values > 0 at the time point with the strongest gradient pattern along the y dimension and 98.9% of the derivative values 0 along the y dimension and 97.7% of the derivative values < 0 along the z dimension (Fig. 4a right, Supplementary Fig. S2). These results indicated that the signal difference between SF and SH, and between OF and OH, decreased along the anterior-to-posterior axis (Fig. 4b upper row), and decreased along the ventral-to-dorsal axis of the brain (Fig. 4b lower row), whereas there was no difference between the left and right hemispheres. Taking the results at the anterior-posterior and the ventral-dorsal dimensions together, these findings provided statistical evidence for the temporal sequence observed in the event-related magnetic field analysis, showing that the representations of face-related gaze tracks dynamically progressed along the ventral pathway, starting from the ventral anterior areas (e.g. OFC and ATL) via medial temporal lobe (MTL) to ventral occipitotemporal cortex. The spatial patterns at the time point where the gradient reached its peak are shown in Fig. 4b (left: ‘SF – SH’, R 2 peaked at 910 ms along the y dimension and at 670 ms along the z dimension; right: ‘OF – OH’, R 2 peaked at 1485 ms along the y dimension and at 650 ms along the z dimension). Importantly, the spatial gradient emerged earlier for ‘SF – SH’ than the spatial gradient for ‘OF – OH’ along the y dimension (i.e., the posterior-to-anterior dimension), mean latency difference = -310 ms, 95% CI = [-460 ms, -165 ms], but not along the z dimension mean latency difference = 20 ms, 95% CI = [-360 ms, 450 ms] (Fig. 4e). Together with the cross-experiment classifications of gaze patterns, these results indicated that self-generated gaze tracks were more sensitive than other-generated gaze tracks to activate the face-selective neural representations. In the Image Session, the ERF signal difference between Face and House also showed a significant gradient pattern along the y and z dimensions, cluster-based permutation at p < 0.05, whereas the x dimension did not reach significance (Fig. 4c). Along both the y and z dimensions, the estimated model showed a monotonic characteristic, with 99.4% of the derivative values < 0 along the y dimension and 82.3% of the derivative values < 0 along the z dimension (Supplementary Fig. S2). The spatial patterns at the time point where the gradient reached its peak are shown in Fig. 4d ( R 2 peaked at 260 ms along the y dimension and at 710 ms along the z dimension). While the gradient pattern along the z dimension was consistent with the Gaze Session (Fig. 4c), with signal difference decreasing from the ventral to the dorsal part of the brain (Fig. 4d, lower row), the gradient pattern along the y dimension was reversed, with signal difference decreased from the posterior to the anterior part of the brain (Fig. 4d, upper row). Importantly, the reversed gradient pattern along the y dimension between the Gaze Session and the Image Session again indicated that the observed gradient pattern was not due to the general intrinsic brain dynamics along the ventral pathway, but rather reflected the feedback-dominant vs. feedforward-dominant processing specific to the current task (i.e., gaze following vs. image processing). The reversed pattern along the anterior-posterior direction between the Gaze Session and the Image Session was further confirmed by the statistical evidence that the spatial gradient along the y dimension showed a negative correlation between the two sessions, r = -0.99, 95%CI = [-0.994, -0.957], p < 0.001 at the peak time point for ‘SF – SH’, and r = -0.98, 95%CI = [-0.991, -0.934], p < 0.001 at the peak time point for ‘OF – OH’ (Fig. 4f, left). By contrast, the spatial gradient along the z dimension showed a positive correlation between the two sessions, r = 0.90, 95%CI = [0.863, 0.951], p < 0.001 at the peak time point for ‘SF – SH’, and r = 0.92, 95%CI = [0.856, 0.953], p < 0.001 at the peak time point for ‘OF – OH’ (Fig. 4f, right). Collectively, the reversed pattern along the anterior-posterior direction between the Gaze Session and the Image Session suggested a combination of feedback and feedforward processing in natural face perception (i.e., when we look at a face using eye movements). Discussion We have shown that face-related gaze tracks were dynamically represented along the ventral pathway, from OFC via ATL, to MTL and ventral occipitotemporal cortex. During the gaze following, there was a gradient pattern along the ventral stream, with face-selective activity progressing from OFC to the occipitotemporal cortex. However, when actual images of faces and houses were presented, the reverse gradient was observed, face-selective activity progressing from the ventral posterior occipitotemporal cortex to the prefrontal cortex. Taken together, our findings show that the ventral pathway represents aspects of the eye-movement program used to explore faces. The fixation sequences may help to form a top-down prediction of face category in OFC and ATL to guide, via the MTL or directly, the perceptual representation of faces in ventral occipitotemporal cortex, particularly under demanding viewing conditions. The brain areas that we found representing categorical face-related gaze tracks, from OFC via ATL to ventral occipitotemporal cortex have previously been found to support face perception, both in human and non-human primates 11–13 . Importantly, we found these areas to represent face-specific gaze patterns in the absence of face or house images, indicating that not only visual features, but also information about category-specific gaze sequences are represented, like fixation locations and their temporal sequence. In addition, the representation of face-related gaze sequences was particularly early and strong when gaze sequences that were followed were generated by the same participant during actual face viewing. This pattern is in line with the stable interindividual differences in gaze patterns that can be found across different viewing conditions 10 . Face-specific activity was observed earliest in OFC and ATL, spreading backwards along the ventral stream. Although we do not know about previous reports of eye movement processing in human orbitofrontal cortex, modulation of orbitofrontal activity by eye movements has recently been reported, particularly during looking at faces 14,15 . OFC and ATL, connected via the uncinate fasciculus, are known to be vital for social interaction, with lesions in this network leading to the behavioral variant of frontotemporal dementia 26,27 . Given the importance of face perception - including perception of facial expressions - for social interaction, it is not astonishing to find representations of face-specific gaze sequences in these areas. The early occurence of face-specific gaze representation in OFC and ATL in the ventral face processing stream may suggest that the processing of face-specific gaze patterns is vital for social interaction. It may thus be worthwhile to investigate if face-specific gaze sequences break down in degenerative diseases affecting OFC and ATL, like frontotemporal dementia. Face-related gaze sequence activation spread further to the medial temporal lobe, which has previously been shown to be vital for the exploration of the environment with eye movements, particularly, but not only, in memory-guided vision 28–30 . Moreover, during face viewing, activation in the hippocampal as well as in the fusiform face area was modulated by the number of fixations 31 and during scene viewing, functional connectivity between the hippocampus and the PPA was enhanced during free viewing (versus forced central fixation) 18 . The MTL is connected to orbitofrontal and temporopolar cortex via the perirhinal cortex 32 . Thus, in the context of face viewing, information from OFC and ATL about the highly structured (T-shaped) fixation pattern may elicit a memory trace of the 'face'-category in the hippocampus. This, however, needs further investigation. If OFC and ATL are first to represent face-specific gaze patterns, how does the information about fixation patterns arrive at these areas in the first place? In normal looking behaviour, this may be answered by our finding that during the presentation of actual face images, faces and houses can be discriminated early and most strongly in occipitotemporal cortex, spreading fast to anterior brain areas. Thus, during free viewing of a face, there will be an interaction of feedforward and feedback signals supporting perception. Our data, in line with previous reports, suggest that information about face-specific gaze sequences may aid face perception particularly in the absence of rich and unambiguous visual information 19,20,22 . A recent MEG study also showed that the ventral prefrontal cortex guides the construction of low-dimensional categorical prediction from the high-dimensional visual information in occipitotemporal cortex 21 . Taken together, following the gaze tracks may activate category-predictive processes in OFC and ATL, sending feedback signals via the MTL or directly to ventral occipitotemporal face patches including the FFA to facilitate face perception. Recently, it has been emphasized that categorical object representation in the brain may be linked to the object's behavioral relevance 33 , in line with models of perception-action coupling 34–36 . Given that eye movements are mostly generated fast and without conscious control 37 , representations of category-specific gaze sequences as part of an object's neural representation may be a natural case of object representation including relevant behavior, at least for object categories with structured gaze sequences, such as for faces. It may seem puzzling that we found gaze-specific activation patterns for faces mainly in areas of the ventral stream, whereas the dorsal stream's importance for eye movement control is well known 38–41 . We also could discriminate face versus house-related gaze following in dorsal brain areas, particularly in the frontal eye fields and the superior frontal cortex, known to support attentional control 42 , as well as in left frontopolar cortex, known to support exploratory attentional resource allocation 43 . Interestingly, this was only the case for self-generated dot following, ruling out that these activation patterns were simply due to differences in basic eye movement parameters like saccade amplitudes. Nevertheless, dorsal stream activation differences for face versus house-related gaze following were much less than in ventral stream areas. This may be due to the nature of our contrasts, which asked for a categorical (‘face – house’) distinction between the associated gaze sequences. This categorical distinction may be more associated with the known capabilities of the ventral stream in object categorization than with the visuomotor control functions associated with the dorsal stream 44 . Conclusion A ventral network of brain areas known to support face processing, reaching from orbitofrontal and anterior temporal cortex via medial temporal to ventral occipitotemporal cortex, was found to represent face-related gaze sequences. During gaze following, activation in this network followed an anterior-to-posterior gradient, indicating feedback from anterior areas to ventral occipitotemporal perceptual areas, possibly using gaze patterns to support face perception. Our findings add an ideomotor perspective to the ventral stream function and the reconsideration of both ventral and dorsal streams in representing eye movements. The face-related eye movement representations in OFC and ATL suggest the role of the frontotemporal network in gaze control during social interaction. Methods Participants The sample size was decided with G-Power 3.0 45 based on a previous study that examined the capabilities of MEG signals in decoding eye movement patterns 46 . Given a correlation coefficient between eye movement and MEG patterns = 0.86, alpha = 0.0001 reported in this study, a sample size of 27 is required to achieve an expected power of 99%. Following this criterion while considering potential exclusion, 32 university students (19 females, 13 males, mean age 20.8 years old) were recruited in the present study. All participants reported normal or corrected to normal vision. Informed written consent was obtained prior to the experiment. One participant was excluded due to his drop-out from the MEG experiment, resulting in 31 participants (19 females, 12 males, mean age 20.6 years old). This study was conducted in accordance with the Declaration of Helsinki, and was approved by the ethics committee of the local university. Design and procedure Each participant went through a behavioral experiment (Figure 1a) and an MEG experiment, with a one-week interval between the two experiments. In the behavioral experiment, we collected the gaze patterns of all participants while they were looking at faces and houses. These gaze patterns were then presented in the Gaze Session of the MEG experiment, and participants had to follow the gaze patterns with eye movements. In the behavioral experiment, stimuli were images of faces and houses presented on a black background of a computer screen. For each participant, the images of faces and houses presented during the experiment were randomly chosen from an image set (20 male faces, 20 female faces, and 40 houses). The size of the face images was fixed at a width of 14.4° × height of 16.7° of visual angle, with an eye-to-mouth distance of 7°. Due to the varying structures, the size of houses was not constant, with a mean width of 19.3° ± 1.0°, and a height of 13.1° ± 2.4°. Participants were required to complete an N-back task while looking at each of the pictures 9 . At the beginning of each trial, a green dot (0.2° of visual angle in diameter) was presented at one of the four corners (15° from the center of the screen) to attract eye fixation. The green dot was presented for a varying interval of 1200-2000 ms. In 20% of the trials, a small black dot (0.05° in diameter) was presented at the center of the green dot for 100 ms. Participants were asked to detect the black dot by pressing the ‘z’ button using the left index finger on a standard keyboard. The onset of this small black dot was randomly chosen from the time point during the presentation of the green dot. After the offset of the green dot, a face or house picture was presented at the center of the screen and remained on the screen for 1500 ms. Participants were instructed to look at the images with free eye movements. There were 14 blocks of trials in the experiment. In the first block, a set of 6 face images (3 males and 3 females) and 6 house images were presented, one per trial, in a random order. In each of the following 13 blocks, 1-3 new images (either face or house) were added into the original 12 images. Participants were asked to memorize the images in the first block, and detect if a new image was presented in the following 13 blocks by pressing the ‘m’ button using the right index finger. In the MEG experiment, stimuli were presented through an LCD projector onto a rear screen located in front of the scanner. There were two sessions in the MEG experiment: a Gaze Session and an Image Session. In the Gaze Session, each participant was asked to follow dots that represented his/her own gaze patterns as well as dots that represented another participant’s gaze patterns. Each trial started with a red dot on a grey background, which remained at the center of the screen for 1400-2000ms. After a blank screen of a jittered interval (450 – 650 ms), the gaze track was presented on the screen, in the form of a sequence of green dots. The sequence of green dots represented the gaze pattern for a specific picture obtained from the behavioral experiment, and each dot represented a fixation of the gaze pattern. Given that the gaze patterns were collected during the 1500ms-time range of the picture presentation, the dot sequence lasted approximately 1500 ms on the screen. In 20% of the trials, a small black dot (0.05° in diameter) was presented at the center of the central red dot or the moving green dot (with equal probabilities). This small black dot was presented for 100 ms, and participants were asked to detect the black dot by pressing the button using the right index finger. According to our design, four categories of gaze tracks were presented: face-related gaze tracks from the current observer (self-face, SF), house-related gaze tracks from the current observer (self-house, SH), face-related gaze tracks from another participant (other-face, OF), and house-related gaze tracks from another participant (other-house, OH). There were 10 blocks of gaze-tracks, with 40 trials (10 trials per condition) in each block. Trials from the 4 conditions were mixed and presented in a random order. At the end of each block, a feedback screen was presented to inform the participants’ performance in detecting the small black dots. In the Image Session, participants were asked to view successively presented images in a one-back task. Images were grouped into 10 blocks of faces and 10 blocks of houses. The two block types were presented in a random order. Each block started with a central fixation (a green cross) at a varying interval of 1000-1600 ms. Then images of the same category (face or house) were successively presented (each lasted for 1000 ms), with a jittered interval of 300-600 ms between each two images. The central fixation was presented throughout the block, and participants were required to maintain their eyes on the central fixation. In each block, 10-11 images were presented with one or two images that were repeated immediately after their first presentation. Participants were asked to detect the immediate repetition of the image by button press. Apart from the immediate repetition, there were no other repetitions of images in each block. In total, each participant viewed 100-110 images of each category. There was a ~30s break between each two blocks. For both the behavioral experiment and the MEG experiment, eye-movement data were recorded during the experiment with an EyeLink 1000 plus system (SR-Research, Canada), at an online sampling rate of 1000 Hz. A standard procedure of nine-point calibration and validation was performed at the beginning of the experiment, with a maximum error of 1.0° as the threshold. A drift check was performed at the beginning of each block, and the calibration and validation were performed if the error of the drift check exceeded the threshold (i.e., > 1.0°). MEG data acquisition and preprocessing Neuromagnetic signals were recorded using a whole-head MEG system, with 204 planar gradiometers and 102 magnetometers (Eleka Neuromag TRIUX) in a magnetically shielded room. Four head position indication (HPI) coils were placed in each participant’s head to estimate head position during recording, with two coils in left and right mastoids and two on the forehead. The raw MEG signals were online sampled at 1000Hz and were band-pass filtered between 0.1 and 330 Hz. The structural MRI of each participant was obtained using a 3T Siemens Prisma MR scanner. The MRI scanning was conducted on a different day after the MEG experiment. Head shapes were quantified using the Probe Position Identification system (Polhemus), and three anatomical landmarks (nasion, left and right pre-auricular points) were used to co-register the MEG data with MRI coordinates. Max-filter was used to reduce external noise and compensate for head movements (temporal signal space separation method, tSSS 47 ). The offline pre-processing analysis of MEG data was performed using Brainstorm 48 . The continuous MEG data was first down-sampled to 200 Hz. Then the MEG data was band-pass filtered (0.1Hz to 60Hz, zero phase shift FIR filter) and notch filtered at 50 Hz. Independent component analysis (ICA) was used to detect and discard artifacts related to eye blinks, head movements and heat beats. The data were then epoched with the time interval of -500 to 1500 ms relative to the onset of the first fixation in the Gaze Session and with the time interval of -200 to 1000 ms relative to the onset of the image in the Image Session. Analysis of eye-movement data In the behavioral experiment, eye-movement data were extracted from the 1.5-s image presentation. Data were preprocessed using the cili module, a python-based tool for detecting and correcting eye blinks. Eye blinks were firstly removed, and fixations were identified based on the velocity threshold of 30 °/s and the acceleration threshold of 8000°/s 2 . Trials without any valid fixation events, and trials with fixation localized beyond the region of the picture were also excluded. To prepare the gaze tracks in the MEG experiment, following the previous study 9 , a fixation was identified as a gaze event if its duration was longer than or equal to 100 ms, while identified as a non-gaze event if its duration was shorter than 100 ms. This non-gaze event was represented by a blank screen in the Gaze Session of the MEG experiment. Then, trials with less than two gazes were excluded. The gaze coordinates were proportionally transformed and co-registered with the screen resolution in the MEG scanner. In the Gaze Session of the MEG experiment, the online fixation events were also identified and analyzed. Multivariate classifications were performed on the gaze features to show the distinct patterns between categories. The classification analysis was performed using the scikit-learn package (http://github.com/scikit-learn). Three features were included: the x , y coordinates, and the duration of each gaze. The fixation data was parsed in the way that 80% of the data was included as the training set and 20% of the data as the test set. A linear support vector (SVM) classifier was trained and cross-validated based on the saccadic features of the two categories (Face vs. House). The classification was performed for each participant, rendering both individual-level prediction accuracies and the group mean of the accuracies. Permutation-based testing was conducted to assess the statistical significance. For each participant, the classifier was trained with randomly shuffled labels of the two categories, and a permuted accuracy was calculated. This procedure was repeated 100 times, rendering a set of 100 chance accuracies for each participant. For group-level statistical testing, one chance accuracy was selected from each participant and the individual chance accuracies were averaged into a group chance accuracy. This procedure was repeated 10 5 times, resulting in a set of 10 5 group chance accuracies. Significance testing was performed by calculating the probability of the unpermuted group mean accuracy across participants in the distribution of the group chance accuracies (one-tailed). The classification was performed both for the fixation data in the behavioral experiment (Face vs. House) and the fixation data in the Gaze Session of the MEG experiment (SF vs. SH, OF vs. OH). Note that the SF vs. SH and OF vs. OH classifications in the Gaze Session were not strictly specific to the Face vs. House distinction because no face or house images were presented. To show the specificity, cross-experiment classifications were performed where the fixation patterns in the behavioral experiment were used to train the classifier (Face vs. House), which was then used to predict the fixation categories in the Gaze Session (SF vs. SH, OF vs. OH). Importantly, to assess the sensitivity of the distinct fixation patterns, the cross-experiment classification was performed by varying the number of the fixations (i.e., the first fixation, the 1-2 fixations, the 1-3 fixations, and the 1-4 fixations). Multiple comparisons were corrected with Bonferroni methods. Representational distance 3 was calculated to assess if the fixation data had a consistent structure specific to the visual category in the Gaze Session. Specifically, the representational distance was quantified by the Euclidean distance between the fixations within each category, assuming that a lower distance indicates a higher fixation structure 23 . We also calculated the representational distance between the categories as the control. The fixation pattern for a specific category was identified as structural if the representational distance within that category was significantly lower than the between-category representational distance. Event-related magnetic field (ERF) analysis of MEG data After the pre-processing, the epoched data were averaged over the trials for each condition and each participant. Individual T1-weighted MRIs were segmented with the FreeSurfer software package 49 (http://surfer.nmr.mgh.harvard.edu) and then imported to the Brainstorm (https://neuroimage.usc.edu/brainstorm) for further source-level analysis. The white-gray matter boundary segmented by the FreeSurfer was used as a source space for activity estimation in the cortex. After co-registration between the individual anatomy and MEG sensors, the cortical currents were estimated using a distributed model consisting of 15002 current dipoles from the averaged epochs (evoked activities) using a linear inverse estimator (minimum norm current estimation). The density map was standardized using a Z-score transformation with respect to a noise matrix which was calculated with a 2-minute empty-room recording of the MEG signal. The dipole orientation was constrained to the orthogonality of the white-gray matter boundary of the individual MRIs. The difference in the estimated cortical current maps was calculated between the following conditions: ‘SF – SH’, ‘OF – OH’, ‘(SF – SH) – (OF – OH)’. Then the source maps were filtered with a low-pass filter (30Hz), standardized through a z-score baseline normalization (-450 to 0 ms relative to the gaze onset as the baseline, with the first 50 ms of the baseline period being excluded to avoid the edge effect resulted from the low-pass filter), and rectified to retain only absolute values. The source maps were then projected on a standard brain (ICBM152) and spatially smoothed (Full Width at Half Maximum, FWHM=3mm) before group statistical analysis. A two-tailed one-sample Chi 2 test was used for group statistical analysis for each time point and each vertex with the null hypothesis that the difference in variances of the cortical activities between the two conditions was equal to zero 50 . Bonferroni correction was used to solve the multiple comparison problems. The significance threshold was set at p < 0.05 after corrections. To show the brain areas that were involved in the Image Session, the whole-brain source reconstruction was also performed by comparing the ERF signals during face viewing and the ERF signals during house viewing (‘Face – House’). Modeling the gradient of MEG signal in source space To quantify the spatial patterns of MEG signals during the gaze-track following, a five-order polynomial function was used to approximate the data along each of the three spatial dimensions ( x , y and z coordinates for the data in the source space) 24 . For each time point, we employed a polynomial function p(v) = p 0 + p 1 v + p 2 v 2 + p 3 v 3 + p 4 v 4 + p 5 v 5 ( polyfit , MATLAB 2022a) to estimate the coordinates along each spatial dimension with the MEG signal difference between conditions (e.g., ‘SF – SH’, ‘OF – OH’). The amplitude of MEG signal was normalized to z-scores across vertexes to avoid an ill-conditioned Vandermonde matrix in model fitting. For each spatial dimension, the model fitted the signal difference in Z-scored amplitudes of each vertex v to its spatial coordinate (MNI coordinates) across vertices. The quality of the model was quantified by the adjusted R 2 , which determined the proportion of variance explained by the model. R 2 was adjusted by the number of coefficients. To assess the dynamic spatial gradient of the MEG signal difference, a Jackknife method was used to fit the model and calculate R 2 for each of the 3 dimensions. Specifically, one of the participants was excluded and the source-reconstructed MEG signals of the remaining participants were averaged to fit the model. This procedure was iterated across participants. A one-sample t test (one-tail) was used to test if R 2 at each time point was higher than the baseline, the time interval of -500 to 0 ms relative to the stimulus onset. Cluster-based permutation was used to resolve the multi-comparison problem across time points. We also calculated the first-order derivatives of the estimated model to test if the spatial gradient had a monotonic increasing or decreasing pattern along a specific dimension. The calculation was performed on the model at the time point with peak R 2 , and the evaluation of the derivatives was based on the signal range between the minimum and the maximum value of the MEG amplitude. The spatial gradient was identified as monotonically increasing given the derivative values > 0 and monotonically decreasing given the derivative values < 0. To test if the spatial gradient of ‘SF – SH’ emerged earlier than the spatial gradient of ‘OF – OH’, cross correlation ( xcorr , MTALAB 2022a, ‘unbiased’, maxlag = 200) was performed on the two R 2 time courses to calculate the latency difference. The latency difference was defined as the temporal lag with which the R 2 time courses showed maximum correlation between the two time courses across participants. The Bootstrapping method (iteration number = 1000) was used to estimate the 95% confidence interval of latency difference. The same analysis was also performed on the MEG signal difference between Face and House in the Image Session to show the spatial gradient. The analysis was performed on the 0-1000 ms interval during the image presentation (0 denotes the image onset), with the -200-0 ms interval as the baseline. To assess the similarity/dissimilarity of the spatial gradient between the Gaze Session and the Image Session, we performed correlation analyses between the two sessions. Specifically, we performed the model fitting with the group average of source-reconstructed MEG signal difference between conditions (‘SF – SH’ and ‘OF – OH’ for the Gaze Session and ‘Face – House’ for the Image Session). A Bootstrapping method (iteration number = 1000) was used to estimate the variance of R 2 time courses that were calculated between the model and the group average of the MEG data. At the peak time point of R 2 time courses ( y and z , respectively), we projected the fitted function p (v) in the three-dimensional space with the polynomial function. The predicted coordinates of the function p (v) were sorted according to the amplitude of the MEG signal. Then we calculated the Pearson coefficients of the sorted coordinates between the two sessions. Declarations Acknowledgments We thank Dr. Jiayu Zhan and Dr. Liyu Cao for their suggestions on the design of the MEG experiment. This study was supported by the National Natural Science Foundation of China (32271086), and a Mercator Fellowship of the Deutsche Forschungsgemeinschaft (DFG, 450600965) to LW, and a DFG grant (PO548/18-1) to SP. Author contributions Conceptualization, L.W., S.P., Z.S.; Methodology, Z.S., L.W.; Investigation, Z.S., L.W.; Formal Analysis, Z.S., L.W.; Visualization, Z.S.; Writing – Original Draft, S.P., L.W., Z.S.; Writing – Review & Editing, S.P., L.W., Z.S., X.Z.; Supervision, L.W., X.Z.; Funding Acquisition, L.W., X.Z., S.P. Competing interests The authors declare no competing interests. Data and code availability Data and codes have been deposited at OSF, accession code osf.io/vwfxc/?view_only=55feffd4ae034a968da54048b65927f8. References Desimone, R., Albright, T. D., Gross, C. G. & Bruce, C. Stimulus-selective properties of inferior temporal neurons in the macaque. J Neurosci 4 , 2051–2062 (1984). Kiani, R., Esteky, H., Mirpour, K. & Tanaka, K. Object category structure in response patterns of neuronal population in monkey inferior temporal cortex. J. Neurophysiol. 97 , 4296–4309 (2007). Kriegeskorte, N. et al. Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey. Neuron 60 , 1126–1141 (2008). Majaj, N. J., Hong, H., Solomon, E. A. & DiCarlo, J. J. Simple learned weighted sums of inferior temporal neuronal firing rates accurately predict human core object recognition performance. J. Neurosci. 35 , 13402–13418 (2015). Ungerleider, L. G. & Mishkin, M. Two cortical visual systems. in Analysis of visual behavior (MIT press, 1982). Kravitz, D. J., Saleem, K. S., Baker, C. I., Ungerleider, L. G. & Mishkin, M. The ventral visual pathway: An expanded neural framework for the processing of object quality. Trends Cogn. Sci. 17 , 26–49 (2013). Kanwisher, N., McDermott, J. & Chun, M. M. The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception. J. Neurosci. 17 , 4302–4311 (1997). Epstein, R. & Kanwisher, N. A cortical representation of the local visual environment. Nature 392 , 598–601 (1998). Wang, L., Baumgartner, F., Kaule, F. R., Hanke, M. & Pollmann, S. Individual face- and house-related eye movement patterns distinctively activate FFA and PPA. Nat. Commun. 10 , 1–16 (2019). Peterson, M. F. & Eckstein, M. P. Individual differences in eye movements during face identification reflect observer-specific optimal points of fixation. Physiol. Sci. 27 , 1216–1225 (2013). Landi, S. M. & Freiwald, W. A. Two areas for familiar face recognition in the primate brain. Science. 357 , 591–595 (2017). Tsao, D. Y., Moeller, S. & Freiwald, W. A. Comparing face patch systems in macaques and humans. Proc. Natl. Acad. Sci. U. S. A. 105 , 19514–19519 (2008). Tsao, D. Y., Schweers, N., Moeller, S. & Freiwald, W. A. Patches of face-selective cortex in the macaque frontal lobe. Nat. Neurosci. 11 , 877–879 (2008). Dal Monte, O. et al. Widespread implementations of interactive social gaze neurons in the primate prefrontal-amygdala networks. Neuron 110 , 2183-2197.e7 (2022). Fan, S., Dal Monte, O., Nair, A. R., Fagan, N. A. & Chang, S. W. C. Closed-loop microstimulations of the orbitofrontal cortex during real-life gaze interaction enhance dynamic social attention. Neuron 112 , 2631-2644.e6 (2024). Voss, J. L. et al. Spontaneous revisitation during visual exploration as a link among strategic behavior, learning, and the hippocampus. Proc. Natl. Acad. Sci. U. S. A. 108 , E402–E409 (2011). Ryan, J. D., Shen, K. & Liu, Z. X. The intersection between the oculomotor and hippocampal memory systems: empirical developments and clinical implications. Ann. N. Y. Acad. Sci. 1464 , 115–141 (2020). Liu, Z.-X., Rosenbaum, R. S. & Ryan, J. D. Restricting Visual Exploration Directly Impedes Neural Activity, Functional Connectivity, and Memory. Cereb. Cortex Commun. 1 , (2020). Summerfield, C. et al. Predictive Codes for Forthcoming Perception in the Frontal Cortex. Science. 314 , 1311–1314 (2006). Bar, M. et al. Top-down facilitation of visual recognition. Proc. Natl. Acad. Sci. U. S. A. 103 , 449–454 (2006). Duan, Y., Zhan, J., Gross, J., Ince, R. A. A. & Schyns, P. G. Pre-frontal cortex guides dimension-reducing transformations in the occipito-ventral pathway for categorization behaviors. Curr. Biol. 34 , 3392-3404.e5 (2024). Freedman, D. J., Riesenhuber, M., Poggio, T. & Miller, E. K. Categorical representation of visual stimuli in the primate prefrontal cortex. Science. 291 , 312–316 (2001). Wang, Z., Meghanathan, R. N., Pollmann, S. & Wang, L. Common structure of saccades and microsaccades in visual perception. J. Vis. 24 , 1–13 (2024). Zalta, A., Large, E. W., Schön, D. & Morillon, B. Neural dynamics of predictive timing and motor engagement in music listening. Sci. Adv. 10 , eadi2525 (2024). Wang, X.-J. Probabilistic Decision Making by Slow Reverberation in Cortical Circuits. Neuron 36 , 955–968 (2002). Neary, D., Snowden, J. S., Northen, B. & Goulding, P. Dementia of frontal lobe type. J. Neurol. Neurosurg. Psychiatry 353–361 (1988). Rouse, M. A., Binney, R. J., Patterson, K., Rowe, J. B. & Lambon Ralph, M. A. A neuroanatomical and cognitive model of impaired social behaviour in frontotemporal dementia. Brain 147 , 1953–1966 (2024). Hannula, D. E., Ryan, J. D., Tranel, D. & Cohen, N. J. Rapid onset relational memory effects are evident in eye movement behavior, but not in hippocampal amnesia. J. Cogn. Neurosci. 19 , 1690–1705 (2007). Ryals, A. J., Wang, J. X., Polnaszek, K. L. & Voss, J. L. Hippocampal contribution to implicit configuration memory expressed via eye movements during scene exploration. Hippocampus 25 , 1028–1041 (2015). Pollmann, S. & Schneider, W. X. Working memory and active sampling of the environment: Medial temporal contributions. in Handbook of Clinical Neurology 187 , (Elsevier B.V., 2022). Liu, Z. X., Shen, K., Olsen, R. K. & Ryan, J. D. Visual sampling predicts hippocampal activity. J. Neurosci. 37 , 599–609 (2017). Ranganath, C. & Ritchey, M. Two cortical systems for memory-guided behaviour. Nat. Rev. Neurosci. 13 , 713–726 (2012). Contier, O., Baker, C. I. & Hebart, M. N. Distributed representations of behaviour-derived object dimensions in the human visual system. Nat. Hum. Behav. 8 , 2179–2193 (2024). Prinz, W. A Common Coding Approach to Perception and Action BT - Relationships Between Perception and Action: Current Approaches (Springer Berlin Heidelberg, 1990). Olivers, C. N. L. & Roelfsema, P. R. Attention for action in visual working memory. Cortex 131 , 179–194 (2020). Van Ede, F. & Nobre, A. C. Turning Attention Inside Out: How Working Memory Serves Behavior. Annu. Rev. Psychol. 74 , 137–165 (2023). Land, M. & Tatler, B. Looking and Acting: Vision and eye movements in natural behaviour (Oxford University Press, 2009). Paus, T. Location and function of the human frontal eye-field: A selective review. Neuropsychologia 34 , 475–483 (1996). Andersen, R. A., Brotchie, P. R. & Mazzoni, P. Evidence for the lateral intraparietal area as the parietal eye field. Curr. Opin. Neurobiol. 2 , 840–846 (1992). Bruce, C. J. & Goldberg, M. E. Physiology of the frontal eye fields. Trends Neurosci. 7 , 436–441 (1984). Coiner, B. et al. Functional neuroanatomy of the human eye movement network: a review and atlas. Brain Struct. Funct. 224 , 2603–2617 (2019). Hopfinger, J. B., Buonocore, M. H. & Mangun, G. R. The neural mechanisms of top-down attentional control. Nat. Neurosci. 3 , 284–291 (2000). Pollmann, S. Frontopolar Resource Allocation in Human and Nonhuman Primates. Trends Cogn. Sci. 20 , 84–86 (2016). Gallivan, J. P. & Goodale, M. A. The dorsal “action” pathway. in Handbook of Clinical Neurology 151 , (2018). Faul, F., Erdfelder, E., Lang, A. G. & Buchner, A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods 39 , 175–191 (2007). Quax, S. C., Dijkstra, N., van Staveren, M. J., Bosch, S. E. & van Gerven, M. A. J. Eye movements explain decodability during perception and cued attention in MEG. Neuroimage 195 , 444–453 (2019). Taulu, S. & Simola, J. Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurements. Phys. Med. Biol. 51 , 1759–1768 (2006). Tadel, F., Baillet, S., Mosher, J. C., Pantazis, D. & Leahy, R. M. Brainstorm: A user-friendly application for MEG/EEG analysis. Comput. Intell. Neurosci. 2011 , (2011). Fischl, B. FreeSurfer. Neuroimage 62 , 774–781 (2012). Sandberg, K. et al. Distinct MEG correlates of conscious experience, perceptual reversals and stabilization during binocular rivalry. Neuroimage 100 , 161–175 (2014). Additional Declarations There is NO Competing Interest. Supplementary Files Video1GazeSFvsSHventral.avi Extended Data Video 1 Video2GazeSFvsSHdorsal.avi Extended Data Video 2 Video3GazeOFvsOHventral.avi Extended Data Video 3 Video4GazeOFvsOHdrosal.avi Extended Data Video 4 Video5GazeInteractionventral.avi Extended Data Video 5 Video6GazeInteractiondorsal.avi Extended Data Video 6 Video7ImageFvsHventral.avi Extended Data Video 7 Video8ImageFvsHdorsal.avi Extended Data Video 8 SupplementaryInformation.docx Supplementary Information Cite Share Download PDF Status: Published Journal Publication published 24 Nov, 2025 Read the published version in Communications Biology → Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-5835383","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":408845797,"identity":"aeaf02fe-5e4e-4c8e-8f98-e3b14a4f9cdf","order_by":0,"name":"Lihui Wang","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA7ElEQVRIiWNgGAWjYBACPmYeBoYEMJP5AIMEmJGAXwsbQgtbAoNEAjFaGHhgTB4DqGpCWth5j0k8YDgsb86/5vMHyx+HGfjZcwwYfu7A5zC+NKBzDhvunPF2m4REwmEGyZ43Boy9Z/D6xQyo5TbjhhtntzGAtBjcyDFgZmwjrMV+w40zjz+AtNgTqyVxw/keBrDDDCQIazG2SDD4n7zhBpuZhERaOo/EmWcFB3vxaOHnP2N480dFmu2G84cff5awsZbjb0/e+OAnHi0QAIwRUDQyA2MfHE0HCGmA2neAgfEDcUpHwSgYBaNghAEAiuxJdqocG88AAAAASUVORK5CYII=","orcid":"https://orcid.org/0000-0003-1535-0671","institution":"Shanghai Jiao Tong University","correspondingAuthor":true,"prefix":"","firstName":"Lihui","middleName":"","lastName":"Wang","suffix":""},{"id":408845798,"identity":"e9d949ee-d948-41e5-a76a-974734b55af2","order_by":1,"name":"Zhongbin Su","email":"","orcid":"","institution":"Shanghai Jiao Tong University","correspondingAuthor":false,"prefix":"","firstName":"Zhongbin","middleName":"","lastName":"Su","suffix":""},{"id":408845799,"identity":"70263099-90c5-4fcc-9074-dde0e062ca5f","order_by":2,"name":"Xiaolin Zhou","email":"","orcid":"","institution":"East China Normal University","correspondingAuthor":false,"prefix":"","firstName":"Xiaolin","middleName":"","lastName":"Zhou","suffix":""},{"id":408845800,"identity":"c9a6c379-7cd2-4b43-b22d-994989c3afab","order_by":3,"name":"Stefan Pollmann","email":"","orcid":"https://orcid.org/0000-0001-5840-5658","institution":"Otto-von-Guericke-University Magdeburg","correspondingAuthor":false,"prefix":"","firstName":"Stefan","middleName":"","lastName":"Pollmann","suffix":""}],"badges":[],"createdAt":"2025-01-15 14:45:26","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-5835383/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-5835383/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1038/s42003-025-09039-y","type":"published","date":"2025-11-24T05:00:00+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":81024160,"identity":"3d91d68c-ea0c-4ba6-9f90-71a81e9a2964","added_by":"auto","created_at":"2025-04-21 10:12:18","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":1380879,"visible":true,"origin":"","legend":"\u003cp\u003eExperimental design and fixation decoding. (\u003cstrong\u003ea\u003c/strong\u003e) An example trial sequence in the behavioral experiment. (\u003cstrong\u003eb\u003c/strong\u003e) An example trial sequence in the Gaze Session of the MEG experiment. (\u003cstrong\u003ec\u003c/strong\u003e) An example block of the stimuli sequence in the Image Session of the MEG experiment. (\u003cstrong\u003ed\u003c/strong\u003e) An example face (blurred for privacy protection) and house image used in the experiment (upper left panel), and the fixation patterns (lower left panel) collected from one example participant during the view of faces (in blue star) and houses (in green cross). These fixation patterns were used to train a classifier in discriminating face- and house-related fixation patterns. The trained classifier was then used to classify the fixation patterns collected during the following of gaze tracks from the current observer (SF vs. SH, upper right panel) and the fixation patterns during the following of gaze tracks from another participant (OF vs. OH, lower right panel, i.e., cross-experiment classification). (\u003cstrong\u003ee\u003c/strong\u003e) The prediction accuracies of the cross-experiment classification are shown as a function of the comparison (SF vs. SH and OF vs. OH) and the number of fixations included in the classifications. The shaded areas indicate accuracies below the 95% percentile of the chance accuracies obtained from the permutation-based classification. **\u003cem\u003ep\u003c/em\u003e \u0026lt; 0.01, *\u003cem\u003ep\u003c/em\u003e \u0026lt; 0.05 for between self and other (Bonferroni-corrected).\u003c/p\u003e","description":"","filename":"Figure1.png","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/2bd479dbac239fc99f6d1919.png"},{"id":81024415,"identity":"701016eb-2316-40c9-9404-56d1dc01bafd","added_by":"auto","created_at":"2025-04-21 10:20:19","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":2411912,"visible":true,"origin":"","legend":"\u003cp\u003eThe ventral (\u003cstrong\u003ea\u003c/strong\u003e) and dorsal (\u003cstrong\u003eb\u003c/strong\u003e) view of the whole-brain source reconstruction based on the ERF signals. Upper row: the brain areas revealed by the contrast ‘Self-Face (SF) – Self-House (SH)’. Middle row: the brain areas revealed by the contrast ‘Other-Face (OF) – Self-House (OH)’. Lower row: the brain areas revealed by the interaction contrast ‘(SF – SH) – (OF – OH)’. The zero time points indicate the onsets of the gaze tracks.\u003c/p\u003e","description":"","filename":"Figure2.png","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/3442829c52013baf3b686e4e.png"},{"id":81024414,"identity":"e3b767a1-a604-4d67-8685-da78d7329ed1","added_by":"auto","created_at":"2025-04-21 10:20:18","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":1499440,"visible":true,"origin":"","legend":"\u003cp\u003eThe ventral (upper row) and dorsal (lower row) view of the whole-brain source reconstruction based on the ERF signal difference between Face and House in the Image Session. The zero time point indicates the onset of the image.\u003c/p\u003e","description":"","filename":"Figure3.png","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/c914ee751d09ee3fba08c42b.png"},{"id":81024165,"identity":"27e7e138-3f75-41a2-9833-c57ee3e7fb3b","added_by":"auto","created_at":"2025-04-21 10:12:18","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":1635097,"visible":true,"origin":"","legend":"\u003cp\u003eThe spatial gradient of the MEG signal at each dimension of the 3-D brain space. The \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2 \u003c/sup\u003eof the spatial gradient model are shown as a function of the spatial dimensions (\u003cem\u003ex\u003c/em\u003e, \u003cem\u003ey\u003c/em\u003e, \u003cem\u003ez\u003c/em\u003e) and time in the Gaze Session (\u003cstrong\u003ea\u003c/strong\u003e) and the Image Session (\u003cstrong\u003ec\u003c/strong\u003e). The horizontal lines at the bottom of each pallet indicate the time ranges where the \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e were significantly higher than the baseline (multiple comparisons corrected with cluster-based permutation at \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.05). The small triangles indicate the peak of the \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2 \u0026nbsp;\u003c/sup\u003ealong a specific dimension. The spatial gradient patterns (in terms of \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2 \u003c/sup\u003evalues) at the peak time points are shown in the 3D space (\u003cstrong\u003eb\u003c/strong\u003e: Gaze Session, \u003cstrong\u003ed\u003c/strong\u003e: Image Session). (\u003cstrong\u003ee\u003c/strong\u003e): The counts that the spatial gradient pattern for ‘SF – SH’ emerged earlier than the spatial gradient pattern for ‘OF – OH’ along the \u003cem\u003ey\u003c/em\u003e and the \u003cem\u003ez\u003c/em\u003e dimensions. The latency difference between the two \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2 \u003c/sup\u003etime courses was estimated with a cross-correlation method and was tested using a bootstrapping method (see Methods). Dashed lines indicate the 95% confidence interval. (\u003cstrong\u003ef\u003c/strong\u003e): The predicted coordinates sorted by the ERF amplitudes in the Image Session are shown as a function of the predicted coordinates in the Gaze Session along the \u003cem\u003ey\u003c/em\u003e (left column) and the \u003cem\u003ez\u003c/em\u003e (right column) dimensions. Upper row for ‘SF – SH’ and lower row for ‘OF – OH’.\u003c/p\u003e\n\u003cp\u003e\u0026nbsp;\u003c/p\u003e","description":"","filename":"Figure4.png","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/8d6f2d694e67b3a4c52151c7.png"},{"id":96699747,"identity":"b260241d-5be7-4671-a860-a91c3a64cc78","added_by":"auto","created_at":"2025-11-25 08:13:02","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":8126449,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/9cfeca6b-cc5a-4590-9166-d6912c42b00d.pdf"},{"id":81024419,"identity":"507b2d8b-d2c1-4683-899a-7a568788fd6b","added_by":"auto","created_at":"2025-04-21 10:20:19","extension":"avi","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":6123490,"visible":true,"origin":"","legend":"Extended Data Video 1","description":"","filename":"Video1GazeSFvsSHventral.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/519e8309839f1624d0dd9229.avi"},{"id":81025094,"identity":"78c9cbd5-af41-4280-81a6-8418813150bd","added_by":"auto","created_at":"2025-04-21 10:28:19","extension":"avi","order_by":2,"title":"","display":"","copyAsset":false,"role":"supplement","size":5975540,"visible":true,"origin":"","legend":"Extended Data Video 2","description":"","filename":"Video2GazeSFvsSHdorsal.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/230590fb8a52fcd8bdd37aa0.avi"},{"id":81024425,"identity":"bea8558c-361f-4205-a6a8-17ce06a4c265","added_by":"auto","created_at":"2025-04-21 10:20:19","extension":"avi","order_by":3,"title":"","display":"","copyAsset":false,"role":"supplement","size":5920852,"visible":true,"origin":"","legend":"Extended Data Video 3","description":"","filename":"Video3GazeOFvsOHventral.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/0a4607c72f9f28b55c3eac68.avi"},{"id":81024417,"identity":"550f5503-35c7-4f6a-9629-b4fae906e315","added_by":"auto","created_at":"2025-04-21 10:20:19","extension":"avi","order_by":4,"title":"","display":"","copyAsset":false,"role":"supplement","size":5933852,"visible":true,"origin":"","legend":"Extended Data Video 4","description":"","filename":"Video4GazeOFvsOHdrosal.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/1031227f4e45e46c84ebdb62.avi"},{"id":81024177,"identity":"da356041-f2a3-4b77-b77a-91346e83953b","added_by":"auto","created_at":"2025-04-21 10:12:19","extension":"avi","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":5901676,"visible":true,"origin":"","legend":"Extended Data Video 5","description":"","filename":"Video5GazeInteractionventral.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/1c27db3e1548141cbc0eb0d3.avi"},{"id":81025095,"identity":"1f4af44e-d27a-4577-9ce1-392d4b3a5bfb","added_by":"auto","created_at":"2025-04-21 10:28:19","extension":"avi","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":5936410,"visible":true,"origin":"","legend":"Extended Data Video 6","description":"","filename":"Video6GazeInteractiondorsal.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/15291fd6faf58c9ad0a8968f.avi"},{"id":81025099,"identity":"708420f3-cdd9-4cb7-a4a3-2a5f335e389e","added_by":"auto","created_at":"2025-04-21 10:28:19","extension":"avi","order_by":7,"title":"","display":"","copyAsset":false,"role":"supplement","size":4034528,"visible":true,"origin":"","legend":"Extended Data Video 7","description":"","filename":"Video7ImageFvsHventral.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/ae7bc2a0444109d6c4cd9646.avi"},{"id":81024193,"identity":"db0af1ff-8c23-4833-b422-d58dc6ddbb0c","added_by":"auto","created_at":"2025-04-21 10:12:19","extension":"avi","order_by":8,"title":"","display":"","copyAsset":false,"role":"supplement","size":4448044,"visible":true,"origin":"","legend":"\u003cp\u003eExtended Data Video 8\u003c/p\u003e","description":"","filename":"Video8ImageFvsHdorsal.avi","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/cb02efbe2abed8b135a57630.avi"},{"id":81024187,"identity":"4a98e0c5-9d2d-4ff4-b6ee-947581c4db16","added_by":"auto","created_at":"2025-04-21 10:12:19","extension":"docx","order_by":9,"title":"","display":"","copyAsset":false,"role":"supplement","size":707639,"visible":true,"origin":"","legend":"\u003cp\u003eSupplementary Information\u003c/p\u003e","description":"","filename":"SupplementaryInformation.docx","url":"https://assets-eu.researchsquare.com/files/rs-5835383/v1/c4f9396d000fa9a1c671a7bd.docx"}],"financialInterests":"There is \u003cb\u003eNO\u003c/b\u003e Competing Interest.","formattedTitle":"Dynamic face-related eye movement representations in the human ventral pathway","fulltext":[{"header":"Introduction","content":"\u003cp\u003eVentral occipitotemporal cortex is well known for its role in high-level object perception\u003csup\u003e1\u0026ndash;6\u003c/sup\u003e. In particular, the fusiform face area (FFA) is activated by face images\u003csup\u003e7\u003c/sup\u003e and the parahippocampal place area (PPA) by house images\u003csup\u003e8\u003c/sup\u003e. In a recent study, however, it was shown that FFA and PPA exhibited distinct neural activation patterns to face- and house-related gaze tracks, elicited in the absence of face or house image perception\u003csup\u003e9\u003c/sup\u003e. In this study, participants followed a sequence of dots on a uniform background with eye movements, where the dot sequence replayed gaze tracks previously recorded during face or house viewing. The face- and house-related gaze tracks could be decoded by the activation patterns in the FFA and PPA, thus indicating category-specific representations of gaze tracks in areas that were known to be activated by the respective image categories. Furthermore, the category-selective activation patterns were more sensitive to self-generated gaze tracks than gaze tracks generated by other observers\u003csup\u003e9\u003c/sup\u003e, in line with known individual differences in looking at faces\u003csup\u003e10\u003c/sup\u003e.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eHere, we asked what the function of these gaze-track representations in a high-level perceptual area such as the FFA might be. Multiple areas along the ventral pathway are dedicated to face processing. In addition to the already mentioned FFA, face-responsive patches along the rostrocaudal extent of the temporal lobes as well as in the orbitofrontal cortex have been found in both human and non-human primates\u003csup\u003e11\u0026ndash;13\u003c/sup\u003e. Notably, all of these studies investigated the processing of face images, without recourse to eye movements. Recently, however, modulations of neural activity by eye movements have been reported in the orbitofrontal cortex, particularly involving face viewing\u003csup\u003e14,15\u003c/sup\u003e. Moreover, neuronal activity in the medial temporal lobes can be modulated by eye movements, and lesions in this area may lead to changes in eye movement patterns during active sampling of the environment\u003csup\u003e16,17\u003c/sup\u003e. Medial temporal structures also interact with PPA during scene exploration\u003csup\u003e18\u003c/sup\u003e. Taken together with the evidence of gaze-sequence representation in the FFA and PPA\u003csup\u003e9\u003c/sup\u003e, these findings may lead to the hypothesis that the ventral stream, from orbitofrontal cortex via the uncinate fascicle to anterior temporal cortex and, via medial temporal cortex or directly, further to ventral occipitotemporal cortex, might be involved in representing face-specific gaze tracks.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eHowever, how might face-specific gaze track representations be processed along the ventral stream? A top-down hypothesis is that eye movement sequences, e.g., during face viewing, initially activate prefrontal cortex, creating a categorical prediction (in this example of the \u0026apos;face\u0026apos; category) which is fed back via the ventral stream to guide recognition in posterior perceptual areas\u003csup\u003e19\u0026ndash;21\u003c/sup\u003e. To test this hypothesis, we investigated the spatiotemporal gradient of the MEG signal changes during face-related (vs. house-related) gaze track following. We expected dynamic signal changes specific to face-related gaze tracks to occur along the ventral stream, from orbitofrontal via anterior temporal and medial temporal cortex to ventral occipitotemporal cortex. Considering the known interindividual differences in the eye movement patterns of faces\u003csup\u003e10\u003c/sup\u003e, we also expected that following self-generated gaze tracks would optimally stimulate face-specific neural representations\u003csup\u003e9\u003c/sup\u003e, leading to earlier and stronger MEG signal changes than other-generated gaze tracks. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eTop-down processing in the prefrontal cortex has been particularly observed when there is a lack of rich and unambiguous visual information\u003csup\u003e20,22\u003c/sup\u003e, as is the case in our dot-following task. Specifically, the categorical prediction in the prefrontal cortex was activated by ambiguous visual information in early visual areas\u003csup\u003e20\u003c/sup\u003e. To test this constraint, we presented actual images of faces and houses in the MEG scanner as a control condition. In contrast to the dot-following task, here, we expected a dominance of feedforward processing along the ventral pathway.\u003c/p\u003e"},{"header":"Results","content":"\u003cp\u003eThe study consisted of a behavioral experiment and an MEG experiment, with a one-week interval between the two experiments. In the behavioral experiment (Fig. 1a), gaze tracks of all participants were recorded while they were looking at images of faces and houses (see Table S1 in Supplementary Information for gaze parameters). In the MEG experiment, participants followed dot sequences of their own (self-face, SF; self-house, SH) or another participant\u0026rsquo;s gaze tracks (other-face, OF; other-house, OH) with eye movements (Gaze Session, Fig. 1b). After the Gaze Session, participants took part in an Image Session where they viewed images of faces and houses while maintaining the eyes on a central fixation point (Fig. 1c). The behavioral performance is presented in \u003cem\u003eBehavioral results\u003c/em\u003e of Supplementary Information.\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eDistinct patterns between face- and house-related gaze tracks\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eWe first analyzed if the face- and house-related fixation sequences obtained in the behavioral experiment showed distinct patterns. A machine-learning classification analysis over the spatiotemporal parameters of the fixations (\u003cem\u003ex\u003c/em\u003e, \u003cem\u003ey\u003c/em\u003e coordinates, and fixation duration) showed a high prediction accuracy in discriminating the two categories of gaze sequences, 70.5 \u0026plusmn; 11.4% (M \u0026plusmn; SD), with above-chance significance (\u003cem\u003ep\u003c/em\u003e \u0026lt; 10\u003csup\u003e5\u003c/sup\u003e; permutation-based significance testing). The same analysis was also performed on the eye movements collected in the MEG-Gaze Session, yielding a high prediction accuracy in discriminating between SF and SH, 66.3 \u0026plusmn; 10.2%, \u003cem\u003ep\u003c/em\u003e \u0026lt; 10\u003csup\u003e5\u003c/sup\u003e, and between OF and OH, 66.5 \u0026plusmn; 9.7%, \u003cem\u003ep\u003c/em\u003e \u0026lt; 10\u003csup\u003e5\u003c/sup\u003e.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eWe also performed cross-experiment classifications where the classifier was trained with the gaze patterns from the behavioral experiment and was used to predict the gaze patterns in the Gaze Session of the MEG experiment. The above-chance cross-experiment prediction accuracies confirmed that the distinct patterns of the online eye movement in the Gaze Session were related to the face vs. house categories in the behavioral experiment. Moreover, the cross-experiment prediction accuracies were higher for self-generated gaze tracks than for other-generated gaze tracks (Fig. 1e), indicating that participants followed their own gaze tracks better (see \u003cem\u003eCross-experiment classification of gaze patterns\u003c/em\u003e in Supplementary Information for the statistics).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eNeural face-related gaze pattern representations\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eTo reveal the face-related gaze representations, we compared the MEG signals of face-related gaze tracks with the MEG signals of house-related gaze tracks. Here the house-related gaze tracks were taken as a control for face-related gaze tracks because there were eye movements and visual stimulation but a lack of a structural pattern\u003csup\u003e23\u003c/sup\u003e (for statistical evidence, see \u003cem\u003eThe structural pattern of face-related gaze trac\u003c/em\u003eks in Supplementary Information). For each condition (SF, SH, OF, OH) in the Gaze Session, the ERF signal from 0-1500 ms relative to the onset of the gaze track was calculated. The difference in the estimated cortical current maps was calculated between the following conditions: \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, \u0026lsquo;OF \u0026ndash; OH\u0026rsquo;, and \u0026lsquo;(SF \u0026ndash; SH) \u0026ndash; (OF \u0026ndash; OH)\u0026rsquo;. These results would reveal the temporal development of the brain networks involved in the face-related gaze tracks, respectively areas that were sensitive to self-generated face-gaze tracks. The face-related gaze tracks elicited stronger ERF signals than the house-related gaze tracks in the orbitofrontal cortex (OFC) and the ventral anterior temporal lobe (ATL) extending to the medial temporal lobe (\u0026lsquo;SF \u0026ndash; SH\u0026rsquo; and \u0026lsquo;OF \u0026ndash; OH\u0026rsquo; in Fig. 2a, Bonferroni-corrected for time and spatial clustering; see also Extended Data Video 1-4). Although small, there are also signal differences reaching back to the occipital cortex (at 600ms and 1400ms, depending on the contrast). Importantly, the activities in the OFC and ATL emerged earlier than the activities in the medial temporal lobe and occipital cortex, suggesting a top-down prediction of the face category guided by the face-related gaze tracks.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eMoreover, this network was more active during the following of self-generated gaze tracks than another observer\u0026rsquo;s gaze tracks, as revealed by the interaction contrast \u0026lsquo;(SF \u0026ndash; SH) \u0026ndash; (OF \u0026ndash; OH)\u0026rsquo; (Fig. 2a and Extended Data Video 5). The source reconstruction was dominantly localized in the ventral stream, with the notable exception of dorsal stream areas in frontal cortex, including the frontal eye field (FEF) and supplementary eye field (SEF), in the interaction contrast (Fig. 2b and Extended Data Video 6).\u003c/p\u003e\n\u003cp\u003eThe whole-brain source reconstruction was also performed based on the signal difference between Face and House in the Image Session. In contrast to the Gaze Session, here the strongest signal difference was localized in the posterior occipitotemporal areas, beginning already 200 ms post-stimulus onset (Fig. 3 and Extended Data Video 7-8).\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eDynamic spatiotemporal patterns of the gaze-related representations in the brain\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eIn order to provide statistical evidence for the temporal order of signal development observed in the event-related magnetic field analysis, we tested if there was an information flow from the anterior areas (e.g., OFC and ATL) to the posterior areas (e.g., FFA) during gaze following. We performed spatial gradient analysis, which quantified how the MEG signals gradually changed along a specific dimension (e.g., anterior-posterior) in the brain space\u003csup\u003e24\u003c/sup\u003e. Here, the top-down hypothesis of the gaze-track representations predicted information flow from the anterior areas to the posterior areas along the ventral pathway.\u0026nbsp;This can be probed with the gradually decreased activity from the anterior areas to the posterior areas, in particular, how the gradient became face-selective during the gaze following. As an area or neural network with stronger signals would be faster to exceed the neural threshold of maintaining sensory selectivity or perceptual preference\u003csup\u003e25\u003c/sup\u003e, the stronger signal changes in the anterior areas indicated that the face-selective representation of the gaze tracks emerged earlier than the posterior areas.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eTo test our hypothesis, here the spatial gradient was analyzed based on the ERF signal differences (e.g., \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, \u0026lsquo;OF \u0026ndash; OH\u0026rsquo;) to show how the signal changed at the anterior-posterior dimension. The analysis was performed at each time point during gaze following to show how the gradient pattern became face-selective over time. To provide a complete gradient pattern at the whole-brain level, the analysis was also performed for the dorsal-ventral and left-right dimensions. For each of the three dimensions (\u003cem\u003ex\u003c/em\u003e, \u003cem\u003ey\u003c/em\u003e, \u003cem\u003ez\u003c/em\u003e, hence left-right, anterior-posterior, dorsal-ventral dimension), we modeled the coordinates with the ERF difference at each time point\u003csup\u003e24\u003c/sup\u003e. The \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e of the model was calculated to assess the accounted variance. The first-order derivatives of the estimated model were calculated to test if the spatial gradient increased or decreased monotonically along a specific dimension. As shown in Fig. 4a (left), the ERF signal difference between SF and SH showed a significant gradient pattern along the \u003cem\u003ey\u003c/em\u003e and \u003cem\u003ez\u003c/em\u003e dimensions, cluster-based permutation correction at \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.05 (see Fig. 4a for the significant time ranges), whereas the \u003cem\u003ex\u0026nbsp;\u003c/em\u003edimension did not reach significance (no time ranges reached significance). The significant gradient pattern that emerged over time during the gaze following (i.e., significantly higher \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u0026nbsp;\u003c/sup\u003ethan baseline) indicated that the gradient pattern was not due to the general intrinsic brain dynamics but rather to the face-specific gaze following. Along both the \u003cem\u003ey\u003c/em\u003e and \u003cem\u003ez\u003c/em\u003e dimensions, the estimated model showed a monotonic characteristic, with 97.5% of the derivative values \u0026gt; 0 at the time point with the strongest gradient pattern along the \u003cem\u003ey\u003c/em\u003e dimension and 98.9% of the derivative values \u0026lt; 0 along the z dimension (Supplementary Fig. S2). The signal difference between OF and OH showed a similar pattern, with 98.7% of the derivative values \u0026gt; 0 along the \u003cem\u003ey\u003c/em\u003e dimension and 97.7% of the derivative values \u0026lt; 0 along the \u003cem\u003ez\u003c/em\u003e dimension (Fig. 4a right, Supplementary Fig. S2). These results indicated that the signal difference between SF and SH, and between OF and OH, decreased along the anterior-to-posterior axis (Fig. 4b upper row), and decreased along the ventral-to-dorsal axis of the brain (Fig. 4b lower row), whereas there was no difference between the left and right hemispheres. Taking the results at the anterior-posterior and the ventral-dorsal dimensions together, these findings provided statistical evidence for the temporal sequence observed in the event-related magnetic field analysis, showing that the representations of face-related gaze tracks dynamically progressed along the ventral pathway, starting from the ventral anterior areas (e.g. OFC and ATL) via medial temporal lobe (MTL) to ventral occipitotemporal cortex.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe spatial patterns at the time point where the gradient reached its peak are shown in Fig. 4b (left: \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e peaked at 910 ms along the \u003cem\u003ey\u003c/em\u003e dimension and at 670 ms along the \u003cem\u003ez\u003c/em\u003e dimension; right: \u0026lsquo;OF \u0026ndash; OH\u0026rsquo;, \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u0026nbsp;\u003c/sup\u003epeaked at 1485 ms along the \u003cem\u003ey\u003c/em\u003e dimension and at 650 ms along the \u003cem\u003ez\u003c/em\u003e dimension). Importantly, the spatial gradient emerged earlier for \u0026lsquo;SF \u0026ndash; SH\u0026rsquo; than the spatial gradient for \u0026lsquo;OF \u0026ndash; OH\u0026rsquo; along the \u003cem\u003ey\u003c/em\u003e dimension (i.e., the posterior-to-anterior dimension), mean latency difference = -310 ms, 95% CI = [-460 ms, -165 ms], but not along the \u003cem\u003ez\u003c/em\u003e dimension mean latency difference = 20 ms, 95% CI = [-360 ms, 450 ms] (Fig. 4e). Together with the cross-experiment classifications of gaze patterns, these results indicated that self-generated gaze tracks were more sensitive than other-generated gaze tracks to activate the face-selective neural representations.\u003c/p\u003e\n\u003cp\u003eIn the Image Session, the ERF signal difference between Face and House also showed a significant gradient pattern along the \u003cem\u003ey\u003c/em\u003e and \u003cem\u003ez\u003c/em\u003e dimensions, cluster-based permutation at \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.05, whereas the \u003cem\u003ex\u003c/em\u003e dimension did not reach significance (Fig. 4c). Along both the \u003cem\u003ey\u003c/em\u003e and \u003cem\u003ez\u003c/em\u003e dimensions, the estimated model showed a monotonic characteristic, with 99.4% of the derivative values \u0026lt; 0 along the \u003cem\u003ey\u003c/em\u003e dimension and 82.3% of the derivative values \u0026lt; 0 along the \u003cem\u003ez\u003c/em\u003e dimension (Supplementary Fig. S2). The spatial patterns at the time point where the gradient reached its peak are shown in Fig. 4d (\u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e peaked at 260 ms along the \u003cem\u003ey\u003c/em\u003e dimension and at 710 ms along the \u003cem\u003ez\u003c/em\u003e dimension). While the gradient pattern along the \u003cem\u003ez\u003c/em\u003e dimension was consistent with the Gaze Session (Fig. 4c), with signal difference decreasing from the ventral to the dorsal part of the brain (Fig. 4d, lower row), the gradient pattern along the \u003cem\u003ey\u003c/em\u003e dimension was reversed, with signal difference decreased from the posterior to the anterior part of the brain (Fig. 4d, upper row). Importantly, the reversed gradient pattern along the \u003cem\u003ey\u003c/em\u003e dimension between the Gaze Session and the Image Session again indicated that the observed gradient pattern was not due to the general intrinsic brain dynamics along the ventral pathway, but rather reflected the feedback-dominant vs. feedforward-dominant processing specific to the current task (i.e., gaze following vs. image processing).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe reversed pattern along the anterior-posterior direction between the Gaze Session and the Image Session was further confirmed by the statistical evidence that the spatial gradient along the \u003cem\u003ey\u003c/em\u003e dimension showed a negative correlation between the two sessions, \u003cem\u003er\u003c/em\u003e = -0.99, 95%CI = [-0.994, -0.957], \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.001 at the peak time point for \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, and \u003cem\u003er\u003c/em\u003e = -0.98, 95%CI = [-0.991, -0.934], \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.001 at the peak time point for \u0026lsquo;OF \u0026ndash; OH\u0026rsquo; (Fig. 4f, left). By contrast, the spatial gradient along the \u003cem\u003ez\u003c/em\u003e dimension showed a positive correlation between the two sessions, \u003cem\u003er\u003c/em\u003e = 0.90, 95%CI = [0.863, 0.951], \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.001 at the peak time point for \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, and \u003cem\u003er\u003c/em\u003e = 0.92, 95%CI = [0.856, 0.953], \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.001 at the peak time point for \u0026lsquo;OF \u0026ndash; OH\u0026rsquo; (Fig. 4f, right). Collectively, the reversed pattern along the anterior-posterior direction between the Gaze Session and the Image Session suggested a combination of feedback and feedforward processing in natural face perception (i.e., when we look at a face using eye movements).\u0026nbsp;\u003c/p\u003e"},{"header":"Discussion","content":"\u003cp\u003eWe have shown that face-related gaze tracks were dynamically represented along the ventral pathway, from OFC via ATL, to MTL and ventral occipitotemporal cortex. During the gaze following, there was a gradient pattern along the ventral stream, with face-selective activity progressing from OFC to the occipitotemporal cortex. However, when actual images of faces and houses were presented, the reverse gradient was observed, face-selective activity progressing from the ventral posterior occipitotemporal cortex to the prefrontal cortex. Taken together, our findings show that the ventral pathway represents aspects of the eye-movement program used to explore faces. The fixation sequences may help to form a top-down prediction of face category in OFC and ATL to guide, via the MTL or directly, the perceptual representation of faces in ventral occipitotemporal cortex, particularly under demanding viewing conditions.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe brain areas that we found representing categorical face-related gaze tracks, from OFC via ATL to ventral occipitotemporal cortex have previously been found to support face perception, both in human and non-human primates\u003csup\u003e11\u0026ndash;13\u003c/sup\u003e. Importantly, we found these areas to represent face-specific gaze patterns in the absence of face or house images, indicating that not only visual features, but also information about category-specific gaze sequences are represented, like fixation locations and their temporal sequence. In addition, the representation of face-related gaze sequences was particularly early and strong when gaze sequences that were followed were generated by the same participant during actual face viewing. This pattern is in line with the stable interindividual differences in gaze patterns that can be found across different viewing conditions\u003csup\u003e10\u003c/sup\u003e. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eFace-specific activity was observed earliest in OFC and ATL, spreading backwards along the ventral stream. Although we do not know about previous reports of eye movement processing in human orbitofrontal cortex, modulation of orbitofrontal activity by eye movements has recently been reported, particularly during looking at faces\u003csup\u003e14,15\u003c/sup\u003e. OFC and ATL, connected via the uncinate fasciculus, are known to be vital for social interaction, with lesions in this network leading to the behavioral variant of frontotemporal dementia\u003csup\u003e26,27\u003c/sup\u003e. Given the importance of face perception - including perception of facial expressions - for social interaction, it is not astonishing to find representations of face-specific gaze sequences in these areas. The early occurence of face-specific gaze representation in OFC and ATL in the ventral face processing stream may suggest that the processing of face-specific gaze patterns is vital for social interaction. It may thus be worthwhile to investigate if face-specific gaze sequences break down in degenerative diseases affecting OFC and ATL, like frontotemporal dementia. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eFace-related gaze sequence activation spread further to the medial temporal lobe, which has previously been shown to be vital for the exploration of the environment with eye movements, particularly, but not only, in memory-guided vision\u003csup\u003e28\u0026ndash;30\u003c/sup\u003e. Moreover, during face viewing, activation in the hippocampal as well as in the fusiform face area was modulated by the number of fixations\u003csup\u003e31\u003c/sup\u003e and during scene viewing, functional connectivity between the hippocampus and the PPA was enhanced during free viewing (versus forced central fixation)\u003csup\u003e18\u003c/sup\u003e. The MTL is connected to orbitofrontal and temporopolar cortex via the perirhinal cortex\u003csup\u003e32\u003c/sup\u003e. Thus, in the context of face viewing, information from OFC and ATL about the highly structured (T-shaped) fixation pattern may elicit a memory trace of the \u0026apos;face\u0026apos;-category in the hippocampus. This, however, needs further investigation. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIf OFC and ATL are first to represent face-specific gaze patterns, how does the information about fixation patterns arrive at these areas in the first place? In normal looking behaviour, this may be answered by our finding that during the presentation of actual face images, faces and houses can be discriminated early and most strongly in occipitotemporal cortex, spreading fast to anterior brain areas. Thus, during free viewing of a face, there will be an interaction of feedforward and feedback signals supporting perception. Our data, in line with previous reports, suggest that information about face-specific gaze sequences may aid face perception particularly in the absence of rich and unambiguous visual information\u003csup\u003e19,20,22\u003c/sup\u003e. A recent MEG study also showed that the ventral prefrontal cortex guides the construction of low-dimensional categorical prediction from the high-dimensional visual information in occipitotemporal cortex\u003csup\u003e21\u003c/sup\u003e. Taken together, following the gaze tracks may activate category-predictive processes in OFC and ATL, sending feedback signals via the MTL or directly to ventral occipitotemporal face patches including the FFA to facilitate face perception.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eRecently, it has been emphasized that categorical object representation in the brain may be linked to the object\u0026apos;s behavioral relevance\u003csup\u003e33\u003c/sup\u003e, in line with models of perception-action coupling\u003csup\u003e34\u0026ndash;36\u003c/sup\u003e. Given that eye movements are mostly generated fast and without conscious control\u003csup\u003e37\u003c/sup\u003e, representations of category-specific gaze sequences as part of an object\u0026apos;s neural representation may be a natural case of object representation including relevant behavior, at least for object categories with structured gaze sequences, such as for faces. \u003cem\u003e\u0026nbsp;\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eIt may seem puzzling that we found gaze-specific activation patterns for faces mainly in areas of the ventral stream, whereas the dorsal stream\u0026apos;s importance for eye movement control is well known\u003csup\u003e38\u0026ndash;41\u003c/sup\u003e. We also could discriminate face versus house-related gaze following in dorsal brain areas, particularly in the frontal eye fields and the superior frontal cortex, known to support attentional control\u003csup\u003e42\u003c/sup\u003e, as well as in left frontopolar cortex, known to support exploratory attentional resource allocation\u003csup\u003e43\u003c/sup\u003e. Interestingly, this was only the case for self-generated dot following, ruling out that these activation patterns were simply due to differences in basic eye movement parameters like saccade amplitudes. Nevertheless, dorsal stream activation differences for face versus house-related gaze following were much less than in ventral stream areas. This may be due to the nature of our contrasts, which asked for a categorical (\u0026lsquo;face \u0026ndash; house\u0026rsquo;) distinction between the associated gaze sequences. This categorical distinction may be more associated with the known capabilities of the ventral stream in object categorization than with the visuomotor control functions associated with the dorsal stream\u003csup\u003e44\u003c/sup\u003e.\u0026nbsp;\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eA ventral network of brain areas known to support face processing, reaching from orbitofrontal and anterior temporal cortex via medial temporal to ventral occipitotemporal cortex, was found to represent face-related gaze sequences. During gaze following, activation in this network followed an anterior-to-posterior gradient, indicating feedback from anterior areas to ventral occipitotemporal perceptual areas, possibly using gaze patterns to support face perception. Our findings add an ideomotor perspective to the ventral stream function and the reconsideration of both ventral and dorsal streams in representing eye movements. The face-related eye movement representations in OFC and ATL suggest the role of the frontotemporal network in gaze control during social interaction.\u003c/p\u003e"},{"header":"Methods","content":"\u003cp\u003e\u003cem\u003eParticipants\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eThe sample size was decided with G-Power 3.0\u003csup\u003e45\u003c/sup\u003e based on a previous study that examined the capabilities of MEG signals in decoding eye movement patterns\u003csup\u003e46\u003c/sup\u003e. Given a correlation coefficient between eye movement and MEG patterns = 0.86, alpha = 0.0001 reported in this study, a sample size of 27 is required to achieve an expected power of 99%. Following this criterion while considering potential exclusion, 32 university students (19 females, 13 males, mean age 20.8 years old) were recruited in the present study. All participants reported normal or corrected to normal vision. Informed written consent was obtained prior to the experiment. One participant was excluded due to his drop-out from the MEG experiment, resulting in 31 participants (19 females, 12 males, mean age 20.6 years old). This study was conducted in accordance with the Declaration of Helsinki, and was approved by the ethics committee of the local university.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eDesign and procedure\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eEach participant went through a behavioral experiment (Figure 1a) and an MEG experiment, with a one-week interval between the two experiments. In the behavioral experiment, we collected the gaze patterns of all participants while they were looking at faces and houses. These gaze patterns were then presented in the Gaze Session of the MEG experiment, and participants had to follow the gaze patterns with eye movements.\u003c/p\u003e\n\u003cp\u003eIn the behavioral experiment, stimuli were images of faces and houses presented on a black background of a computer screen. For each participant, the images of faces and houses presented during the experiment were randomly chosen from an image set (20 male faces, 20 female faces, and 40 houses). The size of the face images was fixed at a width of 14.4\u0026deg; \u0026times; height of 16.7\u0026deg; of visual angle, with an eye-to-mouth distance of 7\u0026deg;. Due to the varying structures, the size of houses was not constant, with a mean width of 19.3\u0026deg; \u0026plusmn;\u0026nbsp;1.0\u0026deg;, and a height of 13.1\u0026deg;\u0026nbsp;\u0026plusmn;\u0026nbsp;2.4\u0026deg;. Participants\u0026nbsp;were required to complete an N-back task while looking at each of the pictures\u003csup\u003e9\u003c/sup\u003e. At the beginning of each trial, a green dot (0.2\u0026deg; of visual angle in diameter) was presented at one of the four corners (15\u0026deg; from the center of the screen) to attract eye fixation. The green dot was presented for a varying interval of 1200-2000 ms. In 20% of the trials, a small black dot (0.05\u0026deg; in diameter) was presented at the center of the green dot for 100 ms. Participants were asked to detect the black dot by pressing the \u0026lsquo;z\u0026rsquo; button using the left index finger on a standard keyboard. The onset of this small black dot was randomly chosen from the time point during the presentation of the green dot. After the offset of the green dot, a face or house picture was presented at the center of the screen and remained on the screen for 1500 ms. Participants were instructed to look at the images with free eye movements. There were 14 blocks of trials in the experiment. In the first block, a set of 6 face images (3 males and 3 females) and 6 house images were presented, one per trial, in a random order. In each of the following 13 blocks, 1-3 new images (either face or house) were added into the original 12 images. Participants were asked to memorize the images in the first block, and detect if a new image was presented in the following 13 blocks by pressing the \u0026lsquo;m\u0026rsquo; button using the right index finger.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIn the MEG experiment, stimuli were presented through an LCD projector onto a rear screen located in front of the scanner. There were two sessions in the MEG experiment: a Gaze Session and an Image Session. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIn the Gaze Session, each participant was asked to follow dots that represented his/her own gaze patterns as well as dots that represented another participant\u0026rsquo;s gaze patterns. Each trial started with a red dot on a grey background, which remained at the center of the screen for 1400-2000ms. After a blank screen of a jittered interval (450 \u0026ndash; 650 ms), the gaze track was presented on the screen, in the form of a sequence of green dots. The sequence of green dots represented the gaze pattern for a specific picture obtained from the behavioral experiment, and each dot represented a fixation of the gaze pattern. Given that the gaze patterns were collected during the 1500ms-time range of the picture presentation, the dot sequence lasted approximately 1500 ms on the screen. In 20% of the trials, a small black dot (0.05\u0026deg; in diameter) was presented at the center of the central red dot or the moving green dot (with equal probabilities). This small black dot was presented for 100 ms, and participants were asked to detect the black dot by pressing the button using the right index finger. According to our design, four categories of gaze tracks were presented: face-related gaze tracks from the current observer (self-face, SF), house-related gaze tracks from the current observer (self-house, SH), face-related gaze tracks from another participant (other-face, OF), and house-related gaze tracks from another participant (other-house, OH). There were 10 blocks of gaze-tracks, with 40 trials (10 trials per condition) in each block. Trials from the 4 conditions were mixed and presented in a random order. At the end of each block, a feedback screen was presented to inform the participants\u0026rsquo; performance in detecting the small black dots.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIn the Image Session, participants were asked to view successively presented images in a one-back task. Images were grouped into 10 blocks of faces and 10 blocks of houses. The two block types were presented in a random order. Each block started with a central fixation (a green cross) at a varying interval of 1000-1600 ms. Then images of the same category (face or house) were successively presented (each lasted for 1000 ms), with a jittered interval of 300-600 ms between each two images. The central fixation was presented throughout the block, and participants were required to maintain their eyes on the central fixation. In each block, 10-11 images were presented with one or two images that were repeated immediately after their first presentation. Participants were asked to detect the immediate repetition of the image by button press. Apart from the immediate repetition, there were no other repetitions of images in each block. In total, each participant viewed 100-110 images of each category. There was a ~30s break between each two blocks.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eFor both the behavioral experiment and the MEG experiment, eye-movement data were recorded during the experiment with an EyeLink 1000 plus system (SR-Research, Canada), at an online sampling rate of 1000 Hz. A standard procedure of nine-point calibration and validation was performed at the beginning of the experiment, with a maximum error of 1.0\u0026deg; as the threshold. A drift check was performed at the beginning of each block, and the calibration and validation were performed if the error of the drift check exceeded the threshold (i.e., \u0026gt; 1.0\u0026deg;). \u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eMEG data acquisition and preprocessing\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eNeuromagnetic signals were recorded using a whole-head MEG system, with 204 planar gradiometers and 102 magnetometers (Eleka Neuromag TRIUX) in a magnetically shielded room. Four head position indication (HPI) coils were placed in each participant\u0026rsquo;s head to estimate head position during recording, with two coils in left and right mastoids and two on the forehead. The raw MEG signals were online sampled at 1000Hz and were band-pass filtered between 0.1 and 330 Hz. The structural MRI of each participant was obtained using a 3T Siemens Prisma MR scanner. The MRI scanning was conducted on a different day after the MEG experiment. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eHead shapes were quantified using the Probe Position Identification system (Polhemus), and three anatomical landmarks (nasion, left and right pre-auricular points) were used to co-register the MEG data with MRI coordinates. Max-filter was used to reduce external noise and compensate for head movements (temporal signal space separation method, tSSS\u003csup\u003e47\u003c/sup\u003e). The offline pre-processing analysis of MEG data was performed using Brainstorm\u003csup\u003e48\u003c/sup\u003e . The continuous MEG data was first down-sampled to 200 Hz. Then the MEG data was band-pass filtered (0.1Hz to 60Hz, zero phase shift FIR filter) and notch filtered at 50 Hz. Independent component analysis (ICA) was used to detect and discard artifacts related to eye blinks, head movements and heat beats. The data were then epoched with the time interval of -500 to 1500 ms relative to the onset of the first fixation in the Gaze Session and with the time interval of -200 to 1000 ms relative to the onset of the image in the Image Session. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eAnalysis of eye-movement data\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eIn the behavioral experiment, eye-movement data were extracted from the 1.5-s image presentation. Data were preprocessed using the \u003cem\u003ecili\u003c/em\u003e module, a python-based tool for detecting and correcting eye blinks. Eye blinks were firstly removed, and fixations were identified based on the velocity threshold of 30 \u0026deg;/s and the acceleration threshold of 8000\u0026deg;/s\u003csup\u003e2\u003c/sup\u003e. Trials without any valid fixation events, and trials with fixation localized beyond the region of the picture were also excluded. To prepare the gaze tracks in the MEG experiment, following the previous study\u003csup\u003e9\u003c/sup\u003e, a fixation was identified as a gaze event if its duration was longer than or equal to 100 ms, while identified as a non-gaze event if its duration was shorter than 100 ms. This non-gaze event was represented by a blank screen in the Gaze Session of the MEG experiment. Then, trials with less than two gazes were excluded. The gaze coordinates were proportionally transformed and co-registered with the screen resolution in the MEG scanner. In the Gaze Session of the MEG experiment, the online fixation events were also identified and analyzed.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eMultivariate classifications were performed on the gaze features to show the distinct patterns between categories. The classification analysis was performed using the scikit-learn package (http://github.com/scikit-learn). Three features were included: the \u003cem\u003ex\u003c/em\u003e, \u003cem\u003ey\u003c/em\u003e coordinates, and the duration of each gaze. The fixation data was parsed in the way that 80% of the data was included as the training set and 20% of the data as the test set. A linear support vector (SVM) classifier was trained and cross-validated based on the saccadic features of the two categories (Face vs. House). The classification was performed for each participant, rendering both individual-level prediction accuracies and the group mean of the accuracies. Permutation-based testing was conducted to assess the statistical significance. For each participant, the classifier was trained with randomly shuffled labels of the two categories, and a permuted accuracy was calculated. This procedure was repeated 100 times, rendering a set of 100 chance accuracies for each participant. For group-level statistical testing, one chance accuracy was selected from each participant and the individual chance accuracies were averaged into a group chance accuracy. This procedure was repeated 10\u003csup\u003e5\u0026nbsp;\u003c/sup\u003etimes, resulting in a set of 10\u003csup\u003e5\u0026nbsp;\u003c/sup\u003egroup chance accuracies. Significance testing was performed by calculating the probability of the unpermuted group mean accuracy across participants in the distribution of the group chance accuracies (one-tailed). The classification was performed both for the fixation data in the behavioral experiment (Face vs. House) and the fixation data in the Gaze Session of the MEG experiment (SF vs. SH, OF vs. OH).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eNote that the SF vs. SH and OF vs. OH classifications in the Gaze Session were not strictly specific to the Face vs. House distinction because no face or house images were presented. To show the specificity, cross-experiment classifications were performed where the fixation patterns in the behavioral experiment were used to train the classifier (Face vs. House), which was then used to predict the fixation categories in the Gaze Session (SF vs. SH, OF vs. OH). Importantly, to assess the sensitivity of the distinct fixation patterns, the cross-experiment classification was performed by varying the number of the fixations (i.e., the first fixation, the 1-2 fixations, the 1-3 fixations, and the 1-4 fixations). Multiple comparisons were corrected with Bonferroni methods.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eRepresentational distance\u003csup\u003e3\u003c/sup\u003e was calculated to assess if the fixation data had a consistent structure specific to the visual category in the Gaze Session. Specifically, the representational distance was quantified by the Euclidean distance between the fixations within each category, assuming that a lower distance indicates a higher fixation structure\u003csup\u003e23\u003c/sup\u003e. We also calculated the representational distance between the categories as the control. The fixation pattern for a specific category was identified as structural if the representational distance within that category was significantly lower than the between-category representational distance.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eEvent-related magnetic field (ERF) analysis of MEG data\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eAfter the pre-processing, the epoched data were averaged over the trials for each condition and each participant. Individual T1-weighted MRIs were segmented with the FreeSurfer software package\u003csup\u003e49\u003c/sup\u003e (http://surfer.nmr.mgh.harvard.edu) and then imported to the Brainstorm (https://neuroimage.usc.edu/brainstorm) for further source-level analysis. The white-gray matter boundary segmented by the FreeSurfer was used as a source space for activity estimation in the cortex. After co-registration between the individual anatomy and MEG sensors, the cortical currents were estimated using a distributed model consisting of 15002 current dipoles from the averaged epochs (evoked activities) using a linear inverse estimator (minimum norm current estimation). The density map was standardized using a Z-score transformation with respect to a noise matrix which was calculated with a 2-minute empty-room recording of the MEG signal. The dipole orientation was constrained to the orthogonality of the white-gray matter boundary of the individual MRIs. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe difference in the estimated cortical current maps was calculated between the following conditions: \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, \u0026lsquo;OF \u0026ndash; OH\u0026rsquo;, \u0026lsquo;(SF \u0026ndash; SH) \u0026ndash; (OF \u0026ndash; OH)\u0026rsquo;. Then the source maps were filtered with a low-pass filter (30Hz), standardized through a z-score baseline normalization (-450 to 0 ms relative to the gaze onset as the baseline, with the first 50 ms of the baseline period being excluded to avoid the edge effect resulted from the low-pass filter), and rectified to retain only absolute values. The source maps were then projected on a standard brain (ICBM152) and spatially smoothed (Full Width at Half Maximum, FWHM=3mm) before group statistical analysis. A two-tailed one-sample Chi\u003csup\u003e2\u003c/sup\u003e test was used for group statistical analysis for each time point and each vertex with the null hypothesis that the difference in variances of the cortical activities between the two conditions was equal to zero\u003csup\u003e50\u003c/sup\u003e. Bonferroni correction was used to solve the multiple comparison problems. The significance threshold was set at \u003cem\u003ep\u003c/em\u003e \u0026lt; 0.05 after corrections.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eTo show the brain areas that were involved in the Image Session, the whole-brain source reconstruction was also performed by comparing the ERF signals during face viewing and the ERF signals during house viewing (\u0026lsquo;Face \u0026ndash; House\u0026rsquo;). \u003cem\u003e\u0026nbsp;\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003e\u003cem\u003eModeling the gradient of MEG signal in source space\u003c/em\u003e\u003c/p\u003e\n\u003cp\u003eTo quantify the spatial patterns of MEG signals during the gaze-track following, a five-order polynomial function was used to approximate the data along each of the three spatial dimensions (\u003cem\u003ex\u003c/em\u003e, \u003cem\u003ey\u003c/em\u003e and \u003cem\u003ez\u003c/em\u003e coordinates for the data in the source space)\u003csup\u003e24\u003c/sup\u003e. For each time point, we employed a polynomial function \u003cem\u003ep(v) = p\u003csub\u003e0\u003c/sub\u003e + p\u003csub\u003e1\u003c/sub\u003ev + p\u003csub\u003e2\u003c/sub\u003ev\u003csup\u003e2\u0026nbsp;\u003c/sup\u003e+ p\u003csub\u003e3\u003c/sub\u003ev\u003csup\u003e3\u0026nbsp;\u003c/sup\u003e+ p\u003csub\u003e4\u003c/sub\u003ev\u003csup\u003e4\u0026nbsp;\u003c/sup\u003e+ p\u003csub\u003e5\u003c/sub\u003ev\u003csup\u003e5\u003c/sup\u003e\u003c/em\u003e (\u003cem\u003epolyfit\u003c/em\u003e, MATLAB 2022a) to estimate the coordinates along each spatial dimension with the MEG signal difference between conditions (e.g., \u0026lsquo;SF \u0026ndash; SH\u0026rsquo;, \u0026lsquo;OF \u0026ndash; OH\u0026rsquo;). The amplitude of MEG signal was normalized to z-scores across vertexes to avoid an ill-conditioned Vandermonde matrix in model fitting. For each spatial dimension, the model fitted the signal difference in Z-scored amplitudes of each vertex \u003cem\u003ev\u003c/em\u003e to its spatial coordinate (MNI coordinates) across vertices. The quality of the model was quantified by the adjusted \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e, which determined the proportion of variance explained by the model. \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e was adjusted by the number of coefficients. To assess the dynamic spatial gradient of the MEG signal difference, a Jackknife method was used to fit the model and calculate \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u0026nbsp;\u003c/sup\u003efor each of the 3 dimensions. Specifically, one of the participants was excluded and the source-reconstructed MEG signals of the remaining participants were averaged to fit the model. This procedure was iterated across participants. A one-sample \u003cem\u003et\u003c/em\u003e test (one-tail) was used to test if \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e at each time point was higher than the baseline, the time interval of -500 to 0 ms relative to the stimulus onset. Cluster-based permutation was used to resolve the multi-comparison problem across time points. We also calculated the first-order derivatives of the estimated model to test if the spatial gradient had a monotonic increasing or decreasing pattern along a specific dimension. The calculation was performed on the model at the time point with peak \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e, and the evaluation of the derivatives was based on the signal range between the minimum and the maximum value of the MEG amplitude. The spatial gradient was identified as monotonically increasing given the derivative values \u0026gt; 0 and monotonically decreasing given the derivative values \u0026lt; 0. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eTo test if the spatial gradient of \u0026lsquo;SF \u0026ndash; SH\u0026rsquo; emerged earlier than the spatial gradient of \u0026lsquo;OF \u0026ndash; OH\u0026rsquo;, cross correlation (\u003cem\u003excorr\u003c/em\u003e, MTALAB 2022a, \u0026lsquo;unbiased\u0026rsquo;, maxlag = 200) was performed on the two \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e time courses to calculate the latency difference. The latency difference was defined as the temporal lag with which the \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e time courses showed maximum correlation between the two time courses across participants. The Bootstrapping method (iteration number = 1000) was used to estimate the 95% confidence interval of latency difference. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eThe same analysis was also performed on the MEG signal difference between Face and House in the Image Session to show the spatial gradient. The analysis was performed on the 0-1000 ms interval during the image presentation (0 denotes the image onset), with the -200-0 ms interval as the baseline.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eTo assess the similarity/dissimilarity of the spatial gradient between the Gaze Session and the Image Session, we performed correlation analyses between the two sessions. Specifically, we performed the model fitting with the group average of source-reconstructed MEG signal difference between conditions (\u0026lsquo;SF \u0026ndash; SH\u0026rsquo; and \u0026lsquo;OF \u0026ndash; OH\u0026rsquo; for the Gaze Session and \u0026lsquo;Face \u0026ndash; House\u0026rsquo; for the Image Session). A Bootstrapping method (iteration number = 1000) was used to estimate the variance of \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e time courses that were calculated between the model and the group average of the MEG data. At the peak time point of \u003cem\u003eR\u003c/em\u003e\u003csup\u003e2\u003c/sup\u003e time courses (\u003cem\u003ey\u003c/em\u003e and \u003cem\u003ez\u003c/em\u003e, respectively), we projected the fitted function \u003cem\u003ep\u003c/em\u003e(v) in the three-dimensional space with the polynomial function. The predicted coordinates of the function \u003cem\u003ep\u003c/em\u003e(v) were sorted according to the amplitude of the MEG signal. Then we calculated the \u003cem\u003ePearson\u003c/em\u003e coefficients of the sorted coordinates between the two sessions.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAcknowledgments\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe thank Dr. Jiayu Zhan and Dr. Liyu Cao for their suggestions on the design of the MEG experiment. This study was supported by the National Natural Science Foundation of China (32271086), and a Mercator Fellowship of the Deutsche Forschungsgemeinschaft (DFG, 450600965) to LW, and a DFG grant (PO548/18-1) to SP. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthor contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConceptualization, L.W., S.P., Z.S.; Methodology, Z.S., L.W.; Investigation, Z.S., L.W.; Formal Analysis, Z.S., L.W.; Visualization, Z.S.; Writing \u0026ndash; Original Draft, S.P., L.W., Z.S.; Writing \u0026ndash; Review \u0026amp; Editing, S.P., L.W., Z.S., X.Z.; Supervision, L.W., X.Z.; Funding Acquisition, L.W., X.Z., S.P.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCompeting interests\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors declare no competing interests.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eData and code availability\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eData and codes have been deposited at OSF, accession code osf.io/vwfxc/?view_only=55feffd4ae034a968da54048b65927f8.\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eDesimone, R., Albright, T. D., Gross, C. G. \u0026amp; Bruce, C. Stimulus-selective properties of inferior temporal neurons in the macaque. \u003cem\u003eJ Neurosci\u003c/em\u003e \u003cstrong\u003e4\u003c/strong\u003e, 2051\u0026ndash;2062 (1984).\u003c/li\u003e\n\u003cli\u003eKiani, R., Esteky, H., Mirpour, K. \u0026amp; Tanaka, K. Object category structure in response patterns of neuronal population in monkey inferior temporal cortex. \u003cem\u003eJ. Neurophysiol.\u003c/em\u003e \u003cstrong\u003e97\u003c/strong\u003e, 4296\u0026ndash;4309 (2007).\u003c/li\u003e\n\u003cli\u003eKriegeskorte, N. \u003cem\u003eet al.\u003c/em\u003e Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey. \u003cem\u003eNeuron\u003c/em\u003e \u003cstrong\u003e60\u003c/strong\u003e, 1126\u0026ndash;1141 (2008).\u003c/li\u003e\n\u003cli\u003eMajaj, N. J., Hong, H., Solomon, E. A. \u0026amp; DiCarlo, J. J. Simple learned weighted sums of inferior temporal neuronal firing rates accurately predict human core object recognition performance. \u003cem\u003eJ. Neurosci.\u003c/em\u003e \u003cstrong\u003e35\u003c/strong\u003e, 13402\u0026ndash;13418 (2015).\u003c/li\u003e\n\u003cli\u003eUngerleider, L. G. \u0026amp; Mishkin, M. Two cortical visual systems. in Analysis of visual behavior (MIT press, 1982).\u003c/li\u003e\n\u003cli\u003eKravitz, D. J., Saleem, K. S., Baker, C. I., Ungerleider, L. G. \u0026amp; Mishkin, M. The ventral visual pathway: An expanded neural framework for the processing of object quality. \u003cem\u003eTrends Cogn. Sci.\u003c/em\u003e \u003cstrong\u003e17\u003c/strong\u003e, 26\u0026ndash;49 (2013).\u003c/li\u003e\n\u003cli\u003eKanwisher, N., McDermott, J. \u0026amp; Chun, M. M. The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception. \u003cem\u003eJ. Neurosci.\u003c/em\u003e \u003cstrong\u003e17\u003c/strong\u003e, 4302\u0026ndash;4311 (1997).\u003c/li\u003e\n\u003cli\u003eEpstein, R. \u0026amp; Kanwisher, N. A cortical representation of the local visual environment. \u003cem\u003eNature\u003c/em\u003e \u003cstrong\u003e392\u003c/strong\u003e, 598\u0026ndash;601 (1998).\u003c/li\u003e\n\u003cli\u003eWang, L., Baumgartner, F., Kaule, F. R., Hanke, M. \u0026amp; Pollmann, S. Individual face- and house-related eye movement patterns distinctively activate FFA and PPA. \u003cem\u003eNat. Commun.\u003c/em\u003e \u003cstrong\u003e10\u003c/strong\u003e, 1\u0026ndash;16 (2019).\u003c/li\u003e\n\u003cli\u003ePeterson, M. F. \u0026amp; Eckstein, M. P. Individual differences in eye movements during face identification reflect observer-specific optimal points of fixation. \u003cem\u003ePhysiol. Sci.\u003c/em\u003e \u003cstrong\u003e27\u003c/strong\u003e, 1216\u0026ndash;1225 (2013).\u003c/li\u003e\n\u003cli\u003eLandi, S. M. \u0026amp; Freiwald, W. A. Two areas for familiar face recognition in the primate brain. \u003cem\u003eScience.\u003c/em\u003e \u003cstrong\u003e357\u003c/strong\u003e, 591\u0026ndash;595 (2017).\u003c/li\u003e\n\u003cli\u003eTsao, D. Y., Moeller, S. \u0026amp; Freiwald, W. A. Comparing face patch systems in macaques and humans. \u003cem\u003eProc. Natl. Acad. Sci. U. S. A.\u003c/em\u003e \u003cstrong\u003e105\u003c/strong\u003e, 19514\u0026ndash;19519 (2008).\u003c/li\u003e\n\u003cli\u003eTsao, D. Y., Schweers, N., Moeller, S. \u0026amp; Freiwald, W. A. Patches of face-selective cortex in the macaque frontal lobe. \u003cem\u003eNat. Neurosci.\u003c/em\u003e \u003cstrong\u003e11\u003c/strong\u003e, 877\u0026ndash;879 (2008).\u003c/li\u003e\n\u003cli\u003eDal Monte, O. \u003cem\u003eet al.\u003c/em\u003e Widespread implementations of interactive social gaze neurons in the primate prefrontal-amygdala networks. \u003cem\u003eNeuron\u003c/em\u003e \u003cstrong\u003e110\u003c/strong\u003e, 2183-2197.e7 (2022).\u003c/li\u003e\n\u003cli\u003eFan, S., Dal Monte, O., Nair, A. R., Fagan, N. A. \u0026amp; Chang, S. W. C. Closed-loop microstimulations of the orbitofrontal cortex during real-life gaze interaction enhance dynamic social attention. \u003cem\u003eNeuron\u003c/em\u003e \u003cstrong\u003e112\u003c/strong\u003e, 2631-2644.e6 (2024).\u003c/li\u003e\n\u003cli\u003eVoss, J. L. \u003cem\u003eet al.\u003c/em\u003e Spontaneous revisitation during visual exploration as a link among strategic behavior, learning, and the hippocampus. \u003cem\u003eProc. Natl. Acad. Sci. U. S. A.\u003c/em\u003e \u003cstrong\u003e108\u003c/strong\u003e, E402\u0026ndash;E409 (2011).\u003c/li\u003e\n\u003cli\u003eRyan, J. D., Shen, K. \u0026amp; Liu, Z. X. The intersection between the oculomotor and hippocampal memory systems: empirical developments and clinical implications. \u003cem\u003eAnn. N. Y. Acad. Sci.\u003c/em\u003e \u003cstrong\u003e1464\u003c/strong\u003e, 115\u0026ndash;141 (2020).\u003c/li\u003e\n\u003cli\u003eLiu, Z.-X., Rosenbaum, R. S. \u0026amp; Ryan, J. D. Restricting Visual Exploration Directly Impedes Neural Activity, Functional Connectivity, and Memory. \u003cem\u003eCereb. Cortex Commun.\u003c/em\u003e \u003cstrong\u003e1\u003c/strong\u003e, (2020).\u003c/li\u003e\n\u003cli\u003eSummerfield, C. \u003cem\u003eet al.\u003c/em\u003e Predictive Codes for Forthcoming Perception in the Frontal Cortex. \u003cem\u003eScience.\u003c/em\u003e \u003cstrong\u003e314\u003c/strong\u003e, 1311\u0026ndash;1314 (2006).\u003c/li\u003e\n\u003cli\u003eBar, M. \u003cem\u003eet al.\u003c/em\u003e Top-down facilitation of visual recognition. \u003cem\u003eProc. Natl. Acad. Sci. U. S. A.\u003c/em\u003e \u003cstrong\u003e103\u003c/strong\u003e, 449\u0026ndash;454 (2006).\u003c/li\u003e\n\u003cli\u003eDuan, Y., Zhan, J., Gross, J., Ince, R. A. A. \u0026amp; Schyns, P. G. Pre-frontal cortex guides dimension-reducing transformations in the occipito-ventral pathway for categorization behaviors. \u003cem\u003eCurr. Biol.\u003c/em\u003e \u003cstrong\u003e34\u003c/strong\u003e, 3392-3404.e5 (2024).\u003c/li\u003e\n\u003cli\u003eFreedman, D. J., Riesenhuber, M., Poggio, T. \u0026amp; Miller, E. K. Categorical representation of visual stimuli in the primate prefrontal cortex. \u003cem\u003eScience.\u003c/em\u003e \u003cstrong\u003e291\u003c/strong\u003e, 312\u0026ndash;316 (2001).\u003c/li\u003e\n\u003cli\u003eWang, Z., Meghanathan, R. N., Pollmann, S. \u0026amp; Wang, L. Common structure of saccades and microsaccades in visual perception. \u003cem\u003eJ. Vis.\u003c/em\u003e \u003cstrong\u003e24\u003c/strong\u003e, 1\u0026ndash;13 (2024).\u003c/li\u003e\n\u003cli\u003eZalta, A., Large, E. W., Sch\u0026ouml;n, D. \u0026amp; Morillon, B. Neural dynamics of predictive timing and motor engagement in music listening. \u003cem\u003eSci. Adv.\u003c/em\u003e \u003cstrong\u003e10\u003c/strong\u003e, eadi2525 (2024).\u003c/li\u003e\n\u003cli\u003eWang, X.-J. Probabilistic Decision Making by Slow Reverberation in Cortical Circuits. \u003cem\u003eNeuron\u003c/em\u003e \u003cstrong\u003e36\u003c/strong\u003e, 955\u0026ndash;968 (2002).\u003c/li\u003e\n\u003cli\u003eNeary, D., Snowden, J. S., Northen, B. \u0026amp; Goulding, P. Dementia of frontal lobe type. \u003cem\u003eJ. Neurol. Neurosurg. Psychiatry\u003c/em\u003e 353\u0026ndash;361 (1988).\u003c/li\u003e\n\u003cli\u003eRouse, M. A., Binney, R. J., Patterson, K., Rowe, J. B. \u0026amp; Lambon Ralph, M. A. A neuroanatomical and cognitive model of impaired social behaviour in frontotemporal dementia. \u003cem\u003eBrain\u003c/em\u003e \u003cstrong\u003e147\u003c/strong\u003e, 1953\u0026ndash;1966 (2024).\u003c/li\u003e\n\u003cli\u003eHannula, D. E., Ryan, J. D., Tranel, D. \u0026amp; Cohen, N. J. Rapid onset relational memory effects are evident in eye movement behavior, but not in hippocampal amnesia. \u003cem\u003eJ. Cogn. Neurosci.\u003c/em\u003e \u003cstrong\u003e19\u003c/strong\u003e, 1690\u0026ndash;1705 (2007).\u003c/li\u003e\n\u003cli\u003eRyals, A. J., Wang, J. X., Polnaszek, K. L. \u0026amp; Voss, J. L. Hippocampal contribution to implicit configuration memory expressed via eye movements during scene exploration. \u003cem\u003eHippocampus\u003c/em\u003e \u003cstrong\u003e25\u003c/strong\u003e, 1028\u0026ndash;1041 (2015).\u003c/li\u003e\n\u003cli\u003ePollmann, S. \u0026amp; Schneider, W. X. Working memory and active sampling of the environment: Medial temporal contributions. in \u003cem\u003eHandbook of Clinical Neurology\u003c/em\u003e \u003cstrong\u003e187\u003c/strong\u003e, (Elsevier B.V., 2022).\u003c/li\u003e\n\u003cli\u003eLiu, Z. X., Shen, K., Olsen, R. K. \u0026amp; Ryan, J. D. Visual sampling predicts hippocampal activity. \u003cem\u003eJ. Neurosci.\u003c/em\u003e \u003cstrong\u003e37\u003c/strong\u003e, 599\u0026ndash;609 (2017).\u003c/li\u003e\n\u003cli\u003eRanganath, C. \u0026amp; Ritchey, M. Two cortical systems for memory-guided behaviour. \u003cem\u003eNat. Rev. Neurosci.\u003c/em\u003e \u003cstrong\u003e13\u003c/strong\u003e, 713\u0026ndash;726 (2012).\u003c/li\u003e\n\u003cli\u003eContier, O., Baker, C. I. \u0026amp; Hebart, M. N. Distributed representations of behaviour-derived object dimensions in the human visual system. \u003cem\u003eNat. Hum. Behav.\u003c/em\u003e \u003cstrong\u003e8\u003c/strong\u003e, 2179\u0026ndash;2193 (2024).\u003c/li\u003e\n\u003cli\u003ePrinz, W. A Common Coding Approach to Perception and Action BT - Relationships Between Perception and Action: Current Approaches (Springer Berlin Heidelberg, 1990).\u003c/li\u003e\n\u003cli\u003eOlivers, C. N. L. \u0026amp; Roelfsema, P. R. Attention for action in visual working memory. \u003cem\u003eCortex\u003c/em\u003e \u003cstrong\u003e131\u003c/strong\u003e, 179\u0026ndash;194 (2020).\u003c/li\u003e\n\u003cli\u003eVan Ede, F. \u0026amp; Nobre, A. C. Turning Attention Inside Out: How Working Memory Serves Behavior. \u003cem\u003eAnnu. Rev. Psychol.\u003c/em\u003e \u003cstrong\u003e74\u003c/strong\u003e, 137\u0026ndash;165 (2023).\u003c/li\u003e\n\u003cli\u003eLand, M. \u0026amp; Tatler, B. Looking and Acting: Vision and eye movements in natural behaviour (Oxford University Press, 2009).\u003c/li\u003e\n\u003cli\u003ePaus, T. Location and function of the human frontal eye-field: A selective review. \u003cem\u003eNeuropsychologia\u003c/em\u003e \u003cstrong\u003e34\u003c/strong\u003e, 475\u0026ndash;483 (1996).\u003c/li\u003e\n\u003cli\u003eAndersen, R. A., Brotchie, P. R. \u0026amp; Mazzoni, P. Evidence for the lateral intraparietal area as the parietal eye field. \u003cem\u003eCurr. Opin. Neurobiol.\u003c/em\u003e \u003cstrong\u003e2\u003c/strong\u003e, 840\u0026ndash;846 (1992).\u003c/li\u003e\n\u003cli\u003eBruce, C. J. \u0026amp; Goldberg, M. E. Physiology of the frontal eye fields. \u003cem\u003eTrends Neurosci.\u003c/em\u003e \u003cstrong\u003e7\u003c/strong\u003e, 436\u0026ndash;441 (1984).\u003c/li\u003e\n\u003cli\u003eCoiner, B. \u003cem\u003eet al.\u003c/em\u003e Functional neuroanatomy of the human eye movement network: a review and atlas. \u003cem\u003eBrain Struct. Funct.\u003c/em\u003e \u003cstrong\u003e224\u003c/strong\u003e, 2603\u0026ndash;2617 (2019).\u003c/li\u003e\n\u003cli\u003eHopfinger, J. B., Buonocore, M. H. \u0026amp; Mangun, G. R. The neural mechanisms of top-down attentional control. \u003cem\u003eNat. Neurosci.\u003c/em\u003e \u003cstrong\u003e3\u003c/strong\u003e, 284\u0026ndash;291 (2000).\u003c/li\u003e\n\u003cli\u003ePollmann, S. Frontopolar Resource Allocation in Human and Nonhuman Primates. \u003cem\u003eTrends Cogn. Sci.\u003c/em\u003e \u003cstrong\u003e20\u003c/strong\u003e, 84\u0026ndash;86 (2016).\u003c/li\u003e\n\u003cli\u003eGallivan, J. P. \u0026amp; Goodale, M. A. The dorsal \u0026ldquo;action\u0026rdquo; pathway. in \u003cem\u003eHandbook of Clinical Neurology\u003c/em\u003e \u003cstrong\u003e151\u003c/strong\u003e, (2018).\u003c/li\u003e\n\u003cli\u003eFaul, F., Erdfelder, E., Lang, A. G. \u0026amp; Buchner, A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. \u003cem\u003eBehav. Res. Methods\u003c/em\u003e \u003cstrong\u003e39\u003c/strong\u003e, 175\u0026ndash;191 (2007).\u003c/li\u003e\n\u003cli\u003eQuax, S. C., Dijkstra, N., van Staveren, M. J., Bosch, S. E. \u0026amp; van Gerven, M. A. J. Eye movements explain decodability during perception and cued attention in MEG. \u003cem\u003eNeuroimage\u003c/em\u003e \u003cstrong\u003e195\u003c/strong\u003e, 444\u0026ndash;453 (2019).\u003c/li\u003e\n\u003cli\u003eTaulu, S. \u0026amp; Simola, J. Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurements. \u003cem\u003ePhys. Med. Biol.\u003c/em\u003e \u003cstrong\u003e51\u003c/strong\u003e, 1759\u0026ndash;1768 (2006).\u003c/li\u003e\n\u003cli\u003eTadel, F., Baillet, S., Mosher, J. C., Pantazis, D. \u0026amp; Leahy, R. M. Brainstorm: A user-friendly application for MEG/EEG analysis. \u003cem\u003eComput. Intell. Neurosci.\u003c/em\u003e \u003cstrong\u003e2011\u003c/strong\u003e, (2011).\u003c/li\u003e\n\u003cli\u003eFischl, B. FreeSurfer. \u003cem\u003eNeuroimage\u003c/em\u003e \u003cstrong\u003e62\u003c/strong\u003e, 774\u0026ndash;781 (2012).\u003c/li\u003e\n\u003cli\u003eSandberg, K. \u003cem\u003eet al.\u003c/em\u003e Distinct MEG correlates of conscious experience, perceptual reversals and stabilization during binocular rivalry. \u003cem\u003eNeuroimage\u003c/em\u003e \u003cstrong\u003e100\u003c/strong\u003e, 161\u0026ndash;175 (2014).\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"nature-portfolio","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"","title":"Nature Portfolio","twitterHandle":"","acdcEnabled":false,"dfaEnabled":false,"editorialSystem":"ejp","reportingPortfolio":"","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"eye movements, human ventral pathway, object representation, face, magnetoencephalography (MEG)","lastPublishedDoi":"10.21203/rs.3.rs-5835383/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-5835383/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"Multiple brain areas along the ventral pathway have been known to represent face images. \r\nHere, in a magnetoencephalography (MEG) experiment, we show dynamic representations of face-related eye movements in the ventral pathway in the absence of image perception. Participants followed a dot presented on a uniform background, the movement of which represented gaze tracks acquired previously during their free-viewing of face and house pictures. We found a dominant role of the ventral stream in representing face-related gaze tracks, starting from the orbitofrontal cortex (OFC) and anterior temporal lobe (ATL), and extending to the medial temporal and ventral occipitotemporal cortex. Our findings show that the ventral pathway represents the gaze tracks used to explore faces, by which top-down prediction of face category in OFC and ATL may guide, via the medial temporal cortex or directly, face perception in the ventral occipitotemporal cortex.","manuscriptTitle":"Dynamic face-related eye movement representations in the human ventral pathway","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-04-21 10:12:13","doi":"10.21203/rs.3.rs-5835383/v1","editorialEvents":[],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"communications-biology","isNatureJournal":true,"hasQc":false,"allowDirectSubmit":false,"externalIdentity":"commsbio","sideBox":"Learn more about [Communications Biology](http://www.nature.com/commsbio/)","snPcode":"","submissionUrl":"","title":"Communications Biology","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"ejp","reportingPortfolio":"Communications Series","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"d55108c6-12d1-4da4-af7c-3b43f6467a9e","owner":[],"postedDate":"April 21st, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[{"id":43597590,"name":"Biological sciences/Neuroscience/Cognitive neuroscience"},{"id":43597591,"name":"Biological sciences/Neuroscience/Visual system/Object vision"}],"tags":[],"updatedAt":"2025-11-25T08:12:51+00:00","versionOfRecord":{"articleIdentity":"rs-5835383","link":"https://doi.org/10.1038/s42003-025-09039-y","journal":{"identity":"communications-biology","isVorOnly":false,"title":"Communications Biology"},"publishedOn":"2025-11-24 05:00:00","publishedOnDateReadable":"November 24th, 2025"},"versionCreatedAt":"2025-04-21 10:12:13","video":"","vorDoi":"10.1038/s42003-025-09039-y","vorDoiUrl":"https://doi.org/10.1038/s42003-025-09039-y","workflowStages":[]},"version":"v1","identity":"rs-5835383","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-5835383","identity":"rs-5835383","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00