Optimizing cyclopean stimuli for the evaluation of stereo vision by steady-state visual evoked potentials
János Radó, Eszter Mikó–Baráth, Péter Hegyi, Vanda A. Nemes, Gábor Jandó, Péter Buzás

TL;DR
This study optimized visual stimuli to measure stereo vision using brainwave responses, finding that a specific stimulus frequency worked best with minimal electrodes.
Contribution
The study introduces optimized cyclopean stimuli for reliable and efficient ssVEP-based assessment of stereopsis.
Findings
DRDC at 1.875 cps evoked significant ssVEP responses in 93% of participants using occipital electrodes.
Monocular control measurements confirmed stereo-specificity of the responses.
Optimized DRDC stimuli support clinical assessments for non-verbal or pediatric populations.
Abstract
This study aimed to optimize dynamic random dot correlogram (DRDC) and stereogram (DRDS) stimuli to evoke steady-state visual evoked potentials (ssVEP) on multiple EEG channels for the objective assessment of stereopsis. EEG recordings were conducted on 22 healthy adults (mean age: 30.2 ± 5.8 years) while viewing cyclopean and control stimuli. DRDC and DRDS were presented at three temporal frequencies (0.9375, 1.875, and 3.75 cycles per second, cps) using anaglyphic channel separation. The ssVEP responses were analyzed using T2circ statistical test to determine the most effective stimulus for eliciting significant cortical activity. DRDC at 1.875 cps evoked significant ssVEP responses in 93% of participants on at least one occipital electrode (O1, Oz, O2) and in 100% when including parietal-occipital electrodes. DRDS at similar frequencies also produced robust responses but required…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 |
|---|---|---|---|---|---|---|---|---|---|---|
| Stimulus | No. cases | Median No. epochs | Harmonic | Median of | IQR of | Median of T2circ values | IQR of T2circ values | No. significant cases | No. significant cases. for f1 or f2 | Effectiveness |
| DRDC 0.9375 cps | 21 | 30 | f1 | 0.00063 | 0.00164 | 8.78 | 6.87 | 15 | 15 | 71% |
| f2 | 0.00765 | 0.00316 | 5.36 | 0.50 | 3 | |||||
| DRDC 1.875 cps | 22 | 30 | f1 | 0.00020 | 0.00132 | 9.94 | 8.98 | 21 | 22 | 100% |
| f2 | 0.00099 | 0.00348 | 7.90 | 4.21 | 11 | |||||
| DRDC 3.75 cps | 22 | 29 | f1 | 8.0714 × 10–6 | 0.00133 | 14.97 | 15.41 | 18 | 19 | 86% |
| f2 | 0.00036 | 0.00197 | 10.27 | 9.28 | 14 | |||||
| DRDS 0.9375 cps | 21 | 34 | f1 | 0.00422 | 0.00495 | 6.30 | 1.36 | 2 | 11 | 52% |
| f2 | 0.00033 | 0.00185 | 9.16 | 4.74 | 10 | |||||
| DRDS 1.875 cps | 22 | 29 | f1 | 0.00241 | 0.00216 | 6.86 | 1.013 | 2 | 18 | 82% |
| f2 | 0.00004 | 0.00112 | 12.11 | 11.402 | 18 | |||||
| DRDS 3.75 cps | 22 | 33 | f1 | 0.00208 | 0.00470 | 6.97 | 2.603 | 5 | 21 | 95% |
| f2 | 0.00018 | 0.00199 | 10.39 | 14.69 | 20 |
| Stimulus | No. rec. binocular_ monocular | AUC | 95% CI | Criterion | Sensitivity at criterion (%) | Specificity at criterion (%) | Effectiveness (Rado et al.) |
|---|---|---|---|---|---|---|---|
| DRDC 0.9375 cps | 21_14 | 0.952 | 0.822–0.996 | 0.0225 | 100 | 85.7 | 71% |
| DRDC 1.875 cps | 22_19 | 0.962 | 0.850–0.997 | 0.0056 | 94.7 | 100 | 100% |
| DRDC 3.75 cps | 22_13 | 0.86 | 0.701–0.954 | 0.027 | 69.2 | 95.5 | 86% |
| DRDS 0.9375 cps | 21_14 | 0.823 | 0.657–0.931 | 0.0183 | 92.9 | 61.9 | 52% |
| DRDS 1.875 cps | 22_17 | 0.909 | 0.773–0.977 | 0.0019 | 100 | 72.7 | 82% |
| DRDS 3.75 cps | 22_10 | 0.991 | 0.874–1.000 | 0.0100 | 100 | 95.5 | 95% |
- —Ministry for Innovation and Technology in Hungary
- —University of Pécs
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual perception and processing mechanisms · EEG and Brain-Computer Interfaces · Neural dynamics and brain function
Introduction
The goal of the present study was to develop a protocol for visually evoked potentials (VEPs) that can accurately determine the presence of stereopsis in healthy adult participants with normal binocular vision. To achieve this, we used stimuli that selectively activate the neural mechanisms underlying binocular depth perception, also called stereopsis. Based on the scientific literature of several decades [1–3], the most specific stimuli for this purpose are cyclopean pattern, such as dynamic random dot stereograms (DRDS) and correlograms (DRDC). These eliminate monocular form cues and rely on binocular information, using a dynamic field of random dots. DRDS contain true binocular disparity, resulting in a clear and localized sense of depth. DRDC, in contrast, provide depth cues through interocular correlation of dot positions without binocular disparity, leading to a more diffuse depth percept. Despite this difference, both stimuli are widely used in stereopsis research due to their ability to selectively engage disparity-sensitive neural mechanisms [2].
When presented dichoptically (e.g., via anaglyph red-green glasses), these patterns evoke a three-dimensional percept in observers with intact stereopsis, and appear as random flickering dots under monocular viewing. Proper channel separation is therefore essential for effective stimulus delivery. Measuring behavioral or electrophysiological responses to DRDS and DRDC have become the standard when objective evaluation of stereoscopic perception is required and verbal responses are not available or not reliable, such as in animals [4, 5], infants or children [6–8]. A missing response to such stimuli indicates diminished or missing stereopsis that can be due to amblyopia or the presence of amblyogenic conditions [9–11]. Stereopsis can be impaired by abnormal visual experiences due to incoherent retinal images, often caused by monocular blur from refractive errors, strabismus, or unilateral eye disease. Persistent binocular vision disturbances in childhood may lead to amblyopia. Beyond optical and oculomotor factors, stereopsis deficits may also result from neural processing impairments within the visual pathways or cortex. While the lack of response to random dot stereo stimuli primarily detects impaired or absent stereopsis, it may also indicate broader visual developmental abnormalities [12]. An objective electrophysiological marker for stereo vision disturbances would therefore be especially useful for the evaluation of pediatric patients and could be integrated with other VEP protocols.
Various visual depth percepts, such as spatial surfaces or temporal changes in depth can be generated by using DRDS or DRDC [2, 13]. The wide range of possible depth illusions raises the question of which stimuli are most suitable for eliciting objectively measurable electrophysiological responses. It is important to consider that the perceptual salience of the depth encoded features is not necessarily a good indicator of the amplitude or reliability of the evoked electrical responses. Even if a stereogram is clearly visible, it may not evoke proper VEP or trigger enough neuronal activity to surpass statistical thresholds. The reliability of the VEP response can be further compromised when the examinee is moving or inattentive, which is sometimes the case with children.
Circular T-squared (T^2^circ) statistics [14] and other bivariate methods [15] are beneficial in the evaluation of steady-state VEPs as they provide a robust approach for detecting and analyzing periodic signals in the presence of noise and naturally occurring background EEG activity. These statistical tests help in distinguishing significant VEP responses and improve the accuracy of diagnosing visual pathway integrity [8, 16].
DRDC and DRDS are still a large family of possible stimuli that must be specified by a range of parameters defining the random dot carrier and the spatial and temporal characteristics of the disparate features responsible for the stereoscopic effect. There is no consensus in scientific literature on the most effective set of parameters that evokes a VEP response. Based on the broader scientific literature (see Table 3) [2, 6, 9, 17, 18], our previous works [7, 19–21], and pilot measurements preceding the current study (see Methods and SI Table 1), two specific stimulus types, incorporating periodic temporal changes at three temporal frequencies, were selected for further investigation.
Here, we sought to identify the stimulus that produces a statistically significant and reproducible response in the largest number of participants with intact stereopsis.
- Which stimulus type, DRDS or DRDC, elicits the most robust cortical response in VEP measurements?
- Which temporal frequency (1–4 cycles per second) is optimal for eliciting a reliable response?
- Can stereopsis be consistently and objectively identified through electrophysiological recordings in individuals with psychophysically confirmed stereopsis?
Methods
Participants
In our cross-sectional study, we examined 22 healthy volunteers aged 22–42 years (mean age: 30.2 ± 5.8 years, SD, 68% female) with normal or corrected to normal visual acuity and no history of eye disease and surgery. The study was approved by the Regional Research Ethics Committee at the University of Pécs (registration number: 5638-PTE), and all participants provided both verbal and written informed consent after receiving detailed information about the study.
Visual acuity was assessed at the stimulus viewing distance of 100 cm using a custom-designed acuity chart. The participants' visual acuity was − 0.09 ± 0.07 (SD) logMAR. When needed, participants wore their best-corrected prescription throughout the whole testing process. Each participant underwent the TNO stereo vision test (Lameris Ootech bv, Belgium, 6716 WC Ede, Da Vincilaan 7) following the manufacturer’s instructions, and all demonstrated a stereoacuity of at least 120" (seconds of arc viewing angle). Prior to VEP measurements, we ensured that all participants could perceive the depth-encoded features in the cyclopean stimuli. An additional inclusion criterion was a normal ssVEP response to luminance-defined checkerboard (see Results).
Pilot measurements
In a pilot study, we compared ssVEPs evoked by various DRDC and DRDS stimuli systematically to measure cyclopean vision in healthy adults (Radó et al., International Neuroscience Conference, Pécs 2024, p. 211. P6.25 and Radó et al., ECVP 2024; E-Abstract p. 112, P268). Altogether, 23 participants and 31 measurement sessions were included. This pilot study was an independent measurement series, separate from the main study. We optimized cyclopean stimuli by systematically refining the spatial pattern, spatial and temporal frequency, dot size and refresh rates to identify those that produced the strongest and most reliable ssVEP responses (SI Table 1). When analyzing the 3 occipital electrodes only, the most effective stimuli featured 3.75 cps (cycles per second) DRDC with full-field or checkerboard reversal appearance, a 3.5' (minutes of arc) dot size and 60 Hz refresh rate.
Stimuli and experimental setup
Stimuli were generated using MATLAB (The MathWorks, Inc., USA) with Psychophysics Toolbox version 3 (PTB-3) extensions [22, 23]. The stimuli were displayed on a CRT monitor (Samsung Syncmaster 957 MB S) with a dot size of 3.5’, and a dot refresh rate of 60 Hz synchronized to the frame refresh of the monitor. The viewing distance was 100 cm, covering a visual field of 1194’ × 818’. A yellow circular fixation point (41’ diameter) was presented at the center of the screen. Measurements were conducted in a darkened room.
The experiment included 1) high-contrast, non-stereo, steady-state checkerboard reversal stimuli and 2) cyclopean stimuli as follows:
First, non-stereo checkerboard reversal with check sizes of 60’ and 15’ were used to evaluate the quality of ssVEP response from each participant. These stimuli were presented according to the ISCEV standard protocol [24], except for the temporal frequency, which was set to 3.75 cps (7.5 reversals per second, rps). This frequency was chosen to match the reversal rate of the stereo stimuli and ensure steady-state EEG responses. The average luminance was 40.5 cd/m^2^, and the Michelson contrast was 99.1%.
Second, two types of cyclopean stimuli were used: dynamic random dot correlograms (DRDC) [2] and dynamic random dot stereograms (DRDS) [13]. In DRDC stimuli, correlated and anticorrelated states alternated, where the correlated state was characterized by equal contrast of the dots in the two eyes (correlation coefficient + 1), whereas the anticorrelated state they had opposite contrast (correlation coefficient − 1). As these stimuli occupied the entire screen and apart from the random dots, no other spatial pattern could be discerned (Fig. 1 Panel A), we call it a full-field stimulus. To minimize interocular crosstalk, dot colors were calibrated according to prior research [25]. The average luminance was 2.17 cd/m^2^, and the Michelson contrast was 19.3% when viewed through anaglyphic glasses. We measured the luminance of stimuli by using an ILT1700 photometer (International Light Technologies, Peabody, MA) equipped with a SED033 detector, Y2 filter and R input optic.Fig. 1. Representative example of averaged multi-channel VEP responses evoked by a dynamic random dot correlogram (DRDC). A Illustration of the different phases (each displayed for 266.7 ms) of the full-field DRDC stimulus alternating at 1.875 cps frequency. The first and third images illustrate the correlated phase featuring yellow and black dots that appear as identical patterns for both eyes when viewed through anaglyph glasses. The second and fourth images correspond to the anticorrelated phase, because they are composed of red and green dots resulting in the perception of opposite contrast patterns by the two eyes. B Averaged VEP responses for the 10 EEG channels recorded in participant S215. Solid lines represent significant responses based on T^2^circ statistics, and dotted lines indicate nonsignificant responses
For DRDS stimuli, disparity-defined checkerboards were presented using a single pair of reversing disparities (± 7’) with a check size of 168’ (Fig. 2). The red dots were set to maximum achievable luminance, while the green dots were adjusted to match this level, resulting in an average luminance of 1.63 cd/m^2^ and a Michelson contrast of 91.7% through the anaglyphic glasses.Fig. 2. Representative example of averaged multi-channel VEP responses evoked by a dynamic random dot stereogram (DRDS) checkerboard stimulus. A Illustration of the different phases of the DRDS stimulus alternating at 1.875 cps frequency. Each frame contains red, green, yellow and black dots in equal proportions. In the above illustration, the bright and dark dots are arranged to create the illusion of a 6 × 6 checkerboard pattern, with squares appearing either in front of or behind the plane of fixation. The disparity of the squares alternated its sign from phase to phase every 266.7 ms, creating the illusion of depth reversal. B Averaged VEP responses for the 10 EEG channels recorded in participant S227. Solid lines represent significant responses based on T^2^circ statistics, and dotted lines indicate nonsignificant responses
Dot size was 3.5’ both for DRDC and DRDC. Anaglyphic channel separation and red-green filter glasses belonging to the TNO stereo test were used. Both stimuli were presented at three temporal frequencies: 0.9375, 1.875, and 3.75 cps. For instance, in a 1.875 cps DRDC stimulus, the cycle duration was 532 ms, comprising 266 ms each of the correlated and anticorrelated states.
Study design
Each participant underwent a single measurement session, during which VEPs were recorded in response to the following stimuli:
- Monocular, luminance-determined checkerboards (60’ and 15’).
- DRDC and DRDS stimuli at temporal frequencies 0.9375, 1.875, and 3.75 cps (Table 1) under binocular, anaglyphic viewing conditions. (Figs. 1 and 2 Panel A and videos in the Electronic Supplementary material).
- Control measurements were performed for each of the six cyclopean stimuli under the following conditions: monocular anaglyphic viewing with the right eye (green filter), monocular anaglyphic viewing with the left eye (red filter), and binocular viewing without anaglyphic glasses. In the latter case, channel separation did not occur, indicating that the stimulus was not stereoscopic; instead, both eyes received the same stimulus simultaneously. These controls were compared by T^2^circ test to actual measurements to verify the dependency of responses on intact stereo vision.
Each cyclopean VEP recording lasted a minimum of 90 s, while recordings with luminance-determined checkerboards lasted 30 s. For each participants, we ensured that the order of stimulus presentation was randomized within the 45-min session, and the total number of right-eye, left-eye, and no-glasses control measurements were balanced throughout the entire study. Rest periods were provided as needed or when attention declined, as indicated by the presence of alpha waves or increased blinking in the EEG, monitored in real time by trained research staff through continuous visual inspection of the EEG recordings. Alpha activity was identified primarily over posterior scalp regions, while increased blink frequency was noted via frontal EEG channels. The duration and frequency of rest periods were not pre-specified but were tailored to individual needs.
Data acquisition and analysis
EEG recordings were performed using the BrainVision Recorder software (Brain Products GmbH, Germany) with an actiCHamp EEG system (Brain Products GmbH) equipped with 64 channels and actiCAP slim electrodes. Electrode placement followed the 10–20 system and included the ground electrode (Cz), reference electrode (FCz), and 10 measurement electrodes (O1, Oz, O2, P7, P3, Pz, P4, P8, Fp1, Fp2) relevant to this experiment. Data were recorded at a sampling frequency of 10 kHz and filtered with a low cutoff at 0.05 Hz and a high cutoff at 1250 Hz.
Statistical analysis was performed using BrainVision Analyzer 2 (Brain Products GmbH, Germany) in conjunction with a custom MATLAB script, which can be found in the Supplementary Material (tcirc2_matlab_script.txt). To analyze evoked responses, the data were resampled to 960 Hz. This ensured an exact integer number of samples per cycle at the given stimulus frequency (0.9375 cps and its integer multiples), which is crucial for the subsequent fast Fourier transformation (FFT) to prevent spectral leakage during analysis [26].
During preprocessing, we applied a frequency filter with a high-pass filter set at 0.05 Hz and a low-pass filter set at 45 Hz cutoff. We excluded recording artefacts caused by eye movements or other sources, as follows. An automatic artefact rejection protocol was performed, whereby events where amplitude changes within a 200-ms EEG segment exceeded 100 μV were rejected along with the preceding 400 ms and following 200 ms segments. This effectively removed most of the blink artefacts. Further disturbances caused by potential muscle movements or other sources were removed manually by two of the authors (RJ and MBE).
For VEPs evoked by non-stereo checkerboard reversal stimuli, peak latencies were extracted on the Oz electrode using the automatic peak detection algorithm of BrainVision Analyzer 2 (main positive peak between 70 and 120 ms), and the results were verified through visual inspection. The inclusion of non-stereo checkerboard data served two purposes: First, to electrophysiologically confirm that participants had normal vision and typical EEG/VEP responses. Second, to assess signal quality in real time using a stimulus known to evoke a robust and easily identifiable response, even in the absence of statistical analysis.
For the analysis of ssVEPs evoked by cyclopean stimuli, the EEG data were segmented into non-overlapping epochs of 533 ms for the high-contrast checkerboard stimuli and 2133 ms for the stereo stimuli. Datasets containing fewer than 15 valid epochs following artifact removal were excluded from further analysis both in the case of the stereo and the control measurements. This number of epochs was expected to ensure moderate to high statistical power of the subsequent T^2^circ statistical test [14].
To determine the presence of stimulus-related responses at each channel, we began by performing FFT on each epoch and extracting the Fourier components at the fundamental of the stimulus and its second harmonic. Throughout this paper, the fundamental frequency of the stimulus in cycles per second will be referred to as the 1 st harmonic and denoted as f1, whereas twice the fundamental frequency will be called the 2nd harmonic and denoted as f2.
The Fourier components obtained for each channel and response frequency were represented as sets of complex numbers, which can also be interpreted as two-dimensional vectors. The T^2^circ statistical test [14] was then performed with a significance threshold of p < 0.01. The T^2^circ test evaluates whether the average of a set of two-dimensional vectors differs significantly from the null vector. The null hypothesis for the T^2^circ test states that the population mean vector in the complex plane is zero, i.e., that there is no consistent phase-locked response across epochs. The two-sample version of the T^2^circ test was employed to compare vectors derived from a cyclopean ssVEP with those obtained from a control measurement using the same stimulus (Fig. 5).
Figure 3 presents examples of T^2^circ test results for a specific recording across various channels, illustrating the typical spatial distribution of Fourier component vectors associated with epochs significant or non-significant responses.Fig. 3. Representative T^2^circ plots for a full-field DRDC stimulus (1.875 cps reversal rate, Participant S215) based on the same recording as that illustrated in Fig. 1 The blue dots represent the complex-valued Fourier coefficients for each 2133 ms epoch taken at the fundamental frequency of the stimulus, with their coordinates in the complex plane given by the real (x-coordinate) and imaginary (y-coordinate) parts (in relative units). The data points can also be interpreted as the tips of vectors whose lengths and angles represent the amplitude and phase of the response, respectively. A channel is considered to have a significant response if the 99% confidence interval (red circle) of the mean vector (red line) does not encompass the origin. Channels with significant responses are highlighted with a green background, while those without significant responses are shown with a red background
T^2^circ values (as calculated by Eq. 1) are used to express signal-to-noise ratio of the complex-valued Fourier component vectors \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{z}}_{\mathrm{j}}$$\end{document} at the frequency of interest:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{T}}_{{{\mathrm{circ}}}}^{2} = { }\left( {{\text{M }} - { }1} \right){ }\frac{{\left| {\overline{z}} \right|^{2} }}{{\mathop \sum \nolimits_{j} { }\left| {{\mathrm{z}}_{{\mathrm{j}}} - { }\overline{z}} \right|^{2} }}$$\end{document}where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathrm{M}$$\end{document} is the number of independent estimates of the Fourier component, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\overline{z }$$\end{document} is the empirical mean of the Fourier component estimates. The two-sample version of the T^2^circ test was used to compare signals evoked by cyclopean and control stimuli [14].
Summary statistics for real valued data are reported as arithmetic means ± standard deviation unless stated otherwise.
The main outcome measure, stimulus effectiveness, was defined as the proportion of participants who exhibited a significant ssVEP response—at either the fundamental (f1) or second harmonic (f2) frequency—on any of the eight selected posterior channels. To further assess stimulus performance, we performed a receiver operating characteristic (ROC) analysis for each of the six conditions. Using the lowest T^2^circ p value across the eight channels as a continuous predictor to discriminate between binocular (negative case) and monocular viewing (positive case) within subjects, we calculated the area under the curve (AUC), along with the optimal classification threshold, sensitivity, and specificity. To verify that our sample size was adequate for ROC analysis, we performed power analysis (MedCalc v23.2.6), assuming an expected AUC of 0.8, α = 0.05, β = 0.20 (80% power), a null hypothesis AUC of 0.5, and a 2:1 ratio of negative to positive cases. This analysis yielded a minimum requirement of 10 positive and 20 negative cases, which our dataset exceeded.
Results
Each of the 22 examined participants fulfilled the following inclusion criteria: 1) successful completion of the TNO stereo test at the 120″ threshold indicating good to excellent stereo vision. 2) The average peak time of the binocularly recorded non-cyclopean checkerboard VEP was 99.7 ms (± 7.5 SD) and 104.6 ms (± 6.3 SD) for large and small check width, respectively. For monocular recordings (left and right), the average peak times on the Oz electrode were 102.3 ms (± 7.5 SD) and 104.7 ms (± 6.2 SD), respectively. The peak times of all participants were within the 90% confidence intervals reported by Thompson et colleagues [27] for the respective check sizes.
Derivation of the effectiveness measure for ssVEP responses evoked by cyclopean stimuli
The primary tool in assessing ssVEP responses was the T^2^circ test.
Table 1 summarizes the statistics for the six cyclopean stimuli under scrutiny. The p-values of the significant responses were well below the 0.01 threshold, confirming clear responses in most cases for all stimuli examined. The T^2^circ value (Eq. 1) represents signal-to-noise ratio of the responses at the frequency in question; a higher T^2^circ value indicates greater deviation from the null hypothesis, suggesting a stimulus-related response.
The availability of multiple recording channels and the full frequency spectrum of each recording provide us with several options for assessing stereo vision. To consider the frequency component,
Table 1 shows the number of significant recordings for both the fundamental frequency of the stimulus (f1) and the second harmonic frequency (f2). Consistent with the cyclic alternation of correlated and anticorrelated phases of the DRDC stimulus, f1 responses dominated the recordings, although a considerable number of recordings showed significant f2 responses. In certain participants, there were in fact, more channels showing significant f2 than f1 responses (see f1 or f2 labels in Fig. 4). Conversely, the two phases of the DRDS checkerboard stimulus were equal in terms of all spatially averaged stimulus parameters, and only local reversals of disparity could be perceived. Accordingly, the second harmonic frequency dominated the responses, although in certain cases, the f1 response was also significant (Table 1) or even dominant (see f1 or f2 labels in SI Figs. S3, S4, S5). Therefore, we decided to take both f1 and f2 frequency components into consideration, when assessing the effectiveness of stimuli.Fig. 4. Summary of the participants’ responses evoked by the 1.875 cps DRDC stimulus. Each recording is represented by dots in a head model. The area of the green dots is proportional to the T^2^circ value (see Materials and Methods) for the given recording at specific electrode positions, if the test result for that channel was significant. The dot representing the highest T^2^circ value of the participant has the same normalized area on each head model. The labels (f1 or f2) indicate the harmonic at which more channels showed significance for the given recording. Red dots show electrode positions that were not significant in the given recording. While red x marks indicate electrodes that delivered noisy, uncertain signals as judged by visual inspection. Labels Sxxx are participant identifiers. The results for other test stimuli can be found in Figs. S1, S2, S3, S4 and S5 of the SI
To understand the variability of responses by channel location, one may find the representation of Fig. 4 helpful. The figure illustrates the significance of responses at f1 and f2 frequencies, as well as the relative T^2^circ values for all participants, with the channels arranged on head models. For the 1.875 cps DRDC illustrated there, significant responses in individual participants were predominantly observed in the O1, Oz, and O2 channels. However, responses were also frequently significant on one or more of the P7, P3, Pz, P4, and P8 channels. Fp1 and Fp2 channels were not considered here, as they only served for the detection of gross eye movements.
Similar head model images for the remaining five stimuli are provided in SI Figs. S1, S2, S3, S4 and S5. They not only show similar variability in the dominant channel location but also illustrate cases where significant stereo responses appeared on the parietal channels but not on the occipital ones.
The upshot of these observations is that the dominant frequency and dominant location of stereo responses varies by individual as well as by the exact set of stimulus parameters. To address the resulting challenge of comparing the suitability of stereo stimuli for the assessment of stereo vision, we derived the metric of stimulus effectiveness based on the ability to elicit a significant response on any of channels O1, Oz, O2, P7, P3, Pz, P4 and P8 at the fundamental or first harmonic frequencies. Accordingly, we calculated the ratio of participants showing significant responses on any of those eight channels at f1 or f2 over the total number of participants (Table 1, column 11).
Comparing the effectiveness and response patterns of cyclopean stimuli
Based on the metric derived above, the most effective stimulus was the 1.875 cps DRDC, which evoked a significant response in 100% of participants. The second most effective stimulus was the 3.75 cps DRDS, with a 95% effectiveness in our stereo normal participants. In contrast, the slower, 0.93 cps stimuli performed less well, with effectiveness values of 71% for DRDC and 52% for DRDS.
To model the results expected when using a standard clinical VEP setup, which typically uses only three to five electrodes, we also assessed stimulus effectiveness using only three occipital electrodes (see SI Table 2). The 1.875 cps DRDC stimulus remained highly effective, eliciting significant responses in all cases. However, the effectiveness of the 1.875 cps and 3.75 cps DRDS stimuli declined considerably in this setting, achieving approx. 70% effectiveness. Stimuli under 1 cps are generally less likely (62% and 24% for DRDC and DRDS, respectively) to evoke a VEP that can be detected by T^2^circ test, so we do not recommend low frequencies in this setting.
Regarding the number [1–8] and the distribution of the significant responses over the different channels, a stimulus-specific pattern could be observed. For the 1.875 cps DRDC stimulus (Fig. 4), all 22 participants showed a significant response at Oz, O1 and O2, including 7 who showed significance in each of the 8 electrodes, although with typically lower T^2^circ values at parietal electrodes. Every participant exhibited a significant response on at least one of the occipital electrodes. It can generally be stated that DRDS stimuli (Figs. 4, S3, S4) often elicited significant responses in the parietal electrode row, or only there, indicating a parietal involvement. For example, in the case of the most effective 3.75 cps DRDS (SI Fig. S5), out of the 21 individuals who gave significant responses, there were 6 who had significant responses in the parietal channels only. Finally, some of our participants exhibited slightly lateralized responses, sometimes to the point that the central locations Oz and Pz were non-significant (Figs. 4, S1, S2, S3, S4, S5).
As hinted above already, the frequency composition of the responses was also fundamentally influenced by the stimulus type. For DRDC, f1 responses were characteristic (Fig. 1), while additional f2 responses were only present in approx. 50% of the cases (Table 1). In the case of DRDSs, almost only f2 responses occurred (Fig. 2, Table 1). Significant f1-responses were rarely present, and there were only two participants who gave exclusively f1-resonse (Figs. S3, S5).
Control measurements
For each measurement with cyclopean stimuli, control measurements were designed to judge the possibilities of false negative or false positive results. Based on the combinations of stimulus type (DRDC or DRDS) and the control condition (either monocular viewing or viewing without goggles), four kinds of controls were possible. Each control recording was evaluated by T^2^circ tests for both the f1 and f2 frequencies. Therefore, the total number of cases reported below is always twice the number of channels analyzed.
- Viewing DRDC stimuli without glasses (Fig. 1) produces a perceptible exchange of color and potentially mean luminance, which might cause a significant ssVEP. This is important to be kept in mind, because a similar situation would occur if the observer’s visual field is not completely covered by the filter goggles, potentially resulting in a false positive judgement of their stereo vision. Indeed, among the 320 T^2^circ tests, 173 (54%) showed a significant response in these control measurements.
- When stimulus colors are well-calibrated, viewing DRDC stimuli monocularly (through either the red or green filter) should be perceived as random dynamic noise without any change between the correlated or anticorrelated phases, and thus, no significant VEP response is expected. A similar situation would occur if an amblyopic person viewed the stimuli, or if one recorded from a channel without stereo response, even if an individual has intact stereo vision. Among 720 T^2^circ tests with this kind of control, only 5 yielded a significant (false positive) response (0.69%). However, in four of these five cases, ssVEP responses could be differentiated from the binocular responses by using the 2-sample T^2^circ test [14]. In only one case (0.14%) the test was unable to distinguish the binocular response from the control. A representative false positive case is seen on channel O1 in Fig. 5.
- Viewing DRDS stimuli without glasses results in no depth reversal, nor should a perceptible change in color or mean luminance occur (Fig. 2); therefore again, no significant ssVEP response was expected. Out of 368 T^2^circ tests, only 14 showed a significant response in the control measurement. In 9 cases (2.45%), the two-sample test could not differentiate between the response to the stereo recording and the control.
- Similarly, viewing DRDS stimuli monocularly (through either the red or green filter) results in no perceptible change in depth. Out of 704 T^2^circ tests, 9 (1.28%) showed a significant response in the control measurement. In a single case (0.14%), the two-sample test failed to differentiate between the stereo and control responses. Fig. 5. Representative T^2^circ plots for the f1 frequency response to a full-field DRDC stimulus (3.75 cps reversal rate, Participant S206) and their monocular control (through the left eye). Fourier components of the epochs, mean vectors and confidence intervals are plotted in the same way as in Fig. 3, except that blue and red colors represent the binocular and control measurements, respectively. Every channel except Fp1 and Fp2 gave a significant binocular response (the blue confidence intervals do not encompass the origin). On channel O1, also the control measurement resulted in a spurious significant response (the origin is outside the red confidence interval), but the two-sample T^2^circ test proved a significantly different response to the binocular stimulus. For each channel, green background signifies that the latter test proved significant difference from the control, whereas red background indicates that the two measurements did not differ significantly. Sometimes, small but significant binocular responses such as those on P7, Pz and P8, could not be differentiated from the control by the two-sample test
We performed receiver operating characteristic (ROC) analysis to evaluate the sensitivity–specificity trade-offs associated with our ultimate goal of detecting stereopsis (Table 2). DRDC 1.875 cps and DRDS 3.75 cps yielded the highest AUCs (0.962 and 0.991, respectively), with optimal thresholds aligning closely with our predefined 0.01 cutoff for p-values. While DRDC stimuli generally achieved high sensitivity, they also showed increased false positive rates in the monocular control condition (up to 54%), especially at lower frequencies. In contrast, DRDS stimuli, particularly at 3.75 cps, demonstrated excellent specificity (95.5%) and more robust performance under control conditions. These findings highlight different trade-offs between sensitivity and specificity across stimulus types.
Discussion
Previous electrophysiological studies utilizing random dot correlograms and stereograms in humans
Random dot correlograms and stereograms (DRDC and DRDS) isolate the stereoscopic visual mechanism: they only evoke a response if stereopsis is functional. Their effectiveness does, however, depend on many parameters.
There are only a few studies that systematically examine the applicability of DRDC and DRDS stimuli in humans. In Table 3, we summarize key test parameters from notable publications. Béla Julesz, who first described the application of DRDCs in human VEP studies [2], reported that DRDCs offer several advantages over DRDSs, including simpler stimulus generation and a greater absolute magnitude of evoked responses. Additionally, DRDC functions effectively at higher stimulus frequencies (12 rps). They also highlighted its insensitivity to head tilt, the necessity of a large stimulus size, and the differences in response periodicity between DRDCs and DRDSs. Table 3. Stimulus parameters utilized in previous DRDC and/or DRDS VEP studies reported in the literatureAuthor, YearJulesz et al. [2]Petrig et al. [6]Bergua et al. [18]Skrandies [17]Markó et al. [19]Participant healthy/impaired stereo vision4/017/022/2220/016/0Stimulus (DRDC, DRDS)RDC/RDSDRDC/DRDSDRDSDRDSDRDCChannel separationAnaglyphicAnaglyphicShutterAnaglyphicAnaglyphicGogglesKodak Wratten red and green filtersKodak Wratten colour filters (Nos. 55 and 26)Electric liquid crystal shutter glassesRed/green gogglesR26 low-pass and YG09 band-pass gelatineStimulus patternDRDC full-field correlation reversal, DRDS checkerboard depth onset-offsetDRDC full-field correlation reversal, DRDS checkerboard depth onset-offsetCheckerboard depth onset-offsetCheckerboard depth alternationFull-field correlation reversalField size (°)38.6 × 30.282 × 6828.4 × 21.917.1 × 13.440 × 30Viewing distance [mm]2500100075015001000Check size [arcmin]330 × 330280 × 280132 × 16490Not applicableDot size [arcmin]13.8 × 6.920 × 181.38 × 1.843.5′7.5Dot density [%]1001000.4No data100Stimulus rate [cps]1 and 32.5 cps141.875Duration of phases [ms]500/500 and 166.7/166.7200/200500/500125/125266.7/266.7Monitor typeAdvent VideoBeam 1000 A projection systemAdvent VideoBeam 1000 A projection systemtwo Eizo F77 CRT monitorsHitachi CRT monitorSamsung SyncMaster 957 MB CRT monitorMonitor refresh rate (Hz)60 interlaced501204860Mean luminance [cd/m^2^]No data8.255.5202,89 ± 0,18Contrast [%]Maximum black and white82%No dataNo data5.5–80 in 10 stepsDisparity [arcmin]27.6’ far40’ far9, 13, 22, 35, 407, 10.5, 14, 17.5, 21, 24.5’Not applicable No. active electrodes11171Active electrode position2 cm above the inion2 cm above the inion5 cm above inionOz, O1, O2, P7, P8, PO7, PO8OzReference electrode positionHalfway between vertex and nasion30 percent nasion-inion distanceRight earFzFzGnd electrode positionForeheadVertexLeft earPzCzEpoch duration [ms]1000800100010002133Bandpass filter [Hz]0.1 and 1000.5 and 400.5 and 70 Hz0.1 and 70 Hz0.5 and 250 HzPresentation duration (s)12 sessions * 75 s40No data30070Minimum No. of epochs755050No data10Arcmin minutes of arc, cd/m^2^ candela per square meter, CRT cathode ray tube; DRDC dynamic random dot correlograms, DRDS dynamic random dot stereograms, cps cycles per secundum, Hz Hertz, ms millisecond, No. number of, VEP visual evoked potentials
In their study on infants, Petrig et al. [6] applied DRDC full-field correlation reversal and DRDS checkerboard depth onset-offset stimuli, reporting that both evoked responses with comparable amplitudes in early development. Braddick [28] found that the mean age for the first detectable response was correlated between the two tests, with no significant difference (correlogram: 70 days; stereogram: 75 days). This aligns closely with the age at which the first signs of stereopsis are observed in psychophysical tests [8].
Skrandies' seven-channel study demonstrated responses to DRDS checkerboard reversal at five temporo-occipital channels, specifically at the first harmonic frequency. Psychophysically estimated stereo thresholds were lower than electrophysiologically determined thresholds. Furthermore, they showed that strabismic—but not stereo blind—patients exhibited higher-than-normal thresholds as determined by DRDS-VEP [11]. Additionally, they found the largest response amplitude approximately 4.25 cm (± 1.6 cm) from the inion [3].
Bergua et al. observed no significant differences in VEP peak times in response to DRDS onset-offset stimuli with disparities between 9.0′ and 40′ in healthy individuals. However, in patients with glaucoma, peak times were significantly delayed [18].
In two previous studies, we examined the effects of luminance and contrast changes on DRDC responses. We found that response amplitudes remained stable across a broad range of contrast (5.5–80%) and luminance (0.06–5.5 cd/m^2^). However, significant reductions in response amplitude were observed with decreases in luminance, but none with decreasing contrast [19, 20]. These findings support the magnocellular origin of DRDC responses.
Regarding the specific stimulus parameters, the studies listed in Table 3 are in broad agreement, but they typically derive from exploration that is not systematically described in the papers. Together with our pilot studies (see SI Table 1), we explored a large range of various parameters with the specific aim of optimizing stereo VEPs. Our current data suggests that the most likely stimulus to evoke an ssVEP response is the full-field (spatially homogeneous) DRDC where dot correlation is modulated at around 1.875 cps (Table 1). Our DRDS stimulus at 3.75 cps produced only slightly fewer significant responses, but correlograms were in general, superior to stereograms. DRDC stimuli were also superior in that a stable response was obtained when using only the three occipital electrodes (SI Table 2). As discussed below, the location of EEG channels also turned out to be highly relevant to optimizing the electrophysiological detection of stereoscopic function.
The source of correlogram and stereogram responses
Our findings are intriguing because the spatio-temporal patterns encoded in DRDCs are perceptually less salient than disparity defined patterns of the DRDSs [2]. Despite the higher subjective salience of stereograms, they produced less reliable ssVEP responses than correlograms. These differences likely reflect distinct neural pathways. It has been suggested that the processing of stereo disparity [29, 30] as well as the processing of DRDCs [19] rely on the activity of the magnocellular visual pathway, which dominates the activity of the dorsal stream of extrastriate visual cortex. Work by Cummings et al. [5] has pointed out differences between cortical regions in the processing of disparity by employing anti-correlated stereograms. In low-level visual areas V1 and V2, disparity tuned neurons give responses to both correlated and anti-correlated stereograms, although their disparity tuning is opposite. Binocular neurons of the dorsal stream [31] also respond to anticorrelated stereograms. However, this is in contrast with conscious experience, because disparity in anti-correlated stereograms cannot be perceived. The perception of observers is better reflected in the activity of extrastriate areas of the ventral stream, which are less responsive or not responsive at all to anti-correlated stereograms. Thus, the stronger DRDC than DRDS responses in the VEP may be explained by the presence of large neuronal populations modulated by changes in binocular correlation in occipital and dorsal visual cortical regions.
Our data also point to differences in spatial localization. Even though stereo stimuli primarily evoked responses at O1, Oz, and O2 electrode locations, we regularly found significant stereo responses on parietal channels P7, P3, P4, P8, and Pz. In the case of DRDS, these channels were more prominent, or in some participants, the only source of significant stereo response (SI Figs. S3, S4, S5). To objectively assess stereopsis by VEP, it seems therefore prudent to record activity from parietal as well as occipital channels.
In our adult participants, the DRDC stimulus typically elicited a fundamental frequency (f1) response, whereas the DRDS stimulus evoked a predominantly second harmonic (f2) response. Often, significant responses on both harmonics occurred. This difference is well explained by the nature of the stimuli: a full-field DRDC involves the alternation of correlated and anticorrelated dot patterns, where the two phases drive opposite responses [2, 20]. In contrast, the two phases of a checkerboard reversal stimulus (whether luminance or disparity defined), are in terms of spatial average, equal. The pronounced f2 response is driven by the local changes, which occur twice during the cycle, and the two reversals are equal in terms of the net direction of change, which is 50% near-to-far and 50% far-to-near every time.
In summary, EEG responses to cyclopean stimuli vary depending on stimulus attributes, spatial location, and response frequency, and these effects may also differ between individuals. Our current focus was to describe this complexity and to provide a method for identifying cyclopean responses in the face of such variability. Nevertheless, further analysis using more complex statistical models that account for the relationships and interactions among these variables may help uncover the underlying principles governing the spatial and spectral distribution of cyclopean VEPs in the future.
Considerations for standardizing stereopsis assessment by ssVEP
VEPs have been used routinely in the diagnosis of visual pathway disturbances in humans [24]. Recently, a standard protocol employing periodic stimuli and steady-state potentials has been devised to estimate visual acuity [16]. Our findings provide a foundation for optimizing stereopsis assessments in both research and clinical applications. A key challenge to address is the individual variability in localization and frequency composition of the evoked response, which exist even in a population with intact stereopsis. We suggest that standard VEP procedures need to be amended in two ways for this purpose. First, it is necessary to consider activity from occipital as well as parietal channels. Second, both fundamental and second harmonic responses must be considered.
We used CRT monitors for stimulus display primarily because they do not exhibit luminance transients during changes in pixel intensity, and they maintain stable color and luminance across a wide range of viewing angles. These characteristics comply with current ISCEV requirements for stimulus devices [24] and were instrumental in ensuring that, after proper calibration, our DRDC and DRDS stimuli were free of monocular cues [25]. We chose the anaglyph technique for achieving dichoptic separation because of its simplicity and low cost, acknowledging that perfect channel separation is more challenging to achieve than with Wheatstone stereoscopy. In our setup, this limitation was addressed by rigorous calibration of the monitor’s colors based on empirical photometric characterization of our CRT + filters system, applying the same methodological principles described by Radó et al. [25]. While Radó et al. used an LED display with red–green glasses, their calibration approach is not display-specific and can be readily adapted to any filter-based three-dimensional display once the spectral and luminance characteristics of the actual setup are measured. Following calibration, both DRDC and DRDS stimuli were tested for residual monocular cues. Our control measurements confirmed that DRDC stimuli, although more effective at eliciting responses than DRDS, are also more sensitive to imperfect calibration, which may produce false positive VEP responses [25]. These occasional monocular false positives differed statistically (in amplitude and phase) from genuine binocular responses (Fig. 5). Anaglyphic DRDCs can also evoke significant binocular ssVEP responses when the goggles are removed (Figs. 1, 5, also see [2]), or when parts of the stimulus remain visible around the goggles. These findings highlight the need for stringent calibration and physical stimulus occlusion when using DRDCs. By contrast, the DRDS stimuli we used were less susceptible to spurious monocular responses caused by imperfect color calibration or removal of the filter goggles. However, detecting a stereo response to DRDSs likely requires recordings from at least six electrodes, as these stimuli predominantly activate parietal cortical areas.
Control measurements without goggles, as well as monocular control conditions, are essential for verifying the reliability of the stimulation and recording setup at the outset. Such controls should also be repeated regularly to ensure that calibration remains uncompromised over time, for example, due to aging of the stimulus screen or degradation of the filter glasses.
Our ROC analysis underscores the importance of balancing sensitivity and specificity when selecting stereoscopic stimuli for clinical or research use. DRDC stimuli -though highly sensitive- were more susceptible to false positives under monocular viewing, likely due to luminance artefacts or incomplete channel separation. DRDS stimuli, especially at 3.75 cps, offered greater specificity and robustness, making them more suitable for clinical environments where calibration is limited. Based on these findings, a two-stage testing approach could be considered: initial screening with DRDS, followed by DRDC confirmation under more controlled conditions.
Attentional fatigue is a potential factor that can both deteriorate and add further complexity to ssVEP recordings [32]. Although rest periods were provided based on signs of attentional decline in the current study, their duration and frequency were not formally monitored or correlated with signal quality. Therefore, residual differences in attention or engagement may still have contributed to inter-individual variability in ssVEP responses. Recent theoretical studies [33, 34] have proposed an analytical framework with the potential to identify systematic departures from the circular symmetry of ssVEP Fourier components [35], which may arise due to fluctuating attention during recording. This approach could be incorporated into future protocols for monitoring participants' attention, complementing conventional EEG-based fatigue markers such as increased alpha-band power.
The development of stereopsis is an experience dependent process [7, 36, 37] extending over years in early childhood [6, 8, 28]. Age-related changes in ssVEP responses have been found in our previous studies [7, 21] showing that DRDC stimuli in young children elicit dominantly f2 harmonic responses with an age-dependent phase change, reflecting latency decreases. After the age of 6 months, an increasing f1 component appears in the responses. Further research is required to optimize stimuli, recording configuration and analysis methods beyond the relatively homogeneous, healthy participant population of the present study.
The main limitation of the study is that, due to the use of multiple electrode sites and the analysis of different harmonic components across six distinct stimuli, it was not possible to define a single output variable for a predefined statistical test. Consequently, a custom metric (effectiveness) was developed to compare stimulus performance, and the specific statistical power could not be calculated. However, we believe that in future translational studies, ROC analysis may serve as an appropriate statistical approach to distinguish individuals with impaired versus intact binocular vision. The current sample size, which exceeded the minimum required to detect an AUC of 0.8 with 80% statistical power, may provide a valid reference for future investigations involving clinical populations with reduced stereopsis, where comparisons against a control group will remain essential.
Supplementary Information
Below is the link to the electronic supplementary material.Supplementary file1 (DOCX 425 KB)Supplementary file2 (AVI 4260 KB)Supplementary file3 (AVI 14774 KB)Supplementary file1 (TXT 7 KB)
