Stimuli are perceived as lasting longer when there is something bright on the screen
Hakan Karsilar, Sebastiaan Mathôt, Hedderik van Rijn

TL;DR
Bright stimuli and backgrounds make people perceive time as passing more slowly, while pupil size does not affect time perception.
Contribution
The novel finding is that background brightness affects perceived duration, suggesting low-level visual input influences time perception.
Findings
Increased brightness of a stimulus or background lengthens perceived duration.
Pupil size does not correlate with perceived time, contradicting arousal-based assumptions.
Abstract
Perceived time often diverges from physical time. This discrepancy is important given the crucial role of time perception in numerous cognitive processes. A critical question concerning the non-veridicality of timing is whether and how different physical attributes (e.g., size, speed, and numerosity) influence perceived duration. The present study deals specifically with how perceived time depends on stimulus brightness, both of a to-be-timed stimulus and the background on which this stimulus is presented. The results of two experiments show that increased brightness lengthens perceived duration, and, surprisingly, that this is the case both for the stimulus and the background. The finding that stimulus brightness affects time perception is a much needed replication of classic studies; however, the finding that background brightness similarly affects time perception is novel, and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —"Interval Timing in the Real World: A functional, computational and neuroscience approach" financed by the Dutch Research Council (NWO)
- —Innovational Research Incentives Scheme VIDI from the Dutch Research Council (NWO)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Music Perception · Multisensory perception and integration · Visual perception and processing mechanisms
Introduction
Our ability to perceive time underlies many different cognitive processes, such as attention, working memory, language, and decision-making (Allman et al., 2014; Buhusi & Meck, 2005; Wittmann, 2013). The capacity to use time-based information has been deemed a “sense” in its own right. Yet time perception is fundamentally unlike other senses, such as vision or hearing, as it lacks a dedicated sensory organ (Ivry & Schlerf, 2008; Lewis & Miall, 2003). The question is therefore: If time perception is indeed a unique sense, then what exactly is being sensed, and what are the rules that govern this sensory modality? (Basgol et al., 2021; Grondin, 2010; Merchant et al., 2013)
A significant amount of research into the nature of time perception has focused on instances where it diverges from veridicality (i.e., when perceived time differs from real time; Grondin, 2010; Matthews & Meck, 2016; Pariyadath & Eagleman, 2007; Ulrich et al., 2006). Humans often overestimate durations of larger objects, faster movements, higher frequencies, louder sounds, higher pitches, and larger quantities (Berglund et al., 1969; Brown, 1995; Droit-Volet, 2013; Eagleman & Pariyadath, 2009; Herbst et al., 2013; Karsilar et al., 2016, 2019; Karsilar et al., 2018; Matthews & Meck, 2016; Ono & Kawahara, 2007; Penton-Voak et al., 1996; Rammsayer & Lima, 1991; Rammsayer & Verner, 2016; Thomas & Cantor, 1976; Xuan et al., 2007). Neurophysiological research has corroborated these discoveries by demonstrating that the brain’s processing of time shares common pathways in the parietal and frontal regions with processing of quantity and size (Bueti & Walsh, 2009; Cai & Connell, 2015; Cantlon et al., 2009; Casasanto & Boroditsky, 2008; Walsh, 2003; Wittmann et al. 2010). Collectively, these studies suggest that perception of time is mostly indirect, derived from physical and statistical features in the environment.
It is challenging to explain the connections between time perception and nontemporal stimulus properties, such as brightness, which is the focus of the present study. However, information-processing models of timing have been notably effective in addressing this issue (Creelman, 1962; Gibbon et al., 1984; Treisman, 1963). Such models hypothesize an internal clock that produces time-representative pulses, where the integration of these pulses code for an interval (Church, 1984; Gibbon, 1977; Malapani & Fairhurst, 2002; Treisman et al., 1990; van Rijn et al., 2014; Wearden, 1999; Wearden, 2003). This clock can run faster or slower depending on the properties of sensory input as well as internal processes, such as arousal (Allan, 1979; Wearden, 2008), attention (Lejeune & Wearden, 2009), and other nontemporal task demands (Buhusi & Meck, 2005; Fortin & Rousseau, 1987, 1998; Wearden, 1991; Zakay & Block, 1997; see also Coull et al., 2011; Meck, 2006). For instance, heightened arousal in response to an intense stimulus, such as a scary photo, is thought to accelerate the production of time-representative pulses, thus resulting in a longer perceived duration (Angrilli et al., 1997; Droit-Volet & Meck, 2007; Gil & Droit-Volet, 2012; Mella et al., 2011; Nather & Bueno, 2011; Sackett et al., 2010; Smith et al., 2011). Conversely, shifting focus away from a task’s temporal aspects is thought to decelerate the production of pulses, thus resulting in a shorter perceived duration (Block & Gruber, 2014; Coull et al., 2004; Fortin, 2003; Tse et al., 2004; Zakay, 1998; Zakay & Block, 1996, 1997). It is important to note that the “internal clock” model of timing has also been criticized for its high degree of flexibility, to the point of being unfalsifiable (Staddon & Higa, 1999). In fact, it is difficult to create a scenario where an effect of nontemporal stimulus properties on temporal behavior would not be explained by a combination of arousal and attention-related processes, both of which are general concepts encompassing a wide range of psychological phenomena. Additionally, the idea of an internal clock which accumulates “time-representative pulses” akin to a stop-watch lacks biological plausibility (Salet et al., 2022; see also Bi & Zhou, 2020; Goel & Buonomano, 2014; Raphan et al., 2019 for alternative modeling approaches). Consequently, although the internal-clock model provides a convenient framework through which stimulus properties can be said to affect subjective time, it is more of an abstract account of how timing takes place in natural environments than a realistic one.
Goldstone and Goldfarb (1964a, b) were among the first to demonstrate the interaction between time and other physical dimensions. The researchers presented two consecutive durations using a red-light-emitting diode and found that the participants tended to perceive a duration as longer when the comparison stimulus was brighter than the standard (see also Goldstone et al., 1978). Two decades later, Brigner (1986) replicated these findings by presenting visual stimuli with different light intensities and durations, this time including flickering and nonflickering stimuli. Brigner’s findings revealed that increasing the light intensity caused participants to perceive time intervals as longer. It is important to note that these pioneering studies exploring the relationship between brightness and time perception have, surprisingly, not been replicated or expanded upon using contemporary methodologies. Particularly, it remains unclear whether it is the brightness of the timed stimulus that causes the overestimation of time, or whether it is the overall brightness of visual input—which includes both the timed stimulus and its visual environment—that matters (Matthews, 2011). This distinction is critical, because it has crucial implications for the role of low-level vision in time perception.
Following the theoretical framework of information-processing theory of timing (and see Salet et al., 2022), the clock pulse rate is not only determined by low-level stimulus features, such as brightness, but also by higher-level cognitive and affective processes. Notably, increased arousal is widely thought to accelerate the pulse rate, thus resulting in longer perceived durations (Gil & Droit-Volet, 2012; Lake et al., 2016; Mella et al., 2011; Nather et al., 2011; Van Volkinburg & Balsam, 2014). A key finding in support of this was reported by Suzuki et al. (2016), who measured pupil size in primates, which is known to be an index of arousal (Beatty & Lucero-Wagoner, 2000; Bradley et al., 2008; Mathôt, 2018; Partala & Surakka, 2003). The primates were trained to perform a saccadic eye movement after a specific interval had elapsed. Crucially, the authors found that primates made faster saccades when their pupils were large, suggesting that high (pupil-linked) arousal indeed resulted in longer perceived durations. However, a recent study by Warda et al. (2022) arrived at a different conclusion. Here, the authors measured pupil size while human participants performed a temporal bisection task in which participants reported whether a probe stimulus was presented for a long or a short duration (relative to two reference durations). In contrast to the study by Suzuki et al. (2016), the authors did not find that large pupils were associated with longer perceived durations, which in this paradigm would translate into a response bias towards long reports; rather, they found that large pupils were associated with increased response precision, presumably because pupil dilation is associated with increased motivation to perform the task well. Taken together, there is some evidence suggesting that arousal as indexed by pupil size affects time perception (see also Akdoğan et al., 2016; Kinzuka et al., 2021), but the evidence is mixed and thus warrants further scrutiny.
Our study aims to fill the gap in understanding how perceived duration is affected by both low-level visual features (notably brightness) and arousal. Previous studies have suggested that increased brightness and increased arousal both result in longer perceived duration; however, evidence is mixed and the specific conditions under which these effects emerge remain to be tested. The inconsistent findings regarding arousal’s impact on time perception as indexed by pupil size also call for further empirical investigation. By testing these influences separately, we aim to contribute to a more nuanced understanding of time perception, challenging and potentially refining existing theoretical models. To that end, we employed a temporal bisection task in two complementary experiments, varying both the brightness of the stimulus (Experiment 1) and the background against which it was timed (Experiment 2). Our results showed that the effect of brightness on time perception is such that increased brightness results in longer perceived durations, and that, crucially, this is the case for both the brightness of the to-be-timed stimulus and the background on which this stimulus is presented. Additionally, in both experiments we tracked spontaneous fluctuations of the pupil size as a measure of arousal. We expected larger pupils to be associated with longer perceived durations. However, against our expectation, our results suggest that spontaneous pupil size is unrelated to perceived time.
Experiment 1
Methods
Participants
Thirty participants (25 women; Mage = 22.9 years) were included in Experiment 1. Participants received a compensation of €8 for their contribution. Participants provided written informed consent prior to the study. All participants had normal or corrected-to-normal vision. No participants were excluded from the analyses. In the absence of an expected effect size, sample size was based on previous similar studies on temporal perception.
Stimuli and apparatus
Stimuli were presented on a 27-in. LCD monitor (1,920 × 1,080 pixels; 60 Hz) using OpenSesame 3.3 (Mathôt & March, 2022; Mathôt et al., 2012) with the PsychoPy (Peirce, 2007) framework for stimulus presentation and PyGaze (Dalmaijer et al., 2014) for eye tracking. An EyeLink 1000 (SR Research) recorded the right eye at 1000 Hz. The flat-screen monitor was placed approximately 60 cm from a chin rest stabilized on the edge of the supporting desk. Participants provided their responses using a wired keyboard while their heads were stabilized on the chin rest for the duration of the experimental session. The to-be-timed stimuli consisted of a filled circle with various levels of brightness (discussed below) briefly presented in the center of the screen (subtending a visual angle of 2.39°), which occluded a white central fixation dot (0.48°). The central area of stimulus presentation was fully black (9.90°), transitioning as a gradient to a gray circular background towards the edge of the screen (≈24°) in order to minimize contrast effects (Karsilar et a., 2024; see Fig. 1B).Fig. 1A Timeline of a single trial in experiment 1. B Sample screen depicting the background gradient in a trial with the annulus surrounding the fixation dot during stimulus presentation. C Four levels of stimulus brightness (top to bottom; low, medium, high, maximum/white)
Procedure
Each session consisted of a variable-brightness block, followed by a shorter constant-brightness block. The presentation duration and the brightness of the timed stimuli were randomly varied in the variable-brightness block, whereas stimulus brightness was fixed in the constant-brightness block. An entire session with the two blocks lasted approximately 50 min. Both blocks started with a 9-point eye-tracking calibration with white targets. Each trial in the session was initiated with a drift correction (single-point recalibration) procedure. Participants were told that the purpose of the eye tracker was to ensure that they always focused on the central fixation dot throughout the experiment.
Temporal bisection procedure
Training
The temporal bisection task is a form of a two-alternative forced-choice task that entails categorizing experienced durations as “short” or “long” based on their subjective similarity to previously learned short and long reference durations (Penney & Cheng, 2018; Wearden & Ferrara, 1995). As such, the task started with the presentation of the two reference durations (200.4 and 801.6 ms) in the form of the aforementioned circle with a constant brightness of 15.8 cd/m^2^, followed by a training session. During training, the two reference durations were presented randomly with equal likelihood. Participants’ pressed the “F” and “J” keys with their index fingers to report a short and a long duration, respectively. Following common recommendations in the literature, participants were instructed not to count or use any other chronometric methods, such as tapping, throughout the experiment. Participants were instructed to only attend to the amount of time that the circle stayed on the screen and decide which of the previously presented durations it matched. The training phase was terminated upon 20 correct responses in the variable-brightness block, and 10 correct responses in the constant-brightness block. Given the high discriminability between the two reference durations, no feedback was given during training. The majority of the participants required fewer than four correction trials.
Test
The test phase was identical to the training phase, except for the presentation of four durations (317.3, 434.2, 567.8, 684.7 ms) in addition to the two reference durations (200.4 and 801.6 ms). Participants were explicitly provided with this information and were asked to decide which of the two reference durations they learned during the training phase was closest to the one being presented on each trial. A typical trial consisted of a response-to-stimulus (RSI) interval between 1,000 ms to 2,000 ms, followed by the probe presentation and a response screen with no explicit deadline. Each response was followed by a fixed 200 ms interval (Fig. 1A). The RSI was randomized in or to preclude the participants from establishing a rhythmic response pattern in the session. No feedback was given. All participants were observed to readily follow the instructions of the test phase. During the test phase of the variable-brightness block, the brightness of the timed stimulus was also manipulated trial-to-trial (four levels; low, medium, high, maximum/white; 5.5, 15.8, 35.6, and 66 cd/m^2^, respectively; see Fig. 1C). We selected the four brightness values such that they would be logarithmically spaced between 5.5 and 66 cd/m^2^ to align with the Weber–Fechner law, which models perception as proportional to the logarithm of stimulus intensity. This approach ensures perceptually meaningful spacing while maintaining a straightforward model for grayscale representation. In order to achieve a comprehensive mapping of duration discriminability and to control for the effect of ensemble statistics and potential carryover effects, we employed a De Bruijn sequence in designing the temporal bisection task. This generated sequence ensured that each of the six probe durations were preceded and followed by every other duration in the set an equal number of times. The length of the shortest De Bruijn sequence of a set of n elements is n^2^. Thus, in order to ensure that each of the four stimulus brightness levels were presented an equal number of times, a single De Bruijn sequence of six probe durations required 144 trials. In order to achieve a better fit to data, each probe duration was presented 12 times by doubling the amount of trials, resulting in a test block of 288 trials in total, which also fit well with the targeted session duration of 50–60 min.
Data processing and analysis
Following preprocessing in Python, data were analyzed using generalized linear mixed-effects models. Analyses were conducted using the “glmer” function in the lme4 package (Bates et al., 2015) in R (R Core Team, 2020). To facilitate interpretation and improve numerical stability, the probe duration variable was mean-centered (but not z-scored) before all analyses. Bayesian and frequentist statistical analyses were performed using JASP software (JASP Team, 2023). To control for response artifacts, trials with reaction times shorter than 100 ms or longer than 5,000 ms were excluded.
Pupil data processing
Preprocessing of EyeLink 1000 data was performed using the eyelinkparser package (Mathôt & Vilotijević, 2023) in Python. Pupil size, originally recorded in arbitrary units, was converted to mm based on a conversion formula previously established for our setup. Missing or invalid data were interpolated if possible and removed otherwise. Invalid trials were defined based on standard pupillometry preprocessing as follows: (1) trials with extreme baseline pupil sizes (z-score > 2 or < − 2) were excluded; (2) trials where more than 50% of the pupil data were missing due to blinks, tracking loss, or artifacts were removed; and (3) trials with extreme pupil-size deviations (more than ± 3 SD from the mean) or unnatural jumps were also excluded (see Mathôt & Vilotijević, 2023).
Choice of analytical approach
Traditional psychophysical approaches to time perception often rely on fitting psychometric functions (e.g., sigmoid or Weibull curves) to response proportions and extracting the point of subjective equality (PSE; Bausenhart et al., 2018). While widely used, these methods are limited by their reliance on predefined functional forms and parametric response distributions, which can introduce biases in PSE estimation (Grondin, 2024). Additionally, aggregating data across trials leads to a loss of trial-level variability and inadequately accounts for both intra- and interindividual differences, making it difficult to handle hierarchical data structures effectively (García-Pérez, 2014a, b; Vidotto et al., 2019; Wang & Reynaud, 2023). Consequently, generalized linear mixed models (GLMM) have been increasingly recommended in time perception research for their ability to improve estimation accuracy and generalizability (García-Pérez, 2014a, b; Stone, 2014). GLMMs offer multiple advantages (V. A. Brown, 2021): They retain all trial-level data, enhancing statistical power and sensitivity; provide a suitable framework for binomial dependent variables, addressing the nonnormality inherent in timing tasks distinguishing “long” versus “short” responses (Wang & Reynaud, 2023); and are robust to missing and unbalanced data (e.g., excluded trials due to badly recorded pupils), reducing biases from trial exclusions common in psychophysical studies (Vidotto et al., 2019). Moreover, GLMMs allow for the direct estimation of experimental effects on temporal decision-making without requiring an intermediate curve-fitting step (García-Pérez, 2014a, b; Wang & Reynaud, 2023). Their hierarchical structure enables the modeling of both within- and between-subject variability, offering a comprehensive analysis of perceptual decision-making. Recent methodological work further supports the use of GLMM in psychophysical research, particularly when trial-level response variability carries meaningful information, such as that observed in the temporal bisection task (García-Pérez, 2014a, b; Wang & Reynaud, 2023).
To address these issues, we employed a GLMM approach to analyzing our data, enabling trial-level analysis while incorporating subject-specific random effects. We fit a logistic regression model predicting the probability of a “long” response as a function of probe duration and other experimental predictors. The model, using a binomial family with a “logit” link function, was optimized via maximum likelihood estimation with the Laplace approximation and the bobyqa optimizer. By including both random intercepts and slopes, GLMMs capture interindividual variability and model the influence of experimental factors, such as brightness or pupil size, on time perception more effectively than traditional methods (Grondin, 2008).
In practice, we compared two variants of this model: a “simple” model with only a random intercept per participant, and a more “complex” one that added random slopes for both probe duration and each predictor in order to provide a more comprehensive picture of the data. The simpler “random-intercepts-only” model accounts for individual differences in overall response bias (i.e., the baseline tendency to respond “long”) but assumes that the effect of probe duration and the predictor (e.g., brightness) is the same for every participant. The more complex model additionally tests whether participants differ in how strongly these factors influence their responses. While such complexity can more accurately capture true interindividual variability, it also carries the risk of overparameterization or convergence problems if the dataset is not sufficiently large or if effects do not meaningfully vary across individuals. Therefore, we include both models in our analyses, and we explicitly report any convergence warnings or boundary fits as they arise.
Lastly, to validate the robustness of our GLMM results, we extracted summary parameters for each model, which were then analyzed by an analysis of variance (ANOVA; both Bayesian and frequentist). The first parameter of interest, PSE, represents the temporal discrimination threshold and is determined statistically as the duration at which a participant is equally likely to give both a “short” and a “long” response. PSEs were computed by fitting psychometric curves to each participant’s data using parameters obtained from the GLMM and identifying the probe duration at which the predicted probability of a “long” response equaled 0.5 for each participant. A lower PSE for a given condition therefore represents a longer perceived duration compared with a higher one, and vice versa. Second, in order to detect any potential difference in stimulus discriminability as a function of a predictor (i.e., brightness or pupil size), we also calculated Weber Ratios (WR). The WR, computed as the ratio of the difference limen (half the distance between the first and the third quartiles: ([p(long) = 0.75 – p(long) = 0.25]/2) to the PSE, provides a measure of perceptual sensitivity to temporal distortions across conditions. A higher WR implies lower temporal sensitivity and precision, ceteris paribus. However, given the methodological advantages of GLMM described above, we consider it the more appropriate analytical approach for elucidating the effect of brightness and pupil size on time perception as employed in this study.
Results
Bright stimuli are perceived as longer
To assess the influence of stimulus brightness on perceived durations, we evaluated a GLMM with fixed effects for probe duration and stimulus brightness, along with a random intercept for individual subjects. The low stimulus brightness condition was set as the reference category. Fixed effects in this intercept-only, or “simple,” model demonstrated a highly significant effect of probe duration (β = 12.604, SE = 0.261, p < 0.001; Fig. 2). With regard to the effect of stimulus brightness, medium brightness (β = 0.277, SE = 0.095), high brightness (β = 0.323, SE = 0.095), and maximum (white) brightness (β = 0.262, SE = 0.096) were all significantly associated with higher log-odds of a long response compared with the low brightness (all p < 0.01). The effect for the intercept was nonsignificant (p > 0.05). To verify the robustness of our findings from the GLMM, we conducted additional repeated ANOVAs on PSE values obtained via the method described above. The results qualitatively replicated our GLMM findings, confirming a highly significant effect of brightness on PSEs, F(1.98, 59.25) = 146,334.34, p < 0.001, ω^2^ₚ = 0.014, BF_10_ = 1.0 (see Fig. 2A).Fig. 2. Average responses for individual observers (dots) and model-predicted probabilities (solid lines) from a generalized linear mixed-effects model with a logistic link function, plotted as a function of probe duration for each level of (A) stimulus brightness and (B) pupil bin in Experiment 1. PSE values are given as insets
As mentioned above, we also tested a more complex model which included random slopes for all fixed effects. The results were similar to the simpler model; fixed effects in the complex model also showed a significant effect for the probe duration (β = 13.417, SE = 0.615, p < 0.001). Medium brightness (β = 0.288, SE = 0.122), high brightness (β = 0.337, SE = 0.147), and maximum brightness (β = 0.265, SE = 0.137) also remained significant predictors (all p values < 0.05). The effect for the intercept was nonsignificant (p > 0.05). Both models were validated by obtaining an identical pattern of results through split-half testing (results not shown). Although the more complex model provided a more accurate fit to data (Akaike information criterion [AIC] = 5797.9, Bayesian information criterion [BIC] = 5924.7) compared with the simpler model (AIC = 5867.9, BIC = 5910.1), the significance and directionality of the fixed effects remained consistent between both models.
Taken together, the results show that bright stimuli are perceived as longer, where qualitatively the biggest difference exists between the lowest level of brightness and all other levels of brightness.
No clear correlation between arousal-related pupil-size changes and perceived duration
To assess the relationship between pupil size and perceived duration, a similar pair of simple and complex GLMMs were tested, with pupil bin as the fixed effect, replacing stimulus brightness in the models detailed above. Here, “pupil bin” was determined as described below to reflect endogenous, or spontaneous, fluctuations in pupil size, which are assumed to reflect fluctuations in arousal. In order to be readily comparable to the model explaining the effect of four stimulus brightness levels above, average pupil sizes during the 900–1,000 ms window of the RSI interval on each trial were binned into four roughly equal-sized categories (small, medium-small, medium-large, large).
Variable-brightness block
The binning operation in the variable-brightness block was performed separately for each level of stimulus brightness in order to avoid conflating the effect of brightness on baseline pupil size with spontaneous fluctuations in pupil size. Crucially, since higher-luminance stimuli can induce a direct (and almost immediate) pupil constriction and thus potentially shift the overall level of pupil size, we wanted to cleanly separate “spontaneous” or endogenous fluctuations in pupil size from any brightness-driven pupil responses. To do so, for each level of stimulus brightness (e.g., low, medium, high, maximum), we created an internal distribution of baseline pupil sizes across trials in that brightness level alone and then binned those baseline pupil sizes into four equally sized categories (small, medium-small, medium-large, large). This approach ensures that a “large” pupil within a low-luminance context is categorized relative to other trials in the low-luminance condition, rather than relative to a high-luminance condition. By separately binning within each brightness level, we avoided conflating any systematic brightness effect on pupil diameter with the spontaneous, trial-to-trial variation in arousal. Once the four bins were assigned within each brightness condition, we pooled all trials—now each labeled with a pupil bin—into one dataset for the main logistic regression. This way, the pupil bin factor reflected true endogenous variability rather than luminance-induced variability. Pupil bins from the four brightness levels were then entered into the models. As with the models with stimulus brightness described above, a simple and a complex model were fitted to data, with the latter allowing for random slopes for the effect of pupil bin and probe duration. The small pupil bin condition was set as the reference category. In the simple model (AIC = 5877.8, BIC = 5920.0), the fixed effects revealed a nonsignificant effect for the intercept (p > 0.05), whereas probe duration demonstrated a significant effect on choice likelihood (β = 12.587, SE = 0.260, p < 0.001). Regarding the effect of pupil size, the Large pupil bin was found to have a significant effect when compared to the reference category (β = 0.195, SE = 0.096, p = 0.0418). However, the other two pupil bins (medium-small and medium-large) did not have a significantly different effect than the reference category (both p values > 0.05). The complex model (AIC = 5877.8, BIC = 5920.0) also yielded a nonsignificant result for the intercept (p > 0.05) and a significant effect for probe duration (β = 13.310, SE = 0.607, p < 0.001). Concerning the effect of pupil size, identical with the simple model, the large pupil bin showed a marginally significant effect compared to the reference category (β = 0.20102, SE = 0.10460, p = 0.0546), while the effects of the other two larger pupil bins were not significant (both p values > 0.05). Notably, the complex model exhibited a boundary fit, which suggests that the model was overparameterized, implying that the additional complexity in this model may not have been warranted by the pupil size data.
Constant-brightness block
As with the variable-brightness block, pupil sizes were categorized into four approximately equal-sized bins (small, medium-small, medium-large, large) for the constant-brightness block. The two model structures (simple and complex) with the small pupil bin condition as the reference were consistent with those reported previously. For the simple model (AIC = 1457.1, BIC = 1491.0), fixed effects revealed that, both the intercept (β = 0.571, SE = 0.238, p = 0.0164), and the probe duration were significant (β = 13.239, SE = 0.562, p < 0.001). With regard to the effect of pupil size, compared to the Small pupil bin (reference category), none of the larger three bins were associated with higher probability of a long response (all p values > 0.05). For the complex model (AIC = 1440.3, BIC = 1542.0), fixed effects were largely consistent with the simple model. The intercept (β = 0.867, SE = 0.274, p = 0.002), and the probe duration remained significant (β = 14.907, SE = 1.104, p < 0.001). As with the variable-brightness block, the complex model in the constant-brightness block also exhibited a boundary (singular) fit.
Taken together, except for a marginally significant difference in one of several contrasts (large vs. small pupils in the variable-brightness block), the results do not show a convincing relationship between pupil size and perceived duration.
No effects of stimulus brightness and pupil size on duration discriminability
Lastly, we tested the effect of stimulus brightness and pupil size on duration discriminability. One-way ANOVAs showed no effect of stimulus brightness (p = 0.826; BF_10_ = 0.063) on WR. Similarly, there was no effect of pupil bin in the variable-brightness block (p = 0.221; BF_10_ = 0.241) or the constant-brightness block (p = 0.634; BF_10_ = 0.086).
Overall, results suggest no effect of stimulus brightness or pupil bin on stimulus discriminability in Experiment 1.
Experiment 2
Experiment 1 demonstrated a robust effect of stimulus brightness on perceived durations, with no clear relationship between pupil size and the likelihood of a “long” response, suggesting that arousal did not, in this context, affect perceived duration. Experiment 2 aimed to test whether this effect is specific to the brightness of the to-be-timed stimulus, or whether it generalizes to the brightness on which the stimulus is presented. As such, the brightness manipulation of the timed stimuli in Experiment 1 was replaced by a brightness manipulation of the background on which the stimuli were presented. All data preprocessing stages, and modeling procedures were identical to Experiment 1.
Methods
Participants
Thirty participants (23 women; Mage = 20.95 years) were included in Experiment 2. Participants received course credit for their contribution. All participants provided written informed consent prior to the study. All had normal or corrected-to-normal vision. No participants were excluded from the analyses. Sample size was matched to Experiment 1.
Stimuli and apparatus
The apparatus was identical to Experiment 1. The to-be-timed stimulus in Experiment 2 consisted of an annulus briefly presented in the center of the screen (Rinner = 1.62°, Router = 2.39°), which surrounded a white central fixation dot (Fig. 3B). As with Experiment 1, the central area of the stimulus presentation was fully black, transitioning as a gradient to a gray background towards the edge of the screen.Fig. 3A Timeline of a single trial in Experiment 2. B Sample screen depicting the background gradient in a trial with the annulus surrounding the fixation dot during stimulus presentation. C Three levels of background brightness (top to bottom; low, medium, high brightness)
Procedure
The procedure was identical to Experiment 1, except that we now manipulated the brightness of the background, rather than that of the probe (Fig. 3C). Crucially, the color of the timed stimulus was white, and was constant throughout the session (Fig. 3A). A session with the two blocks lasted approximately 45 min. All remaining details pertaining to the eye tracking procedure were identical to Experiment 1.
Temporal bisection procedure
The parameters and the task demands of training and test phases of the temporal bisection procedure were identical to those in Experiment 1. The annulus detailed above was used as the training and the test stimulus. As with Experiment 1, the majority of the participants required less than four correction trials during training. The brightness of the background during the test phase was manipulated trial-to-trial by varying values in the red–green–blue color space (three levels; low, medium, high; 5.5, 15.8, 35.6 cd/m^2^). A white background was not added due to the expected intensity of the effect. As with Experiment 1, we employed a De Bruijn sequence in designing the temporal bisection task, resulting in a test block of 216 trials in total.
Results
Stimuli on a bright background are perceived as longer
The analysis procedure was identical to that of Experiment 1. The fixed effects in the simple model displayed a non-significant trend for the intercept (p > 0.05), whereas the probe duration showed a significant effect (β = 12.737, SE = 0.302, p < 0.001; Fig. 4). Compared with the low brightness condition, both medium brightness (β = 0.327, SE = 0.096) and high brightness (β = 0.557, SE = 0.096) increased the likelihood of a prolonged response (both p values < 0.001). In the complex model, similarly the intercept was nonsignificant (p > 0.05), but the probe duration’s influence was evident (β = 13.627, SE = 0.676, p < 0.001). The associations of medium (β = 0.318, SE = 0.130) and high brightness (β = 0.560, SE = 0.111) remained robust when juxtaposed with the low brightness baseline (both p values < 0.05). Comparative analysis based on AIC and BIC values showed the complex model to be a more precise representation of the data (AIC = 4334.1, BIC = 4421.9) than the simple model (AIC = 4391.9, BIC = 4425.7), although the significance and general direction of the fixed effects were consistent across both modeling approaches. To verify the robustness of our findings from the GLMM, we conducted additional repeated ANOVAs on PSE values. As with Experiment 1, the results qualitatively replicated our GLMM findings, confirming a highly significant effect of brightness on PSEs, F(2, 58) = 704,046.34, p < 0.001, ω^2^ₚ = 0.077, BF_10_ = 1.0 (see Fig. 4A).Fig. 4. Average observed responses for individual observers (dots) and model-predicted probabilities (solid lines) from a generalized linear mixed-effects model with a logistic link function, plotted as a function of probe duration for each level of (A) background brightness and (B) pupil bin in Experiment 2. PSE values are given as insets
Taken together, the results show that stimuli are perceived as longer when they are presented on a bright background.
No clear correlation between arousal-related pupil-size changes and perceived duration
To test the relationship between spontaneous pupil size and perceived duration, we conducted the same analysis as we did for Experiment 1, except that we created three pupil bins (instead of four) to match the number of different brightness levels in Experiment 2.
Variable-brightness block
For the simple model (AIC = 4423.4, BIC = 4457.2), the fixed effects highlighted a nonsignificant effect for the intercept (p > 0.05), whereas the probe duration was a significant predictor (β = 12.658, SE = 0.299, p < 0.001). In relation to pupil size effects, as contrasted with the Small pupil bin reference, neither of the medium or large pupil bins manifested significant capability in predicting the likelihood of a long response (both p values > 0.05). With the complex model (AIC = 4370.7, BIC = 4458.5) the results provided a similar narrative. The intercept in the complex model was not statistically significant (p > 0.05), while the probe duration retained its significant predictive power (β = 13.489, SE = 0.674, p < 0.001). Mirroring the findings from the simple model, pupil size did not offer any significant predictive power when set against the reference category (both p values > 0.05). Notably, the complex model indicated a boundary (singular) fit, suggestive of an overparameterized model, underscoring the possibility that the complexity in this model might not be adequately supported by the pupil size data.
Constant-brightness block
The fixed effects of the simple model revealed the intercept to be marginally significant (β = 0.3532, SE = 0.2472, p = 0.153). The probe duration, as in Experiment 1, remained significant (β = 13.2684, SE = 0.5557, p < 0.001). Concerning the effect of pupil size, compared to the reference category (small bin), the likelihood of a long response was not significantly different for the medium or the large pupil bin (both p values < 0.05). The complex model failed to converge.
Taken together, the results do not show any relationship between pupil size and perceived duration.
No effects of background brightness and pupil size on duration discriminability
Like Experiment 1, we compared WR values (ratio of DL to PSE) for different brightness levels and pupil bins in Experiment 2. One-way ANOVAs showed no effect of background brightness (p = 0.424; BF_10_ = 0.198) or an effect of pupil bin in the variable-brightness block (p = 0.33; BF_10_ = 0.242) or the constant-brightness block (p = 0.191; BF_10_ = 0.385) on WR. Overall, identical to those in Experiment 1, results in Experiment 2 suggest no effect of background brightness or pupil bin on stimulus discriminability.
Discussion
Although people use duration information effectively to guide behavior, subjective time perception often diverges from veridical time (Eagleman, 2008; Zakay & Block, 1997). Research shows that both cognitive states and physical event characteristics impact duration perception (de Hevia et al., 2014; Kanai et al., 2006; Kaneko & Murakami, 2009; Lake et al., 2016; Vidaud-Laperrière et al., 2022; Walsh, 2003). In this study, we show that increased brightness results in longer perceived durations, and surprisingly, that this happens regardless of whether the brightness of a to-be-timed stimulus itself or its background is manipulated. We further show that arousal as indexed by pupil size does not notably affect perceived duration.
We reported two experiments aimed to clarify the specifics of the mechanism through which brightness influences perceived time. The first experiment demonstrated a direct effect of stimulus brightness on perceived durations, while the second experiment, which kept stimulus brightness constant and varied background brightness, showed similar results. These findings challenge the assumption that brightness affects time perception solely through the stimulus properties, instead suggesting a broader influence of environmental luminance. Specifically, the results suggest that the pulse rate of a hypothetical internal clock may be affected by brightness very generally and irrespective of the source of the brightness. In other words, the results suggest that the internal clock may be driven, among other things, by low-level visual input.
Within the context of an information-processing model, temporal integration is thought to be accelerated in response to increased stimulus intensity, which would be captured by an increased tendency to report a duration as “long” (Allan & Gibbon, 1991; Wearden, 1991). This tendency was observed in both of our experiments. These results can be explained by an accelerated internal clock mediated by increased stimulus intensity (i.e., higher brightness; Ivry & Schlerf, 2008; Penney et al., 2000; Treisman, 1963). This shift, evident in our experiments, suggests a stimulus-driven acceleration of the internal timing mechanism. In some ways the effect of stimulus intensity is analogous to the effect of arousal (if any, see below), in the sense that stimulus intensity affects the intensity of low-level visual processing, whereas arousal reflects the intensity of higher-level cognitive processes. However, it is important to note that shifts in PSEs are likely a culmination of several cognitive processes working in concert—including that of temporal attention—therefore attributing these results solely to increased stimulus intensity would likely fall short of capturing the underlying complexity of the observed effect (Angrilli et al., 1997; Droit-Volet & Meck, 2007; Gil & Droit-Volet, 2012; Mella et al., 2011; Smith et al., 2011; Wearden et al., 1998). Furthermore, recent findings suggest that the extent to which stimulus features influence time perception depends on the task used to measure the effect. Specifically, Petrizzo et al. (2023) demonstrated that numerosity biases duration judgments only in discrimination tasks (e.g., two-alternative forced choice), but not in equality judgments or reproduction tasks. Given that our study employed a temporal bisection task, which similarly requires categorical judgments, it is possible that decision-related processes contributed to the observed effect of brightness on time perception. Future research could explore whether these effects persist in noncategorical timing tasks (e.g., duration reproduction or equality judgments) to disentangle purely perceptual influences from those arising from top-down decisional factors such as heuristic-based decision-making strategies and biases.
It is not straightforward to explain our results through an effect of attention mediated by changes in brightness (Block & Gruber, 2014; Coull et al., 2004; Zakay & Block, 1997). Research indicates that attending to nontemporal task demands competes with focusing temporal elements (Brown, 1997; Zakay & Block, 1997). Since all attention lapses, whether intentional or not, can decrease the probability of registering time-representative pulses, this competition can lead to shorter perceived durations (Casini & Macar, 1997; Fortin & Masse, 2000; Grondin, 2010; Zakay & Block, 1997). Possibly, in Experiment 1, participants attended more to the stimulus if it was brighter, which could indirectly lead to increased perceived duration of bright stimuli; in this interpretation, the effect of brightness of perceived duration would be mediated by temporal attention. However, in Experiment 2, participants likely attended less to the stimulus if it was presented on a brighter background, which in this interpretation should lead to reduced perceived duration—the opposite of what we found. Thus, our findings likely reflect effects of brightness rather than attentional changes.
In both experiments we also monitored spontaneous pupil size to explore its potential impact on time perception for a given level of stimulus or background brightness. The motivation for this inquiry was threefold. Firstly, we aimed to utilize the established link between arousal and pupillary response, given arousal’s presumed effect on extending perceived time durations. Secondly, viewed through the lens of the “coding efficiency model” of temporal perception—a theory proposing that our brains optimize the processing of temporal information to enhance efficiency (Eagleman & Pariyadath, 2009)—higher neural activation on the retina would lead to spontaneously larger pupils being associated with longer perceived durations—due to higher light influx (Matthews & Meck, 2016). Lastly, research on the effect of neuromodulators such as dopamine and serotonin have postulated a common pathway for pupillary dynamics and temporal information processing, further bolstering a potential correlation between the two (Aston-Jones & Cohen, 2005; Berridge & Waterhouse, 2003; Buhusi & Meck, 2005; Coull et al., 2011; Joshi et al., 2016; Meck, 2006). Surprisingly, neither experiment found a significant correlation between pupil size and time perception, a finding that diverges from existing, albeit limited, literature. This suggests that, at least in the present paradigm, perceived duration is not affected by arousal.
Nevertheless, there are several methodological considerations that may have influenced our findings regarding the null effect of pupil size on duration perception. First, we measured naturally occurring (as opposed to experimentally induced) pupil-size variations, which might generate only modest changes in arousal insufficient to impact timing performance (Joshi & Gold, 2020). Second, the pronounced effect of brightness may have overshadowed subtle pupil-driven effects, particularly given that pupil diameter is itself partially governed by luminance levels (Binda et al., 2013; Mathôt & Van der Stigchel, 2015). Third, despite our trial-level analyses and relatively large number of trials per participant, we cannot rule out that a more targeted manipulation of arousal (e.g., emotional stimuli, reward) could reveal a clearer pupil–time link (Akdoğan et al., 2016; Einhauser et al., 2010; Goldinger & Papesh, 2012; Naber et al., 2013; Rohenkohl et al., 2012; Suzuki et al., 2016; Van Rijn et al., 2012). Additionally, while our generalized linear mixed models were robust and accounted for various sources of variability, it is possible that alternative analytical approaches or more sensitive measures of arousal might detect effects that our current analyses did not. Nonetheless, our paradigms successfully captured large, systematic shifts in timing judgments due to brightness, suggesting that our design was sufficiently sensitive to detect moderate effect sizes. Therefore, while we cannot definitively rule out a role for arousal in time perception, our data suggest that any effect of spontaneous pupil-size fluctuations on duration judgments, at least under the present conditions, is minimal. Lastly, one limitation of our study is that we assumed brightness perception follows the Weber–Fechner law rather than measuring subjective brightness levels. Future studies could improve upon this by having participants rate brightness levels beforehand and then using those ratings to calibrate stimuli (see Hurvich & Jameson, 1966; Palmer, 1999; Sulutvedt et al., 2021 for related work on brightness perception). This would better account for individual differences and enhance sensitivity to potential pupil-linked effects on time perception.
In conclusion, the combined behavioral and pupillometric findings presented here help illuminate the intricate interplay between sensory input and temporal cognition (Laeng et al., 2012; Matthews & Meck, 2016). More specifically, the findings that increased brightness, be it of the stimulus or its background, lengthens perceived duration, while spontaneous pupil size shows no significant correlation with time perception, challenge models that underpin widely held assumptions regarding a directional relationship between “arousal” and time perception. Our research thus puts forward a more nuanced view where low-level visual properties significantly influence our perception of time, independent of arousal-linked mechanisms traditionally associated with pupil size. Specifically, the finding that the brightness of the background affects time perception in more or less the same way as the brightness of the timed stimulus underscores the idea that our cognitive evaluation of time is holistic and integrative, relying not merely on the properties of the timed stimuli but on a composite of sensory experiences (Calvert et al., 2004; Driver & Noesselt, 2008; Matthews, 2011; Stein & Meredith, 1993; Stein et al., 2009; van Wassenhove, 2009; van Wassenhove et al., 2008). As time perception remains a central dimension of our conscious experience, unraveling its multifaceted nature will be instrumental in painting a more complete picture of human cognition.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bates, D., Maechler, M., Bolker, B., Walker, S., Christensen, R. H. B., Singmann, H., ... & Bolker, M. B. (2015). Package ‘lme 4’. Convergence, 12(1), 2.
- 2Brown, V. A. (2021). An introduction to linear mixed-effects modeling in R. Advances in Methods and Practices in Psychological Science, 4(1). 10.1177/2515245920960351
- 3Calvert, G. A., Spence, C., & Stein, B. E. (Eds.). (2004). The handbook of multisensory processes. MIT Press.
- 4Einhauser, W., Koch, C., & Carter, O. (2010). Pupil dilation betrays the timing of decisions. Frontiers in Human Neuroscience, 4, Article 946.10.3389/fnhum.2010.00018 PMC 283163320204145 · doi ↗ · pubmed ↗
- 5García-Pérez, M. A. (2014 b). Does time ever fly or slow down? The difficult interpretation of psychophysical data on time perception. Frontiers in Human Neuroscience, 8, Article 415.10.3389/fnhum.2014.00415 PMC 405126424959133 · doi ↗ · pubmed ↗
- 6Goel, A., & Buonomano, D. V. (2014). Timing as an intrinsic property of neural networks: Evidence from in vivo and in vitro experiments. Philosophical transactions of the Royal Society B: Biological sciences, 369(1637), Article 20120460.10.1098/rstb.2012.0460 PMC 389598524446494 · doi ↗ · pubmed ↗
- 7Grondin, S. (Ed.). (2008). Methods for studying psychological time. Psychology of time. Springer. [Google Books]
- 8Herbst, S. K., Javadi, A. H., van der Meer, E., & Busch, N. A. (2013). How long depends on how fast—Perceived flicker dilates subjective duration. PLOS ONE, 8(10), Article e 76074.10.1371/journal.pone.0076074 PMC 380676024194829 · doi ↗ · pubmed ↗
