Cognate Effects on Bilingual Lexical–Semantic Processing in Children: Insights from ERPs
Chih Yeh, Kathrin Wicinski, Caroline F. Rowland, Sergio Miguel Pereira Soares

TL;DR
This study explores how shared words (cognates) help bilingual children process language, using brain activity measurements.
Contribution
It provides direct neural evidence of cognate facilitation effects in bilingual children and examines interactions with language experience factors.
Findings
Cognate facilitation effects were observed only in the non-dominant language (German).
Higher proficiency and age reduced cognate facilitation effects, while higher exposure amplified them.
Findings suggest cross-linguistic activation benefits younger children with lower proficiency or higher exposure.
Abstract
This study investigates whether and, if so, how cognates facilitate lexical–semantic processing during early bilingual development. Additionally, we examine the interaction between the cognate facilitation effect (CFE) and bilingual experience factors, such as language proficiency, exposure, and age. We investigated language backgrounds and recorded event-related potentials during a semantic priming task in Dutch–German bilingual children. Most participants were Dutch-dominant, characterized by higher exposure and proficiency in Dutch. We compared the N400 response to target words preceded by semantically related cognate versus non-cognate primes. We found a reduced N400 effect (indexing cognate facilitation) only in the non-dominant language (nDL; German). Individual difference analyses further revealed that higher proficiency of nDL and increasing age attenuated the CFE. In contrast,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1- —Max Planck Society
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurobiology of Language and Bilingualism · Language Development and Disorders · Categorization, perception, and language
1. Introduction
Cognates are translation equivalents across two languages that share meaning and overlap in phonological and/or orthographic form. Previous studies have suggested that cognates can benefit lexical development and processing in bilingual children. For example, multiple studies report that translation equivalents with form overlap tend to be learned more easily and earlier in children’s vocabularies (Bosch & Ramon-Casas, 2014; Bosma et al., 2019; Garcia-Castro et al., 2025; Goriot et al., 2021; Lindgren & Bohnacker, 2020; Mitchell et al., 2024). Experiments using picture naming, word recognition, lexical decision, and sentence-reading tasks find shorter reaction times and/or higher accuracy for cognates vs. non-cognates (Bosma & Nota, 2020; Duñabeitia et al., 2016; Iniesta et al., 2024; Koutamanis et al., 2024a; Poarch & van Hell, 2012; Schröter & Schroeder, 2016). These findings align with the cognate facilitation effect (CFE). Previous studies suggested that CFE may arise due to the co-activation of overlapping semantic and phonological/orthographical representations within an integrated bilingual lexicon. Such co-activation often leads to faster lexical–semantic access, the process of retrieving words’ meanings and forms from memory (see the Bilingual Interactive Activation Plus model, BIA+; Dijkstra & van Heuven, 2002).
1.1. The Role of Age and Bilingual Experience
Some studies also addressed how cognates interact with bilingual language development by comparing CFE across age groups. Specifically, these studies examine whether the processing advantage for cognates (e.g., recognizing rose faster because of the Dutch roos) changes as children mature. However, results have been inconsistent. For instance, two studies report that sensitivity to cognates increases with age. Bosma et al. (2019) and Goriot et al. (2021) found that in younger children (ages 5–8 and ages 4–12), the CFE tends to emerge or increase with age, which they attribute to growing linguistic awareness and greater sensitivity to cross-linguistic phonological regularities. In contrast, Duñabeitia et al. (2016) reported that in older children and adolescents (ages 8–15), the CFE decreases with age. They suggest that increased reading experience and maturing language control mechanisms may strengthen cross-language inhibition and thus reduce CFE over time.
Several of the studies reviewed above have also investigated the role of individual bilingual experiences on the cognate status. Overall, they found asymmetric CFE in the two languages, where language exposure and proficiency appeared to modulate the effect (Bosma et al., 2019; Bosma & Nota, 2020; Duñabeitia et al., 2016; Goriot et al., 2021; Koutamanis et al., 2024a). These studies typically categorize participants’ two languages as dominant language (DL) and non-dominant language (nDL) based on exposure and/or objective proficiency measures. Several developmental studies report CFEs when targets are in the nDL (Bosma et al., 2019; Bosma & Nota, 2020; Koutamanis et al., 2024a). These findings align with the BIA+ model, which predicts faster activation of words in the DL and a substantial influence of DL activation on translation equivalents in the weaker language. Bosma and Nota (2020) found shorter reading times for cognates only in the children’s nDL in a sentence-reading task. Koutamanis et al. (2024b) also observed a CFE in Dutch-dominant children in a lexical decision task with Dutch–Greek bilinguals. These results indicate that both age and bilingual experiences may shape early bilingual lexical–semantic development, and they underscore the need to measure and model bilingual experience factors, such as exposure, proficiency, and age, when investigating CFE in semantic processing.
1.2. Cognate Effects in Semantic Processing
Despite robust evidence for CFE at the level of lexical access in both lexical processing and early bilingual development, far less is known about how this effect extends beyond word recognition or lexical access to influence bilingual semantic processing during development. Only one behavioral study so far has directly compared the semantic processing of cognates and non-cognates, and few have systematically examined cognate status. Floccia et al. (2020) found no cognate advantage in an intermodal preferential-looking task, but their broad cognate definition and omission of language dominance measures may have obscured effects. Evidence from adult studies indicates that cognates can speed recognition and enhance downstream semantic prediction (Ito et al., 2025), and faster prime processing increases priming magnitude (Hutchison et al., 2008). These findings posit a plausible mechanism in which cross-language overlap accelerates lexical access to the prime. Such speeded processing, in turn, facilitates prediction and semantic integration, which is important during early bilingual development.
Other work, instead of comparing cognates and non-cognates, examined the role of language distance in semantic priming. These results provide an indirect lens on CFEs: semantic processing in closely related language pairs is likely to benefit more from cognates than processing in more distantly related pairs. Studies of closely related language pairs (24-month Spanish–English, De Anda & Friend, 2020; 30-month French–English, Jardak & Byers-Heinlein, 2019) report priming in both languages with no interaction with dominance. Similarly, an EEG study reported N400 priming in bilingual toddlers (24–30 months) and found more pronounced priming effects in French–Spanish than in French–English toddlers (Rämä et al., 2024). The researchers reported a higher proportion of the cognate words in the French–Spanish condition (42.5%) than French–English (27.5%), although they did not explicitly control that cognate status. Such differences may partly account for the stronger priming effect in the French–Spanish condition. By contrast, Singh (2014) found no cross-language priming when the prime was in the nDL for 30-month English–Mandarin bilinguals, suggesting that bilingual children may not have benefited from their DL because the large language distance between the DL and nDL limits cross-language support. These patterns are consistent with higher cognate frequency and greater lexical overlap in related languages (Schepens et al., 2013), which would enhance cross-language activation and support semantic processing in the nDL. Together, these language distance findings provide converging, indirect evidence, implying that cognates may facilitate semantic processing by accelerating prime recognition and strengthening cross-language links.
1.3. The Present Study
The preceding review of behavioral and neural data tentatively suggests that cognates influence semantic processing; however, a systematic examination of CFE on semantic processing and development is currently lacking. To address this gap, the present study utilizes an auditory semantic priming paradigm alongside electroencephalogram (EEG) recordings in Dutch–German bilinguals (a closely related language pair; Schepens et al., 2013). We targeted the 2-to-6-year age range to capture the developmental trajectory of the CFE during the most dynamic period of lexical acquisition (DeAnda et al., 2016). Covering this broad window allows us to examine how sensitivity to cognates evolves from early word learning through the preschool years, providing a more granular view of how bilingual experience impacts CFE and semantic processing over time. Here, we use the N400, a well-established event-related potential (ERP) index of lexical–semantic retrieval (Delogu et al., 2019), as the primary neural signature of processing. The N400 is typically larger (more negative amplitude) for semantically incongruent than for congruent word pairs and is interpreted as reflecting increased semantic integration demands (Kutas & Federmeier, 2011). In priming paradigms, reduced N400 amplitude (i.e., less negative) for primed targets is taken as evidence of facilitated semantic access and activation of the semantic network. These features provide an online, response-independent index of cognitive processing and allow us to capture neural effects comparable to previous behavioral findings (e.g., Bice & Kroll, 2015).
With this in mind, the present study aims to (1) provide a neural-level test of CFE and (2) systematically investigate the moderating role of bilingual experience variables (specifically, language exposure, proficiency, and age). Our primary hypothesis is that cognate primes will facilitate the semantic processing of target words. We tested this by comparing the N400 responses of Dutch–German bilingual children listening to word pairs containing cognate primes (e.g., Baum/boom; “tree” in German and Dutch) versus non-cognate primes (e.g., Teller/bord; “plate” in German and Dutch). Given that a reduction in N400 amplitude indexes easier semantic integration, we specifically predicted smaller N400s for trials involving cognate primes. Furthermore, based on prior work, we predicted a stronger CFE in the children’s non-dominant language. Given the mixed developmental findings, we made no directional prediction for the effect of age.
2. Materials and Methods
2.1. Participants
Forty typically developing Dutch–German bilingual children aged between 2 and 6 years (mean = 3.99, SD = 1.07, range = 2.08–5.75, 15 females) participated in the study. We recruited these participants through the Baby and Child Research Centre in Nijmegen, The Netherlands. During the recruitment phase, we ensured that participants met specific inclusion criteria. Specifically, we verified that all children had normal hearing and vision, no diagnosed developmental disorders, and no significant exposure to languages other than Dutch or German. Furthermore, we confirmed that every participant received at least 10% of their language input in each of the two languages (Hoff et al., 2012; Hoff & Ribot, 2017). Geographically, most families resided in the Dutch–German border region; specifically, 37 families lived in the Netherlands, while three families lived in neighboring Germany. Caregivers reported a predominantly high socioeconomic status (SES) background. Regarding household income, 90% of families reported earning more than €3000 per month (with 75% exceeding €3800). Similarly, parental education levels were high, with 79.5% of caregivers holding a university degree. The study received approval from the Ethical Board of Social Sciences at Radboud University in Nijmegen, The Netherlands. Prior to the first experimental session, we provided caregivers with detailed information regarding the study procedures and obtained their written informed consent. Families received either €50 or €40 plus a children’s book as compensation.
2.2. Language Background Assessment
2.2.1. Cross-Linguistic Lexical Task (CLT)
We assessed children’s receptive vocabulary using the Cross-linguistic Lexical Task (CLT; Haman et al., 2015), part of the LITMUS test battery (https://www.bi-sli.org/crosslinguistic-lexical-task, accessed on 11 February 2026). The CLT was developed for bilingual populations between 3 and 7 years of age and is available in multiple languages, making it suitable for cross-linguistic comparisons (analyses across different language systems) and within-child comparisons of lexical knowledge (the breadth and depth of a child’s vocabulary). We selected the CLT because it is specifically designed for bilingual populations; the test items were carefully selected based on Age of Acquisition (AoA) and complexity to ensure cross-linguistic equivalence and minimize cultural bias. In this study, we administered the comprehension component of the Dutch (CLT-NL; Van Wonderen & Unsworth, 2021) and the German version (CLT-DE; Rinker & Gagarina, 2017). The comprehension task follows a four-picture choice format: after viewing four images, the child hears an audio prompt and selects the picture that matches the target noun or verb. Each domain (nouns, verbs) includes 32 test items. Accuracy in the CLT reflects the child’s receptive vocabulary size and provides a reliable estimate of lexical knowledge in each language, independent of production abilities. The task’s standardized design across languages enables direct comparison of lexical development across two languages. In this study, the CLT was administered in Dutch and German, respectively, following the two EEG recording sessions (see Section 2.3.2 Procedure).
2.2.2. Quantifying Bilingual EXperience (Q-BEx)
We collected children’s bilingual experience using the Q-BEx parental questionnaire (De Cat et al., 2023), which quantifies a bilingual child’s language background and experience. We chose this tool because it captures the dynamic nature of bilingualism by estimating cumulative exposure rather than current input alone, providing a more reliable metric of the nature of the linguistic input that shapes lexical development over time. Participants’ caregivers completed the questionnaire, which included modules on background information (e.g., age, family background), risk factors (e.g., language developmental delays), language exposure and use, the richness of linguistic experience, and language mixing. For the present study, we focused solely on the language exposure and use module, using the resulting cumulative estimates as an index of language exposure. To calculate cumulative exposure, caregivers first identified significant time points where a child’s language exposure changed. These points defined distinct time periods, and for each period, the caregiver reported the percentage of exposure to each language. Then, we calculated the total cumulative exposure for each language by multiplying the duration of each period (in months) by the percentage of exposure to that language and summing these values across all periods. The percentage of cumulative language exposure was then determined by dividing this total cumulative exposure score for a language by the child’s current age in months. In the current study, the Q-BEx was completed online by the caregiver(s) before the experiment and took approximately 15 to 30 min to finish.
2.3. Semantic Priming Paradigm
2.3.1. Materials
We have pre-registered this study (Open-Ended Registration; Wicinski et al., 2025) on OSF (Open Science Framework). The materials were a subset from a larger study investigating lexical–semantic development in Dutch–German bilingual children using within- and across-language semantic priming with noun and verb pairs1 (for more details, see footnote 1 and the preregistration, Yeh et al., 2025). The current study focuses specifically on the CFE on semantic processing, given the N400’s sensitivity to various semantic factors (e.g., word frequency, concreteness, and semantic anomalies; see Kutas & Federmeier, 2011, for a review). To ensure that any reduction in the N400 response is solely due to cognate status, here, we only compare semantic-related noun pairs within each language.
The stimuli comprised 124 nouns in Dutch and German, arranged into 62 prime-target word pairs (32 Dutch and 31 German). To ensure that the words were familiar to the children, we selected the words from the Dutch and German McArthur Communicative Development Inventory (CDI-WS) (Szagun et al., 2023; Zink & Lejaegere, 2003), with a few additional German words sourced from a Goethe Institute A1-level word list (Goethe-Institut, 2004). To ensure uniform acoustic quality and native pronunciation, stimuli were recorded in a soundproof booth by two female speakers per language (Dutch and German). All audio files were subsequently processed using the pydub library in Python (version 3.8) to normalize intensity (RMS amplitude) (Robert, 2018).
Cognate status was determined by four native Dutch–German bilingual adults. Based on their ratings, there were 16 German cognates in the Dutch condition and 19 Dutch cognates in the German condition. We also calculated the normalized Levenshtein Distance (NLD), a measure of phonetic similarity for determining the degree of phonological overlap between the translation equivalents (Dijkstra et al., 2010; Schepens et al., 2013; Tokowicz et al., 2002), and included word frequency (SUBTLEX; Brysbaert et al., 2011; Keuleers et al., 2010) and word length as control variables. Details are presented in Table 1. These measures confirmed the rating results: cognate conditions had significantly higher phonetic similarity than non-cognate conditions (Dutch: t(30) = 6.88, p < 001; German: t(29) = 7.74, p < 001). Crucially, we found no significant differences between the cognate and non-cognate word pairs for word length (Primes: Dutch t(30) = −1.01, p = 0.32; German t(29) = −1.43, p = 0.16; Targets: Dutch t(30) = −0.47, p = 0.64; German t(29) = 0.73, p = 0.47), word frequency (Primes: Dutch t(29) = 1.91, p = 0.07; German t(28) = 0.36, p = 0.72; Targets: Dutch t(28) = 0.89, p = 0.38; German t(28) = 0.15, p = 0.90), or semantic relatedness between the pairs (Dutch t(30)= −0.35, p = 0.73; German t(29) = 0.74, p = 0.47). Thus, our two conditions varied primarily in their phonetic overlap. A detailed overview of the stimuli and their characteristics is provided in Appendix A Table A1.
2.3.2. Procedure
The study was conducted across two sessions. Each of the two visits comprised an EEG recording and a CLT session. The order of languages of CLT (Dutch vs. German) was counterbalanced across the two visits. Additionally, within each CLT assessment, the administration order of the subsections (nouns vs. verbs) was randomized and counterbalanced. During the EEG recording, the participant and their parents were seated in a soundproof, electrically shielded booth. Stimuli were presented via two speakers at 65 dB. In each trial, a cognate or non-cognate prime was presented within a carrier phrase within the language (examples for Dutch: cognate prime: “een sok”, a sock; non-cognate prime: “een winkel”, a store), followed by a semantically related target word (e.g., “schoen”, shoe; “bakker”, bakery). The interstimulus interval (ISI) between the prime phrase and target word was 500 ms; the intertrial interval (ITI) was 2200 ms.
To keep the participants engaged and calm, silent cartoon videos unrelated to the stimuli were shown on a screen; quiet fidget toys and coloring materials were also provided during the experiment. Importantly, because participants process the auditory stimuli passively, developmental EEG studies often prioritize the acquisition of high-quality neural data over enforcing attentional focus (Hervé et al., 2022). To prevent inadvertent cueing by the parent, parents wore headphones delivering masking audio (music and speech). After each EEG recording, participants completed the CLT with an experimenter (see Section 2.2.1).
2.3.3. EEG Recording and Analysis
The electroencephalogram (EEG) was continuously recorded using 32 active Ag/AgCl electrodes (actiCAP) (FP1, FP2, F7, F3, Fz, F4, F8, FT9, FC5, FC1, FC2, FC6, FT10, T7, C3, Cz, C4, T8, TP9, CP5, CP1, CP2, CP6, TP10, P7, P3, Pz, P4, P8, O1, Oz, O2) from the 10–20 system (BrainAmp DC amplifier, and Brain Vision Recorder Brain Products GmbH, Gilching, Germany). All electrodes were online referenced to FCz. The O1 and O2 electrodes were placed on the mastoids as offline reference electrodes (“TP9L”, “TP10L”). This different configuration provided offline re-referencing without compromising relevant data acquisition, as occipital channels are not critical for the analyses conducted in this paradigm. The two frontal electrodes above the eyes (Fp1, Fp2) were used to capture vertical eye movements, and the two electrodes at the outer canthi of the eyes (FT9, FT10) were used to capture horizontal eye movements. Electrode impedance levels were generally kept below 25 kΩ. Data were recorded at a sampling rate of 500 Hz, with an online time cut-off of 10 s and a high cut-off of 1000 Hz.
For the ERP analysis, continuous data were pre-processed in MATLAB (R2024a) using EEGLAB (Delorme & Makeig, 2004) and ERPLAB (Lopez-Calderon & Luck, 2014). The continuous EEG was band-pass-filtered offline from 0.1 to 30 Hz. Epochs were created time-locked to the target word onset and spanned 1200 ms (−200 to 1000 ms relative to stimulus onset). Baseline correction was performed on the −200 to 0 ms pre-stimulus window. Automatic artefact detection removed bad channels and epochs with high-amplitude artefacts (±150 μV for EEG channels; ±250 μV for EOG channels). After removal of large artefacts, data were decomposed using independent component analysis (ICA); components reflecting eye movements were identified by component morphology and visual inspection and removed. On average, 1.63 eye movement components were removed per participant. A second round of manual artefact rejection was then performed to remove residual noise not captured by ICA. Missing channels were interpolated for 14 participants using spherical-spline interpolation (mean number of repaired channels = 1.7, range = 1–5). Artefact-free data were re-referenced to the average of the mastoid electrodes (TP9L, TP10L). Only participants with at least eight artefact-free trials per condition were included in the final analysis. Overall, 20.84% of trials were rejected on average. Eleven participants were rejected.
As proposed in the preregistration, we firstly performed cluster-based permutation analyses (CBPA; Maris & Oostenveld, 2007) in Fieldtrip (Oostenveld et al., 2011) to assess differences between the cognate and non-cognate. In the expected N400 time window (250–550 ms), the test revealed no significant differences in either language. However, an exploratory open-window search (0–1000 ms) identified a significant negative cluster in the German condition (p = 0.02) from 726 to 998 ms over fronto-central electrodes (21 channels). Within this cluster, non-cognates displayed more negative amplitudes than cognates. While we initially expected the cognate effect to manifest as a reduced N400 (Kutas & Federmeier, 2011), this later time window aligns with distinct neural signatures observed in bilingual children’s non-dominant language. For instance, Sirri and Rämä (2019) found a late anterior negativity in French–Spanish bilingual children (aged 2–4), which they attributed to less efficient semantic processing. Similarly, Conboy and Mills (2006) linked late negativity to increased attentional demands in toddler word processing. Thus, the late negativity observed here may reflect delayed semantic integration in the non-dominant language, with the reduced amplitude for cognate targets indicating a reduction in these processing demands.
Despite this exploratory finding, our primary hypothesis specifically concerned the earlier N400 component. Cluster-based tests, while robust for exploratory searches, can lack sensitivity for specific effects if the signal is spatially broad or temporally fleeting (Sassenhagen & Draschkow, 2019). Therefore, to rigorously test our a priori N400 predictions, we proceeded to a region-of-interest (ROI) analysis on the mean amplitudes within the 250–550 ms window.
While our preregistration specified a Bayesian framework, we opted for a frequentist linear mixed-effects (LME) model using the lme4 package (Bates et al., 2015) in R. This adjustment was made to facilitate standard significance testing and ensure comparability with the existing frequentist literature in this domain. The final model included fixed effects for cognate status (levels: cognate, non-cognate; reference: cognate), language (levels: Dutch, German; reference: German), and topographic distribution (levels: frontal–central, posterior; reference: frontal–central), along with their full interactions. To account for individual variability in processing, the model incorporated random intercepts for subjects and random slopes for cognate status by subject, which allowed the effect of cognate status to vary across individual participants.
2.3.4. Individual Differences Analysis
The effects of language exposure, proficiency, and age on the cognate facilitation effect (CFE) were analyzed using linear mixed-effects models. For each participant, N400 mean amplitudes were computed for every condition at each channel. Language exposure was operationalized as the percentage of German exposure reported in the Q-BEx questionnaire, since Dutch and German together accounted for 100% of cumulative exposure; thus, higher German exposure values corresponded to lower Dutch exposure. Proficiency was indexed by overall accuracy in Dutch or German, depending on the language dominance (which would be determined by the amount of language exposure). The initial model specification included fixed effects of cognate status, age, and percentage of German exposure, as well as interactions between cognate status and each of these predictors. Random intercepts for participants were also included. To address potential collinearity, we first assessed intercorrelations among predictors. Based on these results, highly correlated predictors were included in separate models to ensure stability.
3. Results
This study incorporates both behavioral and neural measures, each with different sample sizes, in order to maximize participant inclusion. For the ERP analyses, we report data from 29 participants (10 female; M = 4.16 years, SD = 1.07). Eleven participants were excluded from this dataset due to insufficient number of trials remaining after artifact removal (e.g., blinks, muscle noise and head movements). For the subsequent individual-level analyses, which combine ERP and behavioral data, further exclusions were necessary: two participants due to incomplete Q-BEx questionnaires (final N = 27) and one due to missing CLT task data (final N = 28). As a result, the analyses in Section 3.1 and Section 3.3 rely on different sample sizes. We included 27 participants in the language exposure and age model, but 26 participants in the language exposure and proficiency model. This variation reflects the availability of data for each specific predictor.
In summary, the results confirmed that the majority of participants were Dutch-dominant based on language background measures. Neural data indicated a CFE, indexed by reduced N400 effect, only in the non-dominant language (German). Finally, linear mixed-effects modeling revealed that the effect was modulated by individual differences in age, language proficiency, and cumulative exposure, respectively. We detail these findings in the sections below.
3.1. Language Background
Descriptive results for the participant’s cumulative language exposure, as measured by the Q-BEx, and lexical proficiency in Dutch and German, assessed using the LITMUS Cross-linguistic Lexical Task (CLT), are presented in Table 2. Overall, both language background measures reveal a clear pattern of Dutch, which aligns with the demographic context that the majority of families reside in the Netherlands, where Dutch is the societal language. Consequently, German represents the non-dominant language for our sample on average. Based on the Q-BEx questionnaire (N = 27), twenty-two participants received more cumulative exposure to Dutch than to German, three were predominantly exposed to German, and two had balanced exposure. The CLT score (N = 28) also demonstrates an overall better proficiency in Dutch. Specifically, the accuracy for nouns is better than verbs regardless of language. No significant difference was observed in participants’ noun proficiency across languages (t(24) = 0.16, p = 0.88), whereas verb proficiency was significantly higher in Dutch (t(24) = 5.52, p < 0.001).
To examine the relationships between the behavioral measures, we conducted Pearson’s correlations including age, cumulative exposure to both languages, and CLT scores in both languages. The results shown in Table 3 revealed strong correlations between age and language proficiency (i.e., all the CLT scores).
3.2. ERPs
A linear mixed-effects model was used to investigate whether the N400 mean amplitude measured within the 250 to 550 ms time window differed between cognates and non-cognates in both Dutch and German. The mixed-effects model showed a marginal main effect of cognate status (model coefficient for non-cognate − cognate = −1.86, p = 0.048), which corresponded to cognates eliciting on average an effect that was 1.86 µV more positive than non-cognates (i.e., a smaller N400 response for cognates). In addition, the model revealed several interactions affecting N400 amplitude. The cognate × language interaction (β = 1.23, p = 0.048) indicated that the reduction in N400 for cognates versus non-cognates was reduced in Dutch. The cognate × topography interaction (β = 2.05, p < 0.001) showed that the N400 difference between conditions was smaller at posterior electrodes than at frontal–central electrodes. Language also interacted with topography (β = 1.71, p = 0.004), reflecting language-dependent differences in anterior–posterior amplitude. The three-way interaction between cognate status, language, and topography did not reach the significance threshold (β = −1.65, p = 0.053). However, the result suggested a trend where the reduction in the cognate effect in Dutch (compared to German) was stronger over posterior electrodes. Overall, cognates elicited a smaller N400 than non-cognates (≈1.9 µV), but this N400 difference is substantially modulated by both topographic distribution and language. Figure 1 displays the EEG waveforms. Table 4 details the fixed effects for the main ERP analysis of the semantic priming experiment. We report the subsequent analyses regarding how language proficiency and exposure modulate these effects in the ‘Individual Differences’ section (Section 3.3; see Table 5).
Post hoc planned pairwise comparisons on estimated marginal means (EMMs; emmeans; Lenth et al., 2025) with FDR correction revealed a significant cognate facilitation effect at frontal–central sites in German: cognate–non-cognate = 1.86 µV, SE = 0.94, 95% CI [0.02, 3.70], p = 0.0475 (FDR-adjusted), consistent with the cognate facilitation effect (smaller N400 for cognates), but not at frontal–central sites in Dutch (cognate–non-cognate = 0.63 µV, SE = 0.94, 95% CI [−1.21, 2.48], p = 0.50) or at posterior sites in either language (German: −0.19 µV, 95% CI [−2.00, 1.63], p = 0.84; Dutch: 0.23 µV, 95% CI [−1.58, 2.05], p = 0.80).
3.3. Individual Differences
The LME and post hoc analyses reported in Section 3.2 revealed significant differences in the German condition (frontal–central region) but not the Dutch condition. As such, here we only focus on the German conditions to examine how bilingual experience modulates the N400 cognate facilitation effect (CFE) in a non-dominant language. We tested whether the cognate effect was influenced by German proficiency, age, and cumulative exposure to German. Because age and proficiency were highly collinear (see Table 2), we fitted two separate models. In both models, we included interactions between cognate status and the individual predictors (e.g., cognate × age and cognate × exposure). However, we excluded the three-way interaction (e.g., cognate × age × exposure) and the interaction between predictors (age × exposure) after preliminary model checks revealed that including these terms introduced severe multicollinearity (correlations of fixed effects > 0.80), which compromised the stability of the estimates. Therefore, the final models were specified as follows: Model 1 included cognate status, German proficiency, cumulative exposure and their interaction while Model 2 included cognate status, age, cumulative exposure and their interaction.
The mixed-effects models revealed that cognate status was a robust predictor of N400 mean amplitude across both specifications (see Table 5). In Model 1, cognate words elicited significantly more negative amplitudes compared to non-cognates (β = −17.79, 95% CI [−21.77, −13.81]), and this effect was moderated by both German CLT accuracy and cumulative exposure to German. Specifically, higher CLT accuracy attenuated the cognate effect (β = 23.06, 95% CI [18.22, 27.91]), whereas greater cumulative exposure amplified it (β = −6.30, 95% CI [−10.33, −2.26]). In addition, German CLT accuracy alone was negatively associated with N400 amplitude (β = −18.62, 95% CI [−26.49, −10.74]), and cumulative exposure showed a modest positive association (β = 7.23, 95% CI [0.67, 13.79]).
Model 2, which included age as a predictor, confirmed the main effect of cognate status (β = −4.82, 95% CI [−8.03, −1.61]), while also revealing a significant negative effect of age (β = −1.75, 95% CI [−2.82, −0.69]). Age further interacted with cognate status, such that the cognate effect decreased with increasing age (β = 1.20, 95% CI [0.51, 1.89]). Cumulative exposure to German and its interaction with cognate status were not significant in this model.
Across both models, the cognate effect on N400 amplitude was robust, confirming that cognate words elicit enhanced neural responses compared to non-cognates in the non-dominant language. However, the factors moderating this effect differed depending on whether proficiency or age was considered as part of the model. In Model 1, higher German proficiency reduced the cognate facilitation effect, suggesting that more proficient bilinguals rely less on cross-linguistic form overlap. In contrast, greater cumulative exposure enhanced the effect, indicating that continued experience with the non-dominant language amplified sensitivity to cognate status. In Model 2, age emerged as a significant predictor, both directly influencing N400 amplitude and moderating the cognate effect, such that older participants showed a weaker facilitation effect.
4. Discussion
The current study directly investigated how the CFE influences online semantic processing in the developing brain, and the interaction between the CFE and individual bilingual experience variables. The results from the language background assessment and questionnaire indicated that the majority of our participants were Dutch-dominant, with German as their non-dominant language. The semantic priming experiment revealed a crucial finding: the CFE, indexed by a reduction in N400 amplitude in the cognate condition, was present in the nDL but absent in the DL. This suggests that bilingual speakers utilize cognate representations in the DL to support semantic processing in their nDL. Further analyses using linear mixed-effects (LME) models demonstrated a joint impact of age, language proficiency and cumulative language exposure on the N400 effect. Specifically, participants with lower nDL accuracy and higher nDL exposure showed a greater reliance on cognates during nDL processing. Finally, we discovered an age effect, which aligned with the proficiency findings, indicating that younger participants exhibited a stronger CFE during nDL processing.
This main finding confirmed that there was a CFE in semantic processing, but also raised a question regarding the underlying mechanism: how does a cognate prime, possessing a phonologically and semantically overlapping translation equivalent in the DL (e.g., Dutch oor), facilitate online semantic priming exclusively within the nDL (e.g., German Ohr–Mund)? The relationship between CFE and semantic processing is not as straightforward as in studies of word recognition and naming, where facilitation is typically observed in explicit measures (for example, shorter naming reaction times for cognates than for non-cognates). Semantic priming, in contrast, indexes more implicit and automatic activation. Below, we propose our interpretations of how cognates might influence semantic processing and development.
Cognate words are fundamentally acquired earlier and more easily than non-cognates in bilingual children’s vocabularies (Bosch & Ramon-Casas, 2014; Bosma et al., 2019; Garcia-Castro et al., 2025; Goriot et al., 2021; Lindgren & Bohnacker, 2020; Mitchell et al., 2024). That means bilingual children might be more familiar with cognate words than non-cognates. Garcia-Castro et al. (2025) suggested that the CFE on word acquisition is the result of enhanced cross-language co-activation: when a bilingual child is exposed to a word form, they activate the corresponding lexical representations of both languages. Since cognates receive stronger activation from their translation equivalent than non-cognates, children accumulate learning instances at a faster rate. As a result, the lexical representations of cognate words consolidate at earlier ages than those of non-cognate words.
Therefore, it is plausible that cognates facilitate semantic processing by accelerating recognition of the prime, which in turn enhances activation of the weaker lexical–semantic links in the nDL. This compensatory mechanism is relevant because lexical–semantic activation is typically slower in the nDL of bilinguals. For instance, Rämä et al. (2018) found that semantic priming effects in bilingual infants emerged only at longer Stimulus Onset Asynchronies (SOAs). Given that faster prime recognition has been shown to enhance semantic priming (Hutchison et al., 2008) and that previous studies indicate cognates accelerate word recognition (Duñabeitia et al., 2016; Koutamanis et al., 2024a, 2024b; Schröter & Schroeder, 2016), cognates may partially compensate for slower nDL activation. This supports more efficient semantic processing, as reflected in the reduced N400 amplitude for the cognate primes observed in our study.
Building on this interpretation, we speculate that in unbalanced bilingual children (biased to one of their native languages), cognate words with stronger lexical representations of their DL may serve as a bridge or scaffolding for that of their non-dominant languages’ development (Costa et al., 2005; Iniesta et al., 2024). This implies that as children accumulate concept and lexical representation mappings in their DL, the cognate link (the phonological overlap between translation equivalents) can strengthen the connections between the nDL representations and the shared concept. As a result, unbalanced bilingual children could build their nDL via their DL. This necessity for scaffolding is consistent with the “Weaker Links” Hypothesis (Gollan et al., 2008), which posits that the reduced exposure and use of the nDL lead to less frequent activation and, consequently, weaker lexical–semantic links within that language (Sirri & Rämä, 2019).
Our results revealing both a language proficiency-related and an age-related decrease in the CFE support this idea. These findings suggest that cognates not only facilitate online processing, but may also help unbalanced bilingual children establish stronger lexical representations of their nDL. The age-related decline in the CFE has also been reported in reading development, where increased reading experience and the maturation of language control mechanisms reduce cognate benefits (Duñabeitia et al., 2016). The developmental BIA model (BIA-d) (Grainger et al., 2010) accounts for this effect by proposing that form-based co-activation of translation equivalents decreases as proficiency increases, due to more effective inhibitory control mechanisms suppressing cross-linguistic interference. Duñabeitia et al. (2016) extend this view by emphasizing the role of developing attentional and control processes in attenuating the CFE during reading development. By extension, the consolidation of the lexical semantic system and improved language control mechanisms likely contribute to the age-related reduction in lexical semantic cognate facilitation observed in the present study.
Finally, we need to acknowledge several limitations of the present study. First, as we noted in Materials and Methods, our stimulus set was drawn from a subset of items from a larger study that was not originally designed to compare semantic priming effects between cognate and non-cognate primes. As a result, the present experiment included a relatively low number of trials, which precluded strict control over the cognate status of the target words. This matters because a greater proportion of trials with non-cognate primes preceding cognate targets may have attenuated the observed cognate effect by facilitating processing in the non-cognate condition. Indeed, across both languages, the proportion of cognate targets was higher in the non-cognate prime condition (Dutch: 69% vs. 50%; German: 75% vs. 55%).
Next, even though our final sample size (N = 27) is typical for developmental EEG research, it remains relatively modest given the complexity of the linear mixed-effects models we employed. The inclusion of multiple predictors and interaction terms increases the risk of overfitting and limits the statistical power to detect subtle effects. The modest nature of the N400 evidence (p = 0.048) reflects this issue; we detected this effect only in the hypothesis-driven ROI analysis and not in the more conservative cluster-based permutation test. This discrepancy highlights the trade-off between the sensitivity of hypothesis-driven ROI approaches and the strict error control of cluster-based permutation analyses, and thus these effects should be interpreted with caution.
Despite the limitations regarding statistical power and sample size, these findings offer a theoretical basis for future research. If the CFE serves as a scaffolding mechanism for the non-dominant language, as our data tentatively suggest, this dynamic could eventually inform strategies in bilingual education. Consequently, future research should expand on these findings to better understand the developmental trajectory of the CFE. Beyond the necessity for larger, high-powered replications, longitudinal designs are essential to map how the reliance on cognates shifts over time within the same individuals. Such studies could determine whether the age-related reduction in CFE we observed represents a true developmental shift in lexical organization or simply a byproduct of increasing proficiency. Furthermore, to disentangle the role of phonological overlap from semantic overlap, future work should examine these effects in bilinguals acquiring language pairs with varying degrees of language distance (e.g., English–Dutch versus English–Spanish). Comparing these populations would clarify whether the scaffolding mechanism relies strictly on form similarity or if conceptual links alone are sufficient to drive cross-language activation in the developing brain.
In summary, our results provide tentative evidence for a CFE, suggesting that bilingual children growing up with two closely related languages can benefit from cognates during lexical–semantic processing in their non-dominant language. Consistent with the patterns observed in asymmetric activation, the magnitude of the CFE appears to be influenced by developmental stage, language proficiency, and cumulative exposure. These findings underscore the importance of considering both linguistic and developmental context in future studies of bilingual lexical–semantic processing.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bates D. Mächler M. Bolker B. Walker S. Fitting linear mixed-effects models using lme 4Journal of Statistical Software 20156714810.18637/jss.v 067.i 01 · doi ↗
- 2Bice K. Kroll J. F. Native language change during early stages of second language learning Neuroreport 2015261696697110.1097/WNR.000000000000045326351964 PMC 4589520 · doi ↗ · pubmed ↗
- 3Bosch L. Ramon-Casas M. First translation equivalents in bilingual toddlers’ expressive vocabulary: Does form similarity matter?International Journal of Behavioral Development 201438431732210.1177/0165025414532559 · doi ↗
- 4Bosma E. Blom E. Hoekstra E. Versloot A. A longitudinal study on the gradual cognate facilitation effect in bilingual children’s Frisian receptive vocabulary International Journal of Bilingual Education and Bilingualism 201922437138510.1080/13670050.2016.1254152 · doi ↗
- 5Bosma E. Nota N. Cognate facilitation in Frisian–Dutch bilingual children’s sentence reading: An eye-tracking study Journal of Experimental Child Psychology 202018910469910.1016/j.jecp.2019.10469931568884 · doi ↗ · pubmed ↗
- 6Brysbaert M. Buchmeier M. Conrad M. Jacobs A. M. Bölte J. Böhl A. The word frequency effect: A review of recent developments and implications for the choice of frequency estimates in German Experimental Psychology 201158541242410.1027/1618-3169/a 00012321768069 · doi ↗ · pubmed ↗
- 7Conboy B. T. Mills D. L. Two languages, one developing brain: Event-related potentials to words in bilingual toddlers Developmental Science 200691 F 1F 1210.1111/j.1467-7687.2005.00453.x 16445386 · doi ↗ · pubmed ↗
- 8Costa A. Santesteban M. Caño A. On the facilitatory effects of cognate words in bilingual speech production Brain and Language 20059419410310.1016/j.bandl.2004.12.00215896387 · doi ↗ · pubmed ↗
