Which language matters? Number comparison in bilinguals | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article Which language matters? Number comparison in bilinguals Silke M Göbel, Veniamin Shiron, Miriam Tucker, Angela De Bruin This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7745359/v1 This work is licensed under a CC BY 4.0 License Status: Under Review Version 1 posted 14 You are reading this latest preprint version Abstract Participants’ language background can show subtle effects on number comparison even with Arabic digits. Number pairs can be decade-unit compatible, i.e. the decision about number size is the same for unit and decade (e.g. 24 and 57, 2 < 5 and 4 < 7), or decade-unit incompatible (e.g. 29 and 57, 2 7). Incompatible pairs often lead to longer response times. This compatibility effect is larger in languages with number word inversion (e.g. German). Our aim was to investigate the influence of language on the compatibility effect in bilinguals. In Experiment 1, we compared the compatibility effect in German monolinguals and German-English bilinguals. We found a significant compatibility effect, but no significant differences between the mono- and bilinguals. In Experiment 2, we manipulated language activation within participants by asking German-English bilinguals to describe scenes either in English or German before completing number comparison trials. There was a significant cost of switching languages, showing that participants did activate German and English. However, the size of the compatibility effect was not significantly influenced by participants activating German or English. Overall, we did not find an influence of a bilingual’s language on number comparison with Arabic digits. [199] Humanities/Language and linguistics Social science/Language and linguistics Biological sciences/Neuroscience Biological sciences/Psychology Social science/Psychology Unit-decade compatibility effect number word inversion language switching L2 bilingualism Figures Figure 1 Figure 2 Figure 3 Introduction Numbers are ubiquitous in modern life. We encounter exact numbers in different formats, most frequently as spoken or written number words (‘twenty-four’) or Arabic digits (24). While number words are language-specific 1 , Arabic digits are often seen as an abstract, pure and non-linguistic number format. However, even basic number processing of Arabic digits is influenced by the language background of the person processing them 2 , 3 . Strong support for an influence of language background on number processing comes from developmental research 4 – 9 . For example, whether a language has a clear singular/plural distinction (one frog, two frog s , three frog s , four frog s ...) might influence the acquisition of the meaning of the number word ‘one’ 10 . English-speaking children typically learn the meaning of ‘one’ around 24 to 26 months 10 , 11 . But in languages with less consistent or without obligatory associations of number words with singular/plural nouns, such as Japanese, children learn the meaning of the number word ‘one’ several months later 10 , 12 , 13 than in English. Moving on to larger number words, number word construction is very regular in some languages, such as Japanese and Mandarin. Once children have learnt the number words one to ten in these languages, they can combine these ten elements to create all number words up to 99. Several studies investigated the acquisition of teen numbers (11–19) in different languages and reported a delay in number word acquisition for numbers larger than ten for languages for which number words are less regular 14 – 17 . One’s language acquired from birth does not only influence number word acquisition and numerical and arithmetic development 4 , 18 – 21 , it also affects number processing of Arabic digits in adults 2 , 22 , 23 . One effect that has often been used to investigate language influences on number processing in adults is the unit-decade compatibility effect 24 . This effect is observed in multi-digit number comparison, when participants are asked to choose, for example, the numerically larger number out of two-digit number pairs in Arabic format. A number pair is considered unit-decade compatible when comparing tens and units leads to similar response biases (e.g., 42_57: 4 < 5 and 2 < 7). In turn, a number pair is unit-decade incompatible when the respective comparisons lead to opposite response biases (e.g., 47_62: 4 2). Compatible number pairs are usually responded to faster and with fewer errors than incompatible number pairs 24 – 26 . Several studies have shown that the size of the compatibility effect is influenced by which language people speak 2 , 27 , 28 . Comparing for example the compatibility effect in a German- and an English-speaking sample 27 , a more pronounced effect was found in the German-speaking group. The authors explain this finding by differences in the verbal number word systems. While the English number word system is fairly consistent for numbers larger than 20, German two-digit number words are inverted with respect to the Arabic digit notation (42 is spoken as ‘zweiundvierzig’, literally translated ‘two-and-forty’). That means that in German – in contrast to the Arabic notation - the first named digit for double-digit numbers is the unit digit. Clearly, mapping between number words and Arabic digits is harder in languages with number word inversion 23 . Although the difference between groups was small, a larger compatibility effect in German than English speakers 27 demonstrates that even adults activate number words during processing of numbers in Arabic format. Most of the previous research in this area has focused on different language groups. However, over half of the world’s population is bi- or multilingual 29 and many of these bilinguals need to alternate between different number-word systems in their everyday communications. The two languages of a bilingual influence each other and compete with each other for selection (e.g., 30 ), but knowledge about how a bilingual’s languages influence number processing in adults is still limited. For exact arithmetic, language of instruction/training seems to matter for bilinguals 31 . For example, bilinguals trained in exact arithmetic were faster at testing when they were tested in the language they were originally trained in than when tested in their other language 32 (see also 33 ). In terms of more general language effects (beyond the language of training and testing), a meta-analysis 34 found that bilinguals were faster and more accurate in number naming and arithmetic in their first (L1) than second (L2) language. Schiltz et al. (2024) 3 in a recent review of the influence of multilingualism on numerical and mathematical competencies also conclude that performance in basic numerical skills is better in L1 (and/or in the language of mathematics learning) than in L2. These findings are in line with a prominent model of number processing, the Triple Code Model 35 . This model proposes the existence of three independent but interconnected numerical codes: a language-dependent verbal code for spoken and written number words, an analog magnitude code representing numerical magnitude approximately and a visual code for numerical symbols such as Arabic digits. The verbal code is proposed to be essential for retrieving arithmetic facts, and the language-independent analog magnitude code for number comparison and estimation. When numbers are presented visually as Arabic digits, they are first processed by the visual code. Then there are two processing routes, a direct asemantic route between the visual and the approximate code for number comparison and an indirect route which goes from the visual code via the verbal code to activate the approximate code. The influence of language during number comparison of Arabic digits along with other findings 27 , 28 suggests that both routes are activated in parallel when processing Arabic digits. Lachelin et al. (2024) 36 have recently suggested a bilingual Triple code model (BTCM). In this model, bilinguals have two language-specific codes for number processing (see Fig. 1 ), a verbal code in L1 and a verbal code in L2, which are connected and can influence each other. The presence of two language-specific nodes raises the question which language (L1 or L2) bilinguals activate most during number processing of Arabic digits. To our knowledge, only two previous studies investigated the compatibility effect with Arabic digits in bilinguals. Van Rinsveld et al. (2016) 37 investigated the compatibility effect in German or French monolinguals and German-French bilinguals and found that the German-French bilinguals exhibited the same pattern of results as the German monolinguals. However, their bilingual group was quite different from the two monolingual groups. The bilingual participants’ L1 was Luxembourgish (which developed from a dialectal variant of German) and they also spoke German and French, which are official primary school languages for numeracy and literacy in Luxembourg. German monolinguals were University students at the University of Graz in Austria and French monolinguals were from Brussels in Belgium. Macizo et al. (2011) 38 tested whether the level of L2 proficiency affected the size of the compatibility effect. Two groups of German(L1)-English(L2) bilinguals performed a double-digit number comparison task with Arabic digits, L1 and L2 number words. Participants were equated in terms of their proficiency in German (L1), but were either more or less proficient in English (L2; based on self reports only). The size of the compatibility effect for Arabic digits was not affected by L2 proficiency, while proficiency affected the size of the compatibility effect with number words. While both studies provide evidence for little influence of L2 on the compatibility effect, this could be due to the specific language background of their participants. Macizo et al. focused on proficiency assessed through self reports, which are not always accurate reflections of objective proficiency 39 and the German monolingual and bilingual groups in Van Rinsveld et al.’s (2016) 37 study had some significant differences in terms of first language, language of instruction, and language environment (e.g., early exposure to German in the bilingual group growing up in a very multilingual environment). Current Study The aim of our study was to investigate the effect of a bilingual’s second language (compared to functional monolinguals, Experiment 1; compared to the bilingual’s first language, Experiment 2) on the compatibility effect. We examined this by comparing German-functional monolinguals and German-English bilinguals in Experiment 1 and by manipulating language activation within German-English bilinguals in Experiment 2. Larger compatibility effects have been reported in participants with an L1 with number word inversion. Across the two experiments, we therefore examined the effect on the compatibility effect of an L2 without number word inversion (English) relative to an L1 with number word inversion (German). In both studies we expected to find an overall significant unit-decade compatibility effect (significantly longer response times for incompatible than compatible number pairs). For Experiment 1, based on previous research showing a larger compatibility effect in German- than English-speaking participants 2 , 27 , 28 , we predicted a reduced compatibility effect in our German-English bilingual speakers compared to the German monolinguals. For Experiment 2, we expected a larger compatibility effect for number comparison trials following activation of German (elicited through a preceding scene description trial) than following activation of English.[1] Experiment 1 Methods Participants Experiment 1 was completed by 31 German functional monolinguals and 30 German-English bilinguals. While we will refer to the German “functional monolingual” group as “monolingual”, these participants did have some L2-English knowledge. However, they were functional monolinguals in the sense of predominantly or only using their L1-German in their daily lives. The two groups did not differ significantly in terms of age (monolingual M age = 29.1 years old, SD = 6.3; bilingual M age = 32.0 years old, SD = 6.8; t (59) = 1.732, p = .089). The monolingual group included fewer female participants than the bilingual group (monolingual 13 female, 18 male; 23 right-handed; bilingual 21 female, 8 male, 1 transgender; 25 right-handed). Participant groups did not differ significantly in terms of mean number of years of education (monolingual M = 14.8 years, SD = 4.5; bilingual M = 16.5, SD = 3.7; t (59) = 1.593, p = .116); but bilinguals overall had a higher level of completed education than the monolingual group (monolingual: 17 secondary school/A-levels/vocational training; 10 undergraduate degree; 3 postgraduate degree; 1 doctorate; bilingual: 9 secondary school/A-levels; 6 undergraduate degree; 9 postgraduate degree; 6 doctorate; Χ 2 (3) = 10.019, p = .018). All participants provided informed consent at the start of the study and ethics approval was granted by the Ethics Committee in the Department of Psychology at the University of York. The study was pre-registered and was conducted following the Declaration of Helsinki. Before completing the main study, participants completed a pre-screening. This was completed by 71 potential monolingual and 62 potential bilingual participants. We only invited participants to take part in the main study if they met the following background criteria. All participants had to have no (self-reported) neurological, language, or reading difficulties, or vision or hearing impairments. All participants had to have acquired German from birth, with no other languages acquired from birth and spoken fluently (apart from English in the bilingual group). All participants also had to score at least 60% on the German LexTALE, a short lexical decision task included to measure vocabulary knowledge. In this task, participants had to indicate if letter strings formed an existing word or not 40 . LexTALE scores were calculated following the recommended scoring 40 : ((number of words correct/total words * 100) + (number of nonwords correct/total nonwords *100)) / 2. For the functional monolinguals, our additional language requirements were as follows: no English acquisition from birth and weekly use of English being reported as “sometimes” or lower. All included participants also reported having learnt numbers during childhood in German and not having used numbers in school and/or university, or in their current lives, in a language other than German. All monolingual participants were born in Germany or Austria and were currently living there. Given that English is taught in schools in Germany and Austria, we could not recruit people without any knowledge of English. We therefore focused on “functional monolinguals” who do not use English more than sometimes in their daily lives. These participants did have some L2-English knowledge (see Table 1). The majority of participants reported a self-rated proficiency below 8/10 for English speaking and understanding. Although our pre-registered cut-off was < 8/10, we also included three participants who self-rated their English understanding (and in one case English speaking) as 8/10. This was done to not reduce our sample size further. Of the 71 monolingual participants who completed the pre-screening, 37 were invited for and also completed the main study. Of those, one participant was excluded afterwards as they reported speaking another language well and at home from childhood, one participant because they reported frequent use of English, and four participants because they completed the pre-screening in the same session as the main task but did not actually meet the requirements due to using English too frequently and having a self-rated English proficiency of 9/10. This left us with 31 monolingual participants in the analysis. For the bilinguals, our pre-registered inclusion criteria were as follows: did not use another language than German or English at home or school during childhood; self-rated English proficiency at least 8/10 for speaking and understanding; uses English at least “frequently” on a weekly basis; current language use score on the Language and Social Background Questionnaire (LSBQ) 41 of 2.5 or higher (1 = all German use; 5 = all English use); and no indication that another language than German or English was used exclusively in any of the contexts assessed. To examine this, we used a modified version of the LSBQ, asking about participants’ language use in different contexts, activities, and with various conversation partners. All participants also scored at least 60% on the LexTALE in both English and German. We had an additional pre-registered point stating we would exclude participants who reported their language use during childhood to be above 2.5 on the LSBQ (where higher scores reflect more English use). Two participants scored slightly above this but were included because this only reflected some English education as a teenager. As can be seen in Table 1, bilingual participants had a high proficiency in and use of both languages. They also reported occasional to regular language switching (on a scale from 1 = never to 10 = very frequently) on a daily basis ( M = 6.3, SD = 2.7), within a conversation ( M = 4.8, SD = 2.4), and within a sentence ( M = 4.3, SD = 2.7). In terms of proficiency (see Table 1), participants in the bilingual group showed significantly higher self-rated English proficiency than the monolingual group (all p s .15), with the exception of self-rated German speaking proficiency being slightly lower in the bilingual group ( p = .031). All bilingual participants learnt numbers in German during childhood and, where relevant, used German and/or English for numbers at university and when using numbers in their daily lives, with no use of other languages. In the final questionnaire, we only asked participants to choose one language that they predominantly used for number tasks (e.g., subtractions) and remembering numbers. We did not examine how frequently participants encountered numbers in English or in German and could therefore not assess our pre-registered point of encountering English numbers at least 25% of the time. While all participants were born in Germany, they were currently living in an English-dominant country (e.g., UK or US), making it very likely that they had regular exposure to numbers in English. Participants were also asked questions about their experience with numbers and mathematics, as reported in Table 2. The two participant groups were comparable in terms of the number of hours they had received mathematics teaching in school, the number of people studying mathematics at university level, and the number of people working with numbers in their jobs. Both groups also learnt mathematics in German in school and were taught numbers in German as their first language. The two groups differed, as intended, in their current use of English for talking about mathematics in university, their main language for using numbers for daily activities, and remembering number sequences. These tasks were all done in German in the monolingual group while within the bilingual group, approximately half of the participants reported English to be their main language for number related activities. Our pre-registered plan was to also include a comparison group of German-Dutch bilinguals. However, recruitment was very difficult and only ten participants started the study and did not always meet our inclusion criteria. We therefore decided to only include German functional monolinguals and German-English bilinguals in this experiment, which still allowed us to answer our main research question regarding the influence of English in bilinguals. Our sample sizes for these two groups were slightly lower than the initial pre-registered aim of 40 participants per group. However, in that pre-registration, we also included an optional stopping rule of 30 participants if recruitment turned out difficult, which we ended up having to follow. Our power analysis was based on previous research comparing the compatibility effect between German- and English-speaking adults showing a partial eta squared effect size of .11 28 . Using G*Power, we conducted a power analysis with a slightly larger effect size (.15-.20). This resulted in 95% power with 34 participants per group for an effect size of .15, with the initially planned three-group comparison. While our sample size was slightly lower than planned, we included a large number of trials per condition (120 compatible and 120 incompatible trials per participant), which increases power. Design The number-comparison task had two independent variables: Type (compatible, e.g., 46 − 21, or incompatible, e.g., 81 − 32; within-subject) and Group (monolingual or bilingual; between-subject). Additional filler items (20%, within-decade comparisons, e.g., 68 − 61) were included in the task but not in the analysis. Our dependent variable was reaction times (RTs). Procedure Participants were recruited through Prolific.co and tested using Gorilla.sc 42 . All instructions throughout the study were presented in German. Most participants (all bilinguals and all apart from 8 monolinguals) first included a pre-screening, which included our questionnaire and the LexTALE(s). Only participants meeting the inclusion criteria (see “Participants”) were invited to also take part in the main study. For a few monolinguals, we combined the pre-screening and main study to facilitate recruitment. In the main study, participants were asked to complete a two-digit number magnitude comparison task. The design of our number comparison task closely followed the design of the number comparison task used in study 1 of Bahnmueller et al. (2019) 28 . In each trial, two numbers were presented simultaneously and above each other in the center of the screen. The stimulus set consisted of 300 two-digit Arabic number pairs between 21 and 98, with 120 unit-decade compatible (e.g., 43_58), 120 unit-decade incompatible (e.g., 48_53; taken from 24 ), and 60 within-decade filler pairs (e.g., 51_56, 20% of all trials). Overall distance, decade distance, unit distance, and problem size (i.e., the sum of the two to-be-compared numbers) were matched between the unit-decade compatible and unit-decade incompatible stimulus groups. Participants had to choose the larger of two numbers presented. They were instructed to respond as quickly and accurately as possible with the right index finger (pressing the U key) in case the upper number was larger, and with the left index finger (pressing the N key) when the bottom number was larger. For half of the pairs, the upper number was larger. In the other half, the lower number was larger. Participants had to press the same button not more than twice in a row. Stimuli were presented in white (font: Courier New, bold; font size: 24) against a black background until a response was given. Preceding each trial, a fixation cross was presented in the middle of the screen for 500 ms. Trials were separated by an interstimulus interval of 500 ms. If no response was given, the next trial was started automatically after three seconds. The experiment started with 12 practice trials with feedback, followed by five experimental blocks of 60 number pairs each without feedback. Data analysis The task data are available at https://osf.io/gjnwt/. The RT data were analysed using ANOVAs and linear mixed-effects (LME) analyses (using R version 4.4.1, lme4 version 1.1–36 and lmerTest version 3.1-3). Accuracy was high ( M = 97.2%, SD = 2.4 on compatible and incompatible trials combined; all participants scored 90% or higher) and therefore not analysed further. Prior to RT analysis, we removed inaccurate responses and RTs > 3 SD above or below the mean per participant and condition (0.8% of trials) 43 . No RTs were below 200ms. Visual inspection of the RT data revealed a non-normal distribution and RTs were therefore log-transformed. The analysis only included compatible and incompatible trials, fillers (within-decade comparisons) were not included. The LME analysis included Type (compatible coded as -0.5 and incompatible as 0.5) and Group (monolingual coded as -0.5 and bilingual coded as 0.5) as fixed effects. Participant and item (number pair shown on the screen) were included as random effects, with the corresponding within-participant and within-item slopes. The full model converged. We also ran the corresponding ANOVA, including a Bayesian ANOVA to quantify evidence for the null versus the alternative hypothesis. Here, we compared a null model including main effects of Type (compatibility effect) and Group to a model including an interaction between Type and Group to assess evidence for/against a difference in compatibility effect between bilinguals and monolinguals. The Bayes Factor (BF) is reported in the form BF 01 (evidence for the null over the alternative hypothesis), with values above 1 supporting the null hypothesis. Note that while we pre-registered the use of z-scored RTs in the ANOVA, we used log-transformations instead for consistency with the LME analysis. Z-scoring of RTs was furthermore not needed given that no overall RT differences were observed between the two groups. Results and discussion The LME analysis showed a significant effect of Type (β = 0.027, SE = 0.013, t = 2.088, p = .038), showing that there was a compatibility effect with faster responses to compatible number pairs ( M = 748ms, SD = 125) than to incompatible pairs ( M = 767ms, SD = 123), see Fig. 2. There was no main effect of Group (β = 0.045, SE = 0.041, t = 1.094, p = .279), suggesting bilinguals and monolinguals responded at comparable speed overall (Monolingual M = 741ms, SD = 121; Bilingual M = 774ms, SD = 126). The number compatibility effect was numerically a little smaller for bilinguals ( M = 14ms, SD = 18) than for monolinguals ( M = 24ms, SD = 32) but this did not reach significance (β = -0.014, SE = 0.008, t = -1.843, p = .070). The ANOVA showed the same patterns, with a significant main effect of Type ( F (1,59) = 49.220, p < .001, ηp² = 0.455), no significant effect of Group ( F (1,59) = 1.215, p = .275, ηp² = 0.020), and no significant interaction between Group and Type ( F (1,59) = 3.390, p = .070, ηp² = 0.054). The Bayesian analysis provided no support for the presence of a compatibility-effect difference between the two groups (BF 01 = 1.72, error% = 2.57). All instructions were provided in German, thereby creating a German context throughout the study for all participants. Within the bilingual group, however, some participants reported predominantly using English for daily-life number processing (for tasks such as subtractions or for remembering numbers), while others reported using German, or did not report a consistent preference across different types of number activities. An exploratory analysis comparing 12 participants reporting daily-life number processing in English to 10 participants reporting doing such processing in German showed a slightly smaller number compatibility-effect cost in the task for participants more likely to use English to process numbers ( M cost = 8ms, SD = 21) than those more likely to use German ( M cost = 20ms, SD = 12; t (20) = 1.449, p = .163; d = 0.621). Experiment 1 showed no significant difference between the bilingual and monolingual groups. We attempted to make our functional monolingual group as monolingual as possible, by only including participants who predominantly use German in their daily lives, including for number-related tasks. In contrast, our bilingual participants were living in an English-dominant society and reported frequent use of English for daily-life interactions. However, if any potential effects of language use are small, they might be less likely to arise in these group comparisons, also considering the relatively small size of the number-compatibility effect. The groups tested differed in some ways, including their highest level of education completed. This could have influenced number processing and could have masked any language-related effects, if they exist. Furthermore, our monolingual participants were functional monolinguals who only used their L1, but still had some proficiency in their L2-English. This was inevitable given the presence of English in Germany and Austria, and the compulsory English classes children receive at school. While it is unlikely these participants use English for number processing (given that they learnt to do this in German and only rarely use English in general), we cannot rule out some influence of L2-English processing in our task. Finally, group comparisons can be influenced by individual differences within the bilingual group. Indeed, exploratory analyses suggested bilingual participants varied in their daily-life use of English or German for number processing, which may impact which language they used during the task. Furthermore, even though all task instructions were provided in German only, with this being an online study, we could not control the language environment people completed the study in. Therefore, it is likely that some bilinguals completed the task in an English environment and some in a more German environment, thereby further increasing the difficulty of seeing any group-level effects. Experiment 2 In Experiment 2, we therefore examined the potential influence of language by manipulating language activation within participants , rather than relying on group comparisons. This avoided the need for monolingual participants and issues associated with potential group differences beyond people’s language background. By manipulating language within the task itself we also aimed to minimise the influence of unknown individual differences in language environment while doing the study. We therefore asked participants to describe a simple scene in English or in German (each language used for half of the trials, in response to a language cue) before each number trial within the task. The use of English or German while describing the scene was anticipated to increase the activation of that language immediately prior to making the number-compatibility decision. Following Grosjean’s Language Mode (1985) 44 , language activation falls on a continuum from predominant activation of one language to activation of multiple languages, depending on the context, conversation partners, and general environment. This shows that language is flexible and can change quickly in response to task context and demands. For example, a large number of language switching studies have used designs similar to ours, asking participants to name pictures or describe scenes in response to a language cue. These studies typically show switching costs, with participants requiring more time to name a picture when the previous picture required use of another language, compared to when the same language can be used on two consecutive pictures (e.g., 45 ; see 46 , for a review). These costs are usually interpreted as (at least partly) reflecting ongoing competition between languages, with switch trials requiring bilinguals to rapidly re-balance the activation levels of each language. Further research has also shown that short exposure to a language context (either linguistic or non-linguistic), manipulated on a trial-by-trial basis, can indeed influence activation of the corresponding language. For example, in a simple picture naming task, de Bruin and Martin (2022) 47 showed Basque-Spanish participants linguistic (short sentences) and non-linguistic (flags associated with a language/culture) cues. Participants could name each picture in their language of choice and were free to ignore the “cue”. Nevertheless, participants were more likely to name the picture in the language matching the preceding cue, and were faster naming a picture when following the language indicated by that context, showing fast re-balancing of language activation and use. Similarly, trial-by-trial manipulations of language context have also been shown to influence other cognitive processes. For example, interleaving a flanker task with words presented in one or two languages showed an influence of this briefly presented language context on interference suppression (e.g., 48 49 ). This suggests brief exposure to language contexts varying in the activation levels of one or two languages can influence performance on other cognitive tasks. Indeed, preceding language context has also been shown to influence arithmetic problem solving. Van Rinsveld et al. (2016) 50 presented multilinguals with arithmetic problems presented without context, or with each number trial preceded by a semantic judgement task that had to be completed in German or French (depending on the language of the session). Complex additions were completed faster in a language context than without context, when the task was completed in French (the participants’ less dominant language). This suggests that manipulating language activation levels throughout the task can indeed influence number processing. In Experiment 2, we therefore expected a larger compatibility effect for number comparison trials following activation of German in the preceding scene description than following activation of English. Methods Participants Experiment 2 was completed by 39 German-English bilinguals ( M age = 29.4 years old, SD = 6.8; 21 female, 18 male; 35 right-handed). The mean number of years of education was 17.0 ( SD = 3.5), with 8 participants completing secondary school/A-levels or lower; 17 an undergraduate degree; 11 a postgraduate degree; and 3 a doctorate. All participants provided informed consent at the start of the study and ethics approval was granted by the Ethics Committee in the Department of Psychology at the University of York. The study was pre-registered and was conducted following the Declaration of Helsinki. Before completing the main study, participants completed a pre-screening. This was completed by 118 potential participants. We only invited participants to take part in the main study if they met the criteria described below. Firstly, invited participants had no neurological, language, or reading difficulties, vision or hearing impairments, or colour blindness. One participant was invited despite responding “yes” to one of these questions and was excluded from analysis (thereby leading to a final sample size of 39 rather than 40). All included participants acquired German from birth and reported not being fluent in any other languages apart from German or English, or the use of any other languages during childhood or current life (i.e., none of the participants reported a language other than German or English in any of the LSBQ questions). In our pre-registration, we pre-registered strict inclusion criteria regarding the use of English, including the requirement for participants to be living in an English-speaking country, for them to not select an option lower than “frequently” for weekly English use, or to have an average score below 2.5 across the LSBQ. However, while these requirements were necessary for Experiment 1, they were deemed less essential for Experiment 2, where language was manipulated within participants through the experimental manipulation. We therefore also included participants who used English less frequently than pre-registered. Similar to Experiment 1, we also included two participants with slightly more frequent English use during childhood, as this reflected English use as a teenager. As shown in Table 3, participants had a high proficiency in both languages, as measured through both self-reports and the LexTALE vocabulary scores. All participants, as pre-registered, self-reported their German proficiency as at least 8/10 and their English proficiency as at least 6/10, and had a LexTALE score above 60% in both languages. Participants predominantly used German during their childhood, but currently used English more regularly (all participants reported at least using English sometimes). Participants on average also reported occasional to regular language switching (on a scale from 1 = never to 7 = very frequently; note that this scale differs from Experiment 1) on a daily basis ( M = 4.7, SD = 1.8), within a conversation ( M = 3.4, SD = 1.7), and within a sentence ( M = 3.0, SD = 1.9). All participants learnt mathematics in German during elementary school and (with the exception of one, who reported using English) during secondary school, and all first learnt numbers in German. Where participants discussed mathematics related topics at University, 26 did so in German and 5 in English. At elementary school, most participants reported studying maths for 2–3 hours (16) or 3 + hours (19; 4 participants reported 1–2 hours). In secondary school, most participants (29) reported 3 + hours of maths education (9 2–3 hours; 1 1–2 hours). Twenty-one participants reported studying additional subjects related to mathematics (e.g., statistics) at school; 19 reported studying a maths-related degree at university; and 24 reported using numbers in their jobs. In terms of current use, 35 participants reported using German for tasks such as subtractions (3 reported English and 1 both); 37 participants reported using German for remembering long strings of digits such as phone numbers (2 reported using English). Apart from one person residing in the UK, all participants were living in Germany or Austria. No participants reported learning mathematics, being taught in, or working with numbers in a language other than English or German. Design The design was similar to Experiment 1, with one change: Language was manipulated within participants (the scene before each number trial was described in German or in English), rather than manipulated at the group level between participants. Procedure and Materials Experiment 2 was run in the same way as Experiment 1, with participants completing a pre-screening questionnaire and the LexTALE before being invited to complete the main task. The main task was the same number magnitude comparison task as in Experiment 1. However, this time, participants were asked to describe a simple scene in German or in English prior to each number magnitude comparison. Half of the scenes had to be described in German and half in English. Participants completed 48 compatible number comparisons (24 preceded by each language); 48 incompatible comparisons (24 preceded by each language); and 24 filler trials (12 per language). Half of the scene descriptions were switch trials (different language had to be used than in the previous scene) and half were non-switch trials (same language had to be used as in the previous scene). These were distributed equally across the compatible and incompatible number trials. Each scene was named five times by each participant, twice preceding a compatible and twice preceding an incompatible trial, and once preceding a filler trial. Two lists were used to counterbalance the number of times each scene was described in German or English (two or three times per language) across participants. Each number pair was presented twice in the main task. With the exception of four pairs, each number pair was preceded once by English and once by a German scene description. For the scene description task, we selected 25 scenes from previous papers 51,52 . Scenes were selected that could be easily described in one sentence, in German or English. Due to the lexical similarities between the two languages, cognates (words overlapping in form and meaning) could not be avoided. Prior to starting the task, participants saw an example of a scene and an example of a number comparison trial. Participants were instructed to describe the scenes in a simple way, using a subject-verb-object order to ensure consistency between languages. They were then asked to describe three practice scenes. Following this, they completed three practice number comparison trials, during which they received feedback on their accuracy. Finally, participants completed three practice trials during which they alternated between the scene description and the number comparison trials, always starting with the scene description as in the main task. The main task consisted of six blocks, separated by breaks. The first trial after a break was always a filler trial. On each trial, the language cue (German or British flag) was first shown in the centre of the screen for 500ms. This was followed by the presentation of the scene. Participants were given five seconds to describe the scene, after which the trial automatically continued. After showing a blank (black) screen for 200ms, the number comparison was presented. This stayed on the screen until a response was given, or for 3000ms if no response was given. Data analysis The task data are available at https://osf.io/gjnwt/. As first checks, we examined accuracy in the number task and in the scene description task. Accuracy in the main number task was high (above 91% for all participants, M = 97.0%, SD = 2.4; across compatible and incompatible trials). Similarly, accuracy in the scene description task across all number trials was high in both L1-German ( M = 96.6%, SD = 3.5; all participants over 86% correct) and L2-English ( M = 97.3%, SD = 3.3; all participants over 86% correct; recordings were not audible and therefore not scored for three participants, who were still included in the main analysis). These accuracy scores show that participants indeed paid attention to the task and activated the intended language on each trial. We also computed participants’ naming onset times for each scene (i.e., the delay between the start of scene presentation and the start of naming, using CheckVocal 53 ). We then computed the naming onset time difference between switch trials (trial requiring naming in a different language as the previous trial) and non-switch trials (trial requiring naming in the same language as the previous trial). We did this for English and German separately (e.g., the English switch cost would be the naming time difference between a switch from German to English, and naming two trials in a row in English). As expected, a switch cost was observed in both languages ( M English switch cost = 43ms, SD = 150ms, M German switch cost = 41ms, SD = 116; including trials included in the main number analysis), suggesting the languages were activated as intended during the scene-description trials and participants had to re-balance language activation during the scene descriptions when switching to a new language. The main-task analyses were conducted similarly to Experiment 1. 0.5% of correct trials were removed as RT outliers. The RT analysis only included correctly answered number trials that were preceded by a correctly described scene. Type (compatible = -0.5; incompatible = 0.5) and Language (German = -0.5; English = 0.5) were included as fixed effects. Participant, scene picture name, and number item (number pair shown on screen) were included as random effects. Slopes were included for Language and Type for participant and scene picture-name random effects. However, given that a few items did not occur across both languages, no random slopes were included for the number item random effect. The initial model did not converge. We therefore first removed correlations between slopes and intercepts and then (following the pre-registration), the scene picture-name slope explaining the lowest amount of variance. The final model included all random-effect intercepts, all slopes for the participant random effect, and the trial type + trial type x language slope for the picture-naming random effect. Results and discussion The LME analysis showed that the Type effect did not reach significance (β = 0.047, SE = 0.030, t = 1.551, p = .128), although the numerical pattern went in the expected direction with somewhat faster responses to compatible number pairs ( M = 766ms, SD = 126) than to incompatible pairs ( M = 799ms, SD = 126), see Fig. 3. There was no main effect of Language (β = 0.012, SE = 0.008, t = 1.511, p = .139), suggesting naming the scene in L1-German or L2-English did not influence RTs on the number-comparison task (after L1-German naming: M = 777ms, SD = 125; after L2-English naming: M = 788ms, SD = 129). The number compatibility effect was numerically a little smaller for after naming in the L1-German ( M = 26ms, SD = 48) than after L2-English naming ( M = 41ms, SD = 46) but this did not reach significance (β = 0.017, SE = 0.014, t = 1.287, p = .211). If anything, the observed pattern went in the opposite direction than expected (i.e., a slightly smaller compatibility effect after German use, than after English use). The ANOVA showed the same patterns, but the main effect of Trial Type did reach significance ( F (1,38) = 47.710, p < .001, ηp² = 0.557). There was no significant effect of Language ( F (1,38) = 2.555, p = .118, ηp² = 0.063), and no significant interaction between Language and Type ( F (1,38) = 1.978, p = .168, ηp² = 0.049). The Bayesian analysis provided some (but weak) evidence for the null hypothesis (i.e., it did not support the presence of a language effect on the number-compatibility effect, BF 01 = 2.45, error% = 1.03). One further exploratory analysis was conducted (using LME analyses). If language activation takes time to build up or strengthens with time, we might see a larger effect of naming language on the number task when the same language has been used for at least two scenes. We therefore ran an additional analysis including language trial type (switch/non-switch, with non-switch trials reflecting that the scene was named in the same language as the previous scene). No significant effects of or interactions with language trial type were found in terms of number-task RTs (all p s > .25). This suggests any influence of language on the number task did not increase when the same language was used multiple times in a row and potentially resulted in stronger activation of that language. General discussion In this paper we were interested in the effect a bilingual’s two languages can have on their processing of (non-linguistic) Arabic digits. We tested L1-German speakers, because the compatibility effect in number comparison of Arabic double-digit numbers is significantly larger in participants with a language background that has inverted number words (e.g. German) than in participants whose L1 has (mostly) non-inverted number words (e.g. English). For the same reason, we chose bilingual participants with English as L2. In Experiment 1, both German-monolinguals and German-English bilinguals showed a significant compatibility effect with longer response times to unit-decade incompatible pairs of Arabic digits. While the compatibility effect was numerically smaller in the bilingual group, overall, contrary to our prediction, there was no significant difference in the size of the compatibility effect between the two groups (with some support from the Bayesian analysis for the null). Between-group comparisons can be hampered by group differences beyond the main factor of interest (here L2). Thus, Experiment 2 employed a within-participant manipulation to avoid potential group differences. The same number comparison task was used with German-English bilinguals, but now participants described scenes in either English or German before each number-comparison trial. While participants, as intended, activated the required language on each trial (and showed a switching cost reflecting re-balancing that language activation), there was no significant difference (with some support from the Bayesian analysis) in the size of the compatibility effect when participants had just activated German versus when they had just activated English. Taken together, our findings suggest that L2-English use in German-English bilinguals has little influence on number comparison with Arabic digits. Previous evidence showed that the size of the compatibility effect depends on the L1 spoken 27 , 28 . This strongly suggests that number comparison of double-digit Arabic numbers often activates the indirect processing path, with the verbal code used depending on the languages spoken. The Bilingual Triple Code Model (BTCM 36 ) proposed that in bilinguals, a second indirect path exists in the form of a verbal L2 code. Our results provide no direct evidence for the activation of a second indirect, verbal L2 code, path between the visual and approximate number codes. There are various reasons that can explain these findings. First, findings from number word priming in bilinguals (e.g. 36,54 ) suggest the verbal L2 code pathway might be weaker (or even absent) than the proposed L1 code pathway in the BTCM. For example, Lachelin et al. (2024) 36 showed the priming distance effect in proficient German-French bilinguals was weaker in French (L2) than in German (L1), thus emphasizing a stronger reliance on the verbal L1 code even when L2 number words were used as primes. Their bilingual participants grew up speaking Luxembourgish at home (Home language), learnt German (L1 in their study) early and had numeracy instruction in German at school from grade 1–6 and then in French from grade 7 onwards. Lachelin et al. therefore suggested that the earlier L1 than L2 age of acquisition, as well as the closer language proximity between L1-German and their Home language might have resulted in a weaker activation of the verbal L2 code, compared to the L1. Other studies also report language of schooling as a strong predictor of Arabic digit naming (e.g. 55 ). In line with this, our bilingual participants all had an earlier age of acquisition for German (L1) than English (L2), acquired number words first in German (L1) and had most of their schooling in German (L1). It is therefore possible that for these bilinguals, the verbal L1 code was so strongly established from a young age that all number processing continues to go through the verbal L1 code (rather than, or to a much lower extent, the L2 code). Further research is therefore needed to better understand the potential role of the language of schooling, in bilinguals living in bilingual societies as well as in bilinguals who grew up in a different language environment than the one they are currently living in (as was the case for many of our bilinguals). Further research will also need to further study the role of current daily-life language use in relation to number processing. While our exploratory analyses in Experiment 1 did not show a significant relationship between the compatibility effect and the participant’s daily-life language use for number processing (potentially due to the small sample size and the lack of a detailed questionnaire about current language use for number processing), the numerical pattern suggested further research is warranted to better understand a possible effect of daily-life language use for number processing on the activation of an indirect route through the L1 or L2. Participants in Experiment 1 might have differed in the extent to which they activated the indirect route and possibly the verbal L2 code. We therefore manipulated L1 or L2 activation directly in Experiment 2. Despite the language manipulation “working” as intended (i.e., high accuracy and switching costs), there was no evidence for the activation of the second indirect path between the visual and approximate number codes via the verbal L2 code during number comparison, even when participants had just described a visual scene in L2. This argues further for a preference of L1 for number processing in our bilinguals, which might have been so strong that this was not overruled by a temporary boost of L2. A second explanation, specifically concerning Experiment 2, is that perhaps a longer L2 boost is needed for the indirect path through the verbal L2 code to be used. In our Experiment, language activation was manipulated on a trial-by-trial level. While this has successfully been shown to influence language-related mechanisms in previous literature (e.g. 47 ), such short-lived activation change might not be sufficient to influence other cognitive processes. Indeed, Van Rinsveld and colleagues (2016) 50 showed an impact of language context on arithmetic tasks but manipulated language through longer contextual blocks. While exploratory analyses suggested additional time of using the L2 (through multiple consecutive scene descriptions in that language) did not increase its influence on the number processing task, it remains possible that such L2 effects only arise after longer (e.g., at least 5–10 minutes) blocks of L2 use. Finally, a third potential explanation for our findings could be that participants in Experiment 1 did not activate number words at all while performing the number comparison task with Arabic digits. While this would be in contrast to findings from several other studies 27 , 28 , Macizo et al. (2011) 38 compared the compatibility effect between Spanish-English bilinguals who learnt number words first in Spanish and German-English bilingual who learnt number words first in German and found no significant differences in the size of their compatibility effect for Arabic digits. If their bilinguals had activated their indirect path via the verbal L1 code, the size of the compatibility effect should have differed between these two groups. Therefore, a potential interpretation of our findings remains that participants did not use number word activation during the task. Our findings are in contrast to findings on the influence of spatial reference frames on number processing. In a previous study Moeller et al. (2015) 2 have shown that both number word inversion as well as habitual reading/writing direction (left-to-right vs right-to-left) influence the compatibility effect in symbolic number comparison. Interestingly, and in contrast to our findings, when spatial reference frames were changed in bilingual participants it influenced their number processing, e.g. 56,57 .This is in line with other findings 58 – 61 and highlights a clear difference between the influence of spatial versus linguistic context on symbolic number processing. While spatial reference frames are constructed temporarily and their influence on number representations seem to be flexible and context-dependent in bilinguals, the linguistic influence of L1, be it because of the earlier onset of acquisition or because it was the language of (mathematical) instruction, seems to have a more permanent dominant effect on symbolic number processing in bilinguals. Future studies need to test this conclusion more robustly by, for example, manipulating the extent of bilinguals’ current use of number words in L2. Declarations Funding declaration This work was supported by the Experimental Psychology Society through an EPS undergraduate research bursary for Veniamin Shiron. This research was partially supported by the Research council of Norway (project number 331640) through its centre of excellence scheme. Acknowledgements We would like to thank Katharina Sautter for help with data collection and Silvia Gennari for sharing the images we used in Experiment 2. Author contributions Silke M Göbel: Conceptualization; Funding acquisition; Methodology; Project administration; Supervision; Roles/Writing - original draft, review & editing. Veniamin Shiron: Funding acquisition; Project administration; Investigation; Data Curation; Roles/Writing - review & editing; Miriam Tucker: Roles/Writing - original draft, review & editing; Angela De Bruin: Conceptualization; Funding acquisition; Methodology; Formal analysis; Project administration; Supervision; Roles/Writing - original draft, review & editing; Data availability statement The main task data are available at https://osf.io/gjnwt/. Competing interests The author(s) declare no competing interests. References Chrisomalis, S. Reckonings: Numerals, cognition, and history. (2020). Moeller, K., Shaki, S., Göbel, S. M. & Nuerk, H.-C. Language influences number processing--a quadrilingual study. Cognition 136 , 150–155 (2015). Schiltz, C., Lachelin, R., Hilger, V. & Marinova, M. Thinking about numbers in different tongues: An overview of the influences of multilingualism on numerical and mathematical competencies. Psychol. Res. 88 , 2416–2431 (2024). Steiner, A. F. et al. Language Effects in Early Development of Number Writing and Reading. Journal of Numerical Cognition 7 , 368–387 (2021). Zuber, J., Pixner, S., Moeller, K. & Nuerk, H.-C. On the language specificity of basic number processing: transcoding in a language with inversion and its relation to working memory capacity. J. Exp. Child Psychol. 102 , 60–77 (2009). Pixner, S. et al. One language, two number-word systems and many problems: numerical cognition in the Czech language. Res. Dev. Disabil. 32 , 2683–2689 (2011). Marchand, E., Wade, S., Sullivan, J. & Barner, D. Language-specific numerical estimation in bilingual children. (2019) doi:10.31234/osf.io/afdg8. Schneider, R. M. et al. Do children use language structure to discover the recursive rules of counting? Cogn. Psychol. 117 , 101263 (2020). Slusser, E., Ribner, A. & Shusterman, A. Language counts: Early language mediates the relationship between parent education and children’s math ability. Dev. Sci. 22 , e12773 (2019). Le Corre, M., Li, P., Huang, B. H., Jia, G. & Carey, S. Numerical morphology supports early number word learning: Evidence from a comparison of young Mandarin and English learners. Cogn. Psychol. 88 , 162–186 (2016). Sarnecka, B. W. & Lee, M. D. Levels of number knowledge during early childhood. J. Exp. Child Psychol. 103 , 325–337 (2009). Barner, D., Libenson, A., Cheung, P. & Takasaki, M. Cross-linguistic relations between quantifiers and numerals in language acquisition: evidence from Japanese. J. Exp. Child Psychol. 103 , 421–440 (2009). Sarnecka, B. W., Kamenskaya, V. G., Yamana, Y., Ogura, T. & Yudovina, Y. B. From grammatical number to exact numbers: early meanings of ‘one’, ‘two’, and ‘three’ in English, Russian, and Japanese. Cogn. Psychol. 55 , 136–168 (2007). Aunio, P., Aubrey, C., Godfrey, R., Pan, Y. & Liu, Y. Children’s early numeracy in England, Finland and People’s Republic of China. Int. J. Early Years Educ. 16 , 203–221 (2008). Cankaya, O., LeFevre, J.-A. & Dunbar, K. The role of number naming systems and numeracy experiences in children’s rote counting: Evidence from Turkish and Canadian children. Learn. Individ. Differ. 32 , 238–245 (2014). Lefevre, J.-A., Clarke, T. & Stringer, A. P. Influences of Language and Parental Involvement on the Development of Counting Skills: Comparisons of French- and English-speaking Canadian Children. Early Child Dev. Care 172 , 283–300 (2002). Miller, K. F., Smith, C. M., Zhu, J. & Zhang, H. Preschool Origins of Cross-National Differences in Mathematical Competence: The Role of Number-Naming Systems. Psychol. Sci. 6 , 56–60 (1995). Göbel, S. M., Moeller, K., Pixner, S., Kaufmann, L. & Nuerk, H.-C. Language affects symbolic arithmetic in children: the case of number word inversion. J. Exp. Child Psychol. 119 , 17–25 (2014). Lê, M.-L. T. & Noël, M.-P. Transparent number-naming system gives only limited advantage for preschooler’s numerical development: Comparisons of Vietnamese and French-speaking children. PLoS One 15 , e0243472 (2020). Mark, W. & Dowker, A. Linguistic influence on mathematical development is specific rather than pervasive: revisiting the Chinese Number Advantage in Chinese and English children. Front. Psychol. 6 , 203 (2015). Clayton, F. J. et al. Two-digit number writing and arithmetic in Year 1 children: Does number word inversion matter? Cogn. Dev. 56 , 100967 (2020). Bahnmueller, J., Huber, S., Nuerk, H.-C., Göbel, S. M. & Moeller, K. Processing multi-digit numbers: a translingual eye-tracking study. Psychol. Res. 80 , 422–433 (2016). Steiner, A. F. et al. Twenty-four or four-and-twenty: Language modulates cross-modal matching for multidigit numbers in children and adults. J. Exp. Child Psychol. 202 , 104970 (2021). Nuerk, H. C., Weger, U. & Willmes, K. Decade breaks in the mental number line? Putting the tens and units back in different bins. Cognition 82 , B25–33 (2001). Macizo, P. & Herrera, A. Two-digit number comparison: Decade-unit and unit-decade produce the same compatibility effect with number words. Can. J. Exp. Psychol. 64 , 17–24 (2010). Macizo, P., Herrera, A., Paolieri, D. & Román, P. Is there cross-language modulation when bilinguals process number words? Appl. Psycholinguist. 31 , 651–669 (2010). Nuerk, H., Weger, U. & Willmes, K. Language effects in magnitude comparison: Small, but not irrelevant. Brain Lang. 92 , 262–277 (2005). Bahnmueller, J., Maier, C. A., Göbel, S. M. & Moeller, K. Direct evidence for linguistic influences in two-digit number processing. J. Exp. Psychol. Learn. Mem. Cogn. 45 , 1142–1150 (2019). Grosjean, F. Bilingual . (Harvard University Press, London, England, 2010). doi:10.4159/9780674056459. Spivey, M. J. & Marian, V. Cross talk between native and second languages: Partial activation of an irrelevant lexicon. Psychol. Sci. 10 , 281–284 (1999). Venkatraman, V., Siong, S. C., Chee, M. W. L. & Ansari, D. Effect of language switching on arithmetic: a bilingual FMRI study. J. Cogn. Neurosci. 18 , 64–74 (2006). Spelke, E. S. & Tsivkin, S. Language and number: a bilingual training study. Cognition 78 , 45–88 (2001). Salillas, E. & Wicha, N. Y. Y. Early learning shapes the memory networks for arithmetic: evidence from brain potentials in bilinguals. Psychol. Sci. 23 , 745–755 (2012). Garcia, O., Faghihi, N., Raola, A. R. & Vaid, J. Factors influencing bilinguals’ speed and accuracy of number judgments across languages: A meta-analytic review. J. Mem. Lang. 118 , 104211 (2021). Dehaene, S. Varieties of numerical abilities. Cognition 44 , 1–42 (1992). Lachelin, R., Marinova, M., Reynvoet, B. & Schiltz, C. Weaker semantic priming effects with number words in the second language of math learning. J. Exp. Psychol. Gen. 153 , 706–719 (2024). Van Rinsveld, A., Schiltz, C., Landerl, K., Brunner, M. & Ugen, S. Speaking two languages with different number naming systems: What implications for magnitude judgments in bilinguals at different stages of language acquisition? Cogn. Process. 17 , 225–241 (2016). Macizo, P., Herrera, A., Román, P. & Martín, M. C. The processing of two-digit numbers in bilinguals. Br. J. Psychol. 102 , 464–477 (2011). Tomoschuk, B., Ferreira, V. S. & Gollan, T. H. When a seven is not a seven: Self-ratings of bilingual language proficiency differ between and within language populations. Biling. (Camb. Engl.) 22 , 516–536 (2019). Lemhöfer, K. & Broersma, M. Introducing LexTALE: a quick and valid Lexical Test for Advanced Learners of English. Behav. Res. Methods 44 , 325–343 (2012). Anderson, J. A. E., Mak, L., Keyvani Chahi, A. & Bialystok, E. The language and social background questionnaire: Assessing degree of bilingualism in a diverse population. Behav. Res. Methods 50 , 250–263 (2018). Anwyl-Irvine, A. L., Massonnié, J., Flitton, A., Kirkham, N. & Evershed, J. K. Gorilla in our midst: An online behavioral experiment builder. Behav. Res. Methods 52 , 388–407 (2020). Grange, J. A. trimr: An implementation of common response time trimming methods. R package version 1.0.1. https://cran.r-project.org/web/packages/trimr/index.html (2015). Grosjean, F. The bilingual as a competent but specific speaker‐hearer. J. Multiling. Multicult. Dev. 6 , 467–477 (1985). Meuter, R. F. I. & Allport, A. Bilingual language switching in naming: Asymmetrical costs of language selection. J. Mem. Lang. 40 , 25–40 (1999). Declerck, M. & Philipp, A. M. A review of control processes and their locus in language switching. Psychon. Bull. Rev. 22 , 1630–1645 (2015). de Bruin, A. & Martin, C. D. Perro or txakur? Bilingual language choice during production is influenced by personal preferences and external primes. Cognition 222 , 104995 (2022). Wu, Y. J. & Thierry, G. Fast modulation of executive function by language context in bilinguals. J. Neurosci. 33 , 13533–13537 (2013). Jiao, L., Liu, C., de Bruin, A. & Chen, B. Effects of language context on executive control in unbalanced bilinguals: An ERPs study. Psychophysiology 57 , e13653 (2020). Van Rinsveld, A., Schiltz, C., Brunner, M., Landerl, K. & Ugen, S. Solving arithmetic problems in first and second language: Does the language context matter? Learning and Instruction 42 , 72–82 (2016). Gennari, S. P., Mirković, J. & Macdonald, M. C. Animacy and competition in relative clause production: a cross-linguistic investigation. Cogn. Psychol. 65 , 141–176 (2012). Humphreys, G. F., Mirković, J. & Gennari, S. P. Similarity-based competition in relative clause production and comprehension. J. Mem. Lang. 89 , 200–221 (2016). Protopapas, A. CheckVocal: a program to facilitate checking the accuracy and response time of vocal responses from DMDX. Behav. Res. Methods 39 , 859–862 (2007). Duyck, W., Depestel, I., Fias, W. & Reynvoet, B. Cross-lingual numerical distance priming with second-language number words in native- to third-language number word translation. Q. J. Exp. Psychol. (Hove) 61 , 1281–1290 (2008). Chincotta, D. & Underwood, G. Bilingual memory span advantage for Arabic numerals over digit words. Br. J. Psychol. 88 , 295–310 (1997). Shaki, S. & Fischer, M. H. Reading space into numbers--a cross-linguistic comparison of the SNARC effect. Cognition 108 , 590–599 (2008). Fischer, M. H., Shaki, S. & Cruise, A. It takes just one word to quash a SNARC. Exp. Psychol. 56 , 361–366 (2009). Bächtold, D., Baumüller, M. & Brugger, P. Stimulus-response compatibility in representational space. Neuropsychologia 36 , 731–735 (1998). Fischer, M. H., Mills, R. A. & Shaki, S. How to cook a SNARC: Number placement in text rapidly changes spatial–numerical associations. Brain Cogn. 72 , 333–336 (2010/4). Göbel, S. M. Up or down? Reading direction influences vertical counting direction in the horizontal plane - a cross-cultural comparison. Front. Psychol. 6 , 228 (2015). Göbel, S. M., McCrink, K., Fischer, M. H. & Shaki, S. Observation of directional storybook reading influences young children’s counting direction. J. Exp. Child Psychol. 166 , 49–66 (2018). Footnotes Please note, that, due to an oversight, this hypothesis was stated incorrectly in the opposite direction in our pre-registration. Tables Tables are available in the Supplementary Files section. Additional Declarations No competing interests reported. Supplementary Files Tables.docx Cite Share Download PDF Status: Under Review Version 1 posted Editorial decision: Revision requested 17 Feb, 2026 Reviews received at journal 17 Feb, 2026 Reviews received at journal 16 Feb, 2026 Reviews received at journal 03 Feb, 2026 Reviewers agreed at journal 26 Jan, 2026 Reviewers agreed at journal 24 Jan, 2026 Reviewers agreed at journal 21 Jan, 2026 Reviewers agreed at journal 10 Jan, 2026 Reviewers agreed at journal 10 Nov, 2025 Reviewers agreed at journal 20 Oct, 2025 Reviewers invited by journal 09 Oct, 2025 Editor assigned by journal 09 Oct, 2025 Submission checks completed at journal 07 Oct, 2025 First submitted to journal 06 Oct, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7745359","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Article","associatedPublications":[],"authors":[{"id":532348018,"identity":"2d12d429-c596-4046-b7ac-661869138098","order_by":0,"name":"Silke M Göbel","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA/0lEQVRIie2RsQrCMBCGLxR0KbimCPUVIgVB0HdpETpVHARxUBECcRFcBX0IfQIrgbo5OzjoG2QSBwfPWsQl1VEw35C7IR/3HwdgMPwilsWfDQUSq7Sm+DkKeSmwnX+lwJsi7W+UyoRwpmDolso8lk3R6Dhj66yICLUKk4QHc9h5zjLxZVuE9QUUPEpEpFdwF8yTBKtDxFCRzAWoARF9fTCOyg2VzaGjZD1VipdcBTBYC2AQrGiEPSplsB9TcoKhUp2y2KPHhG2n+5A53O5Sf69fvzKTkl77I7e04OfTtddgdDdZK9Vr6YNls56VFPCxIP+QGaOs3j5/NRgMhv/jDpSQUZqtRpRjAAAAAElFTkSuQmCC","orcid":"","institution":"University of York","correspondingAuthor":true,"prefix":"","firstName":"Silke","middleName":"M","lastName":"Göbel","suffix":""},{"id":532348019,"identity":"3498ae07-6720-4c9b-8cbf-20de39294b51","order_by":1,"name":"Veniamin Shiron","email":"","orcid":"","institution":"University of York","correspondingAuthor":false,"prefix":"","firstName":"Veniamin","middleName":"","lastName":"Shiron","suffix":""},{"id":532348020,"identity":"8839d341-1300-4cf2-bab3-bccb2eeea629","order_by":2,"name":"Miriam Tucker","email":"","orcid":"","institution":"University of York","correspondingAuthor":false,"prefix":"","firstName":"Miriam","middleName":"","lastName":"Tucker","suffix":""},{"id":532348021,"identity":"97d468cb-0a56-464f-ace6-b4ce64041d51","order_by":3,"name":"Angela De Bruin","email":"","orcid":"","institution":"University of York","correspondingAuthor":false,"prefix":"","firstName":"Angela","middleName":"","lastName":"De Bruin","suffix":""}],"badges":[],"createdAt":"2025-09-29 21:23:14","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7745359/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7745359/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":94155449,"identity":"0ab5e0d9-6617-4139-9222-83b5b42941bd","added_by":"auto","created_at":"2025-10-23 02:46:11","extension":"docx","order_by":0,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":512314,"visible":true,"origin":"","legend":"","description":"","filename":"BilingualnumbercomparisonTR1.docx","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/28f8375a00a7d2705867fd4e.docx"},{"id":94155450,"identity":"87d4c20c-de3c-432e-ada2-24444909c2ed","added_by":"auto","created_at":"2025-10-23 02:46:11","extension":"jpg","order_by":1,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":165661,"visible":true,"origin":"","legend":"","description":"","filename":"Figure1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/80e1424ce567aea4433a137d.jpg"},{"id":94155454,"identity":"b5006e35-710b-49f8-a105-8baa3a13aa41","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"jpg","order_by":2,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":614106,"visible":true,"origin":"","legend":"","description":"","filename":"Figure2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/cfab03dbea0a6e49fe1c31ae.jpg"},{"id":94155459,"identity":"b5ab3f0f-b002-42ef-ba17-f2ee654835a9","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"jpg","order_by":3,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":605571,"visible":true,"origin":"","legend":"","description":"","filename":"Figure3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/8b3410ca49c4ac423ce9f874.jpg"},{"id":94155451,"identity":"beb0133e-7440-4a4b-a99b-5a83d10d7054","added_by":"auto","created_at":"2025-10-23 02:46:11","extension":"json","order_by":4,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":6082,"visible":true,"origin":"","legend":"","description":"","filename":"aab1ab8484cc4c2ba9a240e3ca653234.json","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/b084407a08605fbcc94c8daf.json"},{"id":94155455,"identity":"ae04e840-76e0-4d1d-b3b4-e71e2ecba6e8","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"xml","order_by":5,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":188544,"visible":true,"origin":"","legend":"","description":"","filename":"aab1ab8484cc4c2ba9a240e3ca6532341enriched.xml","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/1e46ebbd16be0030ec8ae903.xml"},{"id":94157078,"identity":"576c7bdf-0c84-4617-9338-5073e9ac75c6","added_by":"auto","created_at":"2025-10-23 03:02:12","extension":"jpg","order_by":6,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":165661,"visible":true,"origin":"","legend":"","description":"","filename":"Figure1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/467e68836256b1c37b5674d2.jpg"},{"id":94157476,"identity":"81bd9934-bd8b-4659-9c21-7ccc7a93f458","added_by":"auto","created_at":"2025-10-23 03:10:12","extension":"jpg","order_by":7,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":614106,"visible":true,"origin":"","legend":"","description":"","filename":"Figure2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/57c8e3375e20e46c255c49be.jpg"},{"id":94155464,"identity":"80fb3b8a-8e6c-408a-995a-4c7b12547a33","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"jpg","order_by":8,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":605571,"visible":true,"origin":"","legend":"","description":"","filename":"Figure3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/97ec10223533099b8384b176.jpg"},{"id":94156712,"identity":"775a7b7f-4d4f-44de-82d7-9cdc1e5efed1","added_by":"auto","created_at":"2025-10-23 02:54:12","extension":"jpeg","order_by":9,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":60614,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage1.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/4d5c54e6195ed49c601d6416.jpeg"},{"id":94156716,"identity":"752f43f0-5987-4b7e-97e1-74d18967548e","added_by":"auto","created_at":"2025-10-23 02:54:12","extension":"jpeg","order_by":10,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":276418,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage2.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/a68bca0776e0365c37150369.jpeg"},{"id":94155469,"identity":"93ea3385-4116-4a43-90e5-9575581c6e03","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"png","order_by":11,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":189244,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/65bc22695d515ed540ccccd6.png"},{"id":94155463,"identity":"f1f93fa0-bfcd-4ff6-8cc7-748631bba717","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"png","order_by":12,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":105276,"visible":true,"origin":"","legend":"","description":"","filename":"OnlineFigure1.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/3666d3ef95790fbe2b075376.png"},{"id":94156713,"identity":"f3f7803c-161f-4bd3-8d31-a3eb2f684a42","added_by":"auto","created_at":"2025-10-23 02:54:12","extension":"png","order_by":13,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":87554,"visible":true,"origin":"","legend":"","description":"","filename":"OnlineFigure2.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/4da85d7494c63ae8b189dd37.png"},{"id":94156720,"identity":"3fe93c51-c463-4fc7-babb-fbc7b587b31e","added_by":"auto","created_at":"2025-10-23 02:54:12","extension":"png","order_by":14,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":84532,"visible":true,"origin":"","legend":"","description":"","filename":"OnlineFigure3.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/0b121e4bdc9e3610fe363b1d.png"},{"id":94155470,"identity":"01e1b1dc-aed5-49a1-94ab-dc750c3cebb3","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"png","order_by":15,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":37927,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/406a9a61fca1f6db6cf28447.png"},{"id":94156718,"identity":"63176fb1-5147-4680-8617-502067e75f4a","added_by":"auto","created_at":"2025-10-23 02:54:12","extension":"png","order_by":16,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":48260,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/ced54922d5e51c29d0f76a6a.png"},{"id":94155466,"identity":"48c68e92-de58-48ee-9794-fa4afd69dab4","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"png","order_by":17,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":37940,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/15ea326800b0be51a99485fc.png"},{"id":94157080,"identity":"5cf3f368-7c17-4651-aa28-da147cf5c371","added_by":"auto","created_at":"2025-10-23 03:02:12","extension":"xml","order_by":18,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":179893,"visible":true,"origin":"","legend":"","description":"","filename":"aab1ab8484cc4c2ba9a240e3ca6532341structuring.xml","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/2de034fd2ce93620511abb74.xml"},{"id":94156719,"identity":"ff93a7af-6a55-4588-a2f6-df735aa7971d","added_by":"auto","created_at":"2025-10-23 02:54:12","extension":"html","order_by":19,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":205124,"visible":true,"origin":"","legend":"","description":"","filename":"earlyproof.html","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/914e27056015fa692eb8abe4.html"},{"id":94155446,"identity":"7e0b17be-7451-443a-8a4b-9870a9e523ab","added_by":"auto","created_at":"2025-10-23 02:46:11","extension":"jpg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":134952,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eBilingual Triple Code Model (BTCM) adapted from Lachelin et al. (2024)\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/4483a709e8cf5198f4e7238e.jpg"},{"id":94155452,"identity":"a31ed58e-d62a-4156-b0a7-095a429a49df","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"jpg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":614106,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eMean RTs by Type (the condition being Compatible or Incompatible numbers) and Group (bilingual and monolingual). The centre of the triangle depicts the group mean, the black horizontal line the median. Black dots represent individual participants.\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/68aec764d1d1759992d98058.jpg"},{"id":94155456,"identity":"86cb2910-2ca1-4bfa-924b-0727ae43fad9","added_by":"auto","created_at":"2025-10-23 02:46:12","extension":"jpg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":605571,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eMean RTs by Type (the condition being Compatible or Incompatible numbers) and Language (number trial preceded by English or German scene description). The centre of the triangle depicts the group mean, the black horizontal line the median. Black dots represent individual participants.\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Figure3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/432511875adf7b1cd7b78363.jpg"},{"id":94157691,"identity":"bb3698cf-e5c5-4e67-b88e-bec432e1958e","added_by":"auto","created_at":"2025-10-23 03:18:12","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2115608,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/f109a3a0-bead-457a-ace2-6f747bf40fdc.pdf"},{"id":94155448,"identity":"39d52d8c-2786-4c99-b574-1f9518076e97","added_by":"auto","created_at":"2025-10-23 02:46:11","extension":"docx","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":24048,"visible":true,"origin":"","legend":"","description":"","filename":"Tables.docx","url":"https://assets-eu.researchsquare.com/files/rs-7745359/v1/dbb4c115e0f461fd3d4dfe66.docx"}],"financialInterests":"No competing interests reported.","formattedTitle":"Which language matters? Number comparison in bilinguals","fulltext":[{"header":"Introduction","content":"\u003cp\u003eNumbers are ubiquitous in modern life. We encounter exact numbers in different formats, most frequently as spoken or written number words (\u0026lsquo;twenty-four\u0026rsquo;) or Arabic digits (24). While number words are language-specific \u003csup\u003e\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e\u003c/sup\u003e, Arabic digits are often seen as an abstract, pure and non-linguistic number format. However, even basic number processing of Arabic digits is influenced by the language background of the person processing them \u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e,\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e\u003cp\u003eStrong support for an influence of language background on number processing comes from developmental research \u003csup\u003e\u003cspan additionalcitationids=\"CR5 CR6 CR7 CR8\" citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e\u003c/sup\u003e. For example, whether a language has a clear singular/plural distinction (one frog, two frog\u003cb\u003es\u003c/b\u003e, three frog\u003cb\u003es\u003c/b\u003e, four frog\u003cb\u003es\u003c/b\u003e...) might influence the acquisition of the meaning of the number word \u0026lsquo;one\u0026rsquo; \u003csup\u003e10\u003c/sup\u003e. English-speaking children typically learn the meaning of \u0026lsquo;one\u0026rsquo; around 24 to 26 months \u003csup\u003e\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e,\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e\u003c/sup\u003e. But in languages with less consistent or without obligatory associations of number words with singular/plural nouns, such as Japanese, children learn the meaning of the number word \u0026lsquo;one\u0026rsquo; several months later \u003csup\u003e\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e,\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e,\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e\u003c/sup\u003e than in English.\u003c/p\u003e\u003cp\u003eMoving on to larger number words, number word construction is very regular in some languages, such as Japanese and Mandarin. Once children have learnt the number words one to ten in these languages, they can combine these ten elements to create all number words up to 99. Several studies investigated the acquisition of teen numbers (11\u0026ndash;19) in different languages and reported a delay in number word acquisition for numbers larger than ten for languages for which number words are less regular \u003csup\u003e\u003cspan additionalcitationids=\"CR15 CR16\" citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e\u003cp\u003eOne\u0026rsquo;s language acquired from birth does not only influence number word acquisition and numerical and arithmetic development \u003csup\u003e\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e,\u003cspan additionalcitationids=\"CR19 CR20\" citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e\u003c/sup\u003e, it also affects number processing of Arabic digits in adults \u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e,\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e,\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e\u003c/sup\u003e. One effect that has often been used to investigate language influences on number processing in adults is the unit-decade compatibility effect \u003csup\u003e\u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e\u003c/sup\u003e. This effect is observed in multi-digit number comparison, when participants are asked to choose, for example, the numerically larger number out of two-digit number pairs in Arabic format. A number pair is considered \u003cem\u003eunit-decade compatible\u003c/em\u003e when comparing tens and units leads to similar response biases (e.g., 42_57: 4\u0026thinsp;\u0026lt;\u0026thinsp;5 and 2\u0026thinsp;\u0026lt;\u0026thinsp;7). In turn, a number pair is \u003cem\u003eunit-decade incompatible\u003c/em\u003e when the respective comparisons lead to opposite response biases (e.g., 47_62: 4\u0026thinsp;\u0026lt;\u0026thinsp;6 but 7\u0026thinsp;\u0026gt;\u0026thinsp;2). Compatible number pairs are usually responded to faster and with fewer errors than incompatible number pairs \u003csup\u003e\u003cspan additionalcitationids=\"CR25\" citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e\u003c/sup\u003e.\u003c/p\u003e\u003cp\u003eSeveral studies have shown that the size of the compatibility effect is influenced by which language people speak \u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e,\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e,\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e\u003c/sup\u003e. Comparing for example the compatibility effect in a German- and an English-speaking sample \u003csup\u003e\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e\u003c/sup\u003e, a more pronounced effect was found in the German-speaking group. The authors explain this finding by differences in the verbal number word systems. While the English number word system is fairly consistent for numbers larger than 20, German two-digit number words are inverted with respect to the Arabic digit notation (42 is spoken as \u0026lsquo;zweiundvierzig\u0026rsquo;, literally translated \u0026lsquo;two-and-forty\u0026rsquo;). That means that in German \u0026ndash; in contrast to the Arabic notation - the first named digit for double-digit numbers is the unit digit. Clearly, mapping between number words and Arabic digits is harder in languages with number word inversion \u003csup\u003e\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e\u003c/sup\u003e. Although the difference between groups was small, a larger compatibility effect in German than English speakers \u003csup\u003e\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e\u003c/sup\u003e demonstrates that even adults activate number words during processing of numbers in Arabic format.\u003c/p\u003e\u003cp\u003eMost of the previous research in this area has focused on different language groups. However, over half of the world\u0026rsquo;s population is bi- or multilingual \u003csup\u003e\u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e29\u003c/span\u003e\u003c/sup\u003e and many of these bilinguals need to alternate between different number-word systems in their everyday communications. The two languages of a bilingual influence each other and compete with each other for selection (e.g., \u003csup\u003e30\u003c/sup\u003e), but knowledge about how a bilingual\u0026rsquo;s languages influence number processing in adults is still limited.\u003c/p\u003e\u003cp\u003eFor exact arithmetic, language of instruction/training seems to matter for bilinguals \u003csup\u003e\u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e\u003c/sup\u003e. For example, bilinguals trained in exact arithmetic were faster at testing when they were tested in the language they were originally trained in than when tested in their other language \u003csup\u003e\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e32\u003c/span\u003e\u003c/sup\u003e (see also \u003csup\u003e\u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e\u003c/sup\u003e). In terms of more general language effects (beyond the language of training and testing), a meta-analysis \u003csup\u003e\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e\u003c/sup\u003e found that bilinguals were faster and more accurate in number naming and arithmetic in their first (L1) than second (L2) language. Schiltz et al. (2024)\u003csup\u003e\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e\u003c/sup\u003e in a recent review of the influence of multilingualism on numerical and mathematical competencies also conclude that performance in basic numerical skills is better in L1 (and/or in the language of mathematics learning) than in L2.\u003c/p\u003e\u003cp\u003eThese findings are in line with a prominent model of number processing, the Triple Code Model \u003csup\u003e\u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e35\u003c/span\u003e\u003c/sup\u003e. This model proposes the existence of three independent but interconnected numerical codes: a language-dependent verbal code for spoken and written number words, an analog magnitude code representing numerical magnitude approximately and a visual code for numerical symbols such as Arabic digits. The verbal code is proposed to be essential for retrieving arithmetic facts, and the language-independent analog magnitude code for number comparison and estimation. When numbers are presented visually as Arabic digits, they are first processed by the visual code. Then there are two processing routes, a direct asemantic route between the visual and the approximate code for number comparison and an indirect route which goes from the visual code via the verbal code to activate the approximate code. The influence of language during number comparison of Arabic digits along with other findings \u003csup\u003e\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e,\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e\u003c/sup\u003e suggests that both routes are activated in parallel when processing Arabic digits.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003eLachelin et al. (2024) \u003csup\u003e\u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e\u003c/sup\u003e have recently suggested a bilingual Triple code model (BTCM). In this model, bilinguals have two language-specific codes for number processing (see Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e), a verbal code in L1 and a verbal code in L2, which are connected and can influence each other. The presence of two language-specific nodes raises the question which language (L1 or L2) bilinguals activate most during number processing of Arabic digits.\u003c/p\u003e\u003cp\u003eTo our knowledge, only two previous studies investigated the compatibility effect with Arabic digits in bilinguals. Van Rinsveld et al. (2016) \u003csup\u003e\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e\u003c/sup\u003e investigated the compatibility effect in German or French monolinguals and German-French bilinguals and found that the German-French bilinguals exhibited the same pattern of results as the German monolinguals. However, their bilingual group was quite different from the two monolingual groups. The bilingual participants\u0026rsquo; L1 was Luxembourgish (which developed from a dialectal variant of German) and they also spoke German and French, which are official primary school languages for numeracy and literacy in Luxembourg. German monolinguals were University students at the University of Graz in Austria and French monolinguals were from Brussels in Belgium.\u003c/p\u003e\u003cp\u003eMacizo et al. (2011) \u003csup\u003e\u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e\u003c/sup\u003e tested whether the level of L2 proficiency affected the size of the compatibility effect. Two groups of German(L1)-English(L2) bilinguals performed a double-digit number comparison task with Arabic digits, L1 and L2 number words. Participants were equated in terms of their proficiency in German (L1), but were either more or less proficient in English (L2; based on self reports only). The size of the compatibility effect for Arabic digits was not affected by L2 proficiency, while proficiency affected the size of the compatibility effect with number words. While both studies provide evidence for little influence of L2 on the compatibility effect, this could be due to the specific language background of their participants. Macizo et al. focused on proficiency assessed through self reports, which are not always accurate reflections of objective proficiency \u003csup\u003e\u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e39\u003c/span\u003e\u003c/sup\u003e and the German monolingual and bilingual groups in Van Rinsveld et al.\u0026rsquo;s (2016) \u003csup\u003e\u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e\u003c/sup\u003e study had some significant differences in terms of first language, language of instruction, and language environment (e.g., early exposure to German in the bilingual group growing up in a very multilingual environment).\u003c/p\u003e\n\u003ch3\u003eCurrent Study\u003c/h3\u003e\n\u003cp\u003eThe aim of our study was to investigate the effect of a bilingual’s second language (compared to functional monolinguals, Experiment 1; compared to the bilingual’s first language, Experiment 2) on the compatibility effect.\u003c/p\u003e\u003cp\u003eWe examined this by comparing German-functional monolinguals and German-English bilinguals in Experiment 1 and by manipulating language activation within German-English bilinguals in Experiment 2. Larger compatibility effects have been reported in participants with an L1 with number word inversion. Across the two experiments, we therefore examined the effect on the compatibility effect of an L2 without number word inversion (English) relative to an L1 with number word inversion (German).\u003c/p\u003e\u003cp\u003eIn both studies we expected to find an overall significant unit-decade compatibility effect (significantly longer response times for incompatible than compatible number pairs). For Experiment 1, based on previous research showing a larger compatibility effect in German- than English-speaking participants \u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e,\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e,\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e\u003c/sup\u003e, we predicted a reduced compatibility effect in our German-English bilingual speakers compared to the German monolinguals. For Experiment 2, we expected a larger compatibility effect for number comparison trials following activation of German (elicited through a preceding scene description trial) than following activation of English.[1]\u003ca class=\"FNLink\" href=\"#Fn1\" id=\"#FNLinkFn1\"\u003e\u003c/a\u003e\u003c/p\u003e"},{"header":"Experiment 1","content":"\u003ch2\u003eMethods\u003c/h2\u003e\n\u003ch2\u003eParticipants\u003c/h2\u003e\n\u003cp\u003eExperiment 1 was completed by 31 German functional monolinguals and 30 German-English bilinguals. While we will refer to the German “functional monolingual” group as “monolingual”, these participants did have some L2-English knowledge. However, they were functional monolinguals in the sense of predominantly or only using their L1-German in their daily lives. The two groups did not differ significantly in terms of age (monolingual \u003cem\u003eM age\u003c/em\u003e = 29.1 years old, \u003cem\u003eSD\u003c/em\u003e = 6.3; bilingual \u003cem\u003eM age\u003c/em\u003e = 32.0 years old, \u003cem\u003eSD\u003c/em\u003e = 6.8; \u003cem\u003et\u003c/em\u003e(59) = 1.732, \u003cem\u003ep\u003c/em\u003e = .089). The monolingual group included fewer female participants than the bilingual group (monolingual 13 female, 18 male; 23 right-handed; bilingual 21 female, 8 male, 1 transgender; 25 right-handed). Participant groups did not differ significantly in terms of mean number of years of education (monolingual \u003cem\u003eM\u003c/em\u003e = 14.8 years, \u003cem\u003eSD\u003c/em\u003e = 4.5; bilingual \u003cem\u003eM\u003c/em\u003e = 16.5, \u003cem\u003eSD\u003c/em\u003e = 3.7; \u003cem\u003et\u003c/em\u003e(59) = 1.593, \u003cem\u003ep\u003c/em\u003e = .116); but bilinguals overall had a higher level of completed education than the monolingual group (monolingual: 17 secondary school/A-levels/vocational training; 10 undergraduate degree; 3 postgraduate degree; 1 doctorate; bilingual: 9 secondary school/A-levels; 6 undergraduate degree; 9 postgraduate degree; 6 doctorate; Χ\u003csup\u003e2\u003c/sup\u003e(3) = 10.019, \u003cem\u003ep\u003c/em\u003e = .018). All participants provided informed consent at the start of the study and ethics approval was granted by the Ethics Committee in the Department of Psychology at the University of York. The study was pre-registered and was conducted following the Declaration of Helsinki.\u003c/p\u003e\n\u003cp\u003eBefore completing the main study, participants completed a pre-screening. This was completed by 71 potential monolingual and 62 potential bilingual participants. We only invited participants to take part in the main study if they met the following background criteria. All participants had to have no (self-reported) neurological, language, or reading difficulties, or vision or hearing impairments. All participants had to have acquired German from birth, with no other languages acquired from birth and spoken fluently (apart from English in the bilingual group). All participants also had to score at least 60% on the German LexTALE, a short lexical decision task included to measure vocabulary knowledge. In this task, participants had to indicate if letter strings formed an existing word or not \u003csup\u003e40\u003c/sup\u003e. LexTALE scores were calculated following the recommended scoring\u003csup\u003e40\u003c/sup\u003e: ((number of words correct/total words * 100) + (number of nonwords correct/total nonwords *100)) / 2.\u003c/p\u003e\n\u003cp\u003eFor the functional monolinguals, our additional language requirements were as follows: no English acquisition from birth and weekly use of English being reported as “sometimes” or lower. All included participants also reported having learnt numbers during childhood in German and not having used numbers in school and/or university, or in their current lives, in a language other than German. All monolingual participants were born in Germany or Austria and were currently living there. Given that English is taught in schools in Germany and Austria, we could not recruit people without any knowledge of English. We therefore focused on “functional monolinguals” who do not use English more than sometimes in their daily lives. These participants did have some L2-English knowledge (see Table\u0026nbsp;1). The majority of participants reported a self-rated proficiency below 8/10 for English speaking and understanding. Although our pre-registered cut-off was \u0026lt; 8/10, we also included three participants who self-rated their English understanding (and in one case English speaking) as 8/10. This was done to not reduce our sample size further. Of the 71 monolingual participants who completed the pre-screening, 37 were invited for and also completed the main study. Of those, one participant was excluded afterwards as they reported speaking another language well and at home from childhood, one participant because they reported frequent use of English, and four participants because they completed the pre-screening in the same session as the main task but did not actually meet the requirements due to using English too frequently and having a self-rated English proficiency of 9/10. This left us with 31 monolingual participants in the analysis.\u003c/p\u003e\n\u003cp\u003eFor the bilinguals, our pre-registered inclusion criteria were as follows: did not use another language than German or English at home or school during childhood; self-rated English proficiency at least 8/10 for speaking and understanding; uses English at least “frequently” on a weekly basis; current language use score on the Language and Social Background Questionnaire (LSBQ)\u003csup\u003e41\u003c/sup\u003e of 2.5 or higher (1 = all German use; 5 = all English use); and no indication that another language than German or English was used exclusively in any of the contexts assessed. To examine this, we used a modified version of the LSBQ, asking about participants’ language use in different contexts, activities, and with various conversation partners. All participants also scored at least 60% on the LexTALE in both English and German. We had an additional pre-registered point stating we would exclude participants who reported their language use during childhood to be above 2.5 on the LSBQ (where higher scores reflect more English use). Two participants scored slightly above this but were included because this only reflected some English education as a teenager. As can be seen in Table\u0026nbsp;1, bilingual participants had a high proficiency in and use of both languages. They also reported occasional to regular language switching (on a scale from 1 = never to 10 = very frequently) on a daily basis (\u003cem\u003eM\u003c/em\u003e = 6.3, \u003cem\u003eSD\u003c/em\u003e = 2.7), within a conversation (\u003cem\u003eM\u003c/em\u003e = 4.8, \u003cem\u003eSD\u003c/em\u003e = 2.4), and within a sentence (\u003cem\u003eM\u003c/em\u003e = 4.3, \u003cem\u003eSD\u003c/em\u003e = 2.7).\u003c/p\u003e\n\u003cp\u003eIn terms of proficiency (see Table 1), participants in the bilingual group showed significantly higher self-rated English proficiency than the monolingual group (all \u003cem\u003ep\u003c/em\u003es \u0026lt; .05). German proficiency scores showed no group differences in the self ratings or LexTALE (\u003cem\u003ep\u003c/em\u003es \u0026gt; .15), with the exception of self-rated German speaking proficiency being slightly lower in the bilingual group (\u003cem\u003ep\u003c/em\u003e = .031).\u003c/p\u003e\n\u003cp\u003eAll bilingual participants learnt numbers in German during childhood and, where relevant, used German and/or English for numbers at university and when using numbers in their daily lives, with no use of other languages. In the final questionnaire, we only asked participants to choose one language that they predominantly used for number tasks (e.g., subtractions) and remembering numbers. We did not examine how frequently participants encountered numbers in English or in German and could therefore not assess our pre-registered point of encountering English numbers at least 25% of the time. While all participants were born in Germany, they were currently living in an English-dominant country (e.g., UK or US), making it very likely that they had regular exposure to numbers in English.\u003c/p\u003e\n\u003cp\u003eParticipants were also asked questions about their experience with numbers and mathematics, as reported in Table 2. The two participant groups were comparable in terms of the number of hours they had received mathematics teaching in school, the number of people studying mathematics at university level, and the number of people working with numbers in their jobs. Both groups also learnt mathematics in German in school and were taught numbers in German as their first language. The two groups differed, as intended, in their current use of English for talking about mathematics in university, their main language for using numbers for daily activities, and remembering number sequences. These tasks were all done in German in the monolingual group while within the bilingual group, approximately half of the participants reported English to be their main language for number related activities.\u003c/p\u003e\n\u003cp\u003eOur pre-registered plan was to also include a comparison group of German-Dutch bilinguals. However, recruitment was very difficult and only ten participants started the study and did not always meet our inclusion criteria. We therefore decided to only include German functional monolinguals and German-English bilinguals in this experiment, which still allowed us to answer our main research question regarding the influence of English in bilinguals. Our sample sizes for these two groups were slightly lower than the initial pre-registered aim of 40 participants per group. However, in that pre-registration, we also included an optional stopping rule of 30 participants if recruitment turned out difficult, which we ended up having to follow. Our power analysis was based on previous research comparing the compatibility effect between German- and English-speaking adults showing a partial eta squared effect size of .11 \u003csup\u003e28\u003c/sup\u003e. Using G*Power, we conducted a power analysis with a slightly larger effect size (.15-.20). This resulted in 95% power with 34 participants per group for an effect size of .15, with the initially planned three-group comparison. While our sample size was slightly lower than planned, we included a large number of trials per condition (120 compatible and 120 incompatible trials per participant), which increases power.\u003c/p\u003e\n\u003ch3\u003eDesign\u003c/h3\u003e\n\u003cp\u003eThe number-comparison task had two independent variables: Type (compatible, e.g., 46 − 21, or incompatible, e.g., 81 − 32; within-subject) and Group (monolingual or bilingual; between-subject). Additional filler items (20%, within-decade comparisons, e.g., 68 − 61) were included in the task but not in the analysis. Our dependent variable was reaction times (RTs).\u003c/p\u003e\n\u003ch3\u003eProcedure\u003c/h3\u003e\n\u003cp\u003eParticipants were recruited through Prolific.co and tested using Gorilla.sc \u003csup\u003e42\u003c/sup\u003e. All instructions throughout the study were presented in German. Most participants (all bilinguals and all apart from 8 monolinguals) first included a pre-screening, which included our questionnaire and the LexTALE(s). Only participants meeting the inclusion criteria (see “Participants”) were invited to also take part in the main study. For a few monolinguals, we combined the pre-screening and main study to facilitate recruitment.\u003c/p\u003e\n\u003cp\u003eIn the main study, participants were asked to complete a two-digit number\u003c/p\u003e\n\u003cp\u003emagnitude comparison task. The design of our number comparison task closely followed the design of the number comparison task used in study 1 of Bahnmueller et al. (2019)\u003csup\u003e28\u003c/sup\u003e. In each trial, two numbers were presented simultaneously and above each other in the center of the screen. The stimulus set consisted of 300 two-digit Arabic number pairs between 21 and 98, with 120 unit-decade compatible (e.g., 43_58), 120 unit-decade incompatible (e.g., 48_53; taken from\u003csup\u003e24\u003c/sup\u003e), and 60 within-decade filler pairs (e.g., 51_56, 20% of all trials). Overall distance, decade distance, unit distance, and problem size (i.e., the sum of the two to-be-compared numbers) were matched between the unit-decade compatible and unit-decade incompatible stimulus groups.\u003c/p\u003e\n\u003cp\u003eParticipants had to choose the larger of two numbers presented. They were instructed to respond as quickly and accurately as possible with the right index finger (pressing the U key) in case the upper number was larger, and with the left index finger (pressing the N key) when the bottom number was larger. For half of the pairs, the upper number was larger. In the other half, the lower number was larger. Participants had to press the same button not more than twice in a row. Stimuli were presented in white (font: Courier New, bold; font size: 24) against a black background until a response was given. Preceding each trial, a fixation cross was presented in the middle of the screen for 500 ms. Trials were separated by an interstimulus interval of 500 ms. If no response was given, the next trial was started automatically after three seconds.\u003c/p\u003e\n\u003cp\u003eThe experiment started with 12 practice trials with feedback, followed by five experimental blocks of 60 number pairs each without feedback.\u003c/p\u003e\n\u003cdiv id=\"Sec7\"\u003e\n \u003ch2\u003eData analysis\u003c/h2\u003e\n \u003cp\u003eThe task data are available at https://osf.io/gjnwt/.\u003c/p\u003e\n \u003cp\u003eThe RT data were analysed using ANOVAs and linear mixed-effects (LME) analyses (using R version 4.4.1, lme4 version 1.1–36 and lmerTest version 3.1-3). Accuracy was high (\u003cem\u003eM\u003c/em\u003e = 97.2%, \u003cem\u003eSD\u003c/em\u003e = 2.4 on compatible and incompatible trials combined; all participants scored 90% or higher) and therefore not analysed further. Prior to RT analysis, we removed inaccurate responses and RTs \u0026gt; 3 \u003cem\u003eSD\u003c/em\u003e above or below the mean per participant and condition (0.8% of trials)\u003csup\u003e43\u003c/sup\u003e. No RTs were below 200ms. Visual inspection of the RT data revealed a non-normal distribution and RTs were therefore log-transformed. The analysis only included compatible and incompatible trials, fillers (within-decade comparisons) were not included. The LME analysis included Type (compatible coded as -0.5 and incompatible as 0.5) and Group (monolingual coded as -0.5 and bilingual coded as 0.5) as fixed effects. Participant and item (number pair shown on the screen) were included as random effects, with the corresponding within-participant and within-item slopes. The full model converged. We also ran the corresponding ANOVA, including a Bayesian ANOVA to quantify evidence for the null versus the alternative hypothesis. Here, we compared a null model including main effects of Type (compatibility effect) and Group to a model including an interaction between Type and Group to assess evidence for/against a difference in compatibility effect between bilinguals and monolinguals. The Bayes Factor (BF) is reported in the form BF\u003csub\u003e01\u003c/sub\u003e (evidence for the null over the alternative hypothesis), with values above 1 supporting the null hypothesis. Note that while we pre-registered the use of z-scored RTs in the ANOVA, we used log-transformations instead for consistency with the LME analysis. Z-scoring of RTs was furthermore not needed given that no overall RT differences were observed between the two groups.\u003c/p\u003e\n\u003c/div\u003e\n\u003ch3\u003eResults and discussion\u003c/h3\u003e\n\u003cp\u003eThe LME analysis showed a significant effect of Type (β = 0.027, \u003cem\u003eSE\u003c/em\u003e = 0.013, \u003cem\u003et\u003c/em\u003e = 2.088, \u003cem\u003ep\u003c/em\u003e = .038), showing that there was a compatibility effect with faster responses to compatible number pairs (\u003cem\u003eM\u003c/em\u003e = 748ms, \u003cem\u003eSD\u003c/em\u003e = 125) than to incompatible pairs (\u003cem\u003eM\u003c/em\u003e = 767ms, \u003cem\u003eSD\u003c/em\u003e = 123), see Fig.\u0026nbsp;2. There was no main effect of Group (β = 0.045, \u003cem\u003eSE\u003c/em\u003e = 0.041, \u003cem\u003et\u003c/em\u003e = 1.094, \u003cem\u003ep\u003c/em\u003e = .279), suggesting bilinguals and monolinguals responded at comparable speed overall (Monolingual \u003cem\u003eM\u003c/em\u003e = 741ms, \u003cem\u003eSD\u003c/em\u003e = 121; Bilingual \u003cem\u003eM\u003c/em\u003e = 774ms, SD = 126). The number compatibility effect was numerically a little smaller for bilinguals (\u003cem\u003eM\u003c/em\u003e = 14ms, \u003cem\u003eSD\u003c/em\u003e = 18) than for monolinguals (\u003cem\u003eM\u003c/em\u003e = 24ms, \u003cem\u003eSD\u003c/em\u003e = 32) but this did not reach significance (β = -0.014, \u003cem\u003eSE\u003c/em\u003e = 0.008, \u003cem\u003et\u003c/em\u003e = -1.843, \u003cem\u003ep\u003c/em\u003e = .070).\u003c/p\u003e\n\u003cp\u003eThe ANOVA showed the same patterns, with a significant main effect of Type (\u003cem\u003eF\u003c/em\u003e(1,59) = 49.220, \u003cem\u003ep\u003c/em\u003e \u0026lt; .001, ηp² = 0.455), no significant effect of Group (\u003cem\u003eF\u003c/em\u003e(1,59) = 1.215, \u003cem\u003ep\u003c/em\u003e = .275, ηp² = 0.020), and no significant interaction between Group and Type (\u003cem\u003eF\u003c/em\u003e(1,59) = 3.390, \u003cem\u003ep\u003c/em\u003e = .070, ηp² = 0.054). The Bayesian analysis provided no support for the presence of a compatibility-effect difference between the two groups (BF\u003csub\u003e01\u003c/sub\u003e = 1.72, error% = 2.57).\u003c/p\u003e\n\u003cp\u003eAll instructions were provided in German, thereby creating a German context throughout the study for all participants. Within the bilingual group, however, some participants reported predominantly using English for daily-life number processing (for tasks such as subtractions or for remembering numbers), while others reported using German, or did not report a consistent preference across different types of number activities. An exploratory analysis comparing 12 participants reporting daily-life number processing in English to 10 participants reporting doing such processing in German showed a slightly smaller number compatibility-effect cost in the task for participants more likely to use English to process numbers (\u003cem\u003eM\u003c/em\u003e cost = 8ms, \u003cem\u003eSD\u003c/em\u003e = 21) than those more likely to use German (\u003cem\u003eM\u003c/em\u003e cost = 20ms, \u003cem\u003eSD\u003c/em\u003e = 12; \u003cem\u003et\u003c/em\u003e(20) = 1.449, \u003cem\u003ep\u003c/em\u003e = .163; \u003cem\u003ed\u003c/em\u003e = 0.621).\u003c/p\u003e\n\u003cp\u003eExperiment 1 showed no significant difference between the bilingual and monolingual groups. We attempted to make our functional monolingual group as monolingual as possible, by only including participants who predominantly use German in their daily lives, including for number-related tasks. In contrast, our bilingual participants were living in an English-dominant society and reported frequent use of English for daily-life interactions. However, if any potential effects of language use are small, they might be less likely to arise in these group comparisons, also considering the relatively small size of the number-compatibility effect. The groups tested differed in some ways, including their highest level of education completed. This could have influenced number processing and could have masked any language-related effects, if they exist. Furthermore, our monolingual participants were functional monolinguals who only used their L1, but still had some proficiency in their L2-English. This was inevitable given the presence of English in Germany and Austria, and the compulsory English classes children receive at school. While it is unlikely these participants use English for number processing (given that they learnt to do this in German and only rarely use English in general), we cannot rule out some influence of L2-English processing in our task. Finally, group comparisons can be influenced by individual differences within the bilingual group. Indeed, exploratory analyses suggested bilingual participants varied in their daily-life use of English or German for number processing, which may impact which language they used during the task. Furthermore, even though all task instructions were provided in German only, with this being an online study, we could not control the language environment people completed the study in. Therefore, it is likely that some bilinguals completed the task in an English environment and some in a more German environment, thereby further increasing the difficulty of seeing any group-level effects.\u003c/p\u003e"},{"header":"Experiment 2","content":"\u003cp\u003eIn Experiment 2, we therefore examined the potential influence of language by manipulating language activation \u003cem\u003ewithin participants\u003c/em\u003e, rather than relying on group comparisons. This avoided the need for monolingual participants and issues associated with potential group differences beyond people’s language background. By manipulating language within the task itself we also aimed to minimise the influence of unknown individual differences in language environment while doing the study. We therefore asked participants to describe a simple scene in English or in German (each language used for half of the trials, in response to a language cue) before each number trial within the task. The use of English or German while describing the scene was anticipated to increase the activation of that language immediately prior to making the number-compatibility decision.\u003c/p\u003e\n\u003cp\u003eFollowing Grosjean’s Language Mode (1985)\u003csup\u003e44\u003c/sup\u003e, language activation falls on a continuum from predominant activation of one language to activation of multiple languages, depending on the context, conversation partners, and general environment. This shows that language is flexible and can change quickly in response to task context and demands. For example, a large number of language switching studies have used designs similar to ours, asking participants to name pictures or describe scenes in response to a language cue. These studies typically show switching costs, with participants requiring more time to name a picture when the previous picture required use of another language, compared to when the same language can be used on two consecutive pictures (e.g., \u003csup\u003e45\u003c/sup\u003e; see \u003csup\u003e46\u003c/sup\u003e, for a review). These costs are usually interpreted as (at least partly) reflecting ongoing competition between languages, with switch trials requiring bilinguals to rapidly re-balance the activation levels of each language.\u003c/p\u003e\n\u003cp\u003eFurther research has also shown that short exposure to a language context (either linguistic or non-linguistic), manipulated on a trial-by-trial basis, can indeed influence activation of the corresponding language. For example, in a simple picture naming task, de Bruin and Martin (2022)\u003csup\u003e47\u003c/sup\u003e showed Basque-Spanish participants linguistic (short sentences) and non-linguistic (flags associated with a language/culture) cues. Participants could name each picture in their language of choice and were free to ignore the “cue”. Nevertheless, participants were more likely to name the picture in the language matching the preceding cue, and were faster naming a picture when following the language indicated by that context, showing fast re-balancing of language activation and use. Similarly, trial-by-trial manipulations of language context have also been shown to influence other cognitive processes. For example, interleaving a flanker task with words presented in one or two languages showed an influence of this briefly presented language context on interference suppression (e.g.,\u003csup\u003e48 49\u003c/sup\u003e). This suggests brief exposure to language contexts varying in the activation levels of one or two languages can influence performance on other cognitive tasks.\u003c/p\u003e\n\u003cp\u003eIndeed, preceding language context has also been shown to influence arithmetic problem solving. Van Rinsveld et al. (2016)\u003csup\u003e50\u003c/sup\u003e presented multilinguals with arithmetic problems presented without context, or with each number trial preceded by a semantic judgement task that had to be completed in German or French (depending on the language of the session). Complex additions were completed faster in a language context than without context, when the task was completed in French (the participants’ less dominant language). This suggests that manipulating language activation levels throughout the task can indeed influence number processing. In Experiment 2, we therefore expected a larger compatibility effect for number comparison trials following activation of German in the preceding scene description than following activation of English.\u003c/p\u003e\n\u003ch3\u003eMethods\u003c/h3\u003e\n\u003cdiv id=\"Sec10\"\u003e\n \u003ch2\u003eParticipants\u003c/h2\u003e\n \u003cp\u003eExperiment 2 was completed by 39 German-English bilinguals (\u003cem\u003eM age\u003c/em\u003e = 29.4 years old, \u003cem\u003eSD\u003c/em\u003e = 6.8; 21 female, 18 male; 35 right-handed). The mean number of years of education was 17.0 (\u003cem\u003eSD\u003c/em\u003e = 3.5), with 8 participants completing secondary school/A-levels or lower; 17 an undergraduate degree; 11 a postgraduate degree; and 3 a doctorate. All participants provided informed consent at the start of the study and ethics approval was granted by the Ethics Committee in the Department of Psychology at the University of York. The study was pre-registered and was conducted following the Declaration of Helsinki.\u003c/p\u003e\n \u003cp\u003eBefore completing the main study, participants completed a pre-screening. This was completed by 118 potential participants. We only invited participants to take part in the main study if they met the criteria described below. Firstly, invited participants had no neurological, language, or reading difficulties, vision or hearing impairments, or colour blindness. One participant was invited despite responding “yes” to one of these questions and was excluded from analysis (thereby leading to a final sample size of 39 rather than 40). All included participants acquired German from birth and reported not being fluent in any other languages apart from German or English, or the use of any other languages during childhood or current life (i.e., none of the participants reported a language other than German or English in any of the LSBQ questions).\u003c/p\u003e\n \u003cp\u003eIn our pre-registration, we pre-registered strict inclusion criteria regarding the use of English, including the requirement for participants to be living in an English-speaking country, for them to not select an option lower than “frequently” for weekly English use, or to have an average score below 2.5 across the LSBQ. However, while these requirements were necessary for Experiment 1, they were deemed less essential for Experiment 2, where language was manipulated within participants through the experimental manipulation. We therefore also included participants who used English less frequently than pre-registered. Similar to Experiment 1, we also included two participants with slightly more frequent English use during childhood, as this reflected English use as a teenager. As shown in Table 3, participants had a high proficiency in both languages, as measured through both self-reports and the LexTALE vocabulary scores. All participants, as pre-registered, self-reported their German proficiency as at least 8/10 and their English proficiency as at least 6/10, and had a LexTALE score above 60% in both languages. Participants predominantly used German during their childhood, but currently used English more regularly (all participants reported at least using English sometimes). Participants on average also reported occasional to regular language switching (on a scale from 1 = never to 7 = very frequently; note that this scale differs from Experiment 1) on a daily basis (\u003cem\u003eM\u003c/em\u003e = 4.7, \u003cem\u003eSD\u003c/em\u003e = 1.8), within a conversation (\u003cem\u003eM\u003c/em\u003e = 3.4, \u003cem\u003eSD\u003c/em\u003e = 1.7), and within a sentence (\u003cem\u003eM\u003c/em\u003e = 3.0, \u003cem\u003eSD\u003c/em\u003e = 1.9).\u003c/p\u003e\n \u003cp\u003eAll participants learnt mathematics in German during elementary school and (with the exception of one, who reported using English) during secondary school, and all first learnt numbers in German. Where participants discussed mathematics related topics at University, 26 did so in German and 5 in English. At elementary school, most participants reported studying maths for 2–3 hours (16) or 3 + hours (19; 4 participants reported 1–2 hours). In secondary school, most participants (29) reported 3 + hours of maths education (9 2–3 hours; 1 1–2 hours). Twenty-one participants reported studying additional subjects related to mathematics (e.g., statistics) at school; 19 reported studying a maths-related degree at university; and 24 reported using numbers in their jobs. In terms of current use, 35 participants reported using German for tasks such as subtractions (3 reported English and 1 both); 37 participants reported using German for remembering long strings of digits such as phone numbers (2 reported using English). Apart from one person residing in the UK, all participants were living in Germany or Austria. No participants reported learning mathematics, being taught in, or working with numbers in a language other than English or German.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec11\"\u003e\n \u003ch2\u003eDesign\u003c/h2\u003e\n \u003cp\u003eThe design was similar to Experiment 1, with one change: Language was manipulated within participants (the scene before each number trial was described in German or in English), rather than manipulated at the group level between participants.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec12\"\u003e\n \u003ch2\u003eProcedure and Materials\u003c/h2\u003e\n \u003cp\u003eExperiment 2 was run in the same way as Experiment 1, with participants completing a pre-screening questionnaire and the LexTALE before being invited to complete the main task. The main task was the same number magnitude comparison task as in Experiment 1. However, this time, participants were asked to describe a simple scene in German or in English prior to each number magnitude comparison. Half of the scenes had to be described in German and half in English. Participants completed 48 compatible number comparisons (24 preceded by each language); 48 incompatible comparisons (24 preceded by each language); and 24 filler trials (12 per language). Half of the scene descriptions were switch trials (different language had to be used than in the previous scene) and half were non-switch trials (same language had to be used as in the previous scene). These were distributed equally across the compatible and incompatible number trials. Each scene was named five times by each participant, twice preceding a compatible and twice preceding an incompatible trial, and once preceding a filler trial. Two lists were used to counterbalance the number of times each scene was described in German or English (two or three times per language) across participants. Each number pair was presented twice in the main task. With the exception of four pairs, each number pair was preceded once by English and once by a German scene description.\u003c/p\u003e\n \u003cp\u003eFor the scene description task, we selected 25 scenes from previous papers \u003csup\u003e51,52\u003c/sup\u003e. Scenes were selected that could be easily described in one sentence, in German or English. Due to the lexical similarities between the two languages, cognates (words overlapping in form and meaning) could not be avoided.\u003c/p\u003e\n \u003cp\u003ePrior to starting the task, participants saw an example of a scene and an example of a number comparison trial. Participants were instructed to describe the scenes in a simple way, using a subject-verb-object order to ensure consistency between languages. They were then asked to describe three practice scenes. Following this, they completed three practice number comparison trials, during which they received feedback on their accuracy. Finally, participants completed three practice trials during which they alternated between the scene description and the number comparison trials, always starting with the scene description as in the main task. The main task consisted of six blocks, separated by breaks. The first trial after a break was always a filler trial.\u003c/p\u003e\n \u003cp\u003eOn each trial, the language cue (German or British flag) was first shown in the centre of the screen for 500ms. This was followed by the presentation of the scene. Participants were given five seconds to describe the scene, after which the trial automatically continued. After showing a blank (black) screen for 200ms, the number comparison was presented. This stayed on the screen until a response was given, or for 3000ms if no response was given.\u003c/p\u003e\n\u003c/div\u003e\n\u003cdiv id=\"Sec13\"\u003e\n \u003ch2\u003eData analysis\u003c/h2\u003e\n \u003cp\u003eThe task data are available at https://osf.io/gjnwt/.\u003c/p\u003e\n \u003cp\u003eAs first checks, we examined accuracy in the number task and in the scene description task. Accuracy in the main number task was high (above 91% for all participants, \u003cem\u003eM\u003c/em\u003e = 97.0%, \u003cem\u003eSD\u003c/em\u003e = 2.4; across compatible and incompatible trials). Similarly, accuracy in the scene description task across all number trials was high in both L1-German (\u003cem\u003eM\u003c/em\u003e = 96.6%, \u003cem\u003eSD\u003c/em\u003e = 3.5; all participants over 86% correct) and L2-English (\u003cem\u003eM\u003c/em\u003e = 97.3%, \u003cem\u003eSD\u003c/em\u003e = 3.3; all participants over 86% correct; recordings were not audible and therefore not scored for three participants, who were still included in the main analysis). These accuracy scores show that participants indeed paid attention to the task and activated the intended language on each trial. We also computed participants’ naming onset times for each scene (i.e., the delay between the start of scene presentation and the start of naming, using CheckVocal\u003csup\u003e53\u003c/sup\u003e). We then computed the naming onset time difference between switch trials (trial requiring naming in a different language as the previous trial) and non-switch trials (trial requiring naming in the same language as the previous trial). We did this for English and German separately (e.g., the English switch cost would be the naming time difference between a switch from German to English, and naming two trials in a row in English). As expected, a switch cost was observed in both languages (\u003cem\u003eM English switch cost\u003c/em\u003e = 43ms, \u003cem\u003eSD\u003c/em\u003e = 150ms, \u003cem\u003eM German switch cost\u003c/em\u003e = 41ms, \u003cem\u003eSD\u003c/em\u003e = 116; including trials included in the main number analysis), suggesting the languages were activated as intended during the scene-description trials and participants had to re-balance language activation during the scene descriptions when switching to a new language.\u003c/p\u003e\n \u003cp\u003eThe main-task analyses were conducted similarly to Experiment 1. 0.5% of correct trials were removed as RT outliers. The RT analysis only included correctly answered number trials that were preceded by a correctly described scene. Type (compatible = -0.5; incompatible = 0.5) and Language (German = -0.5; English = 0.5) were included as fixed effects. Participant, scene picture name, and number item (number pair shown on screen) were included as random effects. Slopes were included for Language and Type for participant and scene picture-name random effects. However, given that a few items did not occur across both languages, no random slopes were included for the number item random effect. The initial model did not converge. We therefore first removed correlations between slopes and intercepts and then (following the pre-registration), the scene picture-name slope explaining the lowest amount of variance. The final model included all random-effect intercepts, all slopes for the participant random effect, and the trial type + trial type x language slope for the picture-naming random effect.\u003c/p\u003e\n\u003c/div\u003e\n\u003ch3\u003eResults and discussion\u003c/h3\u003e\n\u003cp\u003eThe LME analysis showed that the Type effect did not reach significance (β = 0.047, \u003cem\u003eSE\u003c/em\u003e = 0.030, \u003cem\u003et\u003c/em\u003e = 1.551, \u003cem\u003ep\u003c/em\u003e = .128), although the numerical pattern went in the expected direction with somewhat faster responses to compatible number pairs (\u003cem\u003eM\u003c/em\u003e = 766ms, \u003cem\u003eSD\u003c/em\u003e = 126) than to incompatible pairs (\u003cem\u003eM\u003c/em\u003e = 799ms, \u003cem\u003eSD\u003c/em\u003e = 126), see Fig. 3. There was no main effect of Language (β = 0.012, \u003cem\u003eSE\u003c/em\u003e = 0.008, \u003cem\u003et\u003c/em\u003e = 1.511, \u003cem\u003ep\u003c/em\u003e = .139), suggesting naming the scene in L1-German or L2-English did not influence RTs on the number-comparison task (after L1-German naming: \u003cem\u003eM\u003c/em\u003e = 777ms, \u003cem\u003eSD\u003c/em\u003e = 125; after L2-English naming: \u003cem\u003eM\u003c/em\u003e = 788ms, \u003cem\u003eSD\u003c/em\u003e = 129). The number compatibility effect was numerically a little smaller for after naming in the L1-German (\u003cem\u003eM\u003c/em\u003e = 26ms, \u003cem\u003eSD\u003c/em\u003e = 48) than after L2-English naming (\u003cem\u003eM\u003c/em\u003e = 41ms, \u003cem\u003eSD\u003c/em\u003e = 46) but this did not reach significance (β = 0.017, \u003cem\u003eSE\u003c/em\u003e = 0.014, \u003cem\u003et\u003c/em\u003e = 1.287, \u003cem\u003ep\u003c/em\u003e = .211). If anything, the observed pattern went in the opposite direction than expected (i.e., a slightly smaller compatibility effect after German use, than after English use).\u003c/p\u003e\n\u003cp\u003eThe ANOVA showed the same patterns, but the main effect of Trial Type did reach significance (\u003cem\u003eF\u003c/em\u003e(1,38) = 47.710, \u003cem\u003ep\u003c/em\u003e \u0026lt; .001, ηp² = 0.557). There was no significant effect of Language (\u003cem\u003eF\u003c/em\u003e(1,38) = 2.555, \u003cem\u003ep\u003c/em\u003e = .118, ηp² = 0.063), and no significant interaction between Language and Type (\u003cem\u003eF\u003c/em\u003e(1,38) = 1.978, \u003cem\u003ep\u003c/em\u003e = .168, ηp² = 0.049). The Bayesian analysis provided some (but weak) evidence for the null hypothesis (i.e., it did not support the presence of a language effect on the number-compatibility effect, BF\u003csub\u003e01\u003c/sub\u003e = 2.45, error% = 1.03).\u003c/p\u003e\n\u003cp\u003eOne further exploratory analysis was conducted (using LME analyses). If language activation takes time to build up or strengthens with time, we might see a larger effect of naming language on the number task when the same language has been used for at least two scenes. We therefore ran an additional analysis including language trial type (switch/non-switch, with non-switch trials reflecting that the scene was named in the same language as the previous scene). No significant effects of or interactions with language trial type were found in terms of number-task RTs (all \u003cem\u003ep\u003c/em\u003es \u0026gt; .25). This suggests any influence of language on the number task did not increase when the same language was used multiple times in a row and potentially resulted in stronger activation of that language.\u003c/p\u003e\n"},{"header":"General discussion","content":"\u003cp\u003eIn this paper we were interested in the effect a bilingual’s two languages can have on their processing of (non-linguistic) Arabic digits. We tested L1-German speakers, because the compatibility effect in number comparison of Arabic double-digit numbers is significantly larger in participants with a language background that has inverted number words (e.g. German) than in participants whose L1 has (mostly) non-inverted number words (e.g. English). For the same reason, we chose bilingual participants with English as L2.\u003c/p\u003e\u003cp\u003eIn Experiment 1, both German-monolinguals and German-English bilinguals showed a significant compatibility effect with longer response times to unit-decade incompatible pairs of Arabic digits. While the compatibility effect was numerically smaller in the bilingual group, overall, contrary to our prediction, there was no significant difference in the size of the compatibility effect between the two groups (with some support from the Bayesian analysis for the null).\u003c/p\u003e\u003cp\u003eBetween-group comparisons can be hampered by group differences beyond the main factor of interest (here L2). Thus, Experiment 2 employed a within-participant manipulation to avoid potential group differences. The same number comparison task was used with German-English bilinguals, but now participants described scenes in either English or German before each number-comparison trial. While participants, as intended, activated the required language on each trial (and showed a switching cost reflecting re-balancing that language activation), there was no significant difference (with some support from the Bayesian analysis) in the size of the compatibility effect when participants had just activated German versus when they had just activated English. Taken together, our findings suggest that L2-English use in German-English bilinguals has little influence on number comparison with Arabic digits.\u003c/p\u003e\u003cp\u003ePrevious evidence showed that the size of the compatibility effect depends on the L1 spoken \u003csup\u003e\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e,\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e\u003c/sup\u003e. This strongly suggests that number comparison of double-digit Arabic numbers often activates the indirect processing path, with the verbal code used depending on the languages spoken. The Bilingual Triple Code Model (BTCM\u003csup\u003e\u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e\u003c/sup\u003e) proposed that in bilinguals, a second indirect path exists in the form of a verbal L2 code. Our results provide no direct evidence for the activation of a second indirect, verbal L2 code, path between the visual and approximate number codes. There are various reasons that can explain these findings.\u003c/p\u003e\u003cp\u003eFirst, findings from number word priming in bilinguals (e.g. \u003csup\u003e36,54\u003c/sup\u003e ) suggest the verbal L2 code pathway might be weaker (or even absent) than the proposed L1 code pathway in the BTCM. For example, Lachelin et al. (2024)\u003csup\u003e\u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e\u003c/sup\u003e showed the priming distance effect in proficient German-French bilinguals was weaker in French (L2) than in German (L1), thus emphasizing a stronger reliance on the verbal L1 code even when L2 number words were used as primes. Their bilingual participants grew up speaking Luxembourgish at home (Home language), learnt German (L1 in their study) early and had numeracy instruction in German at school from grade 1–6 and then in French from grade 7 onwards. Lachelin et al. therefore suggested that the earlier L1 than L2 age of acquisition, as well as the closer language proximity between L1-German and their Home language might have resulted in a weaker activation of the verbal L2 code, compared to the L1. Other studies also report language of schooling as a strong predictor of Arabic digit naming (e.g.\u003csup\u003e55\u003c/sup\u003e).\u003c/p\u003e\u003cp\u003eIn line with this, our bilingual participants all had an earlier age of acquisition for German (L1) than English (L2), acquired number words first in German (L1) and had most of their schooling in German (L1). It is therefore possible that for these bilinguals, the verbal L1 code was so strongly established from a young age that all number processing continues to go through the verbal L1 code (rather than, or to a much lower extent, the L2 code). Further research is therefore needed to better understand the potential role of the language of schooling, in bilinguals living in bilingual societies as well as in bilinguals who grew up in a different language environment than the one they are currently living in (as was the case for many of our bilinguals). Further research will also need to further study the role of current daily-life language use in relation to number processing. While our exploratory analyses in Experiment 1 did not show a significant relationship between the compatibility effect and the participant’s daily-life language use for number processing (potentially due to the small sample size and the lack of a detailed questionnaire about current language use for number processing), the numerical pattern suggested further research is warranted to better understand a possible effect of daily-life language use for number processing on the activation of an indirect route through the L1 or L2.\u003c/p\u003e\u003cp\u003e Participants in Experiment 1 might have differed in the extent to which they activated the indirect route and possibly the verbal L2 code. We therefore manipulated L1 or L2 activation directly in Experiment 2. Despite the language manipulation “working” as intended (i.e., high accuracy and switching costs), there was no evidence for the activation of the second indirect path between the visual and approximate number codes via the verbal L2 code during number comparison, even when participants had just described a visual scene in L2. This argues further for a preference of L1 for number processing in our bilinguals, which might have been so strong that this was not overruled by a temporary boost of L2.\u003c/p\u003e\u003cp\u003e A second explanation, specifically concerning Experiment 2, is that perhaps a longer L2 boost is needed for the indirect path through the verbal L2 code to be used. In our Experiment, language activation was manipulated on a trial-by-trial level. While this has successfully been shown to influence language-related mechanisms in previous literature (e.g.\u003csup\u003e47\u003c/sup\u003e), such short-lived activation change might not be sufficient to influence other cognitive processes. Indeed, Van Rinsveld and colleagues (2016)\u003csup\u003e\u003cspan citationid=\"CR50\" class=\"CitationRef\"\u003e50\u003c/span\u003e\u003c/sup\u003e showed an impact of language context on arithmetic tasks but manipulated language through longer contextual blocks. While exploratory analyses suggested additional time of using the L2 (through multiple consecutive scene descriptions in that language) did not increase its influence on the number processing task, it remains possible that such L2 effects only arise after longer (e.g., at least 5–10 minutes) blocks of L2 use.\u003c/p\u003e\u003cp\u003e Finally, a third potential explanation for our findings could be that participants in Experiment 1 did not activate number words at all while performing the number comparison task with Arabic digits. While this would be in contrast to findings from several other studies \u003csup\u003e\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e,\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e\u003c/sup\u003e, Macizo et al. (2011) \u003csup\u003e\u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e\u003c/sup\u003e compared the compatibility effect between Spanish-English bilinguals who learnt number words first in Spanish and German-English bilingual who learnt number words first in German and found no significant differences in the size of their compatibility effect for Arabic digits. If their bilinguals had activated their indirect path via the verbal L1 code, the size of the compatibility effect should have differed between these two groups. Therefore, a potential interpretation of our findings remains that participants did not use number word activation during the task.\u003c/p\u003e\u003cp\u003eOur findings are in contrast to findings on the influence of spatial reference frames on number processing. In a previous study Moeller et al. (2015)\u003csup\u003e\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e\u003c/sup\u003e have shown that both number word inversion as well as habitual reading/writing direction (left-to-right vs right-to-left) influence the compatibility effect in symbolic number comparison. Interestingly, and in contrast to our findings, when spatial reference frames were changed in bilingual participants it influenced their number processing, e.g. \u003csup\u003e56,57\u003c/sup\u003e.This is in line with other findings \u003csup\u003e\u003cspan additionalcitationids=\"CR59 CR60\" citationid=\"CR58\" class=\"CitationRef\"\u003e58\u003c/span\u003e–\u003cspan citationid=\"CR61\" class=\"CitationRef\"\u003e61\u003c/span\u003e\u003c/sup\u003e and highlights a clear difference between the influence of spatial versus linguistic context on symbolic number processing. While spatial reference frames are constructed temporarily and their influence on number representations seem to be flexible and context-dependent in bilinguals, the linguistic influence of L1, be it because of the earlier onset of acquisition or because it was the language of (mathematical) instruction, seems to have a more permanent dominant effect on symbolic number processing in bilinguals. Future studies need to test this conclusion more robustly by, for example, manipulating the extent of bilinguals’ current use of number words in L2.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eFunding declaration\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis work was supported by the Experimental Psychology Society through an EPS undergraduate research bursary for Veniamin Shiron. This research was partially supported by the Research council of Norway (project number 331640) through its centre of excellence scheme.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003cstrong\u003eAcknowledgements\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eWe would like to thank Katharina Sautter for help with data collection and Silvia Gennari for sharing the images we used in Experiment 2.\u003c/p\u003e\n\u003cp\u003e\u0026nbsp;\u003cstrong\u003eAuthor contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eSilke M G\u0026ouml;bel:\u003csup\u003e\u0026nbsp;\u003c/sup\u003eConceptualization; Funding acquisition; Methodology; Project administration; Supervision; Roles/Writing - original draft, review \u0026amp; editing. Veniamin Shiron: Funding acquisition; Project administration; Investigation; Data Curation; Roles/Writing - review \u0026amp; editing; Miriam Tucker: Roles/Writing - original draft, review \u0026amp; editing; Angela De Bruin: Conceptualization; Funding acquisition; Methodology; Formal analysis; Project administration; Supervision; Roles/Writing - original draft, review \u0026amp; editing;\u003c/p\u003e\n\u003cp\u003e\u0026nbsp;\u003cstrong\u003eData availability statement\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe main task data are available at https://osf.io/gjnwt/.\u003c/p\u003e\n\u003cp\u003e\u0026nbsp;\u003cstrong\u003eCompeting interests\u003cbr\u003e\u003c/strong\u003eThe author(s) declare no competing interests.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eChrisomalis, S. Reckonings: Numerals, cognition, and history. (2020).\u003c/li\u003e\n\u003cli\u003eMoeller, K., Shaki, S., G\u0026ouml;bel, S. M. \u0026amp; Nuerk, H.-C. Language influences number processing--a quadrilingual study. \u003cem\u003eCognition\u003c/em\u003e \u003cstrong\u003e136\u003c/strong\u003e, 150\u0026ndash;155 (2015).\u003c/li\u003e\n\u003cli\u003eSchiltz, C., Lachelin, R., Hilger, V. \u0026amp; Marinova, M. Thinking about numbers in different tongues: An overview of the influences of multilingualism on numerical and mathematical competencies. \u003cem\u003ePsychol. Res.\u003c/em\u003e \u003cstrong\u003e88\u003c/strong\u003e, 2416\u0026ndash;2431 (2024).\u003c/li\u003e\n\u003cli\u003eSteiner, A. F. \u003cem\u003eet al.\u003c/em\u003e Language Effects in Early Development of Number Writing and Reading. \u003cem\u003eJournal of Numerical Cognition\u003c/em\u003e \u003cstrong\u003e7\u003c/strong\u003e, 368\u0026ndash;387 (2021).\u003c/li\u003e\n\u003cli\u003eZuber, J., Pixner, S., Moeller, K. \u0026amp; Nuerk, H.-C. On the language specificity of basic number processing: transcoding in a language with inversion and its relation to working memory capacity. \u003cem\u003eJ. Exp. Child Psychol.\u003c/em\u003e \u003cstrong\u003e102\u003c/strong\u003e, 60\u0026ndash;77 (2009).\u003c/li\u003e\n\u003cli\u003ePixner, S. \u003cem\u003eet al.\u003c/em\u003e One language, two number-word systems and many problems: numerical cognition in the Czech language. \u003cem\u003eRes. Dev. Disabil.\u003c/em\u003e \u003cstrong\u003e32\u003c/strong\u003e, 2683\u0026ndash;2689 (2011).\u003c/li\u003e\n\u003cli\u003eMarchand, E., Wade, S., Sullivan, J. \u0026amp; Barner, D. Language-specific numerical estimation in bilingual children. (2019) doi:10.31234/osf.io/afdg8.\u003c/li\u003e\n\u003cli\u003eSchneider, R. M. \u003cem\u003eet al.\u003c/em\u003e Do children use language structure to discover the recursive rules of counting? \u003cem\u003eCogn. Psychol.\u003c/em\u003e \u003cstrong\u003e117\u003c/strong\u003e, 101263 (2020).\u003c/li\u003e\n\u003cli\u003eSlusser, E., Ribner, A. \u0026amp; Shusterman, A. Language counts: Early language mediates the relationship between parent education and children\u0026rsquo;s math ability. \u003cem\u003eDev. Sci.\u003c/em\u003e \u003cstrong\u003e22\u003c/strong\u003e, e12773 (2019).\u003c/li\u003e\n\u003cli\u003eLe Corre, M., Li, P., Huang, B. H., Jia, G. \u0026amp; Carey, S. Numerical morphology supports early number word learning: Evidence from a comparison of young Mandarin and English learners. \u003cem\u003eCogn. Psychol.\u003c/em\u003e \u003cstrong\u003e88\u003c/strong\u003e, 162\u0026ndash;186 (2016).\u003c/li\u003e\n\u003cli\u003eSarnecka, B. W. \u0026amp; Lee, M. D. Levels of number knowledge during early childhood. \u003cem\u003eJ. Exp. Child Psychol.\u003c/em\u003e \u003cstrong\u003e103\u003c/strong\u003e, 325\u0026ndash;337 (2009).\u003c/li\u003e\n\u003cli\u003eBarner, D., Libenson, A., Cheung, P. \u0026amp; Takasaki, M. Cross-linguistic relations between quantifiers and numerals in language acquisition: evidence from Japanese. \u003cem\u003eJ. Exp. Child Psychol.\u003c/em\u003e \u003cstrong\u003e103\u003c/strong\u003e, 421\u0026ndash;440 (2009).\u003c/li\u003e\n\u003cli\u003eSarnecka, B. W., Kamenskaya, V. G., Yamana, Y., Ogura, T. \u0026amp; Yudovina, Y. B. From grammatical number to exact numbers: early meanings of \u0026lsquo;one\u0026rsquo;, \u0026lsquo;two\u0026rsquo;, and \u0026lsquo;three\u0026rsquo; in English, Russian, and Japanese. \u003cem\u003eCogn. Psychol.\u003c/em\u003e \u003cstrong\u003e55\u003c/strong\u003e, 136\u0026ndash;168 (2007).\u003c/li\u003e\n\u003cli\u003eAunio, P., Aubrey, C., Godfrey, R., Pan, Y. \u0026amp; Liu, Y. Children\u0026rsquo;s early numeracy in England, Finland and People\u0026rsquo;s Republic of China. \u003cem\u003eInt. J. Early Years Educ.\u003c/em\u003e \u003cstrong\u003e16\u003c/strong\u003e, 203\u0026ndash;221 (2008).\u003c/li\u003e\n\u003cli\u003eCankaya, O., LeFevre, J.-A. \u0026amp; Dunbar, K. The role of number naming systems and numeracy experiences in children\u0026rsquo;s rote counting: Evidence from Turkish and Canadian children. \u003cem\u003eLearn. Individ. Differ.\u003c/em\u003e \u003cstrong\u003e32\u003c/strong\u003e, 238\u0026ndash;245 (2014).\u003c/li\u003e\n\u003cli\u003eLefevre, J.-A., Clarke, T. \u0026amp; Stringer, A. P. Influences of Language and Parental Involvement on the Development of Counting Skills: Comparisons of French- and English-speaking Canadian Children. \u003cem\u003eEarly Child Dev. Care\u003c/em\u003e \u003cstrong\u003e172\u003c/strong\u003e, 283\u0026ndash;300 (2002).\u003c/li\u003e\n\u003cli\u003eMiller, K. F., Smith, C. M., Zhu, J. \u0026amp; Zhang, H. Preschool Origins of Cross-National Differences in Mathematical Competence: The Role of Number-Naming Systems. \u003cem\u003ePsychol. Sci.\u003c/em\u003e \u003cstrong\u003e6\u003c/strong\u003e, 56\u0026ndash;60 (1995).\u003c/li\u003e\n\u003cli\u003eG\u0026ouml;bel, S. M., Moeller, K., Pixner, S., Kaufmann, L. \u0026amp; Nuerk, H.-C. Language affects symbolic arithmetic in children: the case of number word inversion. \u003cem\u003eJ. Exp. Child Psychol.\u003c/em\u003e \u003cstrong\u003e119\u003c/strong\u003e, 17\u0026ndash;25 (2014).\u003c/li\u003e\n\u003cli\u003eL\u0026ecirc;, M.-L. T. \u0026amp; No\u0026euml;l, M.-P. Transparent number-naming system gives only limited advantage for preschooler\u0026rsquo;s numerical development: Comparisons of Vietnamese and French-speaking children. \u003cem\u003ePLoS One\u003c/em\u003e \u003cstrong\u003e15\u003c/strong\u003e, e0243472 (2020).\u003c/li\u003e\n\u003cli\u003eMark, W. \u0026amp; Dowker, A. Linguistic influence on mathematical development is specific rather than pervasive: revisiting the Chinese Number Advantage in Chinese and English children. \u003cem\u003eFront. Psychol.\u003c/em\u003e \u003cstrong\u003e6\u003c/strong\u003e, 203 (2015).\u003c/li\u003e\n\u003cli\u003eClayton, F. J. \u003cem\u003eet al.\u003c/em\u003e Two-digit number writing and arithmetic in Year 1 children: Does number word inversion matter? \u003cem\u003eCogn. Dev.\u003c/em\u003e \u003cstrong\u003e56\u003c/strong\u003e, 100967 (2020).\u003c/li\u003e\n\u003cli\u003eBahnmueller, J., Huber, S., Nuerk, H.-C., G\u0026ouml;bel, S. M. \u0026amp; Moeller, K. Processing multi-digit numbers: a translingual eye-tracking study. \u003cem\u003ePsychol. Res.\u003c/em\u003e \u003cstrong\u003e80\u003c/strong\u003e, 422\u0026ndash;433 (2016).\u003c/li\u003e\n\u003cli\u003eSteiner, A. F. \u003cem\u003eet al.\u003c/em\u003e Twenty-four or four-and-twenty: Language modulates cross-modal matching for multidigit numbers in children and adults. \u003cem\u003eJ. Exp. Child Psychol.\u003c/em\u003e \u003cstrong\u003e202\u003c/strong\u003e, 104970 (2021).\u003c/li\u003e\n\u003cli\u003eNuerk, H. C., Weger, U. \u0026amp; Willmes, K. Decade breaks in the mental number line? Putting the tens and units back in different bins. \u003cem\u003eCognition\u003c/em\u003e \u003cstrong\u003e82\u003c/strong\u003e, B25\u0026ndash;33 (2001).\u003c/li\u003e\n\u003cli\u003eMacizo, P. \u0026amp; Herrera, A. Two-digit number comparison: Decade-unit and unit-decade produce the same compatibility effect with number words. \u003cem\u003eCan. J. Exp. Psychol.\u003c/em\u003e \u003cstrong\u003e64\u003c/strong\u003e, 17\u0026ndash;24 (2010).\u003c/li\u003e\n\u003cli\u003eMacizo, P., Herrera, A., Paolieri, D. \u0026amp; Rom\u0026aacute;n, P. Is there cross-language modulation when bilinguals process number words? \u003cem\u003eAppl. Psycholinguist.\u003c/em\u003e \u003cstrong\u003e31\u003c/strong\u003e, 651\u0026ndash;669 (2010).\u003c/li\u003e\n\u003cli\u003eNuerk, H., Weger, U. \u0026amp; Willmes, K. Language effects in magnitude comparison: Small, but not irrelevant. \u003cem\u003eBrain Lang.\u003c/em\u003e \u003cstrong\u003e92\u003c/strong\u003e, 262\u0026ndash;277 (2005).\u003c/li\u003e\n\u003cli\u003eBahnmueller, J., Maier, C. A., G\u0026ouml;bel, S. M. \u0026amp; Moeller, K. Direct evidence for linguistic influences in two-digit number processing. \u003cem\u003eJ. Exp. Psychol. Learn. Mem. Cogn.\u003c/em\u003e \u003cstrong\u003e45\u003c/strong\u003e, 1142\u0026ndash;1150 (2019).\u003c/li\u003e\n\u003cli\u003eGrosjean, F. \u003cem\u003eBilingual\u003c/em\u003e. (Harvard University Press, London, England, 2010). doi:10.4159/9780674056459.\u003c/li\u003e\n\u003cli\u003eSpivey, M. J. \u0026amp; Marian, V. Cross talk between native and second languages: Partial activation of an irrelevant lexicon. \u003cem\u003ePsychol. Sci.\u003c/em\u003e \u003cstrong\u003e10\u003c/strong\u003e, 281\u0026ndash;284 (1999).\u003c/li\u003e\n\u003cli\u003eVenkatraman, V., Siong, S. C., Chee, M. W. L. \u0026amp; Ansari, D. Effect of language switching on arithmetic: a bilingual FMRI study. \u003cem\u003eJ. Cogn. Neurosci.\u003c/em\u003e \u003cstrong\u003e18\u003c/strong\u003e, 64\u0026ndash;74 (2006).\u003c/li\u003e\n\u003cli\u003eSpelke, E. S. \u0026amp; Tsivkin, S. Language and number: a bilingual training study. \u003cem\u003eCognition\u003c/em\u003e \u003cstrong\u003e78\u003c/strong\u003e, 45\u0026ndash;88 (2001).\u003c/li\u003e\n\u003cli\u003eSalillas, E. \u0026amp; Wicha, N. Y. Y. Early learning shapes the memory networks for arithmetic: evidence from brain potentials in bilinguals. \u003cem\u003ePsychol. Sci.\u003c/em\u003e \u003cstrong\u003e23\u003c/strong\u003e, 745\u0026ndash;755 (2012).\u003c/li\u003e\n\u003cli\u003eGarcia, O., Faghihi, N., Raola, A. R. \u0026amp; Vaid, J. Factors influencing bilinguals\u0026rsquo; speed and accuracy of number judgments across languages: A meta-analytic review. \u003cem\u003eJ. Mem. Lang.\u003c/em\u003e \u003cstrong\u003e118\u003c/strong\u003e, 104211 (2021).\u003c/li\u003e\n\u003cli\u003eDehaene, S. Varieties of numerical abilities. \u003cem\u003eCognition\u003c/em\u003e \u003cstrong\u003e44\u003c/strong\u003e, 1\u0026ndash;42 (1992).\u003c/li\u003e\n\u003cli\u003eLachelin, R., Marinova, M., Reynvoet, B. \u0026amp; Schiltz, C. Weaker semantic priming effects with number words in the second language of math learning. \u003cem\u003eJ. Exp. Psychol. Gen.\u003c/em\u003e \u003cstrong\u003e153\u003c/strong\u003e, 706\u0026ndash;719 (2024).\u003c/li\u003e\n\u003cli\u003eVan Rinsveld, A., Schiltz, C., Landerl, K., Brunner, M. \u0026amp; Ugen, S. Speaking two languages with different number naming systems: What implications for magnitude judgments in bilinguals at different stages of language acquisition? \u003cem\u003eCogn. Process.\u003c/em\u003e \u003cstrong\u003e17\u003c/strong\u003e, 225\u0026ndash;241 (2016).\u003c/li\u003e\n\u003cli\u003eMacizo, P., Herrera, A., Rom\u0026aacute;n, P. \u0026amp; Mart\u0026iacute;n, M. C. The processing of two-digit numbers in bilinguals. \u003cem\u003eBr. J. Psychol.\u003c/em\u003e \u003cstrong\u003e102\u003c/strong\u003e, 464\u0026ndash;477 (2011).\u003c/li\u003e\n\u003cli\u003eTomoschuk, B., Ferreira, V. S. \u0026amp; Gollan, T. H. When a seven is not a seven: Self-ratings of bilingual language proficiency differ between and within language populations. \u003cem\u003eBiling. (Camb. Engl.)\u003c/em\u003e \u003cstrong\u003e22\u003c/strong\u003e, 516\u0026ndash;536 (2019).\u003c/li\u003e\n\u003cli\u003eLemh\u0026ouml;fer, K. \u0026amp; Broersma, M. Introducing LexTALE: a quick and valid Lexical Test for Advanced Learners of English. \u003cem\u003eBehav. Res. Methods\u003c/em\u003e \u003cstrong\u003e44\u003c/strong\u003e, 325\u0026ndash;343 (2012).\u003c/li\u003e\n\u003cli\u003eAnderson, J. A. E., Mak, L., Keyvani Chahi, A. \u0026amp; Bialystok, E. The language and social background questionnaire: Assessing degree of bilingualism in a diverse population. \u003cem\u003eBehav. Res. Methods\u003c/em\u003e \u003cstrong\u003e50\u003c/strong\u003e, 250\u0026ndash;263 (2018).\u003c/li\u003e\n\u003cli\u003eAnwyl-Irvine, A. L., Massonni\u0026eacute;, J., Flitton, A., Kirkham, N. \u0026amp; Evershed, J. K. Gorilla in our midst: An online behavioral experiment builder. \u003cem\u003eBehav. Res. Methods\u003c/em\u003e \u003cstrong\u003e52\u003c/strong\u003e, 388\u0026ndash;407 (2020).\u003c/li\u003e\n\u003cli\u003eGrange, J. A. trimr: An implementation of common response time trimming methods. R package version 1.0.1. \u003cem\u003ehttps://cran.r-project.org/web/packages/trimr/index.html\u003c/em\u003e (2015).\u003c/li\u003e\n\u003cli\u003eGrosjean, F. The bilingual as a competent but specific speaker‐hearer. \u003cem\u003eJ. Multiling. Multicult. Dev.\u003c/em\u003e \u003cstrong\u003e6\u003c/strong\u003e, 467\u0026ndash;477 (1985).\u003c/li\u003e\n\u003cli\u003eMeuter, R. F. I. \u0026amp; Allport, A. Bilingual language switching in naming: Asymmetrical costs of language selection. \u003cem\u003eJ. Mem. Lang.\u003c/em\u003e \u003cstrong\u003e40\u003c/strong\u003e, 25\u0026ndash;40 (1999).\u003c/li\u003e\n\u003cli\u003eDeclerck, M. \u0026amp; Philipp, A. M. A review of control processes and their locus in language switching. \u003cem\u003ePsychon. Bull. Rev.\u003c/em\u003e \u003cstrong\u003e22\u003c/strong\u003e, 1630\u0026ndash;1645 (2015).\u003c/li\u003e\n\u003cli\u003ede Bruin, A. \u0026amp; Martin, C. D. Perro or txakur? Bilingual language choice during production is influenced by personal preferences and external primes. \u003cem\u003eCognition\u003c/em\u003e \u003cstrong\u003e222\u003c/strong\u003e, 104995 (2022).\u003c/li\u003e\n\u003cli\u003eWu, Y. J. \u0026amp; Thierry, G. Fast modulation of executive function by language context in bilinguals. \u003cem\u003eJ. Neurosci.\u003c/em\u003e \u003cstrong\u003e33\u003c/strong\u003e, 13533\u0026ndash;13537 (2013).\u003c/li\u003e\n\u003cli\u003eJiao, L., Liu, C., de Bruin, A. \u0026amp; Chen, B. Effects of language context on executive control in unbalanced bilinguals: An ERPs study. \u003cem\u003ePsychophysiology\u003c/em\u003e \u003cstrong\u003e57\u003c/strong\u003e, e13653 (2020).\u003c/li\u003e\n\u003cli\u003eVan Rinsveld, A., Schiltz, C., Brunner, M., Landerl, K. \u0026amp; Ugen, S. Solving arithmetic problems in first and second language: Does the language context matter? \u003cem\u003eLearning and Instruction\u003c/em\u003e \u003cstrong\u003e42\u003c/strong\u003e, 72\u0026ndash;82 (2016).\u003c/li\u003e\n\u003cli\u003eGennari, S. P., Mirković, J. \u0026amp; Macdonald, M. C. Animacy and competition in relative clause production: a cross-linguistic investigation. \u003cem\u003eCogn. Psychol.\u003c/em\u003e \u003cstrong\u003e65\u003c/strong\u003e, 141\u0026ndash;176 (2012).\u003c/li\u003e\n\u003cli\u003eHumphreys, G. F., Mirković, J. \u0026amp; Gennari, S. P. Similarity-based competition in relative clause production and comprehension. \u003cem\u003eJ. Mem. Lang.\u003c/em\u003e \u003cstrong\u003e89\u003c/strong\u003e, 200\u0026ndash;221 (2016).\u003c/li\u003e\n\u003cli\u003eProtopapas, A. CheckVocal: a program to facilitate checking the accuracy and response time of vocal responses from DMDX. \u003cem\u003eBehav. Res. Methods\u003c/em\u003e \u003cstrong\u003e39\u003c/strong\u003e, 859\u0026ndash;862 (2007).\u003c/li\u003e\n\u003cli\u003eDuyck, W., Depestel, I., Fias, W. \u0026amp; Reynvoet, B. Cross-lingual numerical distance priming with second-language number words in native- to third-language number word translation. \u003cem\u003eQ. J. Exp. Psychol. (Hove)\u003c/em\u003e \u003cstrong\u003e61\u003c/strong\u003e, 1281\u0026ndash;1290 (2008).\u003c/li\u003e\n\u003cli\u003eChincotta, D. \u0026amp; Underwood, G. Bilingual memory span advantage for Arabic numerals over digit words. \u003cem\u003eBr. J. Psychol.\u003c/em\u003e \u003cstrong\u003e88\u003c/strong\u003e, 295\u0026ndash;310 (1997).\u003c/li\u003e\n\u003cli\u003eShaki, S. \u0026amp; Fischer, M. H. Reading space into numbers--a cross-linguistic comparison of the SNARC effect. \u003cem\u003eCognition\u003c/em\u003e \u003cstrong\u003e108\u003c/strong\u003e, 590\u0026ndash;599 (2008).\u003c/li\u003e\n\u003cli\u003eFischer, M. H., Shaki, S. \u0026amp; Cruise, A. It takes just one word to quash a SNARC. \u003cem\u003eExp. Psychol.\u003c/em\u003e \u003cstrong\u003e56\u003c/strong\u003e, 361\u0026ndash;366 (2009).\u003c/li\u003e\n\u003cli\u003eB\u0026auml;chtold, D., Baum\u0026uuml;ller, M. \u0026amp; Brugger, P. Stimulus-response compatibility in representational space. \u003cem\u003eNeuropsychologia\u003c/em\u003e \u003cstrong\u003e36\u003c/strong\u003e, 731\u0026ndash;735 (1998).\u003c/li\u003e\n\u003cli\u003eFischer, M. H., Mills, R. A. \u0026amp; Shaki, S. How to cook a SNARC: Number placement in text rapidly changes spatial\u0026ndash;numerical associations. \u003cem\u003eBrain Cogn.\u003c/em\u003e \u003cstrong\u003e72\u003c/strong\u003e, 333\u0026ndash;336 (2010/4).\u003c/li\u003e\n\u003cli\u003eG\u0026ouml;bel, S. M. Up or down? Reading direction influences vertical counting direction in the horizontal plane - a cross-cultural comparison. \u003cem\u003eFront. Psychol.\u003c/em\u003e \u003cstrong\u003e6\u003c/strong\u003e, 228 (2015).\u003c/li\u003e\n\u003cli\u003eG\u0026ouml;bel, S. M., McCrink, K., Fischer, M. H. \u0026amp; Shaki, S. Observation of directional storybook reading influences young children\u0026rsquo;s counting direction. \u003cem\u003eJ. Exp. Child Psychol.\u003c/em\u003e \u003cstrong\u003e166\u003c/strong\u003e, 49\u0026ndash;66 (2018).\u003c/li\u003e\n\u003c/ol\u003e"},{"header":"Footnotes","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003e Please note, that, due to an oversight, this hypothesis was stated incorrectly in the opposite direction in our pre-registration.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"},{"header":"Tables","content":"\u003cp\u003eTables are available in the Supplementary Files section.\u003c/p\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Unit-decade compatibility effect, number word inversion, language switching, L2, bilingualism","lastPublishedDoi":"10.21203/rs.3.rs-7745359/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7745359/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eParticipants\u0026rsquo; language background can show subtle effects on number comparison even with Arabic digits. Number pairs can be decade-unit compatible, i.e. the decision about number size is the same for unit and decade (e.g. 24 and 57, 2\u0026thinsp;\u0026lt;\u0026thinsp;5 and 4\u0026thinsp;\u0026lt;\u0026thinsp;7), or decade-unit incompatible (e.g. 29 and 57, 2\u0026thinsp;\u0026lt;\u0026thinsp;5 but 9\u0026thinsp;\u0026gt;\u0026thinsp;7). Incompatible pairs often lead to longer response times. This compatibility effect is larger in languages with number word inversion (e.g. German). Our aim was to investigate the influence of language on the compatibility effect in bilinguals. In Experiment 1, we compared the compatibility effect in German monolinguals and German-English bilinguals. We found a significant compatibility effect, but no significant differences between the mono- and bilinguals. In Experiment 2, we manipulated language activation within participants by asking German-English bilinguals to describe scenes either in English or German before completing number comparison trials. There was a significant cost of switching languages, showing that participants did activate German and English. However, the size of the compatibility effect was not significantly influenced by participants activating German or English. Overall, we did not find an influence of a bilingual\u0026rsquo;s language on number comparison with Arabic digits. [199]\u003c/p\u003e","manuscriptTitle":"Which language matters? Number comparison in bilinguals","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-10-23 02:46:07","doi":"10.21203/rs.3.rs-7745359/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2026-02-18T04:08:29+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-17T20:41:06+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-16T17:48:55+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2026-02-03T10:19:31+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"303914076736437124170432889708335489959","date":"2026-01-26T17:31:42+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"24057517844837617642438310265890261733","date":"2026-01-24T20:26:25+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"65824401049655995345744662037482445029","date":"2026-01-21T11:05:32+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"41782564502268538138575069370776767859","date":"2026-01-10T10:21:05+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"303914076736437124170432889708335489959","date":"2025-11-10T22:11:01+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"181664850622810289865236193112136983608","date":"2025-10-20T13:36:51+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-10-09T08:25:27+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-10-09T05:36:11+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-10-07T07:14:47+00:00","index":"","fulltext":""},{"type":"submitted","content":"Scientific Reports","date":"2025-10-06T21:05:22+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"scientific-reports","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"scirep","sideBox":"Learn more about [Scientific Reports](http://www.nature.com/srep/)","snPcode":"","submissionUrl":"","title":"Scientific Reports","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"stoa","reportingPortfolio":"Scientific Reports","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"3bb343b9-5dbb-4a88-a3a3-b10712d8ba81","owner":[],"postedDate":"October 23rd, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"under-review","subjectAreas":[{"id":56586286,"name":"Humanities/Language and linguistics"},{"id":56586287,"name":"Social science/Language and linguistics"},{"id":56586288,"name":"Biological sciences/Neuroscience"},{"id":56586289,"name":"Biological sciences/Psychology"},{"id":56586290,"name":"Social science/Psychology"}],"tags":[],"updatedAt":"2026-05-04T10:54:26+00:00","versionOfRecord":[],"versionCreatedAt":"2025-10-23 02:46:07","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-7745359","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7745359","identity":"rs-7745359","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.