The Dynamics of Collaboration: A Scientometric Study of Team Size and Text Complexity in Scientific Abstracts from Bangladesh

preprint OA: closed
Full text JSON View at publisher
Full text 127,638 characters · extracted from preprint-html · click to expand
The Dynamics of Collaboration: A Scientometric Study of Team Size and Text Complexity in Scientific Abstracts from Bangladesh | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article The Dynamics of Collaboration: A Scientometric Study of Team Size and Text Complexity in Scientific Abstracts from Bangladesh Ravikumar S., Mezbah-ul-Islam Muhammad, Chandrappa , This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8251341/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract The geography of scientific collaboration has become increasingly internationalized, transforming where and how research is constructed and communicated. While collaboration has been shown to impact scientific production and impact, much less is known about how it interacts with the linguistic characteristics of scientific communication. This study analyses the extent to which team size has an impact on the linguistic features of scientific abstracts from scholars with Bangladesh affiliations. Using the full population of 25,883 WoS publications (1972–2023), I assess the differences in three relevant linguistic characteristics - readability, lexical diversity, and abstract length - based on collaboration type (solo, small groups (2 authors), medium groups (3–5), large groups (6–10) and very-large groups (> 10 authors)). Readability is assessed through the Flesch Reading Ease, Flesch-Kincaid Grade Level and Gunning FOG Index while linguistic diversity is assessed through the Type-Token Ratio (TTR). The findings reveal three primary conclusions. First, lexical diversity is negatively impacted by team size in which greater linguistic inclusivity was found within collaborative endeavour’s as team size increased. Second, readability behaves according to a curvilinear pattern whereby 6–10 author teams produce the easiest-to-read abstracts and > 10 author teams produce more concise, complex works. Third, abstract length is positively impacted by team size due to the broader conceptual and methodological scope of collaborative efforts. Mean differences for readability relative to team size are statistically significant (ANOVA; p < 0.001) as a negative trend of reading ease from 2019–2024 reveals a decrease for all group sizes. This study contributes to the emerging field of Scientometrics - by connecting structures of collaboration to assessable linguistic features in practical applied scientific output for communication purposes. Practical applications are made for research evaluation, journal editors and writing specialists who teach novice researchers in burgeoning research fields how to produce such collaborative findings effectively while best anticipating expectations. Ultimately, these results indicate that researchers should welcome access to linguistic features in evaluation of collaborative efforts as communication does not come without cost when collaborative scientific writing occurs. team size scientific collaboration readability lexical diversity text analysis scientometrics collaborative writing research team dynamics Bangladesh Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Introduction Scientific collaboration is an increasingly central component of twenty-first century knowledge construction. Cross-disciplinary researchers work alongside one another to engage at a level of complementary expertise, access to resources, and methodological ability that is necessary for increasingly complicated scientific inquiries. Supported by increasingly more international opportunities for publication and research output, findings increasingly boast publication trends, co-authorship tendencies, increased team size and extensive networks to paper publication. Thus, thus far, collaboration has come to influence productivity, and consequently, knowledge transmission; however, collaboration increasingly connects with how scientific knowledge comes to be written, presented, and subsequently disseminated. At the same time, an increased scholarly focus exists relative to the language characteristics of scientific communication. Recent interdisciplinary findings note that the readability of scientific papers has increased over time relative to lexicon, organization of text, discipline specific jargon and standards of scientific communications established over twenty-first century developments. Text complexity - readability, lexical diversity and structural complexity - provides insight into how discoverable works of scientific communications relative to their complexities are or are not accessible to average audiences for comprehension, citation and application purposes. Yet despite such growing interest across academia and publication, the role of collaboration in this process remains relatively unexplored. Many studies assess team science relative to productivity, impact considerations. Still, few consider how team size (the level or absence of collaboration on authorship) - the number of authors - can impact conciseness of argument, depth of diction and informational value across science-related abstracts. Bangladesh represents a timely and relevant space for investigating this further. As a developing country, Bangladesh boasts rapid growth relative to its system of research within the last two decades - growing publication output, growing levels of in-country co-authorship, and growing interdisciplinary connections characterize the developmental and academic/publishing realities of researchers and students across disciplines. Public and private universities support this output while public/government laboratories and specialty research institutes support interdisciplinary connections in the biomedical sciences, engineering, agriculture, public health, and social sciences. However, thus far, no study has ventured to connect how the shifting nature of collaboration within Bangladesh has shifted over time to impact authorial writing and literacy characteristics in the specific features of scientifically published writing. Abstracts represent a potent space for assessing such change; as relatively shorter pieces to longer works representing content-based papers, abstracts wield methodological appeal via necessity and relevant framing which subsequently inform their own structure and complexity. Abstracts can be simple or complex depending on where they are sent or to what audience they are geared linguistically. Thus, word choice, length and ensuing complexity can come from a collaborative effort of team dynamics which leverage negotiation amongst diverse voices or from limited dynamics. Thus far, research exists relative to scientific impact and team size (Larivière et al., 2015); for example, while those teams of larger size accrue the most citations over time, the smaller sized teams present novel concepts or disruptive findings (Wu, Wang & Evans, 2019 ). Comparative studies note readability - Plavén-Sigray et al., ( 2017 ) - is decreasing over time generally. Yet little effort has been exerted to assess how these two advancements are connected; does team size impact readability? Does the team size on a spectrum operate linearly when it comes to language features? This project aims to fill that void between established contributions to confirm or deny observations related to collaborative writing across developed systems where they join established literature in scientometrics relative to effective team science practices and more generally applicable contributions relevant to scholarly communication. A large corpus study is employed with 25,883 abstracts indexed by Web of Science from 1972–2023 with authors from institutions within Bangladesh which explores length, readability and lexical diversity across different collaboration size tiers, specifically one author. One author collaboration exists. Review of Literature Collaboration Dynamics and Global Scientific Output Collaborative science has shifted considerably over decades in response to levels of specialization, greater facility ability, and international competition. The team-oriented nature of research functions as the primary means of knowledge production for any given field and channels the configuration of outputs, patterns of impact and communication. Internationally, large-scale studies indicate an increasingly progressive, global trend toward expanded collaboration. Gazni, Sugimoto and Didegah ( 2011 ) cite significant increases in small and medium collaborative teams in a sample of over 14 million records in the Web of Science while Adams et al. ( 2005 ) find a 50% increase in collaborative team size in U.S. universities from 1981–1999. Such findings help establish the notion that science exists in a highly complexly structured, interdisciplinary setting today where institutions facilitate this collaborative means of research. This is especially true in emerging research countries like Bangladesh where collaboration patterns, institutional capacity and age of research and communication patterns develop in a new, rapidly growing setting. Growth of Team Science and Evolution of Collaboration Structures Such cumulative findings support unit size and collaborative component due to technological, institutional, national and disciplinary factors over the past twenty years. For example, Gazni et al. ( 2011 ) note that various increases were noted in small- and medium-size groups as technological advancements allow information to be sent to larger ranges without major costs reducing for necessary or Adams et al. ( 2005 ) suggests that although collaboration is high (even within the U.S.), levels which boast significant increased over the course of the twenty-year study boast highly productive increases of papers within the U.S. According to Gazni and Didegah ( 2011 ), Harvard University boasts 88% of papers produced with collaborative efforts and papers produced across institutions boast extremely high levels of citations as well. Furthermore, Franceschet and Costantini ( 2010 ) relative to 20 different disciplines studied in Italy reveal that collaboration, team size, potential for citation and potential for peer-review increase in a positive fashion, as do all variables. Yet there's a variable - discipline. For example, Abt ( 2007 ) asserts that international collaboration via publication is field dependent - international collaboration accounted for 13% of published papers in surgery in 2007 while 55% of papers in astronomy - pointing to some geopolitical interdisciplinary epistemology at play and Didegah & Gazni (2014) assess national collaboration within niche fields while international collaboration is an asset for international visibility and eventual citation potential. Overall, these cumulative findings suggest that unit size and collaborative component is based on technological, institutional, national and disciplinary factors - all pertinent to the specific research field in Bangladesh's developing research. Collaboration Networks and Their Influence on Impact and Innovation But network structure is just as important as network size. Liu et al. (2024) note that the more "small-world characteristics" (high clustering and low path length) teams have, the more they influence publications. Furthermore, levels of national collaboration reliance vary. For instance, the Scientific Dependence Index shows that the smaller systems (Cuba, for instance) rely more heavily upon international collaboration while larger systems boast more balanced output - although this, too, is subject to geopolitical and historical developments. Likewise, in Europe, Zitt et al. (2020) found that long-term political, linguistic and historical connections help understand how collaboration occurs on a bilateral basis. Thus, the fact that Bangladesh is trending more reliant on international networks over time suggests levels of potentially impacted research novelty and language use in scientific publications. But collaboration isn't always helpful. Mongeon and Larivière ( 2015 ) found that collaborators of researchers responsible for retracted publications experience drastic declines in future publication rates and citation impact which indicates caution when deciding to co-author with someone else. Ultimately, while many authors today contribute to discoveries, small author teams have breakthroughs uniquely tied to them. Winnink et al. ( 2015 ) found via the development of the ubiquitin-mediated proteolytic system that concentrated collaboration results in accelerated conceptizations over time. Text Complexity, Readability, and Scientific Communication Research on linguistic characteristics of scientific literature contributes to an additional layer of exploring scientific communication. According to Dressler et al. ( 2025 ), readability is relative, based upon the alignment of linguistic characteristics with textual density and cognitive burden. The denser the language used, the more cognitive effort required, and the lower the potential for comprehension. Findings relative to scientific writing implicate decline in readability, regardless of discipline. For example, in the findings relative to plain language summaries, it was determined by Lang et al. ( 2025 ) that many plain language summaries fell above recommended levels of readability and discipline-specific terms frequently connected to interdisciplinary efforts and large research teams. Plavén-Sigray et al. ( 2017 ) report a similar decline in scientific readability from 1881 to 2015 across 12 dominant fields of inquiry. Furthermore, linguistic findings support that collaborative writing reduces lexical variety and encourages convergence into homogeneity of word choice (Lu et al., 2019 ; Song et al., 2023 ). Therefore, this data supports that collaborative writing may have an impact on the density of linguistic characteristics. Research Context and Gaps in Bangladesh However, with Bangladeshi research on the rise, the following is true for the literature: Many bibliometric works have not accounted for collaborative information across institutions of Bangladesh. No corroborating empirical research exists for the reader effect of group size and text complexity (readability, lexical diversity). No scientifically standardized national scope scientometric study exists that links collaboration and linguistic variables on a sample of scientific abstracts. Few studies exist that assess the ability of authors to shift their writing strategies in accordance with collaborative means. Thus, as research comes out of Bangladesh at a faster pace - higher collaborative rates, more international collaborations and institutions of varying strengths - now is the time to explore the nuances of abstract construction. Synthesis and Conceptual Framework These three themes emerge from the above literature review: Team size and collaborative configuration affect scientific visibility and citations and impact measures, novelty (Franceschet & Costantini, 2010 ; Gazni & Didegah, 2011 ; Liu et al., 2024). Collaboration patterns suggest research capacity exists based on previous engagement with certain countries and field tendencies (Abt, 2007 ; Didegah & Gazni, 2014; Ronda-Pupo, 2025). Lexical complexity suggests cognitive exertion and communicative success, which comes from team size and collaborative writing effort (Dressler et al., 2025 ; Lang et al., 2025 ). Therefore, these three themes of literature support the idea that team size will correlate with abstract clarity, lexical diversity and length, especially within an underdeveloped research context such as Bangladesh. Collaboration Structures and Citation Impact The findings of the research are overwhelmingly positive. Externally, research from external sources (Gazni & Didegah, 2011 ) and research from select universities across Italy (Franceschet & Costantini, 2010 ) indicates that where collaboration is concerned - from group size to citation usefulness - view it as a favorable, correlational study. Internally, the field research (Abt, 2007 ; Xu et al., 2021 ) indicates that increased collaboration in the personal lives of many people only improves levels of collaboration in research. However, where collaboration and impact are concerned, little research exists in comparison to team science and linguistic or written pertinence to one paper. For example, impact relative to abstract familiarity or lexical diversity has not been researched as it pertains to Bangladesh for no relative research exists to date. Therefore, this research is necessary. Objectives of the Study This study aims to: Investigate collaborative tendencies in the linguistic construction of Bangladeshi scientific abstracts. Examine levels of linguistic complexity (readability, lexical variety, syntactic complexity and semantic complexity) in Bangladeshi scientific abstracts. Establish whether team size is an influencing factor for shifts in textual complexity. Discuss socio-academic drivers of collaboration within Bangladesh. Methods Data Source and Pre-processing The dataset used for the study was created from the WoS (Web of Science) Core Collection. It encompasses all publications that have at least one author affiliated with a Bangladeshi institution. The first step of data cleaning was performed in R (v4.3.1). Records without abstracts, empty abstracts and abstracts below 50 characters in length were removed from the study - this ensures that the papers are sufficient for a large enough language study. The study's final analytical dataset consisted of 25,883 scientific abstracts after this stage. The study estimated team size by assessing the number of authors present in the WoS field designated for authors. For example, authors separated by semicolon, semi-colon, were each counted for contribution; those without any data were counted as one author. Thus, the publications were classified into a type of collaboration category: Solo (1 author), Small team (2 authors), Medium team (3–5 authors), Large team (6–10 authors), Very large team (> 10 authors). These categories were treated as ordered factors in further analysis. Linguistic Feature Extraction Languaging was conducted via the quanteda package (Benoit et al., 2018 ). A corpus object was created from the authenticated abstracts with each abstract being its own document. Length and Structural Metrics The following textual variables were calculated per document. Total number of words: total number of character strings excluding whitespace Total number of sentences: estimated via terminal punctuation (., !, ?) They were standards of equivalence approximations for extent and density of form. Readability Measures Readability was assessed using textstat_readability(). Three widely used indices were computed Flesch Reading Ease (FRE): scores from 0–100, where higher scores indicate greater readability. Flesch–Kincaid Grade Level (FKGL): estimates the U.S. school grade required to comprehend the text. Gunning FOG Index: estimates years of formal education needed for first-pass understanding. Lexical Diversity The Type-Token Ratio (TTR) was calculated to represent lexical diversity. The same tokenization process occurred where the abstracts' punctuation, numbers, and special characters were removed, and the case was transformed into lowercase. Then, textstat_lexdiv() was applied to measure this variable $$\:\text{T}\text{T}\text{R}=\frac{\text{U}\text{n}\text{i}\text{q}\text{u}\text{e}\:\text{w}\text{o}\text{r}\text{d}\:\text{t}\text{y}\text{p}\text{e}\text{s}}{\text{T}\text{o}\text{t}\text{a}\text{l}\:\text{w}\text{o}\text{r}\text{d}\:\text{t}\text{o}\text{k}\text{e}\text{n}\text{s}}$$ Statistical Analysis and Visualization All tests were run in R. Descriptive statistics (mean, sd) for all linguistic variables were acquired across all five team-size groups using the dplyr package. Correlation Matrix To assess linear relationships between team size and abstract length (tokens) versus readability measures and lexical diversity, a Pearson correlation matrix was constructed. Group Comparisons A one-way ANOVA was performed to compare the means of linguistic features across collaboration groups (where FRE was the dependent variable, and team size category was the grouping factor). This assessed whether different levels of collaboration impact readability. Visualizations All visualizations were constructed with ggplot2, which subsequently had a publication-ready theme applied. ggplot2 created the following output A bar chart visualizing team sizes of papers with frequency A violin + box plot showing the distributions of the readability/length and lexical variables A heat map denoting correlations between all linguistic variables Line plots per year (2019–2024) Visualizations helped interpret any collaboration-linguistic relationships. Results Team Size Distribution An investigation into the 25,883 works reveals a shift towards more collaborative authorship. According to Fig. 1 , medium (3–5 authors) and large (6–10 authors) collaborative efforts dominate as 10,165 (39%) and 9,869 (38%) papers were published by medium and large collaborative efforts, respectively. Only 500 of the 25,883 works (approximately 2%) represent solo efforts. This reflects, to a degree, the collaborative efforts of publication from Bangladesh as over time, more and more works are published by teams rather than individuals. Team Size and Abstract Characteristics Interestingly, a specific relationship emerged between team size and abstract structure. Not only did grand means of an abstract length greater for team size, but the means of word counts per level of team size occurred more frequently, too. The grand mean was 198.62 words for 1 author and 266.99 for extremely large teams (> 10). This means that the larger the team, the more extensive and more likely and encompassing abstract sent is as more people contribute more information to include. In contrast, lexical diversity decreased by team size on the basis of Type-Token Ratio (TTR). The average TTR score of the smaller (1.54), (1) and the extremely large teams (> 10) (0.56). This means that there are less lexically diverse options made in collaboration abstractions than there are in those made by one author. Table 1 Summary of Abstract Metrics by Team Size Category Team Size Category N Papers Avg. Word Count Avg. Flesch Score Avg. F-K Grade Avg. TTR Solo (1) 500 198.62 19.98 16.74 0.59 Small team (2) 1773 206.28 21.02 16.47 0.58 Medium team (3–5) 10165 218.91 21.1 16.51 0.58 Large team (6–10) 9869 237.03 21.58 16.52 0.57 Very large team (> 10) 3576 266.99 20.45 16.82 0.56 Note . F-K Grade = Flesch-Kincaid Grade Level; TTR = Type-Token Ratio. Two trends are particularly evident: Abstract length increases with team size (∆ ≈ +68 words between solo teams and very large teams) Lexical diversity decreases as TTR scores decline. These two conclusions indicate that the greater the collaborative effort, the greater the likelihood of a longer - but lexically unimpressive - abstract. Team Size and Readability Readability was related to team size, non-linearly. A one-way ANOVA established differences in FRE across the five types of collaboration, F(4, 25,878) = 13.55, p < .001. Ultimately, the most readable abstracts came from large collaborations (6–10 authors) with a mean FRE of 21.58. However, it's essential to realize that FRE decreases slightly for very large collaborations (> 10 authors) meaning that this trend approaches linguistic complexity at excessively collaborative stages. The violin plots below in Fig. 3 show how FREs lie within each team, revealing much spread within groups but consistently low values in the absolute sense - typical of scientific prose where anything lower than 30 is considered technical, graduate student language. Correlations between Linguistic Features Correlation tests provided additional statistically significant observations between team size, word count, readability, and lexical diversity (see Fig. 4 ). For example, a moderately strong positive correlation existed between team size and word count (r = .42), indicating that the larger the team, the more words were used to construct the abstract. In addition, as anticipated, a slightly negative correlation existed between team size and lexical diversity (TTR) (r = − .18), since the more authorship dispersed in works, the smaller the vocabulary used, a phenomenon observed in collaborative works and lexical convergence in scientometric research (Mongeon & Larivière, 2015 ; Winnink et al. 2015 ). Metrics for readability told a different story. A correlation matrix found a correlation between FRE and word count (r = .21) and a correlation with team size (r = .14). This FRE finding in other words means that perhaps slightly longer papers create clarity of exposition. Furthermore, FRE also implies that teams larger than ten are conducive to a better understanding of average reading level materials with more authors. Similarly, FKGL and Gunning FOG Index are positively correlated with team size (r = .14 and r = .17 respectively). Thus, the more authors on a team the more complicated the writing is and required higher levels of reading comprehension. TTR is lexical diversity and this was negatively correlated with both the FOG Index (r = − .22) and FKGL (r = − .19) meaning papers with less lexically diverse vocabularies are also more complicated and conceptually dense. This makes sense in linguistic terms and with cognitive load theory (Mongeon & Larivière, 2015 ). Ultimately, these findings suggest a nuanced complex of collaborative and scientific writing. Temporal Trends in Readability (2019–2024) In order to measure whether the linguistic variables changed over time, a temporal assessment of the trend was performed for 2019–2024. The trend of FRE and TTR per team size is in Fig. 5 . The major findings from this assessment are as follows: Decrease in Readability FRE had an overall decrease over the five-year span (average ΔFRE = -1.8). This means increasingly dense scientific abstracts over the course of time years are less readable. This not only complements findings abroad that show decreases over time (Mongeon & Larivière, 2015 ; Winnink et al. 2015 ), but suggests that researchers are subject to these pressures internationally. Decrease in Lexical Variety : TTR showed a nominal decrease over this time frame (ΔTTR ≈ -0.02), meaning there is a standardization of vocabulary that likely means a homogeneity of writing style as well. Team Size Distribution Consistency : Regardless of annual increases and decreases in total published articles, the percent makeup of single author, small, medium, large and very large teams remained relatively proportionate and consistent with other years. The increase of technicality and specialization to an increasingly less readable extent resonates with researchers internationally as international findings show a similar level of formality in interdisciplinary and increasingly specialized scientific communications. Therefore, the temporal findings suggest researchers similarly find themselves pressured over time by increasingly substantive events. Discussion This interdisciplinary study offers a comprehensive scientometric-linguistic analysis of the collaborative relationships that shape the linguistic characteristics of scientific abstracts from Bangladeshi authors. Drawing on the results of a dataset of 25,883 WoS publications, this study finds major characteristics where team size is appropriately matched to standardized type/token ratio (TTR), readability and length of abstract. First, the relationship between team size and length of abstract is positively correlated. This implies that where authors are collaborating with more other authors, their abstracts become increasingly succinct presentations. Such an increase would be due to the ability for these collaborative contributions to find a balance between diverse approaches and larger designs necessitating increased understanding. Thus, with a multi-authored approach, a greater number of ideas can be posited, meaning that abstract length mirrors the complexity and intellectual comprehension expected of such analyses. However, as team size increases, TTR decreases. This suggests that where authors are working together in a multi-author context, there is a similar lexicon used. This may be due to a consensus effect that teams who collaborated arrived at less specialized language that would be accessible in efforts to appeal to the diverse writing styles and discipline approaches. The results are aligned with findings of collaborative writing where groups of authors writing from drafts have less variation in lexical choices as authors work through a common voice. While this voice may be clearer, it is also homogenized to such an extent that it foregoes stylistic differences for eloquent economy that is more pronounced in larger teams. The relationship between team size and readability is not necessarily linear. Teams of 6–10 authors have the highest readability - ironically compared to the extensive abilities of team dynamics - where it seems a happy median is found between enough persons for internal peer review and expanded editorial management yet not so many that there's a breakup in how to convey information. Excessive teams (> 10 authors) present with lower readability. This indicates that collaborative writing was not managed as effectively along with continued variations in writing and shifts in abstract purpose. This aligns with theoretical explorations of collaborative communication that suggests the larger the team, the less clear it becomes. Second, the negative correlation to readability relative to time from 2019–2024 aligns with global trends. Researchers like Plavén-Sigray et al. ( 2017 ) found that from global access trends, general scientific writing has become less readable relative to increased specialization, author competitiveness and advanced focus on jargon use within the discipline. That Bangladesh finds itself amidst similar trends, however, suggests that as applied linguistic pressures penetrate modern science, technical density and complex methodologies and international publication patterns have permeated developed research fields just as much. Ultimately, clear trends emerge whereby collaboration affects scientific writing in various directions. While larger teams do create longer abstracts, they also rely on standardized vocabulary; medium-sized teams create clearer works relative to small sized or excessively sized teams - while global literacy finds itself increasingly downward trending in interdisciplinary endeavours suggests that the collaborative approaches are unlikely to be clear to greater public audiences. These results are important for Bangladeshi scholars - a burgeoning collaborative research field of academic investigators - for subsequent endeavors. As more and more collaborative and interdisciplinary creations are made, an intention on the results regarding authorship tendencies and editorial group considerations for appropriate dissertation and article development will be of great use without extended lengths of abstracts. Furthermore, a more cohesive effort in communication will bolster worldwide recognition of articles by Bangladeshi scholars. Conclusion This research confirms that research team size is a significant predictor of Bangladeshi authors' language features of abstracts. With the presumption of larger collaborative teams being the most common collaborative authorship approach to what's currently out there, it seems that excessively sized contributions lend to longer - and thus less readable - abstracts. Abstracts with more like 5–10, however, have the potential for greater conceptual cohesion although it's determined that abstract language is more similarly aligned. Readability, however, is even more nuanced - not only is it not linear with the size of the team, but a substantial readable average can be attributed to a specific size (6–10). Average sized contributions with these authors produced better figures in legibility and syllable count than single written and those coalesced through excessive collaboration (11+). At the same time, a relative reduction in readability across the board suggests an international phenomenon as specialized subfield researchers grow more specialized, developing more intricate methodologies and discipline-specific lexicon; this subfield position makes sense within a relatively new national research context like Bangladesh as it suggests needing help to create clear scientific writing practices and understandable practices through workshops, editorial support, and institutional development. Thus, these findings would be generalizable through scientific writing workshops and optimal collaborative authorship conclusions that would be supported by national policies surrounding research. Early-career researchers would benefit from scientific writing workshops that promote appropriate collaborative styles, giving all researchers from Bangladesh awareness to the pros and cons of authorship on collaborative papers/literature. Nationally, best practice promotion of publication through clear communication would make scientifically minded pieces from Bangladesh come across internationally under a better (or at least preferred, clearer) light. Future studies would seek to determine intentions from collaborative writers as to this phenomenon, longitudinal cross-sections of years with full manuscript assessments for within-team comparisons over the same years accessed per team size, interdisciplinary differences for national comparisons or international variances on abstract findings. All would explore collaborative connections between scientific communication in broader contexts. Declarations Funding: The authors received no financial support for the research, authorship, and/or publication of this article. Conflicts of Interest / Competing Interests: The authors declare that there are no conflicts of interest. Ethics Approval: Not applicable. No human participants, animals, or sensitive data were involved. Consent to Participate: Not applicable. Consent for Publication: Not applicable. Availability of Data and Materials: The dataset derived from Web of Science is available from the corresponding author upon reasonable request. Code Availability: R scripts used for text mining, readability analysis, and visualization are available from the authors upon request. Authors’ Contributions (CRediT): S. Ravikumar: Conceptualization; Methodology; Data Curation; Formal Analysis; Visualization; Writing – Original Draft. M. Mezbah-ul-Islam: Supervision; Validation; Writing – Review & Editing; Interpretation. Chandrappa: Writing – review & editing; Resources; Validation , Investigation; Formal analysis; Writing – Original Draft. References Abt, H. A. (2007). The frequencies of multinational papers in various sciences. Scientometrics , 72 (1), 105–115. https://doi.org/10.1007/s11192-007-1710-5 Adams, J. D., Black, G. C., Clemmons, J. R., & Stephan, P. E. (2005). Scientific teams and institutional collaborations: Evidence from U.S. universities, 1981–1999. Research Policy , 34 (3), 259–285. https://doi.org/10.1016/j.respol.2005.01.014 Angulo-Tuesta, A., & Santos, L. M. P. (2015). Evaluation of the impact of maternal and neonatal morbidity and mortality research funded by the Ministry of Health in Brazil. Research Evaluation , 24 (1), 1–11. https://doi.org/10.1093/reseval/rvv022 Ante, L. (2022). The relationship between readability and scientific impact: Evidence from emerging technology discourses. Journal of Informetrics , 16 , 101252. https://doi.org/10.1016/j.joi.2022.101252 Bao, T., Zhao, Y., Mao, J., & Zhang, C. (2025). Examining linguistic shifts in academic writing before and after the launch of ChatGPT: A study on preprint papers. Scientometrics. https://doi.org/10.1007/s11192-025-05341-y Benoit, K., Watanabe, K., Wang, H., Nulty, P., Obeng, A., Müller, S., & Matsuo, A. (2018). quanteda: An R package for the quantitative analysis of textual data. Journal of Open Source Software , 3 (30), 774. https://doi.org/10.21105/joss.00774 Boyack, K. W., van Eck, N. J., Colavizza, G., & Waltman, L. (2018). Characterizing in-text citations in scientific articles: A large-scale analysis. Journal of Informetrics , 12 (1), 59–73. https://doi.org/10.1016/j.joi.2017.11.005 Bu, Y., Ding, Y., Liang, X., & Murray, D. (2018). Understanding persistent scientific collaboration. Journal of the Association for Information Science and Technology , 69 (4), 480–493. https://doi.org/10.1002/asi.23966 Chen, C. (2012). Predictive effects of structural variation on citation counts. Journal of the Association for Information Science and Technology , 63 (3), 431–449. https://doi.org/10.1002/asi.21694 Deng, H., Wu, C., Ge, B., & Wu, H. (2025). Quantification and identification of authorial writing style through higher-order text network modeling. Journal of Informetrics. https://doi.org/10.1016/j.joi.2024.101603 Dong, S., Mao, J., Ke, Q., & Pei, L. (2024). Decoding the writing styles of disciplines: A large-scale quantitative analysis. Information Processing & Management , 61 , 103718. https://doi.org/10.1016/j.ipm.2024.103718 Dressler, W., et al. (2025). Textual complexity and readability in scientific communication: A cognitive-linguistic perspective. Journal of Technical Writing and Communication , 55 (3), 413–439. Franceschet, M., & Costantini, A. (2010). The effect of scholar collaboration on impact and quality of academic papers. Scientometrics , 87 (2), 295–306. https://doi.org/10.1007/s11192-010-0310-4 Gazni, A., Sugimoto, C. R., & Didegah, F. (2011). Mapping world scientific collaboration: Authors, institutions, and countries. Journal of the Association for Information Science and Technology , 63 (2), 323–335. https://doi.org/10.1002/asi.21688 Gazni, A., & Didegah, F. (2011). Investigating different types of research collaboration and citation impact: A case study of Harvard University publications. Scientometrics , 87 (2), 251–265. https://doi.org/10.1007/s11192-010-0318-9 Gui, W., Wang, H., Sun, X., & Wang, N. (2022). Linguistic complexity in scientific writing: A large-scale diachronic study from 1821 to 1920. Scientometrics , 127 (8), 4327–4348. https://doi.org/10.1007/s11192-022-04550-z Hasegawa, Y., et al. (2024). A cell competition system with one gene expression from a single-copy gene in one cell. PLOS ONE , 19 , e302451. https://doi.org/10.1371/journal.pone.0302451 Hayati, Z., & Didegah, F. (2010). International scientific collaboration of Iran with other countries: A scientometric study. Journal of Information Science , 36 (4), 482–491. https://doi.org/10.1177/0165551509351430 Hirsch, J. E. (2010). An index to quantify an individual’s scientific research output that takes into account the effect of multiple coauthorship. Scientometrics , 85 (3), 741–754. https://doi.org/10.1007/s11192-010-0193-9 Hoffman, E., Li, Y., Coutts, R. T., & Maris, S. (2024). Leveraging AI for educational content consistency: Metadata generation for MOOC development. In IEEE DEMOcon 2024 . https://doi.org/10.1109/DEMOCON63027.2024.10747877 Lang, S., Walter, T., & Rodrigues, R. (2025). Readability and jargon in scientific plain-language summaries. PLOS ONE , 20 , e0302451. Lancho-Barrantes, B. S., Guerrero-Bote, V. P., Chinchilla-Rodríguez, Z., & de Moya-Anegón, F. (2012). Citation flows in the zones of influence of scientific collaborations. Journal of the Association for Information Science and Technology , 63 (3), 481–489. https://doi.org/10.1002/asi.21682 Larivière, V., Sugimoto, C. R., & Gingras, Y. (2014). Team size matters: Collaboration and scientific impact since 1900. Journal of the Association for Information Science and Technology , 65 (4), 1323–1332. https://doi.org/10.1002/asi.23266 Lu, C., Bu, Y., Wang, J., Ding, Y., Torvik, V., Schnaars, M., & Zhang, C. (2019). Examining scientific writing styles from the perspective of linguistic complexity. Journal of the Association for Information Science and Technology , 70 (5), 462–475. https://doi.org/10.1002/asi.24126 Lu, C., Bu, Y., Dong, X., Wang, J., Ding, Y., Larivière, V., Sugimoto, C. R., Paul, L., & Zhang, C. (2019). Analyzing linguistic complexity and scientific impact. Journal of Informetrics , 13 , 100961. https://doi.org/10.1016/j.joi.2019.07.004 Lu, C., Zhang, C., Xiao, C., & Ding, Y. (2022). Contributorship in scientific collaborations: Contribution-based byline orders. Information Processing & Management , 59 , 102944. https://doi.org/10.1016/j.ipm.2022.102944 Lu, C., Zhang, Y., Ahn, Y.-Y., Ding, Y., Zhang, C., & Ma, D. (2020). Co-contributorship network and division of labor in individual scientific collaborations. Journal of the Association for Information Science and Technology , 71 (1), 78–92. https://doi.org/10.1002/asi.24321 Mayukh Bagchi. (2025). Toward generative AI–driven metadata modeling: A human–LLM collaborative approach. Library Trends . https://doi.org/10.1353/lib.2025.a961196 Mongeon, P., & Larivière, V. (2015). The career impact of research fraud: A scientometric analysis. Journal of the Association for Information Science and Technology , 66 (2), 205–217. https://doi.org/10.1002/asi.23202 Pan, X., Yan, E., Wang, Q., & Hua, W. (2015). Assessing the impact of software on science: A bootstrapped learning of software entities in full-text papers. Journal of Informetrics , 9 (2), 247–257. https://doi.org/10.1016/j.joi.2015.07.012 Pinfield, S., Salter, J., Bath, P. A., Hubbard, B., Millington, P., Anders, J. H. S., & Hussain, A. (2014). Open-access repositories worldwide, 2005–2012. Journal of the Association for Information Science and Technology , 65 ( 2 ), 240–256. https://doi.org/10.1002/asi.23131 Plavén-Sigray, P., Matheson, G. J., Schiffler, B. C., & Thompson, W. H. (2017). The readability of scientific texts is decreasing over time. eLife , 6 , e27725. https://doi.org/10.7554/eLife.27725 Porwal, P., & Devare, M. (2024). Scientific impact analysis: Unraveling the link between linguistic properties and citations. Journal of Informetrics , 18 , 101526. https://doi.org/10.1016/j.joi.2024.101526 Romanchuk, R., Vysotska, V., Andrunyk, V., Chyrun, L., Chyrun, S., & Brodyak, O. (2023). Intellectual analysis system for Ukrainian-language artistic works. In Proceedings of CSIT 2023 . https://doi.org/10.1109/CSIT61576.2023.10324012 Song, N., Chen, K., & Zhao, Y. (2023). Understanding writing styles of scientific papers in IS–LS: Evidence from abstracts. Journal of Informetrics , 17 , 101377. https://doi.org/10.1016/j.joi.2023.101377 Stephen, D. (2025). Distinguishing articles in questionable and non-questionable psychology journals. Journal of Informetrics , 19 , 101640. https://doi.org/10.1016/j.joi.2025.101640 Sun, Z., He, D., & Li, Y. (2024). How the readability of manuscripts influences peer review: Biomedical evidence. Journal of Informetrics , 18 , 101547. https://doi.org/10.1016/j.joi.2024.101547 Winnink, J., Tijssen, R., & Van Raan, A. F. J. (2015). Small team breakthroughs: Collaboration constellations in high-impact science. Research Evaluation , 24 (3), 352–366. https://doi.org/10.1093/reseval/rvv026 Wu, L., Wang, D., & Evans, J. A. (2019). Large teams develop and small teams disrupt science and technology. Nature , 566 (7744), 378–382. https://doi.org/10.1038/s41586-019-0941-9 Xu, S., Li, L., An, X., Hao, L., & Yang, G. (2021). Detecting commonality and specialty between scientific publications and patents. Scientometrics , 126 (3), 1789–1806. https://doi.org/10.1007/s11192-021-04085-9 Additional Declarations No competing interests reported. Supplementary Files Appendix.png Appendix: Additional Figure Comprehensive panel showing the relationship between team size and (A) Length, (B) Readability, (C) Lexical Diversity, and (D) Sentence Complex Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8251341","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":610710715,"identity":"d4eadfb5-69d1-4deb-9bbc-b3abcd7fa8e3","order_by":0,"name":"Ravikumar S.","email":"","orcid":"","institution":"Central University of Tamil Nadu","correspondingAuthor":false,"prefix":"","firstName":"Ravikumar","middleName":"","lastName":"S.","suffix":""},{"id":610710716,"identity":"4066a3b9-0962-49fc-beca-946a24963ac2","order_by":1,"name":"Mezbah-ul-Islam Muhammad","email":"","orcid":"","institution":"University of Dhaka","correspondingAuthor":false,"prefix":"","firstName":"Mezbah-ul-Islam","middleName":"","lastName":"Muhammad","suffix":""},{"id":610710717,"identity":"97e8ad70-98d3-4693-88a6-49c67c33964c","order_by":2,"name":"Chandrappa ,","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA1klEQVRIiWNgGAWjYDACCSDmAdPMB0CUDCla2BJAFA8pWngMGKBs/IB/dvOzB2/3HJaTnHbm86sbNRY8DOyHj27Aa8mdY+aGc54dNpaWzt1mnXMM6DCetLQb+LQYSCSYSfMcOJw4D6jFOIcNqEWCx4yAlvRvIC3186Rznhnn/CNKSw7YlgRp6Rzmx7ltRGiRuJFTJjnnQLrhzNlpZsy5fRI8bIT8wj8jfZvEmwPW8hK3kx9/zvlWJ8fPfvgYXi3IgE0CTBKrHASYP5CiehSMglEwCkYOAAAO60PQ9WtXsAAAAABJRU5ErkJggg==","orcid":"","institution":"Indian Maritime University","correspondingAuthor":true,"prefix":"","firstName":"","middleName":"","lastName":"Chandrappa","suffix":""}],"badges":[],"createdAt":"2025-12-01 14:23:08","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-8251341/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-8251341/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":105416949,"identity":"17ffbce7-7113-4cdd-bd2a-78975b1bcd6a","added_by":"auto","created_at":"2026-03-25 19:19:16","extension":"jpg","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":27029,"visible":true,"origin":"","legend":"\u003cp\u003eFrequency of publications by author team size category.\u003c/p\u003e","description":"","filename":"Picture1.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/c3c7302b690c7b47a27bc037.jpg"},{"id":105416945,"identity":"29f2da5c-2c80-4048-a38b-f80fce2109ca","added_by":"auto","created_at":"2026-03-25 19:19:16","extension":"jpg","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":44710,"visible":true,"origin":"","legend":"\u003cp\u003eMean values for readability and lexical diversity metrics by team size.\u003c/p\u003e","description":"","filename":"Picture2.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/cc134ac74dd011277b6e0be5.jpg"},{"id":105416948,"identity":"6a3e91c1-df9a-4cab-8ae3-ed960244d58c","added_by":"auto","created_at":"2026-03-25 19:19:16","extension":"jpg","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":28786,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cem\u003eViolin plots of Flesch Reading Ease across team-size categories (diamond = mean).\u003c/em\u003e\u003c/p\u003e","description":"","filename":"Picture3.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/29396acc267419107c0e3d98.jpg"},{"id":105566741,"identity":"f4e909da-7aab-40e3-b86c-60b7754e6d36","added_by":"auto","created_at":"2026-03-27 12:57:10","extension":"jpg","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":52309,"visible":true,"origin":"","legend":"\u003cp\u003eCorrelation matrix of authorship and linguistic features.\u003c/p\u003e","description":"","filename":"Picture4.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/4b44272038cd3387eb23ab96.jpg"},{"id":105565953,"identity":"bd5939ac-4e49-45f1-9c07-c503e61ee97f","added_by":"auto","created_at":"2026-03-27 12:54:52","extension":"jpg","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":28985,"visible":true,"origin":"","legend":"\u003cp\u003eEvolution of abstract readability for small vs. large collaboration teams from 2019 to 2024.\u003c/p\u003e","description":"","filename":"Picture5.jpg","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/7bee3057cb8a0d4e5ce4ef34.jpg"},{"id":105569836,"identity":"5eb4c4af-7101-4347-9058-70c537c028eb","added_by":"auto","created_at":"2026-03-27 13:13:39","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1159680,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/0dba1dba-99de-4339-b9b9-e3d47f48a4ef.pdf"},{"id":105565932,"identity":"0613c85d-6e30-464c-864c-92868f3864f9","added_by":"auto","created_at":"2026-03-27 12:54:47","extension":"png","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":96323,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eAppendix: Additional Figure\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eComprehensive panel showing the relationship between team size and (A) Length, (B) Readability, (C) Lexical Diversity, and (D) Sentence Complex\u003c/p\u003e","description":"","filename":"Appendix.png","url":"https://assets-eu.researchsquare.com/files/rs-8251341/v1/71198fda1337c1838c385653.png"}],"financialInterests":"No competing interests reported.","formattedTitle":"The Dynamics of Collaboration: A Scientometric Study of Team Size and Text Complexity in Scientific Abstracts from Bangladesh","fulltext":[{"header":"Introduction","content":"\u003cp\u003eScientific collaboration is an increasingly central component of twenty-first century knowledge construction. Cross-disciplinary researchers work alongside one another to engage at a level of complementary expertise, access to resources, and methodological ability that is necessary for increasingly complicated scientific inquiries. Supported by increasingly more international opportunities for publication and research output, findings increasingly boast publication trends, co-authorship tendencies, increased team size and extensive networks to paper publication. Thus, thus far, collaboration has come to influence productivity, and consequently, knowledge transmission; however, collaboration increasingly connects with how scientific knowledge comes to be written, presented, and subsequently disseminated.\u003c/p\u003e \u003cp\u003eAt the same time, an increased scholarly focus exists relative to the language characteristics of scientific communication. Recent interdisciplinary findings note that the readability of scientific papers has increased over time relative to lexicon, organization of text, discipline specific jargon and standards of scientific communications established over twenty-first century developments. Text complexity - readability, lexical diversity and structural complexity - provides insight into how discoverable works of scientific communications relative to their complexities are or are not accessible to average audiences for comprehension, citation and application purposes. Yet despite such growing interest across academia and publication, the role of collaboration in this process remains relatively unexplored. Many studies assess team science relative to productivity, impact considerations. Still, few consider how team size (the level or absence of collaboration on authorship) - the number of authors - can impact conciseness of argument, depth of diction and informational value across science-related abstracts.\u003c/p\u003e \u003cp\u003eBangladesh represents a timely and relevant space for investigating this further. As a developing country, Bangladesh boasts rapid growth relative to its system of research within the last two decades - growing publication output, growing levels of in-country co-authorship, and growing interdisciplinary connections characterize the developmental and academic/publishing realities of researchers and students across disciplines. Public and private universities support this output while public/government laboratories and specialty research institutes support interdisciplinary connections in the biomedical sciences, engineering, agriculture, public health, and social sciences. However, thus far, no study has ventured to connect how the shifting nature of collaboration within Bangladesh has shifted over time to impact authorial writing and literacy characteristics in the specific features of scientifically published writing.\u003c/p\u003e \u003cp\u003eAbstracts represent a potent space for assessing such change; as relatively shorter pieces to longer works representing content-based papers, abstracts wield methodological appeal via necessity and relevant framing which subsequently inform their own structure and complexity. Abstracts can be simple or complex depending on where they are sent or to what audience they are geared linguistically. Thus, word choice, length and ensuing complexity can come from a collaborative effort of team dynamics which leverage negotiation amongst diverse voices or from limited dynamics.\u003c/p\u003e \u003cp\u003eThus far, research exists relative to scientific impact and team size (Larivi\u0026egrave;re et al., 2015); for example, while those teams of larger size accrue the most citations over time, the smaller sized teams present novel concepts or disruptive findings (Wu, Wang \u0026amp; Evans, \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e2019\u003c/span\u003e). Comparative studies note readability - Plav\u0026eacute;n-Sigray et al., (\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2017\u003c/span\u003e) - is decreasing over time generally. Yet little effort has been exerted to assess how these two advancements are connected; does team size impact readability? Does the team size on a spectrum operate linearly when it comes to language features?\u003c/p\u003e \u003cp\u003eThis project aims to fill that void between established contributions to confirm or deny observations related to collaborative writing across developed systems where they join established literature in scientometrics relative to effective team science practices and more generally applicable contributions relevant to scholarly communication. A large corpus study is employed with 25,883 abstracts indexed by Web of Science from 1972\u0026ndash;2023 with authors from institutions within Bangladesh which explores length, readability and lexical diversity across different collaboration size tiers, specifically one author. One author collaboration exists.\u003c/p\u003e"},{"header":"Review of Literature","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eCollaboration Dynamics and Global Scientific Output\u003c/h2\u003e \u003cp\u003eCollaborative science has shifted considerably over decades in response to levels of specialization, greater facility ability, and international competition. The team-oriented nature of research functions as the primary means of knowledge production for any given field and channels the configuration of outputs, patterns of impact and communication. Internationally, large-scale studies indicate an increasingly progressive, global trend toward expanded collaboration. Gazni, Sugimoto and Didegah (\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2011\u003c/span\u003e) cite significant increases in small and medium collaborative teams in a sample of over 14\u0026nbsp;million records in the Web of Science while Adams et al. (\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2005\u003c/span\u003e) find a 50% increase in collaborative team size in U.S. universities from 1981\u0026ndash;1999. Such findings help establish the notion that science exists in a highly complexly structured, interdisciplinary setting today where institutions facilitate this collaborative means of research.\u003c/p\u003e \u003cp\u003eThis is especially true in emerging research countries like Bangladesh where collaboration patterns, institutional capacity and age of research and communication patterns develop in a new, rapidly growing setting.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eGrowth of Team Science and Evolution of Collaboration Structures\u003c/h3\u003e\n\u003cp\u003eSuch cumulative findings support unit size and collaborative component due to technological, institutional, national and disciplinary factors over the past twenty years. For example, Gazni et al. (\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e2011\u003c/span\u003e) note that various increases were noted in small- and medium-size groups as technological advancements allow information to be sent to larger ranges without major costs reducing for necessary or Adams et al. (\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2005\u003c/span\u003e) suggests that although collaboration is high (even within the U.S.), levels which boast significant increased over the course of the twenty-year study boast highly productive increases of papers within the U.S.\u003c/p\u003e \u003cp\u003eAccording to Gazni and Didegah (\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2011\u003c/span\u003e), Harvard University boasts 88% of papers produced with collaborative efforts and papers produced across institutions boast extremely high levels of citations as well. Furthermore, Franceschet and Costantini (\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2010\u003c/span\u003e) relative to 20 different disciplines studied in Italy reveal that collaboration, team size, potential for citation and potential for peer-review increase in a positive fashion, as do all variables.\u003c/p\u003e \u003cp\u003eYet there's a variable - discipline. For example, Abt (\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2007\u003c/span\u003e) asserts that international collaboration via publication is field dependent - international collaboration accounted for 13% of published papers in surgery in 2007 while 55% of papers in astronomy - pointing to some geopolitical interdisciplinary epistemology at play and Didegah \u0026amp; Gazni (2014) assess national collaboration within niche fields while international collaboration is an asset for international visibility and eventual citation potential.\u003c/p\u003e \u003cp\u003eOverall, these cumulative findings suggest that unit size and collaborative component is based on technological, institutional, national and disciplinary factors - all pertinent to the specific research field in Bangladesh's developing research.\u003c/p\u003e\n\u003ch3\u003eCollaboration Networks and Their Influence on Impact and Innovation\u003c/h3\u003e\n\u003cp\u003eBut network structure is just as important as network size. Liu et al. (2024) note that the more \"small-world characteristics\" (high clustering and low path length) teams have, the more they influence publications. Furthermore, levels of national collaboration reliance vary. For instance, the Scientific Dependence Index shows that the smaller systems (Cuba, for instance) rely more heavily upon international collaboration while larger systems boast more balanced output - although this, too, is subject to geopolitical and historical developments.\u003c/p\u003e \u003cp\u003eLikewise, in Europe, Zitt et al. (2020) found that long-term political, linguistic and historical connections help understand how collaboration occurs on a bilateral basis. Thus, the fact that Bangladesh is trending more reliant on international networks over time suggests levels of potentially impacted research novelty and language use in scientific publications.\u003c/p\u003e \u003cp\u003eBut collaboration isn't always helpful. Mongeon and Larivi\u0026egrave;re (\u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e2015\u003c/span\u003e) found that collaborators of researchers responsible for retracted publications experience drastic declines in future publication rates and citation impact which indicates caution when deciding to co-author with someone else.\u003c/p\u003e \u003cp\u003eUltimately, while many authors today contribute to discoveries, small author teams have breakthroughs uniquely tied to them. Winnink et al. (\u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e2015\u003c/span\u003e) found via the development of the ubiquitin-mediated proteolytic system that concentrated collaboration results in accelerated conceptizations over time.\u003c/p\u003e\n\u003ch3\u003eText Complexity, Readability, and Scientific Communication\u003c/h3\u003e\n\u003cp\u003eResearch on linguistic characteristics of scientific literature contributes to an additional layer of exploring scientific communication. According to Dressler et al. (\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2025\u003c/span\u003e), readability is relative, based upon the alignment of linguistic characteristics with textual density and cognitive burden. The denser the language used, the more cognitive effort required, and the lower the potential for comprehension.\u003c/p\u003e \u003cp\u003eFindings relative to scientific writing implicate decline in readability, regardless of discipline. For example, in the findings relative to plain language summaries, it was determined by Lang et al. (\u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e2025\u003c/span\u003e) that many plain language summaries fell above recommended levels of readability and discipline-specific terms frequently connected to interdisciplinary efforts and large research teams.\u003c/p\u003e \u003cp\u003ePlav\u0026eacute;n-Sigray et al. (\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2017\u003c/span\u003e) report a similar decline in scientific readability from 1881 to 2015 across 12 dominant fields of inquiry. Furthermore, linguistic findings support that collaborative writing reduces lexical variety and encourages convergence into homogeneity of word choice (Lu et al., \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Song et al., \u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e2023\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eTherefore, this data supports that collaborative writing may have an impact on the density of linguistic characteristics.\u003c/p\u003e\n\u003ch3\u003eResearch Context and Gaps in Bangladesh\u003c/h3\u003e\n\u003cp\u003eHowever, with Bangladeshi research on the rise, the following is true for the literature:\u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eMany bibliometric works have not accounted for collaborative information across institutions of Bangladesh.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eNo corroborating empirical research exists for the reader effect of group size and text complexity (readability, lexical diversity).\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eNo scientifically standardized national scope scientometric study exists that links collaboration and linguistic variables on a sample of scientific abstracts.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eFew studies exist that assess the ability of authors to shift their writing strategies in accordance with collaborative means.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e \u003cp\u003eThus, as research comes out of Bangladesh at a faster pace - higher collaborative rates, more international collaborations and institutions of varying strengths - now is the time to explore the nuances of abstract construction.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eSynthesis and Conceptual Framework\u003c/h2\u003e \u003cp\u003eThese three themes emerge from the above literature review:\u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eTeam size and collaborative configuration affect scientific visibility and citations and impact measures, novelty (Franceschet \u0026amp; Costantini, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2010\u003c/span\u003e; Gazni \u0026amp; Didegah, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2011\u003c/span\u003e; Liu et al., 2024).\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eCollaboration patterns suggest research capacity exists based on previous engagement with certain countries and field tendencies (Abt, \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2007\u003c/span\u003e; Didegah \u0026amp; Gazni, 2014; Ronda-Pupo, 2025).\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eLexical complexity suggests cognitive exertion and communicative success, which comes from team size and collaborative writing effort (Dressler et al., \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2025\u003c/span\u003e; Lang et al., \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e2025\u003c/span\u003e).\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e \u003cp\u003eTherefore, these three themes of literature support the idea that team size will correlate with abstract clarity, lexical diversity and length, especially within an underdeveloped research context such as Bangladesh.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eCollaboration Structures and Citation Impact\u003c/h3\u003e\n\u003cp\u003eThe findings of the research are overwhelmingly positive. Externally, research from external sources (Gazni \u0026amp; Didegah, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2011\u003c/span\u003e) and research from select universities across Italy (Franceschet \u0026amp; Costantini, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2010\u003c/span\u003e) indicates that where collaboration is concerned - from group size to citation usefulness - view it as a favorable, correlational study. Internally, the field research (Abt, \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2007\u003c/span\u003e; Xu et al., \u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e2021\u003c/span\u003e) indicates that increased collaboration in the personal lives of many people only improves levels of collaboration in research.\u003c/p\u003e \u003cp\u003eHowever, where collaboration and impact are concerned, little research exists in comparison to team science and linguistic or written pertinence to one paper. For example, impact relative to abstract familiarity or lexical diversity has not been researched as it pertains to Bangladesh for no relative research exists to date.\u003c/p\u003e \u003cp\u003eTherefore, this research is necessary.\u003c/p\u003e\n\u003ch3\u003eObjectives of the Study\u003c/h3\u003e\n\u003cp\u003eThis study aims to:\u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eInvestigate collaborative tendencies in the linguistic construction of Bangladeshi scientific abstracts.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eExamine levels of linguistic complexity (readability, lexical variety, syntactic complexity and semantic complexity) in Bangladeshi scientific abstracts.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eEstablish whether team size is an influencing factor for shifts in textual complexity.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eDiscuss socio-academic drivers of collaboration within Bangladesh.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e "},{"header":"Methods","content":"\u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003cdiv id=\"Sec12\" class=\"Section3\"\u003e \u003ch2\u003eData Source and Pre-processing\u003c/h2\u003e \u003cp\u003eThe dataset used for the study was created from the WoS (Web of Science) Core Collection. It encompasses all publications that have at least one author affiliated with a Bangladeshi institution. The first step of data cleaning was performed in R (v4.3.1). Records without abstracts, empty abstracts and abstracts below 50 characters in length were removed from the study - this ensures that the papers are sufficient for a large enough language study. The study's final analytical dataset consisted of 25,883 scientific abstracts after this stage.\u003c/p\u003e \u003cp\u003eThe study estimated team size by assessing the number of authors present in the WoS field designated for authors. For example, authors separated by semicolon, semi-colon, were each counted for contribution; those without any data were counted as one author. Thus, the publications were classified into a type of collaboration category: \u003cem\u003eSolo\u003c/em\u003e (1 author), \u003cem\u003eSmall team\u003c/em\u003e (2 authors), \u003cem\u003eMedium team\u003c/em\u003e (3\u0026ndash;5 authors), \u003cem\u003eLarge team\u003c/em\u003e (6\u0026ndash;10 authors), \u003cem\u003eVery large team\u003c/em\u003e (\u0026gt;\u0026thinsp;10 authors). These categories were treated as ordered factors in further analysis.\u003c/p\u003e \u003c/div\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003eLinguistic Feature Extraction\u003c/h2\u003e \u003cp\u003eLanguaging was conducted via the quanteda package (Benoit et al., \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2018\u003c/span\u003e). A corpus object was created from the authenticated abstracts with each abstract being its own document.\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eLength and Structural Metrics\u003c/strong\u003e \u003cp\u003eThe following textual variables were calculated per document.\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eTotal number of words: total number of character strings excluding whitespace\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eTotal number of sentences: estimated via terminal punctuation (., !, ?)\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eThey were standards of equivalence approximations for extent and density of form.\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eReadability Measures\u003c/strong\u003e \u003cp\u003eReadability was assessed using textstat_readability(). Three widely used indices were computed\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003col\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eFlesch Reading Ease (FRE): scores from 0\u0026ndash;100, where higher scores indicate greater readability.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eFlesch\u0026ndash;Kincaid Grade Level (FKGL): estimates the U.S. school grade required to comprehend the text.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003cspan\u003e \u003cli\u003e \u003cp\u003eGunning FOG Index: estimates years of formal education needed for first-pass understanding.\u003c/p\u003e \u003c/li\u003e \u003c/span\u003e \u003c/ol\u003e \u003c/p\u003e \u003cp\u003e \u003cstrong\u003eLexical Diversity\u003c/strong\u003e \u003cp\u003eThe Type-Token Ratio (TTR) was calculated to represent lexical diversity. The same tokenization process occurred where the abstracts' punctuation, numbers, and special characters were removed, and the case was transformed into lowercase. Then, textstat_lexdiv() was applied to measure this variable\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cdiv id=\"Equa\" class=\"Equation\"\u003e \u003cdiv format=\"TEX\" class=\"mathdisplay\" id=\"FileID_Equa\" name=\"EquationSource\"\u003e\n$$\\:\\text{T}\\text{T}\\text{R}=\\frac{\\text{U}\\text{n}\\text{i}\\text{q}\\text{u}\\text{e}\\:\\text{w}\\text{o}\\text{r}\\text{d}\\:\\text{t}\\text{y}\\text{p}\\text{e}\\text{s}}{\\text{T}\\text{o}\\text{t}\\text{a}\\text{l}\\:\\text{w}\\text{o}\\text{r}\\text{d}\\:\\text{t}\\text{o}\\text{k}\\text{e}\\text{n}\\text{s}}$$\u003c/div\u003e \u003c/div\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec14\" class=\"Section2\"\u003e \u003ch2\u003eStatistical Analysis and Visualization\u003c/h2\u003e \u003cp\u003eAll tests were run in R. Descriptive statistics (mean, sd) for all linguistic variables were acquired across all five team-size groups using the dplyr package.\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eCorrelation Matrix\u003c/strong\u003e \u003cp\u003eTo assess linear relationships between team size and abstract length (tokens) versus readability measures and lexical diversity, a Pearson correlation matrix was constructed.\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cstrong\u003eGroup Comparisons\u003c/strong\u003e \u003cp\u003eA one-way ANOVA was performed to compare the means of linguistic features across collaboration groups (where FRE was the dependent variable, and team size category was the grouping factor). This assessed whether different levels of collaboration impact readability.\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cstrong\u003eVisualizations\u003c/strong\u003e \u003cp\u003eAll visualizations were constructed with ggplot2, which subsequently had a publication-ready theme applied. ggplot2 created the following output\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eA bar chart visualizing team sizes of papers with frequency\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eA violin\u0026thinsp;+\u0026thinsp;box plot showing the distributions of the readability/length and lexical variables\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eA heat map denoting correlations between all linguistic variables\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eLine plots per year (2019\u0026ndash;2024)\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eVisualizations helped interpret any collaboration-linguistic relationships.\u003c/p\u003e \u003c/div\u003e"},{"header":"Results","content":"\u003cdiv id=\"Sec16\" class=\"Section2\"\u003e \u003ch2\u003eTeam Size Distribution\u003c/h2\u003e \u003cp\u003eAn investigation into the 25,883 works reveals a shift towards more collaborative authorship. According to Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, medium (3\u0026ndash;5 authors) and large (6\u0026ndash;10 authors) collaborative efforts dominate as 10,165 (39%) and 9,869 (38%) papers were published by medium and large collaborative efforts, respectively. Only 500 of the 25,883 works (approximately 2%) represent solo efforts.\u003c/p\u003e \u003cp\u003eThis reflects, to a degree, the collaborative efforts of publication from Bangladesh as over time, more and more works are published by teams rather than individuals.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec17\" class=\"Section2\"\u003e \u003ch2\u003eTeam Size and Abstract Characteristics\u003c/h2\u003e \u003cp\u003eInterestingly, a specific relationship emerged between team size and abstract structure. Not only did grand means of an abstract length greater for team size, but the means of word counts per level of team size occurred more frequently, too. The grand mean was 198.62 words for 1 author and 266.99 for extremely large teams (\u0026gt;\u0026thinsp;10). This means that the larger the team, the more extensive and more likely and encompassing abstract sent is as more people contribute more information to include.\u003c/p\u003e \u003cp\u003eIn contrast, lexical diversity decreased by team size on the basis of Type-Token Ratio (TTR). The average TTR score of the smaller (1.54), (1) and the extremely large teams (\u0026gt;\u0026thinsp;10) (0.56). This means that there are less lexically diverse options made in collaboration abstractions than there are in those made by one author.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eSummary of Abstract Metrics by Team Size Category\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"6\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c6\" colnum=\"6\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eTeam Size Category\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eN Papers\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eAvg. Word Count\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eAvg. Flesch Score\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eAvg. F-K Grade\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c6\"\u003e \u003cp\u003eAvg. TTR\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSolo (1)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e500\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e198.62\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e19.98\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e16.74\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.59\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eSmall team (2)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e1773\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e206.28\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e21.02\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e16.47\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.58\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eMedium team (3\u0026ndash;5)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e10165\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e218.91\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e21.1\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e16.51\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.58\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eLarge team (6\u0026ndash;10)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e9869\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e237.03\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e21.58\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e16.52\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.57\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e \u003cp\u003eVery large team (\u0026gt;\u0026thinsp;10)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c2\"\u003e \u003cp\u003e3576\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e266.99\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e20.45\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e16.82\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c6\"\u003e \u003cp\u003e0.56\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003ctfoot\u003e \u003ctr\u003e\u003ctd colspan=\"6\"\u003e\u003cem\u003eNote\u003c/em\u003e. F-K Grade\u0026thinsp;=\u0026thinsp;Flesch-Kincaid Grade Level; TTR\u0026thinsp;=\u0026thinsp;Type-Token Ratio.\u003c/td\u003e\u003c/tr\u003e \u003ctr\u003e\u003ctd colspan=\"6\"\u003eTwo trends are particularly evident:\u003c/td\u003e\u003c/tr\u003e \u003c/tfoot\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eAbstract length increases with team size (∆ \u0026asymp; +68 words between solo teams and very large teams)\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eLexical diversity decreases as TTR scores decline.\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eThese two conclusions indicate that the greater the collaborative effort, the greater the likelihood of a longer - but lexically unimpressive - abstract.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec18\" class=\"Section2\"\u003e \u003ch2\u003eTeam Size and Readability\u003c/h2\u003e \u003cp\u003eReadability was related to team size, non-linearly. A one-way ANOVA established differences in FRE across the five types of collaboration, F(4, 25,878)\u0026thinsp;=\u0026thinsp;13.55, p\u0026thinsp;\u0026lt;\u0026thinsp;.001.\u003c/p\u003e \u003cp\u003eUltimately, the most readable abstracts came from large collaborations (6\u0026ndash;10 authors) with a mean FRE of 21.58. However, it's essential to realize that FRE decreases slightly for very large collaborations (\u0026gt;\u0026thinsp;10 authors) meaning that this trend approaches linguistic complexity at excessively collaborative stages.\u003c/p\u003e \u003cp\u003eThe violin plots below in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e show how FREs lie within each team, revealing much spread within groups but consistently low values in the absolute sense - typical of scientific prose where anything lower than 30 is considered technical, graduate student language.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec19\" class=\"Section2\"\u003e \u003ch2\u003eCorrelations between Linguistic Features\u003c/h2\u003e \u003cp\u003eCorrelation tests provided additional statistically significant observations between team size, word count, readability, and lexical diversity (see Fig.\u0026nbsp;\u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003e). For example, a moderately strong positive correlation existed between team size and word count (r\u0026thinsp;=\u0026thinsp;.42), indicating that the larger the team, the more words were used to construct the abstract. In addition, as anticipated, a slightly negative correlation existed between team size and lexical diversity (TTR) (r\u0026thinsp;=\u0026thinsp;\u0026minus;\u0026thinsp;.18), since the more authorship dispersed in works, the smaller the vocabulary used, a phenomenon observed in collaborative works and lexical convergence in scientometric research (Mongeon \u0026amp; Larivi\u0026egrave;re, \u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Winnink et al. \u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e2015\u003c/span\u003e).\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eMetrics for readability told a different story. A correlation matrix found a correlation between FRE and word count (r\u0026thinsp;=\u0026thinsp;.21) and a correlation with team size (r\u0026thinsp;=\u0026thinsp;.14). This FRE finding in other words means that perhaps slightly longer papers create clarity of exposition. Furthermore, FRE also implies that teams larger than ten are conducive to a better understanding of average reading level materials with more authors. Similarly, FKGL and Gunning FOG Index are positively correlated with team size (r\u0026thinsp;=\u0026thinsp;.14 and r\u0026thinsp;=\u0026thinsp;.17 respectively). Thus, the more authors on a team the more complicated the writing is and required higher levels of reading comprehension. TTR is lexical diversity and this was negatively correlated with both the FOG Index (r\u0026thinsp;=\u0026thinsp;\u0026minus;\u0026thinsp;.22) and FKGL (r\u0026thinsp;=\u0026thinsp;\u0026minus;\u0026thinsp;.19) meaning papers with less lexically diverse vocabularies are also more complicated and conceptually dense. This makes sense in linguistic terms and with cognitive load theory (Mongeon \u0026amp; Larivi\u0026egrave;re, \u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e2015\u003c/span\u003e). Ultimately, these findings suggest a nuanced complex of collaborative and scientific writing.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec20\" class=\"Section2\"\u003e \u003ch2\u003eTemporal Trends in Readability (2019\u0026ndash;2024)\u003c/h2\u003e \u003cp\u003eIn order to measure whether the linguistic variables changed over time, a temporal assessment of the trend was performed for 2019\u0026ndash;2024. The trend of FRE and TTR per team size is in Fig.\u0026nbsp;\u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e.\u003c/p\u003e \u003cp\u003eThe major findings from this assessment are as follows:\u003c/p\u003e \u003cp\u003e \u003cstrong\u003eDecrease in Readability\u003c/strong\u003e \u003cp\u003eFRE had an overall decrease over the five-year span (average ΔFRE = -1.8). This means increasingly dense scientific abstracts over the course of time years are less readable. This not only complements findings abroad that show decreases over time (Mongeon \u0026amp; Larivi\u0026egrave;re, \u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Winnink et al. \u003cspan citationid=\"CR38\" class=\"CitationRef\"\u003e2015\u003c/span\u003e), but suggests that researchers are subject to these pressures internationally.\u003c/p\u003e \u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003e \u003cem\u003eDecrease in Lexical Variety\u003c/em\u003e: TTR showed a nominal decrease over this time frame (ΔTTR \u0026asymp; -0.02), meaning there is a standardization of vocabulary that likely means a homogeneity of writing style as well.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003e \u003cem\u003eTeam Size Distribution Consistency\u003c/em\u003e: Regardless of annual increases and decreases in total published articles, the percent makeup of single author, small, medium, large and very large teams remained relatively proportionate and consistent with other years.\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003cp\u003eThe increase of technicality and specialization to an increasingly less readable extent resonates with researchers internationally as international findings show a similar level of formality in interdisciplinary and increasingly specialized scientific communications. Therefore, the temporal findings suggest researchers similarly find themselves pressured over time by increasingly substantive events.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eThis interdisciplinary study offers a comprehensive scientometric-linguistic analysis of the collaborative relationships that shape the linguistic characteristics of scientific abstracts from Bangladeshi authors. Drawing on the results of a dataset of 25,883 WoS publications, this study finds major characteristics where team size is appropriately matched to standardized type/token ratio (TTR), readability and length of abstract.\u003c/p\u003e \u003cp\u003eFirst, the relationship between team size and length of abstract is positively correlated. This implies that where authors are collaborating with more other authors, their abstracts become increasingly succinct presentations. Such an increase would be due to the ability for these collaborative contributions to find a balance between diverse approaches and larger designs necessitating increased understanding. Thus, with a multi-authored approach, a greater number of ideas can be posited, meaning that abstract length mirrors the complexity and intellectual comprehension expected of such analyses.\u003c/p\u003e \u003cp\u003eHowever, as team size increases, TTR decreases. This suggests that where authors are working together in a multi-author context, there is a similar lexicon used. This may be due to a consensus effect that teams who collaborated arrived at less specialized language that would be accessible in efforts to appeal to the diverse writing styles and discipline approaches. The results are aligned with findings of collaborative writing where groups of authors writing from drafts have less variation in lexical choices as authors work through a common voice. While this voice may be clearer, it is also homogenized to such an extent that it foregoes stylistic differences for eloquent economy that is more pronounced in larger teams.\u003c/p\u003e \u003cp\u003eThe relationship between team size and readability is not necessarily linear. Teams of 6\u0026ndash;10 authors have the highest readability - ironically compared to the extensive abilities of team dynamics - where it seems a happy median is found between enough persons for internal peer review and expanded editorial management yet not so many that there's a breakup in how to convey information. Excessive teams (\u0026gt;\u0026thinsp;10 authors) present with lower readability. This indicates that collaborative writing was not managed as effectively along with continued variations in writing and shifts in abstract purpose. This aligns with theoretical explorations of collaborative communication that suggests the larger the team, the less clear it becomes.\u003c/p\u003e \u003cp\u003eSecond, the negative correlation to readability relative to time from 2019\u0026ndash;2024 aligns with global trends. Researchers like Plav\u0026eacute;n-Sigray et al. (\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e2017\u003c/span\u003e) found that from global access trends, general scientific writing has become less readable relative to increased specialization, author competitiveness and advanced focus on jargon use within the discipline. That Bangladesh finds itself amidst similar trends, however, suggests that as applied linguistic pressures penetrate modern science, technical density and complex methodologies and international publication patterns have permeated developed research fields just as much.\u003c/p\u003e \u003cp\u003eUltimately, clear trends emerge whereby collaboration affects scientific writing in various directions. While larger teams do create longer abstracts, they also rely on standardized vocabulary; medium-sized teams create clearer works relative to small sized or excessively sized teams - while global literacy finds itself increasingly downward trending in interdisciplinary endeavours suggests that the collaborative approaches are unlikely to be clear to greater public audiences.\u003c/p\u003e \u003cp\u003eThese results are important for Bangladeshi scholars - a burgeoning collaborative research field of academic investigators - for subsequent endeavors. As more and more collaborative and interdisciplinary creations are made, an intention on the results regarding authorship tendencies and editorial group considerations for appropriate dissertation and article development will be of great use without extended lengths of abstracts. Furthermore, a more cohesive effort in communication will bolster worldwide recognition of articles by Bangladeshi scholars.\u003c/p\u003e"},{"header":"Conclusion","content":"\u003cp\u003eThis research confirms that research team size is a significant predictor of Bangladeshi authors' language features of abstracts. With the presumption of larger collaborative teams being the most common collaborative authorship approach to what's currently out there, it seems that excessively sized contributions lend to longer - and thus less readable - abstracts. Abstracts with more like 5\u0026ndash;10, however, have the potential for greater conceptual cohesion although it's determined that abstract language is more similarly aligned. Readability, however, is even more nuanced - not only is it not linear with the size of the team, but a substantial readable average can be attributed to a specific size (6\u0026ndash;10). Average sized contributions with these authors produced better figures in legibility and syllable count than single written and those coalesced through excessive collaboration (11+).\u003c/p\u003e \u003cp\u003eAt the same time, a relative reduction in readability across the board suggests an international phenomenon as specialized subfield researchers grow more specialized, developing more intricate methodologies and discipline-specific lexicon; this subfield position makes sense within a relatively new national research context like Bangladesh as it suggests needing help to create clear scientific writing practices and understandable practices through workshops, editorial support, and institutional development.\u003c/p\u003e \u003cp\u003eThus, these findings would be generalizable through scientific writing workshops and optimal collaborative authorship conclusions that would be supported by national policies surrounding research. Early-career researchers would benefit from scientific writing workshops that promote appropriate collaborative styles, giving all researchers from Bangladesh awareness to the pros and cons of authorship on collaborative papers/literature. Nationally, best practice promotion of publication through clear communication would make scientifically minded pieces from Bangladesh come across internationally under a better (or at least preferred, clearer) light.\u003c/p\u003e \u003cp\u003eFuture studies would seek to determine intentions from collaborative writers as to this phenomenon, longitudinal cross-sections of years with full manuscript assessments for within-team comparisons over the same years accessed per team size, interdisciplinary differences for national comparisons or international variances on abstract findings. All would explore collaborative connections between scientific communication in broader contexts.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eFunding:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;The authors received no financial support for the research, authorship, and/or publication of this article.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConflicts of Interest / Competing Interests:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;The authors declare that there are no conflicts of interest.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthics Approval:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;Not applicable. No human participants, animals, or sensitive data were involved.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent to Participate:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;Not applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent for Publication:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;Not applicable.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAvailability of Data and Materials:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;The dataset derived from Web of Science is available from the corresponding author upon reasonable request.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eCode Availability:\u0026nbsp;\u003c/strong\u003e\u003cbr\u003e\u0026nbsp;R scripts used for text mining, readability analysis, and visualization are available from the authors upon request.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAuthors\u0026rsquo; Contributions (CRediT):\u003c/strong\u003e\u003c/p\u003e\n\u003cul\u003e\n \u003cli\u003e\u003cstrong\u003eS. Ravikumar:\u003c/strong\u003e Conceptualization; Methodology; Data Curation; Formal Analysis; Visualization; Writing \u0026ndash; Original Draft.\u003c/li\u003e\n \u003cli\u003e\u003cstrong\u003eM. Mezbah-ul-Islam:\u003c/strong\u003e Supervision; Validation; Writing \u0026ndash; Review \u0026amp; Editing; Interpretation.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003e\u003cstrong\u003eChandrappa:\u0026nbsp;\u003c/strong\u003eWriting \u0026ndash; review \u0026amp; editing;\u003cstrong\u003e\u0026nbsp;Resources;\u0026nbsp;\u003c/strong\u003eValidation\u003cstrong\u003e,\u0026nbsp;\u003c/strong\u003eInvestigation; Formal analysis; Writing \u0026ndash; Original Draft.\u003c/li\u003e\n\u003c/ul\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eAbt, H. A. (2007). The frequencies of multinational papers in various sciences. \u003cem\u003eScientometrics\u003c/em\u003e, \u003cem\u003e72\u003c/em\u003e(1), 105\u0026ndash;115. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-007-1710-5\u003c/span\u003e\u003cspan address=\"10.1007/s11192-007-1710-5\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAdams, J. D., Black, G. C., Clemmons, J. R., \u0026amp; Stephan, P. E. (2005). Scientific teams and institutional collaborations: Evidence from U.S. universities, 1981\u0026ndash;1999. \u003cem\u003eResearch Policy\u003c/em\u003e, \u003cem\u003e34\u003c/em\u003e(3), 259\u0026ndash;285. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.respol.2005.01.014\u003c/span\u003e\u003cspan address=\"10.1016/j.respol.2005.01.014\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAngulo-Tuesta, A., \u0026amp; Santos, L. M. P. (2015). Evaluation of the impact of maternal and neonatal morbidity and mortality research funded by the Ministry of Health in Brazil. \u003cem\u003eResearch Evaluation\u003c/em\u003e, \u003cem\u003e24\u003c/em\u003e(1), 1\u0026ndash;11. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/reseval/rvv022\u003c/span\u003e\u003cspan address=\"10.1093/reseval/rvv022\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eAnte, L. (2022). The relationship between readability and scientific impact: Evidence from emerging technology discourses. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e16\u003c/em\u003e, 101252. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2022.101252\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2022.101252\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBao, T., Zhao, Y., Mao, J., \u0026amp; Zhang, C. (2025). Examining linguistic shifts in academic writing before and after the launch of ChatGPT: A study on preprint papers. \u003cem\u003eScientometrics.\u003c/em\u003e \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-025-05341-y\u003c/span\u003e\u003cspan address=\"10.1007/s11192-025-05341-y\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBenoit, K., Watanabe, K., Wang, H., Nulty, P., Obeng, A., M\u0026uuml;ller, S., \u0026amp; Matsuo, A. (2018). quanteda: An R package for the quantitative analysis of textual data. \u003cem\u003eJournal of Open Source Software\u003c/em\u003e, \u003cem\u003e3\u003c/em\u003e(30), 774. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.21105/joss.00774\u003c/span\u003e\u003cspan address=\"10.21105/joss.00774\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBoyack, K. W., van Eck, N. J., Colavizza, G., \u0026amp; Waltman, L. (2018). Characterizing in-text citations in scientific articles: A large-scale analysis. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e12\u003c/em\u003e(1), 59\u0026ndash;73. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2017.11.005\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2017.11.005\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBu, Y., Ding, Y., Liang, X., \u0026amp; Murray, D. (2018). Understanding persistent scientific collaboration. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e69\u003c/em\u003e(4), 480\u0026ndash;493. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.23966\u003c/span\u003e\u003cspan address=\"10.1002/asi.23966\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eChen, C. (2012). Predictive effects of structural variation on citation counts. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e63\u003c/em\u003e(3), 431\u0026ndash;449. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.21694\u003c/span\u003e\u003cspan address=\"10.1002/asi.21694\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDeng, H., Wu, C., Ge, B., \u0026amp; Wu, H. (2025). Quantification and identification of authorial writing style through higher-order text network modeling. \u003cem\u003eJournal of Informetrics.\u003c/em\u003e \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2024.101603\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2024.101603\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDong, S., Mao, J., Ke, Q., \u0026amp; Pei, L. (2024). Decoding the writing styles of disciplines: A large-scale quantitative analysis. \u003cem\u003eInformation Processing \u0026amp; Management\u003c/em\u003e, \u003cem\u003e61\u003c/em\u003e, 103718. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ipm.2024.103718\u003c/span\u003e\u003cspan address=\"10.1016/j.ipm.2024.103718\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDressler, W., et al. (2025). Textual complexity and readability in scientific communication: A cognitive-linguistic perspective. \u003cem\u003eJournal of Technical Writing and Communication\u003c/em\u003e, \u003cem\u003e55\u003c/em\u003e(3), 413\u0026ndash;439.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFranceschet, M., \u0026amp; Costantini, A. (2010). The effect of scholar collaboration on impact and quality of academic papers. \u003cem\u003eScientometrics\u003c/em\u003e, \u003cem\u003e87\u003c/em\u003e(2), 295\u0026ndash;306. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-010-0310-4\u003c/span\u003e\u003cspan address=\"10.1007/s11192-010-0310-4\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGazni, A., Sugimoto, C. R., \u0026amp; Didegah, F. (2011). Mapping world scientific collaboration: Authors, institutions, and countries. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e63\u003c/em\u003e(2), 323\u0026ndash;335. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.21688\u003c/span\u003e\u003cspan address=\"10.1002/asi.21688\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGazni, A., \u0026amp; Didegah, F. (2011). Investigating different types of research collaboration and citation impact: A case study of Harvard University publications. \u003cem\u003eScientometrics\u003c/em\u003e, \u003cem\u003e87\u003c/em\u003e(2), 251\u0026ndash;265. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-010-0318-9\u003c/span\u003e\u003cspan address=\"10.1007/s11192-010-0318-9\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGui, W., Wang, H., Sun, X., \u0026amp; Wang, N. (2022). Linguistic complexity in scientific writing: A large-scale diachronic study from 1821 to 1920. \u003cem\u003eScientometrics\u003c/em\u003e, \u003cem\u003e127\u003c/em\u003e(8), 4327\u0026ndash;4348. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-022-04550-z\u003c/span\u003e\u003cspan address=\"10.1007/s11192-022-04550-z\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHasegawa, Y., et al. (2024). A cell competition system with one gene expression from a single-copy gene in one cell. \u003cem\u003ePLOS ONE\u003c/em\u003e, \u003cem\u003e19\u003c/em\u003e, e302451. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1371/journal.pone.0302451\u003c/span\u003e\u003cspan address=\"10.1371/journal.pone.0302451\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHayati, Z., \u0026amp; Didegah, F. (2010). International scientific collaboration of Iran with other countries: A scientometric study. \u003cem\u003eJournal of Information Science\u003c/em\u003e, \u003cem\u003e36\u003c/em\u003e(4), 482\u0026ndash;491. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1177/0165551509351430\u003c/span\u003e\u003cspan address=\"10.1177/0165551509351430\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHirsch, J. E. (2010). An index to quantify an individual\u0026rsquo;s scientific research output that takes into account the effect of multiple coauthorship. \u003cem\u003eScientometrics\u003c/em\u003e, \u003cem\u003e85\u003c/em\u003e(3), 741\u0026ndash;754. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-010-0193-9\u003c/span\u003e\u003cspan address=\"10.1007/s11192-010-0193-9\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHoffman, E., Li, Y., Coutts, R. T., \u0026amp; Maris, S. (2024). Leveraging AI for educational content consistency: Metadata generation for MOOC development. \u003cem\u003eIn IEEE DEMOcon 2024\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1109/DEMOCON63027.2024.10747877\u003c/span\u003e\u003cspan address=\"10.1109/DEMOCON63027.2024.10747877\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLang, S., Walter, T., \u0026amp; Rodrigues, R. (2025). Readability and jargon in scientific plain-language summaries. \u003cem\u003ePLOS ONE\u003c/em\u003e, \u003cem\u003e20\u003c/em\u003e, e0302451.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLancho-Barrantes, B. S., Guerrero-Bote, V. P., Chinchilla-Rodr\u0026iacute;guez, Z., \u0026amp; de Moya-Aneg\u0026oacute;n, F. (2012). Citation flows in the zones of influence of scientific collaborations. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e63\u003c/em\u003e(3), 481\u0026ndash;489. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.21682\u003c/span\u003e\u003cspan address=\"10.1002/asi.21682\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLarivi\u0026egrave;re, V., Sugimoto, C. R., \u0026amp; Gingras, Y. (2014). Team size matters: Collaboration and scientific impact since 1900. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e65\u003c/em\u003e(4), 1323\u0026ndash;1332. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.23266\u003c/span\u003e\u003cspan address=\"10.1002/asi.23266\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLu, C., Bu, Y., Wang, J., Ding, Y., Torvik, V., Schnaars, M., \u0026amp; Zhang, C. (2019). Examining scientific writing styles from the perspective of linguistic complexity. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e70\u003c/em\u003e(5), 462\u0026ndash;475. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.24126\u003c/span\u003e\u003cspan address=\"10.1002/asi.24126\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLu, C., Bu, Y., Dong, X., Wang, J., Ding, Y., Larivi\u0026egrave;re, V., Sugimoto, C. R., Paul, L., \u0026amp; Zhang, C. (2019). Analyzing linguistic complexity and scientific impact. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e13\u003c/em\u003e, 100961. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2019.07.004\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2019.07.004\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLu, C., Zhang, C., Xiao, C., \u0026amp; Ding, Y. (2022). Contributorship in scientific collaborations: Contribution-based byline orders. \u003cem\u003eInformation Processing \u0026amp; Management\u003c/em\u003e, \u003cem\u003e59\u003c/em\u003e, 102944. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.ipm.2022.102944\u003c/span\u003e\u003cspan address=\"10.1016/j.ipm.2022.102944\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLu, C., Zhang, Y., Ahn, Y.-Y., Ding, Y., Zhang, C., \u0026amp; Ma, D. (2020). Co-contributorship network and division of labor in individual scientific collaborations. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e71\u003c/em\u003e(1), 78\u0026ndash;92. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.24321\u003c/span\u003e\u003cspan address=\"10.1002/asi.24321\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMayukh Bagchi. (2025). Toward generative AI\u0026ndash;driven metadata modeling: A human\u0026ndash;LLM collaborative approach. \u003cem\u003eLibrary Trends\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1353/lib.2025.a961196\u003c/span\u003e\u003cspan address=\"10.1353/lib.2025.a961196\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMongeon, P., \u0026amp; Larivi\u0026egrave;re, V. (2015). The career impact of research fraud: A scientometric analysis. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e66\u003c/em\u003e(2), 205\u0026ndash;217. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.23202\u003c/span\u003e\u003cspan address=\"10.1002/asi.23202\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePan, X., Yan, E., Wang, Q., \u0026amp; Hua, W. (2015). Assessing the impact of software on science: A bootstrapped learning of software entities in full-text papers. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e9\u003c/em\u003e(2), 247\u0026ndash;257. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2015.07.012\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2015.07.012\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePinfield, S., Salter, J., Bath, P. A., Hubbard, B., Millington, P., Anders, J. H. S., \u0026amp; Hussain, A. (2014). Open-access repositories worldwide, 2005\u0026ndash;2012. \u003cem\u003eJournal of the Association for Information Science and Technology\u003c/em\u003e, \u003cem\u003e65\u003c/em\u003e(\u003cem\u003e2\u003c/em\u003e), 240\u0026ndash;256. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1002/asi.23131\u003c/span\u003e\u003cspan address=\"10.1002/asi.23131\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePlav\u0026eacute;n-Sigray, P., Matheson, G. J., Schiffler, B. C., \u0026amp; Thompson, W. H. (2017). The readability of scientific texts is decreasing over time. \u003cem\u003eeLife\u003c/em\u003e, \u003cem\u003e6\u003c/em\u003e, e27725. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.7554/eLife.27725\u003c/span\u003e\u003cspan address=\"10.7554/eLife.27725\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003ePorwal, P., \u0026amp; Devare, M. (2024). Scientific impact analysis: Unraveling the link between linguistic properties and citations. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e18\u003c/em\u003e, 101526. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2024.101526\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2024.101526\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRomanchuk, R., Vysotska, V., Andrunyk, V., Chyrun, L., Chyrun, S., \u0026amp; Brodyak, O. (2023). Intellectual analysis system for Ukrainian-language artistic works. \u003cem\u003eIn Proceedings of CSIT 2023\u003c/em\u003e. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1109/CSIT61576.2023.10324012\u003c/span\u003e\u003cspan address=\"10.1109/CSIT61576.2023.10324012\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSong, N., Chen, K., \u0026amp; Zhao, Y. (2023). Understanding writing styles of scientific papers in IS\u0026ndash;LS: Evidence from abstracts. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e17\u003c/em\u003e, 101377. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2023.101377\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2023.101377\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eStephen, D. (2025). Distinguishing articles in questionable and non-questionable psychology journals. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e19\u003c/em\u003e, 101640. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2025.101640\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2025.101640\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSun, Z., He, D., \u0026amp; Li, Y. (2024). How the readability of manuscripts influences peer review: Biomedical evidence. \u003cem\u003eJournal of Informetrics\u003c/em\u003e, \u003cem\u003e18\u003c/em\u003e, 101547. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1016/j.joi.2024.101547\u003c/span\u003e\u003cspan address=\"10.1016/j.joi.2024.101547\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWinnink, J., Tijssen, R., \u0026amp; Van Raan, A. F. J. (2015). Small team breakthroughs: Collaboration constellations in high-impact science. \u003cem\u003eResearch Evaluation\u003c/em\u003e, \u003cem\u003e24\u003c/em\u003e(3), 352\u0026ndash;366. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1093/reseval/rvv026\u003c/span\u003e\u003cspan address=\"10.1093/reseval/rvv026\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWu, L., Wang, D., \u0026amp; Evans, J. A. (2019). Large teams develop and small teams disrupt science and technology. \u003cem\u003eNature\u003c/em\u003e, \u003cem\u003e566\u003c/em\u003e(7744), 378\u0026ndash;382. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1038/s41586-019-0941-9\u003c/span\u003e\u003cspan address=\"10.1038/s41586-019-0941-9\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eXu, S., Li, L., An, X., Hao, L., \u0026amp; Yang, G. (2021). Detecting commonality and specialty between scientific publications and patents. \u003cem\u003eScientometrics\u003c/em\u003e, \u003cem\u003e126\u003c/em\u003e(3), 1789\u0026ndash;1806. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.1007/s11192-021-04085-9\u003c/span\u003e\u003cspan address=\"10.1007/s11192-021-04085-9\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"team size, scientific collaboration, readability, lexical diversity, text analysis, scientometrics, collaborative writing, research team dynamics, Bangladesh","lastPublishedDoi":"10.21203/rs.3.rs-8251341/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8251341/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThe geography of scientific collaboration has become increasingly internationalized, transforming where and how research is constructed and communicated. While collaboration has been shown to impact scientific production and impact, much less is known about how it interacts with the linguistic characteristics of scientific communication. This study analyses the extent to which team size has an impact on the linguistic features of scientific abstracts from scholars with Bangladesh affiliations. Using the full population of 25,883 WoS publications (1972\u0026ndash;2023), I assess the differences in three relevant linguistic characteristics - readability, lexical diversity, and abstract length - based on collaboration type (solo, small groups (2 authors), medium groups (3\u0026ndash;5), large groups (6\u0026ndash;10) and very-large groups (\u0026gt;\u0026thinsp;10 authors)). Readability is assessed through the Flesch Reading Ease, Flesch-Kincaid Grade Level and Gunning FOG Index while linguistic diversity is assessed through the Type-Token Ratio (TTR). The findings reveal three primary conclusions. First, lexical diversity is negatively impacted by team size in which greater linguistic inclusivity was found within collaborative endeavour\u0026rsquo;s as team size increased. Second, readability behaves according to a curvilinear pattern whereby 6\u0026ndash;10 author teams produce the easiest-to-read abstracts and \u0026gt;\u0026thinsp;10 author teams produce more concise, complex works. Third, abstract length is positively impacted by team size due to the broader conceptual and methodological scope of collaborative efforts. Mean differences for readability relative to team size are statistically significant (ANOVA; \u003cem\u003ep\u003c/em\u003e\u0026thinsp;\u0026lt;\u0026thinsp;0.001) as a negative trend of reading ease from 2019\u0026ndash;2024 reveals a decrease for all group sizes.\u003c/p\u003e \u003cp\u003eThis study contributes to the emerging field of Scientometrics - by connecting structures of collaboration to assessable linguistic features in practical applied scientific output for communication purposes. Practical applications are made for research evaluation, journal editors and writing specialists who teach novice researchers in burgeoning research fields how to produce such collaborative findings effectively while best anticipating expectations. Ultimately, these results indicate that researchers should welcome access to linguistic features in evaluation of collaborative efforts as communication does not come without cost when collaborative scientific writing occurs.\u003c/p\u003e","manuscriptTitle":"The Dynamics of Collaboration: A Scientometric Study of Team Size and Text Complexity in Scientific Abstracts from Bangladesh","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2026-03-25 19:19:06","doi":"10.21203/rs.3.rs-8251341/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"f419a0ba-6f43-43a0-a172-f501895e591b","owner":[],"postedDate":"March 25th, 2026","published":true,"recentEditorialEvents":[{"type":"editorInvitedReview","content":"","date":"2026-05-16T07:28:19+00:00","index":25,"fulltext":""}],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2026-03-25T19:19:06+00:00","versionOfRecord":[],"versionCreatedAt":"2026-03-25 19:19:06","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-8251341","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8251341","identity":"rs-8251341","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00