{"paper_id":"c7a30fed-554f-4ce7-8695-c169b9b2cac7","body_text":"New insights into the structure of cellulose in plant cell walls | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Article New insights into the structure of cellulose in plant cell walls Paul Dupree, Rosalie Cresswell, Parveen Deralia, Yoshihisa Yoshimi, and 2 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4970084/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract The structure of native plant cellulose, despite its abundance and utility in industry, remains elusive. The cellulose structure of several species was studied using 2D solid-state Nuclear Magnetic Resonance (NMR) of 13 C labelled plants. Six major glucose environments were resolved which are common to the cellulose of poplar wood, spruce wood and grasses. The cellulose structure was maintained in isolated holo-cellulose nanofibrils, allowing more detailed characterisation. There are just two glucose environments within the fibril core which have the same NMR 13 C chemical shifts as tunicate cellulose Iβ. The third major glucose site with a carbon 4 shift near 89 ppm, previously assigned to the fibril interior, is one of four surface glucose environments. These advances allowed us to obtain a more accurate measure of the interior to surface ratio for poplar wood fibrils of 0.5, consistent with an 18 chain microfibril structure having 6 core and 12 surface chains. Biological sciences/Plant sciences/Plant molecular biology Biological sciences/Biochemistry/Structural biology/NMR spectroscopy/Solid-state NMR Biological sciences/Structural biology/NMR spectroscopy/Solid-state NMR Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 Figure 9 Introduction Cellulose is the most abundant natural polymer in the world. 1 It forms a major component in the cell walls of plants providing much of their mechanical strength. 2 Its properties are important for the pulp and lumber industries and more recently cellulose has become a major contender as a sustainable alternative to fossil fuels for producing materials, films and biofuels. 3 – 5 Despite the large industrial interest in cellulose, its structure in native plant cell walls remains elusive. Cellulose is a β-1-4-linked polymer of D-glucose in long chains of perhaps 10,000 units. 6 These glucan chains have a 2-fold helical structure which allows them to crystallise via both stacking interactions and a network of hydrogen bonding to form long microfibrils in plant cell walls. 7 Estimates of the diameter of these cellulose microfibrils are between 3–5 nm. 8 In recent work, based on biophysical measurement and advances in understanding the biosynthesis, it is believed that a microfibril is formed from 18–24 glucan chains. 9 – 15 The arrangement of these chains in the microfibril into several stacked sheets also remains unknown, but computational studies have suggested some possible habits. 8 , 16 Cellulose Iα and Iβ are well known forms of crystalline cellulose and have been characterised using both diffraction techniques and solid-state NMR. 17 – 21 Cellulose Iα has a P1 triclinic structure and has non-equivalent glucosyl residues in a cellobiose unit within the same chain, whereas cellulose Iβ has a P2 1 monoclinic structure and its non-equivalent glucosyl residues are in alternating sheets of glucan chains, called centre and origin. 17 , 18 It is known that cellulose from bacteria and some algae have predominantly a cellulose Iα structure whilst tunicate cellulose, which consists of large crystals, is predominantly cellulose Iβ. Although it is possible to obtain high resolution X-ray and neutron diffraction patterns for these larger crystalline forms of cellulose, this is not achievable on native plant cellulose. 22 – 24 The very thin plant microfibrils result in low resolution diffraction patterns such that atomic resolution cannot be achieved. 24 – 26 Solid-state NMR has been used for studying cellulose since the 1980s, mainly using 1D experiments. 27 , 28 It is particularly useful as, in contrast to diffraction, it does not require long-range order and samples can be analysed in situ. Nevertheless, the thin nature of the plant microfibrils means that the surface glucose residues contribute to the NMR spectra far more than in large crystals of tunicates or fibrils of bacteria, complicating attempts to fully assign the spectra. Furthermore, plant samples may contain non-cellulosic cell wall components that contribute to spectral peaks in the cellulose region, as well as a so-called amorphous cellulose component. 29 , 30 The cellulose fibrils also interact with other components in the cell wall which is likely to influence the chemical shifts of cellulose glucosyl residues. 31 – 33 Within these limitations, it is widely thought, based on NMR assignments and the fitting of X-ray diffraction patterns, that cellulose of higher plants is a mixture of mainly cellulose Iβ and some cellulose Iα 28 although there are also several studies that suggest plant cellulose could have its own distinct structure. 28 , 34 , 35 In NMR spectra of cellulose, the carbon shifts of C4 and C6 of the different glucose units in cellulose are split into two main regions which, following Dupree et al., 32 are defined as domain 1 glucosyl residues which have C4 1 and C6 1 shifts of ~ 89 ppm and ~ 65 ppm respectively, and domain 2 glucosyl residues which have C4 2 and C6 2 shifts of ~ 84 ppm and ~ 62 ppm. Domain 1 has been thought to reflect crystalline cellulose as its shifts are close to those of the crystalline cellulose Iα and Iβ 29 , 36 , and these residues are normally also considered interior chains of the microfibril. 34 Domain 2 glucose residues are believed to be surface glucan chains of the cellulose microfibril and are also sometimes described as amorphous. 34 , 36 When restricted to 1D NMR it is only possible to resolve residues in these two domains using C4 and C6 shifts, as the C5 shifts overlap with those from C2 and C3. 32 The C4 cellulose region of the 1D NMR spectrum tends to be reasonably clear of contributions from hemicellulose and pectin etc. and so the ratio of the two domains (C4 1 :C4 2 ) has been widely used as a measure of the crystallinity of cellulose and also to estimate the size of the cellulose microfibril. 37 , 38 It has been suggested 39 that the major source of this change in shift between the two domains arises from a change in the conformation of the hydroxymethyl group of the glucose unit from tg in domain 1 to gt or gg in domain 2 and this has since been supported by several studies using DFT or NMR. 16 , 40 , 41 In recent years, developments in 13 C labelling of plant cell wall materials (and dynamic nuclear polarisation (DNP)) has enabled access to a wealth of 2D Magic Angle Spinning (MAS) NMR experiments that have allowed unparalleled insight into the structure of plant cell walls. 42 These studies have made significant strides in understanding hemicellulose structure and interactions with cellulose in both primary (PCW) and secondary cell walls (SCW). 43 , 44 NMR has been used to identify the 2-fold helical structure of xylan in SCW, and the molecular architecture and water-polymer interactions in softwoods. 31 , 44 , 45 Several distinct glucosyl residues, each with differing chemical shifts due to differing environments within primary cell wall cellulose, were described by Wang et al. who found similar glucose environments in a range of different primary cell wall materials (Brachypodium , Zea mays , and Arabidopsis thaliana ). 46 They reported that glucosyl residues of five environments are within the fibril, and two lie on the surface. They suggested several cellulose microfibril models based on the understanding that the domain 1 glucosyl residues ( tg hydroxymethyl conformation) arise solely from sites in the interior chains of the microfibril. 41 , 47 Under this assumption, and using quantitative NMR, they found that cellulose had too much ‘interior’ relative to surface residues for the probable microfibril size of 18–24 chains. Additionally, some of the interior residues were more distant from water than others. As a result, the cellulose microfibrils bundle was suggested to produce more interior glucose residues. 37 In this work we assign the NMR shifts of the main cellulosic glucose environments found in a range of plant secondary cell wall materials and begin to determine their position in the cellulose microfibril. To achieve this, we studied both poplar wood and xylanase-treated holo-cellulose nanofibrils (hCNFs) prepared from poplar wood using minimal processing to maintain the native cellulose structure. 48 By removing hemicellulose, lignin, and any amorphous cellulose present we are left with a clearer spectrum with improved resolution which provides more certainty in the determination of the NMR shifts for each distinct glucose environment. By comparing with the recently corrected NMR shift assignments for cellulose I we conclusively show that plant cellulose is not a mixture of Iα and Iβ, and that the core of the native cellulose microfibril has NMR shifts identical to tunicate cellulose 1β. We also show that the glucose residues in domain 1 and domain 2 cannot be split into crystalline and amorphous nor can they be considered to be exclusively interior and surface. In particular, we find that one of the domain 1 glucosyl residue environments is part of a surface glucan chain. This discovery allows a more realistic estimate of the interior to surface ratio of fibrils to be obtained than the many previous estimations using 13 C NMR. Results Assigning 13 C NMR chemical shifts for the glucosyl residue environments in cellulose within native plant cell walls Throughout our work 31 , 45 , 49 , 50 never-dried native samples of many different plants have been studied using 13 C MAS NMR to understand the structure and interactions of cellulose hemicellulose, lignin and water in plant cell walls. The composition of the secondary cell wall of poplar is relatively simple with only 3 main components cellulose, xylan and lignin. Figure 1 shows the 1D CP MAS NMR spectrum of the neutral carbohydrate region, which is dominated by the cellulose signal, of poplar wood. The carbon 1 (C1) peak is at ~ 105 ppm, C2, C3, C5 overlap in the region between 70–76 ppm and the C4 and C6 carbons are split into two main peaks. The cellulose glucose environments in domain 1 show C4 1 and C6 1 peaks at ~ 89 ppm and ~ 65 ppm respectively and the C4 2 and C6 2 domain 2 peaks are at ~ 84 ppm and ~ 62 ppm respectively. Interestingly the C4 peaks in the 1D CP MAS spectrum are not simple Lorentzian line shapes indicating there are multiple glucose environments within both domains. The C4 2 region especially is clearly split into two peaks at ~ 83.5 and ~ 84.5 ppm. Fully 13 C labelled poplar wood enabled the structure of plant cellulose to be explored using a wealth of 2D NMR experiments. 2D experiments such as CP INADEQUATE and 30 ms CP PDSD were used to resolve several different glucose environments within cellulose. The CP refocussed INADEQUATE is a double quantum experiment that correlates two covalently bonded carbons. The bonded carbons appear at the same DQ shift which is given by the sum of the respective SQ shift of the correlated carbons. Figure 2 shows the neutral carbohydrate region of the CP refocussed INADEQUATE spectrum of poplar wood. The high resolution means that the carbons within each glucosyl residue can be followed through in the 2D spectrum. This is demonstrated in red for the glucose residue in domain 1 named environment ‘c’, and in black for the C4-C6 region for the residue in domain 2 environment ‘j’. Our naming convention and assignments are based on those of Wang et al. 46 for cellulose in primary cell walls since many of the cellulose environments we observe have similar values, however glucose environment j, which is clearly visible for C4, C5 and C6, is newly reported. The C6 region of the spectrum (inset in Fig. 2 ) shows the main glucose environments identified within domain 1 and domain 2. This region also shows a minor domain 2 environment, named k. In total there are three major glucose environments a, b, c in domain 1 and three f, g, j in domain 2. To make a more complete assignment of the NMR shifts of the glucose environments within cellulose, a 30 ms CP PDSD spectrum was analysed alongside the INADEQUATE spectrum. The CP PDSD experiment is a through space correlation experiment which, with a mixing time of 30 ms, shows cross peaks of carbons only if they are within the same glucose ring. SI Fig. 1 shows the 30 ms CP PDSD spectrum highlighting the key regions which are particularly useful for identifying different glucose environments. Two further minor glucose environments in domain 1 labelled d and e (which is very minor) have previously been assigned in Wang et al. 46 . Table 1 lists the 13 C NMR shifts for all the different glucose environments identified in cellulose, including the more minor ones. Table 1 NMR chemical shifts of all glucose environments assigned in cellulose of poplar wood. * The naming convention and assignments are based on those of Wang et al. 46 with additional environments. The minor glucose environments d, e and k are italicised. A comparison to Iα and Iβ cellulose NMR chemical shifts is shown in SI Table 1 . $ All 13 C NMR chemical shifts are in ppm and have an error of ± 0.1 ppm. Domain 1 Glucose Environment C1 1 C2 1 C3 1 C4 1 C5 1 C6 1 a* (Cellulose Iβ origin) 105.8 $ 71.7 74.3 89.0 72.5 65.0 b 105.2 72.6 75.5 89.1 72.6 65.2 c (Cellulose Iβ centre) 104.1 71.8 75.2 88.1 71.2 65.8 d 105.2 72.5 74.9 87.1 72.5 64.7 e 105.0 - 74.7 89.8 71.1 65.3 Domain 2 Glucose Environment C1 2 C2 2 C3 2 C4 2 C5 2 C6 2 f 105.2 72.4 74.4 84.5 75.3 62.4 g 104.9 72.4 75.3 83.5 75.2 61.4 j 105.1 - - 83.3 73.9 61.3 k - - - 83.8 74.3 63.5 Glucose environments in cellulose of cell walls of different plants Having identified all the glucose environments in cellulose of poplar wood we were interested to compare these to cellulose of other plants. Over the years we have used high resolution 2D NMR to study a wide range of different plants including monocots, eudicots, and gymnosperms. 44 , 49 , 50 The cellulose of many of the plants investigated has remarkably similar NMR shifts such that it is possible to identify our assigned glucose environments in a wide range of plants. This is illustrated using a comparison of two different regions of the 30 ms CP PDSD spectra of a eudicot (poplar), a monocot ( Brachypodium ) and a gymnosperm (spruce) in Fig. 3 for the C1-C4 region and SI Fig. 2 for the C4-C6 region. The relative amount of each glucose environment varies between samples, for example, spruce wood appears to have relatively more b versus a or c whereas in poplar wood cellulose these are more similar in quantity. Brachypodium has significantly more of site d which is a minor cellulose site in the poplar wood secondary cell walls. Whilst not visible in Fig. 3 and SI Fig. 2 , the cellulose environment e is a very minor environment in all three plants. There is one additional cellulose environment ‘s’ seen in the spectra of spruce. There are some small differences in the shifts for some of the more minor cellulose environments. For example, we found that environment d is generally broader than sites a, b, and c with its C4 NMR shift varying by ~ 0.3 ppm indicating that there are more variations in the d local environment. Generally, the NMR shifts remain similar between all the plants, indicating the glucose environments are constant in the cellulose fibrils. Xylanase-treated holo-cellulose nanofibrils (hCNFs) maintain the native plant cellulose structure of poplar wood Whilst studying the native plant cellulose in-situ is ideal for ensuring minimal disruption of the cell wall, the NMR spectrum can be crowded since signals from both hemicelluloses and lignin are present which tends to limit the resolution. The glucose environments in the fibrils may also be influenced by interactions of surface residues with hemicellulose or lignin. By removing the lignin during preparation of holo-cellulose nanofibrils (hCNFs) we can maintain the cellulose fibril structure whilst improving the resolution of the NMR spectrum. 48 To remove hemicellulose we treated hCNFs produced from the poplar wood with xylanase. TEM images of these hCNFs show long and thin cellulose fibrils which are loosely bundled, as observed with the 3nm width hCNFs prepared similarly from Arabidopsis see SI Fig. 3 . 48 To investigate whether the xylanased hCNFs have maintained the cellulose environments we compared the NMR spectra of poplar wood with those of the xylanased hCNFs. Their 1D CP MAS spectra are shown in SI Fig. 4 . There is a significant change in the total signal in the C4 region and in the ratio of domain 1 and domain 2 cellulose peaks. This change is due to the removal of both lignin and hemicelluloses as well perhaps as loss of a less ordered cellulose component. The 2D 30 ms CP PDSD comparisons of the C1-C4 region and the C1-C6 region shown in Fig. 4 and SI Fig. 5 respectively show that the domain 1 glucose environments remain almost unchanged by the production of the hCNFs from wood. The domain 2 environments show some slight NMR shift changes, typically < 0.3 ppm, in both the f and g/j environments. As the domain 2 environments are surface chains of the fibril this may be due to the removal of both lignin and the hemicellulose xylan. The domain 2 cellulose region of the 30 ms CP PDSD spectrum shown in SI Fig. 5 exhibits distinctly narrower peaks for the xylanased hCNFs, i.e. a broad contribution of somewhat less ordered environments has been removed. Since there were no other changes, we are convinced that the production of the hCNFs causes relatively minimal disturbance to the different glucose environments in the poplar fibrils. Plant cellulose fibril core environments are identical to tunicate cellulose Iβ The improvement in the resolution of NMR spectra makes analysis of the xylanased hCNFs ideal for determining NMR shifts, quantities, and the relative location of the different glucose sites within the fibrils. There have been many 1D 13 C NMR studies of cellulose Iα and Iβ, but the use of different referencing has led to a spread in the published NMR shifts. 19 , 20 , 51 , 52 Recently Brouwer et.al. have resolved these discrepancies to give a consistent set of 13 C NMR values. 21 Using the same reference as Brouwer et al. 21 we can compare our shifts from the high-resolution spectra of xylanased hCNFs from poplar wood with those of cellulose Iα and Iβ. None of the domain 2 glucose environments are close to those of the NMR shifts of glucose in these cellulose allomorphs (SI Table 1 ). This means we only need to consider the 13 C NMR shifts of domain 1 glucose residues, where there are only 3 major sites a, b and c. Figure 5 shows the C4-C3/C5 region of the CP refocussed INADEQUATE spectrum of the xylanased hCNFs with the Iα and Iβ shift positions marked. It is evident that the shifts of cellulose Iα are different from those of native plant cellulose whereas the C3, C4 and C5 shifts of a and c are very close to those of Iβ. Indeed, all the shifts for sites a and c, apart from C1, are within ~ 0.2 ppm of those of cellulose Iβ (SI Table 1 ). Since the only substantial difference we observe from the Iβ shifts is in C1 where it seems likely that the Kono et al. 19 , 20 assignment was incorrect, as highlighted in SI Fig. 6 and SI Table 1 . This misassignment by Kono et al. 20 presumably arose because a refocussed INADEQUATE spectrum was used for their assignment and both glucose environments in cellulose Iβ have nearly identical C2 NMR shifts making it difficult to distinguish the associated C1 shifts. Using the 30 ms CP PDSD spectrum (SI Fig. 6 ) together with the INADEQUATE spectrum allowed us to determine confidently the C1 shifts for environments a and c. In a later paper Brouwer and Mikolajewski also commented on the possibility that the Kono et al. C1 assignments could be swapped. 53 With this new NMR shift assignment of cellulose Iβ, all the shifts of fibril core glucose environments a and c match closely tunicate cellulose Iβ. We now assign environment a as origin chain, and environment c as centre chain, because the DFT calculations from the cellulose Iβ predict the C1 chemical shift of the residues in the origin chains is ~ 2 ppm higher than the C1 of residues in the centre chains, as seen here for environments a and c respectively. 46 Native plant cellulose is clearly not a mixture of cellulose Iα and Iβ since, despite environment b having similar shifts to one of the Iα glucose environments for both C1 and C4 (SI Table 2), there is no sign of the second Iα glucose environment which would also be visible in similar quantities within domain 1. This identification of glucose environments a and c now shows that the plant cellulose fibrils are cellulose Iβ. Having determined that two of the main domain 1 environments correspond to glucose in the classical cellulose Iβ structure, we were interested to understand the origin of the third, remaining, domain 1 environment, b. Since b is from domain 1, it has been thought also to be interior to the fibril. To investigate this further, a 2D water-edited (w.e.) 30 ms CP PDSD spectrum was acquired to probe the glucose environments that are closest to water. Figure 6 shows the C1-C4 and C1-C6 regions of the w.e. spectrum compared with a standard 30 ms CP PDSD spectrum. As expected for surface glucose residues, the signal for the domain 2 sites f, g and j is enhanced indicating that they are more water accessible than the fibril core sites a and c. SI Fig. 7 shows similar proximity to water of these three major domain 2 glucose environments. Site j may be further from water than f and g, indicating these residues may be located on different surfaces of the fibril. Figure 6 a shows that the w.e. C1-C4 peak for glucose environment b is also significantly enhanced in a similar way to f, g, and j, indicating that b is also closer to water than the core sites a or c. Glucose environment b of domain 1 is therefore, unexpectedly, in a surface chain of the fibril. Interestingly, for the C1-C6 region shown in Fig. 6 b the signal from b is still enhanced in the w.e. experiment over the signal for the core a and c environments, but to a lesser extent than for the domain 2 surface glucose environments. Being in domain 1, environment b likely reflects a glucose residue in a tg conformation. This conformation may be because the C6 is facing toward the interior of the cellulose fibril where the hydroxymethyl group hydrogen bonds with other residues. 16 Hence in the glucose residue of environment b the C6 is further from water than its C4, and further from water than the other surface environments where the C6 has a water-facing gt or gg conformation. Thus, in contrast to the widely accepted view, not all the domain 1 glucose residues are interior of the fibril. A consequence of this finding is that the ratio of domain 1 and domain 2 signals cannot be used to estimate the size of a fibril since when site b is included in domain 1 the amount of interior will be overestimated, and surface underestimated. The ratio of interior vs surface of the fibril can however be estimated from the C1 region of the INADEQUATE spectrum where sites a and c are resolved from b, and interior a + c and surface D2 + b can now be determined by integration (see SI Fig. 8 ). The value of ~ 0.5 in the poplar hCNFs is consistent with an 18-chain fibril with 6 interior chains and 12 surface chains. The relative amount of site b is ~ 0.7 of sites a or c (see SI Fig. 8 ). To investigate the relative proximities of the different glucose environments, two longer (200 ms and 400 ms) mixing time CP PDSD experiments were acquired. As these experiments are probing distances up to ~ 5–8 Å, cross-peaks are between glucose residues in different sites within individual fibrils are additionally observed. Figure 7 shows that at a mixing time of 200 ms the C1-C1 region of the CP PDSD spectrum gives clear cross peaks between a and c only. This shows that the residue environments a and c are closer to each other than either are to residue environment b. This is consistent with our finding that b is not situated with a and c in the core of the cellulose fibril. The first clear cross peaks between residues in the two cellulose domains can be seen in the C4-C4 region of the 400 ms CP PDSD (Fig. 8 a). There are cross peaks from domain 2 glucose environments to the single C4 1 peak corresponding to glucose residue environments a plus b. Given we know that environments a and c are particularly close, and there are no cross peaks to c, the cross peaks from domain 2 observed here are likely to environment b only. This proximity of b and not c to domain 2 glucose residues is also observed in the C6-C6 region of the 400 ms CP PDSD spectrum (Fig. 8 b). There are also cross peaks between the different domain 2 environments of cellulose. The proximities of b to the domain 2 environments provides further confirmation that b is one of the four major surface glucose residue environments. Discussion The structure of native plant cellulose fibrils has been the focus of many studies for decades. Although a wide range of techniques have been used, the structure remains to be fully elucidated. 30 , 54 – 56 The 13 C NMR chemical shift is very sensitive to the local environment such as differing conformation and hydrogen bonding. In principle, NMR therefore allows distinct and recurring glucose residues in the interior and surface of cellulose fibrils to be distinguishable. High resolution is needed to resolve signals from these residues, and was achieved here by studying never dried, hydrated, isolated 13 C labelled cellulose fibril samples with high field MAS NMR. We found plant cellulose is comprised of six major glucose environments, with four of these on the surface and just two in the core of the fibril. Importantly, these two core glucose residue sites have 13 C NMR shifts very similar to those of tunicate cellulose Iβ centre and origin chains and there is no detectable cellulose Iα, resolving longstanding questions about the composition of plant cellulose. The core of the plant cellulose microfibril is comprised of two types of glucose residues, which lie in cellulose Iβ origin and centre chains. The plant fibril core glucose residue NMR shifts are essentially identical to those of tunicate cellulose Iβ. This finding relied in part on the unified and consistent NMR shifts of cellulose that have recently been determined by Brouwer et al. 21 Second, we found that the C1 13 C shift of the two glucose residues of the origin and centre chains of tunicate cellulose Iβ had previously been misassigned, as was also thought possible by Brouwer and Mikolajewski. 53 Our corrected assignment of the C1 shifts is also more consistent with DFT calculations of cellulose Iβ origin and centre chain environments. 57 We therefore propose the plant fibril glucose environments a and c correspond to cellulose Iβ origin and centre chain environments respectively (Table 1 ). Our NMR shifts for the plant cellulose fibril core are slightly smaller (~ 0.2 ppm) than the Brouwer values for tunicate cellulose Iβ. 21 , 53 Due to the thinness of the plant fibrils, all interior chains are next to surface chains, in contrast to the large crystals of tunicate cellulose that were studied by Kono et al. 20 Therefore the small NMR shift difference could result from these fibril core chains having neighbouring chains, similarly stacked as cellulose Iβ, but which are surface and hence they have different water interactions and bonding. 19 – 21 , 53 Glucose residues with shifts in domain 1 (C4 around 89ppm) have long been considered to reside in the crystalline core of plant cellulose. 37 , 47 However, now we have shown that the third major glucose environment in domain 1, site b, is a residue on the surface of the fibril. We previously suggested 31 that there could be a surface component in the domain 1 region and recently this view has been supported by Addison et al. 33 , 58 Indeed, the NMR spectra of cotton of Kirui et al. have relatively less site b than found in the samples analysed here, consistent with a larger microfibril structure with lower surface to interior ratio. 34 Calculations have shown that a change in C6 hydroxymethyl conformation from tg , to gt/gg changes the C4 shift by ~ 5 ppm. 45 Adoption of the tg conformation is likely to occur where the C6 is facing towards the inside of the fibril and the orientation is fixed by hydrogen bonding to other glucosyl residues rather than water. This indicates that the glucose residue in environment b has the tg conformation and is likely to be a residue in a surface glucan chain but facing inwards towards another chain. It is now clearly more appropriate to consider domain 1 glucose residues as reflecting tg C6 hydroxymethyl confirmation rather than present only in crystalline cellulose and only in the core of fibril. The earlier assignment of all types of domain 1 glucose residues as solely interior to the cellulose fibril has led to a number of debates concerning the core structure. 9 , 34 , 37 , 46 , 47 By comparing 1D NMR spectra in the domain 1 region, it was widely thought that the plant cellulose core is a mixture of cellulose Iα and cellulose Iβ. 51 , 54 , 59 , 60 This misinterpretation arose due to the glucose in environment b having similar C1 and C4 shifts as those of one of the glucose residues of cellulose Iα. Furthermore, the 1D spectrum appears significantly different to that of cellulose Iβ because the C4 and C6 signals from core residue a and surface residue b overlap. This means that the ratio of the core cellulose Iβ environments (now known to be a and c) appears not to be the expected 1:1. The mixture of cellulose of Iα and Iβ has been hypothesised to arise as faults in the fibril structure or by alternating sheets of cellulose Iα and Iβ. 29 , 54 , 55 There have also been suggestions that the core structure could be completely distinct from that of cellulose Iβ. 34 , 37 We have now shown the fibril is solely cellulose Iβ, and so cellulose modelling, computational studies and interpretation of diffraction patterns should be based on the Iβ structure. The breadth of studies across plant and material sciences has given rise to terminology to describe cellulose which can be confusing, especially when the same terminology is applied to data acquired by different methods. For example, descriptions of cellulose as crystalline and amorphous have different meanings between NMR studies and diffraction experiments which require order over a much longer distance. 61 Historically the glucose residues observed in the two domains of NMR spectra were assigned as crystalline and amorphous cellulose because the characterised crystalline forms of cellulose have 13 C shifts in the domain 1 (C4 ~ 89 ppm) region. 38 , 62 , 63 Surface chains are more mobile and are sometimes described as paracrystalline or amorphous. 25 In our work the line widths of the signals from the six glucose residue environments within the surface and core of the fibril are very similar (such as in the C6 region of the CP INADEQUATE spectrum of the xylanase treated hCNFs shown in SI Fig. 9 summarised in SI Table 2). NMR chemical shifts are sensitive only to short range structure ( < ~ 5 A), and so the similar linewidths indicate that both domains have similar short-range order. Therefore, the core and surface glucose residues cannot be divided simply into crystalline and amorphous cellulose. On the other hand, a comparison of the 1D CPMAS spectrum of poplar wood versus the xylanase-treated hCNF sample in Fig. SI4 shows that a weak (< 10%) broad signal underlying the domain 2 region, which may arise from some less ordered material, is removed in the production of the hCNF sample. This less ordered component includes lignin and xylan but also could be related to a less crystalline portion of cellulose that is known to be easily hydrolysed. 64 Therefore, whilst there might be a small component of less ordered cellulose giving rise to signal in the domain 2 region this term cannot describe all these glucose residues. The ratio of the C4 1 and C4 2 signals from 1D NMR spectra was initially called the crystallinity index (CI). The CI has been used to describe the crystallinity of cellulose and is regularly calculated alongside crystallinity measures by diffraction techniques. 38 , 62 This measure of crystallinity by NMR and diffraction has usually been significantly different, although the general trends between samples tend to be consistent. 38 The traditional CI as measured by NMR is a misnomer and does not measure crystallinity. It will be influenced by the surface to core ratio, as well as the presence of any amorphous material. The better estimates of glucose residue environment proportions provide new constraints on plant cellulose fibril size and habit. Previous studies have used the ratio of C4 1 and C4 2 from 1D NMR spectra as a measure of microfibril size, which has resulted in variable estimations such as 24 chains in a fibril or in fibril bundling models. 9 , 37 As we now know domain 1 is not solely interior sites, thus the ratio of C4 1 and C4 2 will give an overestimate of the interior of the fibril and an overestimate of the fibril size. The ratio of the two domains could still be utilised for characterisation as it is still loosely related to the volume of the fibril, but care is needed in using this interpretation. In this work, we have provided a more accurate measure of the interior to surface ratio for the microfibril of poplar wood which was found to be ~ 0.5. We also provide new constraints on the proportions of the environments in the fibril, namely a and c origin and centre chains are similar in proportion, and surface environment b is ~ 0.7 of the amount of a or c chain environments (since the number of b is likely an integer, this ratio corresponds to 2/3 or 3/4). Given the further constraints on the fibril from the dimensions from biophysical measurements that limit the size to 18–24 chains, there are very few habits that fulfil these criteria. The core to surface value is consistent, for example, with an 18-chain fibril only in a habit that has 6 interior chains and 12 surface chains, or a 24 chain fibril with 8 interior and 12 surface chains. A schematic diagram of two potential 18 chain habits 234432 and 34443 that have the observed interior to surface ratio is shown in Fig. 9 , and an alternative 333333 habit can be excluded. These habits fulfil our updated estimate of interior to core chain ratio, but it is difficult to see how the 34443 could have similar amounts of a and c origin and centre sheets. Currently therefore, 234432 is the habit currently that best fits the data. The previously proposed concepts of multi-layered fibril environments 60 or bundling 37 to generate additional interior chains are not required to explain the data. Whilst we have measured the relative amount of surface site b compared to core chain sites a and c for isolated fibrils from poplar wood, these values are likely to vary with any changes in the microfibril size, habit, and hemicellulose interactions in different plants. Hence the relative proportions of the six major glucose residue environments will be an important diagnostic tool for discovering cellulose variability, studying cellulose interactions, and for detecting changes during degradation or industrial processing of wood and other biomass. This work now provides a strong basis for understanding cellulose fibril surfaces as we discovered there are only four major types of surface residues. Understanding the origin of these NMR environments will be very instrumental in defining the hydrophobic and hydrophilic surfaces of fibrils. The exciting prospect now arises of a more complete description and understanding of cellulose fibrils surfaces and their interactions in biomaterials and in the plant cell wall. Methods Sample Production and Preparation The poplar stem Populus tremula × tremuloides used in this work was grown from poplar shoot cuttings which were allowed to root for 2 weeks before being transferred into the growth chamber. Here the saplings were grown in a 13 CO 2 atmosphere 31 , 65 for ~ 3 months to provide ~ 97% 13 C labelled material that is predominantly secondary cell wall. Only wood of lower (woody) never-dried stems was used after removing bark and cambium. The growth of spruce and Brachypodium was as described in Terrett et al. and Tryfona et al. respectively. 44 , 49 To prepare holocellulose nanofibrils (hCNFs), the wood was first delignified four times with 3% peracetic acid (PAA; 0.35 g pure PAA/g dry material, pH 4.8) at 85°C for 45 minutes without stirring. Between cycles, used PAA was decanted, one water wash was done, and new 3% PAA was added. Delignified material (holocellulose) was washed with water after the fourth cycle until the conductivity was below 10 S/cm. The holocellulose was blended for two minutes to form a homogenous slurry. Holocellulose dispersion (0.1 wt%) was blended for 30 minutes in a Vitamix A3500i blender to make a polydispersion. hCNFs were recovered as supernatant from the polydispersion by centrifugation at 5000 rpm for 15 minutes. The hCNFs suspensions were digested with xylanases as described previously. 48 The xylanase treated hCNFs were then freeze-dried and rewetted before being packed directly into the NMR rotor. The never dried 13 C enriched poplar wood was debarked and cut into small pieces of ~ 1–2 mm size with a razor blade and then packed into a 3.2 mm MAS zirconia NMR rotor whilst removing excess water. TEM A ThermoFisher Scientific (FEI) TalosF200X G2 microscope operating in the scanning mode at 200 kV was used to obtain the TEM images. Solid-state NMR Solid-state NMR experiments of poplar wood and the xylanase treated hCNFs were acquired on a Bruker 1GHz AVANCE NEO solid-state NMR spectrometer operating at 1 H and 13 C Larmor frequencies of 1000.4 and 251.6 MHz respectively using a 3.2 mm E Free triple resonance MAS probe. All experiments were conducted at an indicated temperature of 10°C and an MAS frequency of 12.5 kHz with a recycle delay of 2 s. The 13 C chemical shifts were determined using the carbonyl peak at 177.8 ppm of L-alanine as an external reference with respect to tetramethylsilane. This referencing was confirmed to correspond with referencing to the adamantane CH 2 peak to 38.48 ppm 66 to ensure direct comparison with Brouwer and Mikolajewski. 21 , 53 The 90° pulse lengths were typically 3.2 µs ( 1 H) and 4.0 µs ( 13 C). Cross polarization from 1 H to 13 C was achieved using ramped (70–100%) 1 H radiofrequency amplitude and a contact time of 1 ms 67 . SPINAL-64 decoupling was applied at a 1 H nutation frequency of 70–80 kHz during acquisition. 68 Sign discrimination in the indirect dimension of the 2D experiments was achieved using the States-TPPI method. For the principal assignments of cellulose environments, a CP refocused INADEQUATE 13 C double-quantum (DQ) 13 C single-quantum (SQ) correlation experiment was used whereby the evolution of DQ coherence for directly bonded carbons within the same glucan ring is correlated with directly observed SQ coherence. 69 , 70 The acquisition time in the indirect dimension was 6.67 ms with a spectral width of 37.5 kHz and 128 acquisitions per t 1 FID. The echo duration, τ, was 2.24 ms giving a total echo time of 8.96 ms. Both intra- and intermolecular contacts were probed using 2D 13 C- 13 C proton driven spin diffusion (PDSD) experiments with mixing times of 30 to 400 ms. 71 The acquisition time in the indirect dimension ( t 1 ) of the CP PDSD experiments was 5.5–8.1 ms. The spectral width in the indirect dimension was 37.5 kHz with at least 64 acquisitions per t 1 FID. The proximity of water to different cellulose environments was probed using a water-edited 13 C- 13 C 30 ms CP PDSD experiment. This is based on the normal CP PDSD experiment, however before the CP contact time there is a 1 H T 2 filter to remove signal arising from directly attached protons followed by a delay to allow for diffusion from the water protons to its near neighbours. 72 The CP parameters were as stated above. The acquisition time in the indirect dimension was 4.8 ms with a spectral width of 37.5 kHz and 576 acquisitions per t 1 FID. The total 1 H T 2 filter was 320 µs followed by a diffusion delay of 2 ms. All 2D spectra were processed with Fourier transformation into 8 K (F 2 ) × 2 K (F 1 ) points with exponential line broadening of 20–50 Hz in F 2 and cubed sine bell processing in F 1 using Bruker Topspin v.3.6. Contour levels are x 1.1 throughout. The minimum contour is chosen to show the desired features. Declarations Data availability Unprocessed NMR data files will be available from http://wrap.warwick.ac.uk Author contributions RC and RD conducted the NMR experiments and analysis. PKD prepared the hCNFs. YY generated models of cellulose fibril habits for conceptual analysis of glucose environments. PD and RD conceived and supervised the work. SJB contributed to experimental method development and interpretation. RC, RD and PD wrote the paper with comments from all authors. Acknowledgements Many thanks to Eva Hellmann from the Sainsbury lab at University of Cambridge for donating the poplar saplings that were used to grow the 13 C material used in this work. Many thanks to Theodora Tryfona for providing the mature leaves of Brachypodium as well as Alberto Echevarria Poza for all the help with growing and harvesting the 13 C labelled plants from the growth chamber, and Trent Franks for technical assistance. We wish to thank Mike Jarvis for giving us the 13 C NMR chemical shifts of his annealed flax and celery and for a draft version of his review 42 .. The UK High-Field Solid-State NMR Facility used in this research was funded by EPSRC and BBSRC (EP/T015063/1), as well as, for the 1 GHz instrument, EP/R029946/1. The work of RC, RD and PD was supported by the UKRI grant underwriting the ERC advanced grant EVOCATE, Function and evolution of plant cell wall architecture for sustainable technologies EP/X027120/1. PKD was supported by EPSRC grant Bio-derived and Bio-inspired Advanced Materials for Sustainable Industries (VALUED), EP/W031019/1. The work of YY generating models of cellulose fibril habits for conceptual analysis of glucose environments was supported as part of The Center for Lignocellulose Structure and Formation, an Energy Frontier Research Center funded by the US Department of Energy, Office of Science, Basic Energy Sciences, under Award number DE-SC0001090. Competing interests. The authors declare no competing interests References Field CB, Behrenfeld MJ, Randerson JT, Falkowski P (1998) Primary Production of the Biosphere: Integrating Terrestrial and Oceanic Components. Sci (1979) 281:237–240 Gibson LJ (2012) The hierarchical structure and mechanics of plant materials. J R Soc Interface 9:2749–2766 Pauly M, Keegstra K (2008) Cell-wall carbohydrates and their modification as a resource for biofuels. Plant J 54:559–568 Foroughi F, Ghomi ER, Dehaghi FM, Borayek R, Ramakrishna S (2021) A review on the life cycle assessment of cellulose: From properties to the potential of making it a low carbon material. Materials 14:714 Iglesias MC, Gomez-Maldonado D, Via BK, Jiang Z, Peresin MS (2020) Pulping processes and their effects on cellulose fibers and nanofibrillated cellulose properties: A review. Prod J 70:10–21 Klemm D, Heublein B, Fink HP, Bohn A, Cellulose (2005) Fascinating biopolymer and sustainable raw material. Angewandte Chemie - Int Ed 44:3358–3393 Parthasarathi R et al (2011) Insights into hydrogen bonding and stacking interactions in cellulose. J Phys Chem A 115:14191–14202 Kubicki JD et al (2018) The Shape of Native Plant Cellulose Microfibrils. Sci Rep 8:13983 Tai HC et al (2023) Wood cellulose microfibrils have a 24-chain core–shell nanostructure in seed plants. Nat Plants 9:1154–1168 Song B, Zhao S, Shen W, Collings C, Ding SY (2020) Direct Measurement of Plant Cellulose Microfibril and Bundles in Native Cell Walls. Front Plant Sci 11:479 Penttilä PA, Paajanen A (2024) Critical comment on the assumptions leading to 24-chain microfibrils in wood. Nat Plants 10:1064–1066 Jarvis MC (2013) Cellulose biosynthesis: Counting the chains. Plant Physiol 163:1485–1486 Purushotham P et al (2016) A single heterologously expressed plant cellulose synthase isoform is sufficient for cellulose microfibril formation in vitro. PNAS 113:11360–11365 Nixon BT et al (2016) Comparative Structural and Computational Analysis Supports Eighteen Cellulose Synthases in the Plant Cellulose Synthesis Complex. Sci Rep 6:28696 Cosgrove DJ et al (2024) How many glucan chains form plant cellulose microfibrils? Biomacromolecules in press Yang H, Kubicki JD (2020) A density functional theory study on the shape of the primary cellulose microfibril in plants: effects of C6 exocyclic group conformation and H-bonding. Cellulose 27:2389–2402 Nishiyama Y, Langan P, Chanzy H (2002) Crystal structure and hydrogen-bonding system in cellulose Iβ from synchrotron X-ray and neutron fiber diffraction. J Am Chem Soc 124:9074–9082 Nishiyama Y, Sugiyama J, Chanzy H, Langan P (2003) Crystal Structure and Hydrogen Bonding System in Cellulose Iα from Synchrotron X-ray and Neutron Fiber Diffraction. J Am Chem Soc 125:14300–14306 Kono H et al (2002) CP/MAS 13 C NMR study of cellulose and cellulose derivatives. 1. Complete assignment of the CP/MAS 13 C NMR spectrum of the native cellulose. J Am Chem Soc 124:7506–7511 Kono H, Erata T, Takai M (2003) Determination of the through-bond carbon-carbon and carbon-proton connectivities of the native celluloses in the solid state. Macromolecules 36:5131–5138 Brouwer DH, Mikolajewski JG (2023) Resolving the discrepancies in reported 13 C solid state NMR chemical shifts for native celluloses. Cellulose 30:4827–4839 Thomas LH et al (2015) Diffraction evidence for the structure of cellulose microfibrils in bamboo, a model for grass and cereal celluloses. BMC Plant Biol 15:158 Newman RH, Hill SJ, Harris PJ (2013) Wide-angle x-ray scattering and solid-state nuclear magnetic resonance data combined to test models for cellulose microfibrils in mung bean cell walls. Plant Physiol 163:1558–1567 Šturcova A, His I, Apperley DC, Sugiyama J, Jarvis MC (2004) Structural details of crystalline cellulose from higher plants. Biomacromolecules 5:1333–1339 Thomas LH, Altaner CM, Jarvis MC (2013) Identifying multiple forms of lateral disorder in cellulose fibres. J Appl Crystallogr 46:972–979 Thomas LH et al (2014) Structure and spacing of cellulose microfibrils in woody cell walls of dicots. Cellulose 21:3887–3895 Deslandes Y, Hamer GK, Marchessault RH (1983) Application of High-Resolution Solid-State NMR with Cross-Polarization/ Magic-Angle Spinning (CP/MAS) Techniques to Cellulose Chemistry. J Macromolecular Sci Part C 23:187–216 Atalla RH, Vanderhart DL (1984) Native Cellulose: A Composite of Two Distinct Crystalline Forms. Science 223:283–285 Newman RH (1998) Evidence for Assignment of 13 C NMR Signals to Cellulose Crystallite Surfaces in Wood, Pulp and Isolated Celluloses. Holzforschung 52:157–159 Newman RH (1999) Estimation of the lateral dimensions of cellulose crystallites using 13 C NMR signal strengths. Solid State Nucl Magn Reson 15:21–29 Simmons TJ et al (2016) Folding of xylan onto cellulose fibrils in plant cell walls revealed by solid-state NMR. Nat Commun 7:13902 Dupree R et al (2015) Probing the molecular architecture of Arabidopsis thaliana secondary cell walls using two- and three-dimensional 13 C solid state nuclear magnetic resonance spectroscopy. Biochemistry 54:2335–2345 Addison B et al (2024) Atomistic, macromolecular model of the Populus secondary cell wall informed by solid-state NMR. Sci Adv 10:7965 Kirui A et al (2019) Atomic resolution of cotton cellulose structure enabled by dynamic nuclear polarization solid-state NMR. Cellulose 26:329–339 Viëtor RJ, Newman RH, Ha MA, Apperley DC, Jarvis MC (2002) Conformational features of crystal-surface cellulose from higher plants. Plant J 30:721–731 Teeyaer RE, Zhbankov RG, Shipina OT, Marchenko GN, Zabelin LV (1986) NMR Spectroscopy of Cellulose and its Derivatives with Magic Angle Spinning of the sample (Review). Zh Prikl Spektrosk 45:535–554 Kirui A et al (2022) Carbohydrate-aromatic interface and molecular architecture of lignocellulose. Nat Commun 13:538 Park S, Baker JO, Himmel ME, Parilla PA, Johnson DK (2010) Cellulose crystallinity index: measurement techniques and their impact on interpreting cellulase performance. Biotechnol Biofuels 3:10 Fernandes AN et al (2011) Nanostructure of cellulose microfibrils in spruce wood. PNAS 108:1195–1203 Phyo P, Wang T, Yang Y, O’Neill H, Hong M (2018) Direct Determination of Hydroxymethyl Conformations of Plant Cell Wall Cellulose Using 1 H Polarization Transfer Solid-State NMR. Biomacromolecules 19, 1485–1497 Yang H et al (2018) Structural factors affecting 13 C NMR chemical shifts of cellulose: a computational study. Cellulose 25:23–36 Jarvis MC (2023) NMR Studies on Cellulose and Plant Cell walls. Plant Cell Walls Research Milestones and Conceptual Insights, vol 1. CRC Wang T, Zabotina O, Hong M (2012) Pectin-cellulose interactions in the arabidopsis primary cell wall from two-dimensional magic-angle-spinning solid-state nuclear magnetic resonance. Biochemistry 51:9846–9856 Terrett OM et al (2019) Molecular architecture of softwood revealed by solid-state NMR. Nat Commun 10:4978 Cresswell R et al (2021) Importance of Water in Maintaining Softwood Secondary Cell Wall Nanostructure. Biomacromolecules 22:4669–4680 Wang T, Yang H, Kubicki JD, Hong M (2016) Cellulose Structural Polymorphism in Plant Primary Cell Walls Investigated by High-Field 2D Solid-State NMR Spectroscopy and Density Functional Theory Calculations. Biomacromolecules 17:2210–2222 Kang X et al (2019) Lignin-polysaccharide interactions in plant secondary cell walls revealed by solid-state NMR. Nat Commun 10:347 Deralia PK et al (2024) Preserving native cellulose structure in isolated holocellulose nanofibrils and controlling their properties by tuning xylan structure for biomaterials. In preparation Tryfona T et al (2024) Altering the substitution and cross-linking of glucuronoarabinoxylans affects cell wall architecture in Brachypodium distachyon. New Phytol 242:524–543 Bourdon M et al (2023) Ectopic callose deposition into woody biomass modulates the nano-architecture of macrofibrils. Nat Plants 9:1530–1546 Yamamoto H, Horii F (1993) CP/MAS 13 C NMR Analysis of the Crystal Transformation Induced for Valonia Cellulose by Annealing at High Temperatures. Macromolecules 26:1313–1317 Larsson T, Hult E-L, Wickholm K, Pettersson E, Iversen T (1999) CPMAS 13 C-NMR spectroscopy applied to structure and interaction studies on cellulose I. Solid State Nucl Magn Reson 15:31–40 Brouwer DH, Mikolajewski JG (2023) Solid-state NMR reveals a structural variation from the Iβ polymorphic form in bacterial cellulose. Cellulose 30:11341–11356 Atalla RH, Vanderhart DL (1999) The role of solid state 13 C NMR spectroscopy in studies of the nature of native celluloses. Solid State Nucl Magn Reson 15:1–19 Driemeier C, Francisco LH (2014) X-ray diffraction from faulted cellulose I constructed with mixed Iα-Iβ stacking. Cellulose 21:3161–3169 Hanley SJ, Revol J-FËO, Godbout L, Gray DG (1997) Atomic force microscopy and transmission electron microscopy of cellulose from Micrasterias denticulata; evidence for a chiral helical microfibril twist. Cellulose 4:209–220 Rehman Z et al (2024) Organic NMR Crystallography: Enabling Progress for Applications to Pharmaceuticals and Plant Cell Walls. Faraday Discuss Accepted Manuscr Xue Y, Li H, Kang X (2024) Molecular unraveling of polysaccharide digestion in wood-feeding termites: A solid-state NMR perspective. Carbohydr Polym 331:121843 Smith BG, Harris PJ, Melton LD, Newman RH (1998) Crystalline Cellulose in Hydrated Primary Cell Walls of Three Monocotyledons and One Dicotyledon. Plant Cell Physiol 39:711–720 Larsson T, Wickholm K, Iversen T (1997) A CP/MAS 13 C NMR Investigation of Molecular Ordering in Celluloses Carbohydrate Research . 302:19–25 Bryce DL (2017) NMR crystallography: Structure and properties of materials from solid-state nuclear magnetic resonance observables. IUCrJ 4:350–359 Park S, Johnson DK, Ishizawa CI, Parilla PA, Davis MF (2009) Measuring the crystallinity index of cellulose by solid state 13 C nuclear magnetic resonance. Cellulose 16:641–647 Wickholm K, Larsson T, Iversen T (1998) Assignment of non-crystalline forms in cellulose I by CP/MAS 13 C NMR spectroscopy. Carbohydr Res 312:123–129 Chang JKW, Duret X, Berberi V, Zahedi-Niaki H, Lavoie JM (2018) Two-step thermochemical cellulose hydrolysis with partial neutralization for glucose production. Front Chem 6:117 Chen WP et al (2011) An automated growth enclosure for metabolic labeling of Arabidopsis thaliana with 13 C-carbon dioxide - an in vivo labeling system for proteomics and metabolomics research. Proteome Sci 9:9 Morcombe CR, Zilm KW (2003) Chemical shift referencing in MAS solid state NMR. J Magn Reson 162:479–486 Gunther M, Xiaoling W (1994) Smith Steven O. Ramped-Amplitude Cross polarization in Magic-angle-spinning NMR. J Magn Reson A 110:219–227 Fung BM, Khitrin AK, Ermolaev K (2000) An Improved Broadband Decoupling Sequence for Liquid Crystals and Solids. J Magn Reson 142:97–101 Fayon F et al (2005) Through-space contributions to two-dimensional double-quantum J correlation NMR spectra of magic-angle-spinning solids. J Chem Phys 122:194313 Lesage A, Bardet M, Emsley L (1999) Through-bond carbon-carbon connectivities in disordered solids by NMR. J Am Chem Soc 121:10987–10993 Manolikas T, Herrmann T, Meier BH (2008) Protein structure determination from 13 C spin-diffusion solid-state NMR spectroscopy. J Am Chem Soc 130:3959–3966 Ader C et al (2009) Structural rearrangements of membrane proteins probed by water-edited solid-state NMR spectroscopy. J Am Chem Soc 131:170–176 Additional Declarations There is NO Competing Interest. Supplementary Files SICellulosepaperCresswelletal.Finaldocx.docx Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {\"props\":{\"pageProps\":{\"initialData\":{\"identity\":\"rs-4970084\",\"acceptedTermsAndConditions\":true,\"allowDirectSubmit\":true,\"archivedVersions\":[],\"articleType\":\"Article\",\"associatedPublications\":[],\"authors\":[{\"id\":346792339,\"identity\":\"05d31ea1-5300-4729-853f-a8cd665fccc9\",\"order_by\":0,\"name\":\"Paul Dupree\",\"email\":\"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAsUlEQVRIiWNgGAWjYFCCxIYDCRVwXgIxWpIbHzw4Q5qW9GbDh22kaDFnT2yTSJxnJ6/bwPzwA2NbGmEtlj0PgVq2JRtuO8BmLMHYlkNYi8ENkC3bDjBuO8BgxsDYVkGsljkH7LcdYP9GtJZmA2A4Ay3iAdlChMOAfml8kHAsOXnbYZ5iiYRzRHjfnD39wcEfNXa22463b/zwoSyZCIfBWcwMREakAWElo2AUjIJRMOIBAJ0TPgEpJxmZAAAAAElFTkSuQmCC\",\"orcid\":\"https://orcid.org/0000-0001-9270-6286\",\"institution\":\"University of Cambridge\",\"correspondingAuthor\":true,\"prefix\":\"\",\"firstName\":\"Paul\",\"middleName\":\"\",\"lastName\":\"Dupree\",\"suffix\":\"\"},{\"id\":346792340,\"identity\":\"16c4b266-7f67-4988-bc51-bdf3cc3299df\",\"order_by\":1,\"name\":\"Rosalie Cresswell\",\"email\":\"\",\"orcid\":\"\",\"institution\":\"University of Warwick\",\"correspondingAuthor\":false,\"prefix\":\"\",\"firstName\":\"Rosalie\",\"middleName\":\"\",\"lastName\":\"Cresswell\",\"suffix\":\"\"},{\"id\":346792341,\"identity\":\"63109626-667c-4961-abff-aa24d5bd1407\",\"order_by\":2,\"name\":\"Parveen Deralia\",\"email\":\"\",\"orcid\":\"https://orcid.org/0000-0002-1014-5524\",\"institution\":\"University of Cambridge\",\"correspondingAuthor\":false,\"prefix\":\"\",\"firstName\":\"Parveen\",\"middleName\":\"\",\"lastName\":\"Deralia\",\"suffix\":\"\"},{\"id\":346792342,\"identity\":\"a1918649-c15c-45cf-bfa4-6df6e7d1aab8\",\"order_by\":3,\"name\":\"Yoshihisa Yoshimi\",\"email\":\"\",\"orcid\":\"\",\"institution\":\"University of Cambridge\",\"correspondingAuthor\":false,\"prefix\":\"\",\"firstName\":\"Yoshihisa\",\"middleName\":\"\",\"lastName\":\"Yoshimi\",\"suffix\":\"\"},{\"id\":346792343,\"identity\":\"ca525497-1dd3-40f6-bf75-187b4b172d41\",\"order_by\":4,\"name\":\"Ray Dupree\",\"email\":\"\",\"orcid\":\"https://orcid.org/0000-0002-3334-0429\",\"institution\":\"University of Warwick\",\"correspondingAuthor\":false,\"prefix\":\"\",\"firstName\":\"Ray\",\"middleName\":\"\",\"lastName\":\"Dupree\",\"suffix\":\"\"},{\"id\":346792344,\"identity\":\"041c3344-21bf-4127-9e2b-c0d9e5a6bb13\",\"order_by\":5,\"name\":\"Steven Brown\",\"email\":\"\",\"orcid\":\"https://orcid.org/0000-0003-2069-8496\",\"institution\":\"University of Warwick\",\"correspondingAuthor\":false,\"prefix\":\"\",\"firstName\":\"Steven\",\"middleName\":\"\",\"lastName\":\"Brown\",\"suffix\":\"\"}],\"badges\":[],\"createdAt\":\"2024-08-24 16:50:07\",\"currentVersionCode\":1,\"declarations\":\"\",\"doi\":\"10.21203/rs.3.rs-4970084/v1\",\"doiUrl\":\"https://doi.org/10.21203/rs.3.rs-4970084/v1\",\"draftVersion\":[],\"editorialEvents\":[],\"editorialNote\":\"\",\"failedWorkflow\":false,\"files\":[{\"id\":63595351,\"identity\":\"c7691682-cedd-46b4-9502-4cfb97cf67a3\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:27\",\"extension\":\"png\",\"order_by\":1,\"title\":\"Figure 1\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":51652,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eNeutral carbohydrate region of a \\u003c/strong\\u003e\\u003csup\\u003e\\u003cstrong\\u003e13\\u003c/strong\\u003e\\u003c/sup\\u003e\\u003cstrong\\u003eC 1D CP MAS NMR spectrum of poplar wood. \\u003c/strong\\u003eThe spectrum shows the assignments of cellulose (blue), xylan (orange), lignin (green) and pectin (purple). The glucose C4 and C6 peaks in cellulose are split into two regions named domain 1 (D1) and domain 2 (D2). The spectrum was recorded at a \\u003csup\\u003e13\\u003c/sup\\u003eC Larmor frequency of 251.4 MHz and a MAS frequency of 12.5 kHz.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image1.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/ac203043fb626de0d0481934.png\"},{\"id\":63595357,\"identity\":\"53739878-ff37-4eac-973f-ea779569931d\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:28\",\"extension\":\"png\",\"order_by\":2,\"title\":\"Figure 2\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":242989,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eA \\u003c/strong\\u003e\\u003csup\\u003e\\u003cstrong\\u003e13\\u003c/strong\\u003e\\u003c/sup\\u003e\\u003cstrong\\u003eC 2D CP refocussed INADEQUATE MAS NMR spectrum of never-dried poplar wood showing multiple glucose environments in both domains 1 and 2\\u003c/strong\\u003e The path of \\u003csup\\u003e13\\u003c/sup\\u003eC carbon chemical shifts for cellulose glucose residue environments c (red) and j (black) are shown as example assignments.\\u0026nbsp; \\u003cstrong\\u003eInset (top left): \\u003c/strong\\u003eThe C6 region with the major glucose environments identified in poplar wood. The spectrum was as recorded in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image2.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/dd0fab07d53e58dcdd95b60e.png\"},{\"id\":63596006,\"identity\":\"f140b9c6-d7f1-4f25-a268-c6b58ae3012d\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:08:27\",\"extension\":\"png\",\"order_by\":3,\"title\":\"Figure 3\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":179102,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eGlucose environments in cellulose are common across different plants\\u003c/strong\\u003e. A comparison of the C1-C4 region of 30 ms \\u003csup\\u003e13\\u003c/sup\\u003eC 2D CP\\u003csup\\u003e \\u003c/sup\\u003ePDSD MAS NMR spectra of poplar wood (a eudicot), \\u003cem\\u003eBrachypodium\\u003c/em\\u003e mature leaves (a monocot) and spruce wood (a gymnosperm). Common glucose environments are labelled, noting that, whilst the relative amounts vary, the cellulose environments are common throughout. The spectra were recorded as in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image3.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/9e77ba49f8429a1ccd0e978e.png\"},{\"id\":63595349,\"identity\":\"087f9674-f6af-4f83-a17c-60bbd4b2837d\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:27\",\"extension\":\"png\",\"order_by\":4,\"title\":\"Figure 4\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":150665,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eSCW cellulose of poplar wood maintains its structure in the fibrillation process to form hCNFs. \\u003c/strong\\u003eComparison of the cellulose C1-C4 domain 1 region of 30 ms \\u003csup\\u003e13\\u003c/sup\\u003eC 2D CP PDSD MAS NMR spectra of poplar wood (green) and xylanase treated hCNFs of poplar wood (blue). Spectra were recorded as in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image4.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/2057bbc3c114fd762f23754d.png\"},{\"id\":63595352,\"identity\":\"e49eebd0-9aca-4486-b53a-64a632239122\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:27\",\"extension\":\"png\",\"order_by\":5,\"title\":\"Figure 5\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":222342,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003ePlant secondary cell wall cellulose has a cellulose Iβ component but no cellulose Iα component. \\u003c/strong\\u003eA comparison of the domain 1 C4-C3/C5 region of the \\u003csup\\u003e13\\u003c/sup\\u003eC 2D CP refocussed INADEQUATE MAS NMR spectrum of xylanase treated hCNFs of poplar wood with the \\u003csup\\u003e13\\u003c/sup\\u003eC chemical shifts of cellulose Iα and Iβ, as given by Brouwer and Mikolajewski\\u003csup\\u003e21\\u003c/sup\\u003e (see SI Table 1). The filled and empty circles correspond to two distinct \\u003csup\\u003e13\\u003c/sup\\u003eC chemical shifts in the asymmetric unit cells of cellulose Iα and Iβ.\\u003csup\\u003e17,18\\u003c/sup\\u003e The Iβ cellulose positions are within 0.2 ppm in the SQ dimension and 0.4 ppm in the DQ dimension of the native plant cellulose chemical shifts.\\u0026nbsp; The Iα cellulose positions do not match the \\u003csup\\u003e13\\u003c/sup\\u003eC NMR chemical shifts of any glucose environment acquired from the native never dried plant cell wall. Spectra were recorded as in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image5.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/d826206b65aaae18ae0772a6.png\"},{\"id\":63595353,\"identity\":\"bf90f62c-2a39-42aa-8f41-2180e1cca865\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:28\",\"extension\":\"png\",\"order_by\":6,\"title\":\"Figure 6\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":131188,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eDomain 1 glucose environments in cellulose are not solely interior chains. \\u003c/strong\\u003eComparison of the standard (blue) and water edited (brown) 30 ms \\u003csup\\u003e13\\u003c/sup\\u003eC 2D CP PDSD MAS NMR spectra of xylanase treated hCNFs. a) the C1-C4 region, b) the C1-C6 region. Spectra were normalised to glucose environment a, and spectra were recorded as in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image6.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/daf5e4108c771413bf952478.png\"},{\"id\":63595356,\"identity\":\"4403f1e9-e0bb-49d7-835a-6c6190b681e6\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:28\",\"extension\":\"png\",\"order_by\":7,\"title\":\"Figure 7\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":130030,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eDomain 1 glucose environments a and c are in close proximity. \\u003c/strong\\u003eThe C1-C1 region of a 200 ms \\u003csup\\u003e13\\u003c/sup\\u003eC 2D CP PDSD MAS NMR spectrum of xylanase treated hCNFs of poplar wood shows cross-peaks between a and c. Spectra were recorded as in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image7.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/4e0e0062adc19fc804369f9b.png\"},{\"id\":63595354,\"identity\":\"6819ca99-0479-4f8d-a584-29c4b5debab9\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:28\",\"extension\":\"png\",\"order_by\":8,\"title\":\"Figure 8\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":326602,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eDomain 1 glucose environment b and the domain 2 glucose environments are in close proximity. \\u003c/strong\\u003e400 ms \\u003csup\\u003e13\\u003c/sup\\u003eC 2D CP PDSD MAS NMR spectrum of xylanase treated hCNFs of poplar wood. a) the C4-C4 region,\\u003cstrong\\u003e \\u003c/strong\\u003eb) the C6-C6 region.\\u003cstrong\\u003e \\u003c/strong\\u003eSpectra were recorded as in figure 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image8.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/e6e8f548e9c687a68404e7c7.png\"},{\"id\":63596007,\"identity\":\"182193fd-b96d-469a-95cb-d818eda52e98\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:08:28\",\"extension\":\"png\",\"order_by\":9,\"title\":\"Figure 9\",\"display\":\"\",\"copyAsset\":false,\"role\":\"figure\",\"size\":124216,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cstrong\\u003eSchematic diagram of two potential 18 chain cellulose Iβ\\u003c/strong\\u003e \\u003cstrong\\u003emicrofibrils, with 234432 and 34443 habits\\u003c/strong\\u003e. The fibril core and the surface environments are shown in blue and green respectively. Although both habits fulfil the observation of core:surface ratio 0.5, but only the 234432 habit would have a ratio of a (origin): c (centre) chain of 1.\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"image9.png\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/3705445585fb7e8ca9e27b6a.png\"},{\"id\":65978287,\"identity\":\"56443ad2-c8f9-4768-b589-63bdc395ab93\",\"added_by\":\"auto\",\"created_at\":\"2024-10-05 11:48:19\",\"extension\":\"pdf\",\"order_by\":0,\"title\":\"\",\"display\":\"\",\"copyAsset\":false,\"role\":\"manuscript-pdf\",\"size\":2095160,\"visible\":true,\"origin\":\"\",\"legend\":\"\",\"description\":\"\",\"filename\":\"manuscript.pdf\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/5f39532a-1cc7-492a-8372-97ce541b1df8.pdf\"},{\"id\":63595359,\"identity\":\"1c731ce1-ff8c-446e-b6e4-7bdbf44f0ce0\",\"added_by\":\"auto\",\"created_at\":\"2024-08-30 04:00:28\",\"extension\":\"docx\",\"order_by\":1,\"title\":\"\",\"display\":\"\",\"copyAsset\":false,\"role\":\"supplement\",\"size\":10240896,\"visible\":true,\"origin\":\"\",\"legend\":\"\\u003cp\\u003e\\u003cbr\\u003e\\u003c/p\\u003e\",\"description\":\"\",\"filename\":\"SICellulosepaperCresswelletal.Finaldocx.docx\",\"url\":\"https://assets-eu.researchsquare.com/files/rs-4970084/v1/339e5da203fe80bc20ca8ac7.docx\"}],\"financialInterests\":\"There is \\u003cb\\u003eNO\\u003c/b\\u003e Competing Interest.\",\"formattedTitle\":\"New insights into the structure of cellulose in plant cell walls\",\"fulltext\":[{\"header\":\"Introduction\",\"content\":\"\\u003cp\\u003eCellulose is the most abundant natural polymer in the world.\\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003e It forms a major component in the cell walls of plants providing much of their mechanical strength.\\u003csup\\u003e\\u003cspan citationid=\\\"CR2\\\" class=\\\"CitationRef\\\"\\u003e2\\u003c/span\\u003e\\u003c/sup\\u003e Its properties are important for the pulp and lumber industries and more recently cellulose has become a major contender as a sustainable alternative to fossil fuels for producing materials, films and biofuels.\\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR4\\\" citationid=\\\"CR3\\\" class=\\\"CitationRef\\\"\\u003e3\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR5\\\" class=\\\"CitationRef\\\"\\u003e5\\u003c/span\\u003e\\u003c/sup\\u003e Despite the large industrial interest in cellulose, its structure in native plant cell walls remains elusive.\\u003c/p\\u003e \\u003cp\\u003eCellulose is a β-1-4-linked polymer of D-glucose in long chains of perhaps 10,000 units.\\u003csup\\u003e\\u003cspan citationid=\\\"CR6\\\" class=\\\"CitationRef\\\"\\u003e6\\u003c/span\\u003e\\u003c/sup\\u003e These glucan chains have a 2-fold helical structure which allows them to crystallise via both stacking interactions and a network of hydrogen bonding to form long microfibrils in plant cell walls.\\u003csup\\u003e\\u003cspan citationid=\\\"CR7\\\" class=\\\"CitationRef\\\"\\u003e7\\u003c/span\\u003e\\u003c/sup\\u003e Estimates of the diameter of these cellulose microfibrils are between 3\\u0026ndash;5 nm.\\u003csup\\u003e\\u003cspan citationid=\\\"CR8\\\" class=\\\"CitationRef\\\"\\u003e8\\u003c/span\\u003e\\u003c/sup\\u003e In recent work, based on biophysical measurement and advances in understanding the biosynthesis, it is believed that a microfibril is formed from 18\\u0026ndash;24 glucan chains.\\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR10 CR11 CR12 CR13 CR14\\\" citationid=\\\"CR9\\\" class=\\\"CitationRef\\\"\\u003e9\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR15\\\" class=\\\"CitationRef\\\"\\u003e15\\u003c/span\\u003e\\u003c/sup\\u003e The arrangement of these chains in the microfibril into several stacked sheets also remains unknown, but computational studies have suggested some possible habits.\\u003csup\\u003e\\u003cspan citationid=\\\"CR8\\\" class=\\\"CitationRef\\\"\\u003e8\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR16\\\" class=\\\"CitationRef\\\"\\u003e16\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eCellulose Iα and Iβ are well known forms of crystalline cellulose and have been characterised using both diffraction techniques and solid-state NMR.\\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR18 CR19 CR20\\\" citationid=\\\"CR17\\\" class=\\\"CitationRef\\\"\\u003e17\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e\\u003c/sup\\u003e Cellulose Iα has a P1 triclinic structure and has non-equivalent glucosyl residues in a cellobiose unit within the same chain, whereas cellulose Iβ has a P2\\u003csub\\u003e1\\u003c/sub\\u003e monoclinic structure and its non-equivalent glucosyl residues are in alternating sheets of glucan chains, called centre and origin.\\u003csup\\u003e\\u003cspan citationid=\\\"CR17\\\" class=\\\"CitationRef\\\"\\u003e17\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR18\\\" class=\\\"CitationRef\\\"\\u003e18\\u003c/span\\u003e\\u003c/sup\\u003e It is known that cellulose from bacteria and some algae have predominantly a cellulose Iα structure whilst tunicate cellulose, which consists of large crystals, is predominantly cellulose Iβ. Although it is possible to obtain high resolution X-ray and neutron diffraction patterns for these larger crystalline forms of cellulose, this is not achievable on native plant cellulose.\\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR23\\\" citationid=\\\"CR22\\\" class=\\\"CitationRef\\\"\\u003e22\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR24\\\" class=\\\"CitationRef\\\"\\u003e24\\u003c/span\\u003e\\u003c/sup\\u003e The very thin plant microfibrils result in low resolution diffraction patterns such that atomic resolution cannot be achieved.\\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR25\\\" citationid=\\\"CR24\\\" class=\\\"CitationRef\\\"\\u003e24\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR26\\\" class=\\\"CitationRef\\\"\\u003e26\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eSolid-state NMR has been used for studying cellulose since the 1980s, mainly using 1D experiments.\\u003csup\\u003e\\u003cspan citationid=\\\"CR27\\\" class=\\\"CitationRef\\\"\\u003e27\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR28\\\" class=\\\"CitationRef\\\"\\u003e28\\u003c/span\\u003e\\u003c/sup\\u003e It is particularly useful as, in contrast to diffraction, it does not require long-range order and samples can be analysed in situ. Nevertheless, the thin nature of the plant microfibrils means that the surface glucose residues contribute to the NMR spectra far more than in large crystals of tunicates or fibrils of bacteria, complicating attempts to fully assign the spectra. Furthermore, plant samples may contain non-cellulosic cell wall components that contribute to spectral peaks in the cellulose region, as well as a so-called amorphous cellulose component.\\u003csup\\u003e\\u003cspan citationid=\\\"CR29\\\" class=\\\"CitationRef\\\"\\u003e29\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR30\\\" class=\\\"CitationRef\\\"\\u003e30\\u003c/span\\u003e\\u003c/sup\\u003e The cellulose fibrils also interact with other components in the cell wall which is likely to influence the chemical shifts of cellulose glucosyl residues.\\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR32\\\" citationid=\\\"CR31\\\" class=\\\"CitationRef\\\"\\u003e31\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR33\\\" class=\\\"CitationRef\\\"\\u003e33\\u003c/span\\u003e\\u003c/sup\\u003e Within these limitations, it is widely thought, based on NMR assignments and the fitting of X-ray diffraction patterns, that cellulose of higher plants is a mixture of mainly cellulose Iβ and some cellulose Iα\\u003csup\\u003e\\u003cspan citationid=\\\"CR28\\\" class=\\\"CitationRef\\\"\\u003e28\\u003c/span\\u003e\\u003c/sup\\u003e although there are also several studies that suggest plant cellulose could have its own distinct structure.\\u003csup\\u003e\\u003cspan citationid=\\\"CR28\\\" class=\\\"CitationRef\\\"\\u003e28\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR34\\\" class=\\\"CitationRef\\\"\\u003e34\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR35\\\" class=\\\"CitationRef\\\"\\u003e35\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eIn NMR spectra of cellulose, the carbon shifts of C4 and C6 of the different glucose units in cellulose are split into two main regions which, following Dupree et al.,\\u003csup\\u003e\\u003cspan citationid=\\\"CR32\\\" class=\\\"CitationRef\\\"\\u003e32\\u003c/span\\u003e\\u003c/sup\\u003e are defined as domain 1 glucosyl residues which have C4\\u003csup\\u003e1\\u003c/sup\\u003e and C6\\u003csup\\u003e1\\u003c/sup\\u003e shifts of ~\\u0026thinsp;89 ppm and ~\\u0026thinsp;65 ppm respectively, and domain 2 glucosyl residues which have C4\\u003csup\\u003e2\\u003c/sup\\u003e and C6\\u003csup\\u003e2\\u003c/sup\\u003e shifts of ~\\u0026thinsp;84 ppm and ~\\u0026thinsp;62 ppm. Domain 1 has been thought to reflect crystalline cellulose as its shifts are close to those of the crystalline cellulose Iα and Iβ\\u003csup\\u003e\\u003cspan citationid=\\\"CR29\\\" class=\\\"CitationRef\\\"\\u003e29\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR36\\\" class=\\\"CitationRef\\\"\\u003e36\\u003c/span\\u003e\\u003c/sup\\u003e, and these residues are normally also considered interior chains of the microfibril.\\u003csup\\u003e\\u003cspan citationid=\\\"CR34\\\" class=\\\"CitationRef\\\"\\u003e34\\u003c/span\\u003e\\u003c/sup\\u003e Domain 2 glucose residues are believed to be surface glucan chains of the cellulose microfibril and are also sometimes described as amorphous.\\u003csup\\u003e\\u003cspan citationid=\\\"CR34\\\" class=\\\"CitationRef\\\"\\u003e34\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR36\\\" class=\\\"CitationRef\\\"\\u003e36\\u003c/span\\u003e\\u003c/sup\\u003e When restricted to 1D NMR it is only possible to resolve residues in these two domains using C4 and C6 shifts, as the C5 shifts overlap with those from C2 and C3.\\u003csup\\u003e\\u003cspan citationid=\\\"CR32\\\" class=\\\"CitationRef\\\"\\u003e32\\u003c/span\\u003e\\u003c/sup\\u003e The C4 cellulose region of the 1D NMR spectrum tends to be reasonably clear of contributions from hemicellulose and pectin etc. and so the ratio of the two domains (C4\\u003csup\\u003e1\\u003c/sup\\u003e:C4\\u003csup\\u003e2\\u003c/sup\\u003e) has been widely used as a measure of the crystallinity of cellulose and also to estimate the size of the cellulose microfibril.\\u003csup\\u003e\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR38\\\" class=\\\"CitationRef\\\"\\u003e38\\u003c/span\\u003e\\u003c/sup\\u003e It has been suggested \\u003csup\\u003e\\u003cspan citationid=\\\"CR39\\\" class=\\\"CitationRef\\\"\\u003e39\\u003c/span\\u003e\\u003c/sup\\u003e that the major source of this change in shift between the two domains arises from a change in the conformation of the hydroxymethyl group of the glucose unit from \\u003cem\\u003etg\\u003c/em\\u003e in domain 1 to \\u003cem\\u003egt\\u003c/em\\u003e or \\u003cem\\u003egg\\u003c/em\\u003e in domain 2 and this has since been supported by several studies using DFT or NMR.\\u003csup\\u003e\\u003cspan citationid=\\\"CR16\\\" class=\\\"CitationRef\\\"\\u003e16\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR40\\\" class=\\\"CitationRef\\\"\\u003e40\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR41\\\" class=\\\"CitationRef\\\"\\u003e41\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eIn recent years, developments in \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC labelling of plant cell wall materials (and dynamic nuclear polarisation (DNP)) has enabled access to a wealth of 2D Magic Angle Spinning (MAS) NMR experiments that have allowed unparalleled insight into the structure of plant cell walls.\\u003csup\\u003e\\u003cspan citationid=\\\"CR42\\\" class=\\\"CitationRef\\\"\\u003e42\\u003c/span\\u003e\\u003c/sup\\u003e These studies have made significant strides in understanding hemicellulose structure and interactions with cellulose in both primary (PCW) and secondary cell walls (SCW).\\u003csup\\u003e\\u003cspan citationid=\\\"CR43\\\" class=\\\"CitationRef\\\"\\u003e43\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR44\\\" class=\\\"CitationRef\\\"\\u003e44\\u003c/span\\u003e\\u003c/sup\\u003e NMR has been used to identify the 2-fold helical structure of xylan in SCW, and the molecular architecture and water-polymer interactions in softwoods.\\u003csup\\u003e\\u003cspan citationid=\\\"CR31\\\" class=\\\"CitationRef\\\"\\u003e31\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR44\\\" class=\\\"CitationRef\\\"\\u003e44\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR45\\\" class=\\\"CitationRef\\\"\\u003e45\\u003c/span\\u003e\\u003c/sup\\u003e Several distinct glucosyl residues, each with differing chemical shifts due to differing environments within primary cell wall cellulose, were described by Wang et al. who found similar glucose environments in a range of different primary cell wall materials \\u003cem\\u003e(Brachypodium\\u003c/em\\u003e, \\u003cem\\u003eZea mays\\u003c/em\\u003e, and \\u003cem\\u003eArabidopsis thaliana\\u003c/em\\u003e).\\u003csup\\u003e\\u003cspan citationid=\\\"CR46\\\" class=\\\"CitationRef\\\"\\u003e46\\u003c/span\\u003e\\u003c/sup\\u003e They reported that glucosyl residues of five environments are within the fibril, and two lie on the surface. They suggested several cellulose microfibril models based on the understanding that the domain 1 glucosyl residues (\\u003cem\\u003etg\\u003c/em\\u003e hydroxymethyl conformation) arise solely from sites in the interior chains of the microfibril.\\u003csup\\u003e\\u003cspan citationid=\\\"CR41\\\" class=\\\"CitationRef\\\"\\u003e41\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR47\\\" class=\\\"CitationRef\\\"\\u003e47\\u003c/span\\u003e\\u003c/sup\\u003e Under this assumption, and using quantitative NMR, they found that cellulose had too much \\u0026lsquo;interior\\u0026rsquo; relative to surface residues for the probable microfibril size of 18\\u0026ndash;24 chains. Additionally, some of the interior residues were more distant from water than others. As a result, the cellulose microfibrils bundle was suggested to produce more interior glucose residues.\\u003csup\\u003e\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eIn this work we assign the NMR shifts of the main cellulosic glucose environments found in a range of plant secondary cell wall materials and begin to determine their position in the cellulose microfibril. To achieve this, we studied both poplar wood and xylanase-treated holo-cellulose nanofibrils (hCNFs) prepared from poplar wood using minimal processing to maintain the native cellulose structure.\\u003csup\\u003e\\u003cspan citationid=\\\"CR48\\\" class=\\\"CitationRef\\\"\\u003e48\\u003c/span\\u003e\\u003c/sup\\u003e By removing hemicellulose, lignin, and any amorphous cellulose present we are left with a clearer spectrum with improved resolution which provides more certainty in the determination of the NMR shifts for each distinct glucose environment. By comparing with the recently corrected NMR shift assignments for cellulose I we conclusively show that plant cellulose is not a mixture of Iα and Iβ, and that the core of the native cellulose microfibril has NMR shifts identical to tunicate cellulose 1β. We also show that the glucose residues in domain 1 and domain 2 cannot be split into crystalline and amorphous nor can they be considered to be exclusively interior and surface. In particular, we find that one of the domain 1 glucosyl residue environments is part of a surface glucan chain. This discovery allows a more realistic estimate of the interior to surface ratio of fibrils to be obtained than the many previous estimations using \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR.\\u003c/p\\u003e\"},{\"header\":\"Results\",\"content\":\"\\u003cp\\u003e \\u003cb\\u003eAssigning\\u003c/b\\u003e \\u003csup\\u003e\\u003cb\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/b\\u003e\\u003c/sup\\u003e\\u003cb\\u003eC NMR chemical shifts for the glucosyl residue environments in cellulose within native plant cell walls\\u003c/b\\u003e\\u003c/p\\u003e \\u003cp\\u003eThroughout our work\\u003csup\\u003e\\u003cspan citationid=\\\"CR31\\\" class=\\\"CitationRef\\\"\\u003e31\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR45\\\" class=\\\"CitationRef\\\"\\u003e45\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR49\\\" class=\\\"CitationRef\\\"\\u003e49\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR50\\\" class=\\\"CitationRef\\\"\\u003e50\\u003c/span\\u003e\\u003c/sup\\u003e never-dried native samples of many different plants have been studied using \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC MAS NMR to understand the structure and interactions of cellulose hemicellulose, lignin and water in plant cell walls. The composition of the secondary cell wall of poplar is relatively simple with only 3 main components cellulose, xylan and lignin. Figure\\u0026nbsp;\\u003cspan refid=\\\"Fig1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e shows the 1D CP MAS NMR spectrum of the neutral carbohydrate region, which is dominated by the cellulose signal, of poplar wood. The carbon 1 (C1) peak is at ~\\u0026thinsp;105 ppm, C2, C3, C5 overlap in the region between 70\\u0026ndash;76 ppm and the C4 and C6 carbons are split into two main peaks. The cellulose glucose environments in domain 1 show C4\\u003csup\\u003e1\\u003c/sup\\u003e and C6\\u003csup\\u003e1\\u003c/sup\\u003e peaks at ~\\u0026thinsp;89 ppm and ~\\u0026thinsp;65 ppm respectively and the C4\\u003csup\\u003e2\\u003c/sup\\u003e and C6\\u003csup\\u003e2\\u003c/sup\\u003e domain 2 peaks are at ~\\u0026thinsp;84 ppm and ~\\u0026thinsp;62 ppm respectively. Interestingly the C4 peaks in the 1D CP MAS spectrum are not simple Lorentzian line shapes indicating there are multiple glucose environments within both domains. The C4\\u003csup\\u003e2\\u003c/sup\\u003e region especially is clearly split into two peaks at ~\\u0026thinsp;83.5 and ~\\u0026thinsp;84.5 ppm.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003eFully \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC labelled poplar wood enabled the structure of plant cellulose to be explored using a wealth of 2D NMR experiments. 2D experiments such as CP INADEQUATE and 30 ms CP PDSD were used to resolve several different glucose environments within cellulose. The CP refocussed INADEQUATE is a double quantum experiment that correlates two covalently bonded carbons. The bonded carbons appear at the same DQ shift which is given by the sum of the respective SQ shift of the correlated carbons. Figure\\u0026nbsp;\\u003cspan refid=\\\"Fig2\\\" class=\\\"InternalRef\\\"\\u003e2\\u003c/span\\u003e shows the neutral carbohydrate region of the CP refocussed INADEQUATE spectrum of poplar wood. The high resolution means that the carbons within each glucosyl residue can be followed through in the 2D spectrum. This is demonstrated in red for the glucose residue in domain 1 named environment \\u0026lsquo;c\\u0026rsquo;, and in black for the C4-C6 region for the residue in domain 2 environment \\u0026lsquo;j\\u0026rsquo;. Our naming convention and assignments are based on those of Wang et al. \\u003csup\\u003e\\u003cspan citationid=\\\"CR46\\\" class=\\\"CitationRef\\\"\\u003e46\\u003c/span\\u003e\\u003c/sup\\u003e for cellulose in primary cell walls since many of the cellulose environments we observe have similar values, however glucose environment j, which is clearly visible for C4, C5 and C6, is newly reported. The C6 region of the spectrum (inset in Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig2\\\" class=\\\"InternalRef\\\"\\u003e2\\u003c/span\\u003e) shows the main glucose environments identified within domain 1 and domain 2. This region also shows a minor domain 2 environment, named k. In total there are three major glucose environments a, b, c in domain 1 and three f, g, j in domain 2.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003eTo make a more complete assignment of the NMR shifts of the glucose environments within cellulose, a 30 ms CP PDSD spectrum was analysed alongside the INADEQUATE spectrum. The CP PDSD experiment is a through space correlation experiment which, with a mixing time of 30 ms, shows cross peaks of carbons only if they are within the same glucose ring. SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e shows the 30 ms CP PDSD spectrum highlighting the key regions which are particularly useful for identifying different glucose environments. Two further minor glucose environments in domain 1 labelled d and e (which is very minor) have previously been assigned in Wang et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR46\\\" class=\\\"CitationRef\\\"\\u003e46\\u003c/span\\u003e\\u003c/sup\\u003e. Table\\u0026nbsp;\\u003cspan refid=\\\"Tab1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e lists the \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR shifts for all the different glucose environments identified in cellulose, including the more minor ones.\\u003c/p\\u003e \\u003cp\\u003e \\u003cdiv class=\\\"gridtable\\\"\\u003e\\u003ctable float=\\\"Yes\\\" id=\\\"Tab1\\\" border=\\\"1\\\"\\u003e \\u003ccaption language=\\\"En\\\"\\u003e \\u003cdiv class=\\\"CaptionNumber\\\"\\u003eTable 1\\u003c/div\\u003e \\u003cdiv class=\\\"CaptionContent\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eNMR chemical shifts of all glucose environments assigned in cellulose of poplar wood. *\\u003c/b\\u003eThe naming convention and assignments are based on those of Wang et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR46\\\" class=\\\"CitationRef\\\"\\u003e46\\u003c/span\\u003e\\u003c/sup\\u003e with additional environments. The minor glucose environments d, e and k are italicised. A comparison to Iα and Iβ cellulose NMR chemical shifts is shown in SI Table\\u0026nbsp;\\u003cspan refid=\\\"Tab1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e. \\u003csup\\u003e$\\u003c/sup\\u003eAll \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR chemical shifts are in ppm and have an error of \\u0026plusmn;\\u0026thinsp;0.1 ppm.\\u003c/p\\u003e \\u003c/div\\u003e \\u003c/caption\\u003e \\u003ccolgroup cols=\\\"7\\\"\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c1\\\" colnum=\\\"1\\\"\\u003e\\u003c/div\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c2\\\" colnum=\\\"2\\\"\\u003e\\u003c/div\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c3\\\" colnum=\\\"3\\\"\\u003e\\u003c/div\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c4\\\" colnum=\\\"4\\\"\\u003e\\u003c/div\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c5\\\" colnum=\\\"5\\\"\\u003e\\u003c/div\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c6\\\" colnum=\\\"6\\\"\\u003e\\u003c/div\\u003e \\u003cdiv align=\\\"left\\\" class=\\\"colspec\\\" colname=\\\"c7\\\" colnum=\\\"7\\\"\\u003e\\u003c/div\\u003e \\u003cthead\\u003e \\u003ctr\\u003e \\u003cth align=\\\"left\\\" colspan=\\\"7\\\" nameend=\\\"c7\\\" namest=\\\"c1\\\"\\u003e \\u003cp\\u003eDomain 1\\u003c/p\\u003e \\u003c/th\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003eGlucose Environment\\u003c/p\\u003e \\u003c/th\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003eC1\\u003csup\\u003e1\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/th\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003eC2\\u003csup\\u003e1\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/th\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003eC3\\u003csup\\u003e1\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/th\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003eC4\\u003csup\\u003e1\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/th\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003eC5\\u003csup\\u003e1\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/th\\u003e \\u003cth align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003eC6\\u003csup\\u003e1\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/th\\u003e \\u003c/tr\\u003e \\u003c/thead\\u003e \\u003ctbody\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003ea* (Cellulose Iβ origin)\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e105.8\\u003csup\\u003e$\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e71.7\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e74.3\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e89.0\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e72.5\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e65.0\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003eb\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e105.2\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e72.6\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e75.5\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e89.1\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e72.6\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e65.2\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003ec (Cellulose Iβ centre)\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e104.1\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e71.8\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e75.2\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e88.1\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e71.2\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e65.8\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003ed\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e105.2\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e72.5\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e74.9\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e87.1\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e72.5\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e64.7\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003ee\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e105.0\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e-\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e74.7\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e89.8\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e71.1\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e65.3\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colspan=\\\"7\\\" nameend=\\\"c7\\\" namest=\\\"c1\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eDomain 2\\u003c/b\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eGlucose Environment\\u003c/b\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eC1\\u003c/b\\u003e\\u003csup\\u003e\\u003cb\\u003e2\\u003c/b\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eC2\\u003c/b\\u003e\\u003csup\\u003e\\u003cb\\u003e2\\u003c/b\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eC3\\u003c/b\\u003e\\u003csup\\u003e\\u003cb\\u003e2\\u003c/b\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eC4\\u003c/b\\u003e\\u003csup\\u003e\\u003cb\\u003e2\\u003c/b\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eC5\\u003c/b\\u003e\\u003csup\\u003e\\u003cb\\u003e2\\u003c/b\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e\\u003cb\\u003eC6\\u003c/b\\u003e\\u003csup\\u003e\\u003cb\\u003e2\\u003c/b\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003ef\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e105.2\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e72.4\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e74.4\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e84.5\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e75.3\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e62.4\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003eg\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e104.9\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e72.4\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e75.3\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e83.5\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e75.2\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e61.4\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003ej\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e105.1\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e-\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e-\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e83.3\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e73.9\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e61.3\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003ctr\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c1\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003ek\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c2\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e-\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c3\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e-\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c4\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e-\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c5\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e83.8\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c6\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e74.3\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003ctd align=\\\"left\\\" colname=\\\"c7\\\"\\u003e \\u003cp\\u003e\\u003cem\\u003e63.5\\u003c/em\\u003e\\u003c/p\\u003e \\u003c/td\\u003e \\u003c/tr\\u003e \\u003c/tbody\\u003e \\u003c/colgroup\\u003e \\u003c/table\\u003e\\u003c/div\\u003e \\u003c/p\\u003e \\u003cdiv id=\\\"Sec3\\\" class=\\\"Section2\\\"\\u003e \\u003ch2\\u003eGlucose environments in cellulose of cell walls of different plants\\u003c/h2\\u003e \\u003cp\\u003eHaving identified all the glucose environments in cellulose of poplar wood we were interested to compare these to cellulose of other plants. Over the years we have used high resolution 2D NMR to study a wide range of different plants including monocots, eudicots, and gymnosperms.\\u003csup\\u003e\\u003cspan citationid=\\\"CR44\\\" class=\\\"CitationRef\\\"\\u003e44\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR49\\\" class=\\\"CitationRef\\\"\\u003e49\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR50\\\" class=\\\"CitationRef\\\"\\u003e50\\u003c/span\\u003e\\u003c/sup\\u003e The cellulose of many of the plants investigated has remarkably similar NMR shifts such that it is possible to identify our assigned glucose environments in a wide range of plants. This is illustrated using a comparison of two different regions of the 30 ms CP PDSD spectra of a eudicot (poplar), a monocot (\\u003cem\\u003eBrachypodium\\u003c/em\\u003e) and a gymnosperm (spruce) in Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig3\\\" class=\\\"InternalRef\\\"\\u003e3\\u003c/span\\u003e for the C1-C4 region and SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig2\\\" class=\\\"InternalRef\\\"\\u003e2\\u003c/span\\u003e for the C4-C6 region. The relative amount of each glucose environment varies between samples, for example, spruce wood appears to have relatively more b versus a or c whereas in poplar wood cellulose these are more similar in quantity. \\u003cem\\u003eBrachypodium\\u003c/em\\u003e has significantly more of site d which is a minor cellulose site in the poplar wood secondary cell walls. Whilst not visible in Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig3\\\" class=\\\"InternalRef\\\"\\u003e3\\u003c/span\\u003e and SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig2\\\" class=\\\"InternalRef\\\"\\u003e2\\u003c/span\\u003e, the cellulose environment e is a very minor environment in all three plants. There is one additional cellulose environment \\u0026lsquo;s\\u0026rsquo; seen in the spectra of spruce. There are some small differences in the shifts for some of the more minor cellulose environments. For example, we found that environment d is generally broader than sites a, b, and c with its C4 NMR shift varying by ~\\u0026thinsp;0.3 ppm indicating that there are more variations in the d local environment. Generally, the NMR shifts remain similar between all the plants, indicating the glucose environments are constant in the cellulose fibrils.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003c/div\\u003e \\u003cdiv id=\\\"Sec4\\\" class=\\\"Section2\\\"\\u003e \\u003ch2\\u003eXylanase-treated holo-cellulose nanofibrils (hCNFs) maintain the native plant cellulose structure of poplar wood\\u003c/h2\\u003e \\u003cp\\u003eWhilst studying the native plant cellulose in-situ is ideal for ensuring minimal disruption of the cell wall, the NMR spectrum can be crowded since signals from both hemicelluloses and lignin are present which tends to limit the resolution. The glucose environments in the fibrils may also be influenced by interactions of surface residues with hemicellulose or lignin. By removing the lignin during preparation of holo-cellulose nanofibrils (hCNFs) we can maintain the cellulose fibril structure whilst improving the resolution of the NMR spectrum.\\u003csup\\u003e\\u003cspan citationid=\\\"CR48\\\" class=\\\"CitationRef\\\"\\u003e48\\u003c/span\\u003e\\u003c/sup\\u003e To remove hemicellulose we treated hCNFs produced from the poplar wood with xylanase. TEM images of these hCNFs show long and thin cellulose fibrils which are loosely bundled, as observed with the 3nm width hCNFs prepared similarly from Arabidopsis see SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig3\\\" class=\\\"InternalRef\\\"\\u003e3\\u003c/span\\u003e.\\u003csup\\u003e\\u003cspan citationid=\\\"CR48\\\" class=\\\"CitationRef\\\"\\u003e48\\u003c/span\\u003e\\u003c/sup\\u003e To investigate whether the xylanased hCNFs have maintained the cellulose environments we compared the NMR spectra of poplar wood with those of the xylanased hCNFs. Their 1D CP MAS spectra are shown in SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig4\\\" class=\\\"InternalRef\\\"\\u003e4\\u003c/span\\u003e. There is a significant change in the total signal in the C4 region and in the ratio of domain 1 and domain 2 cellulose peaks. This change is due to the removal of both lignin and hemicelluloses as well perhaps as loss of a less ordered cellulose component.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003eThe 2D 30 ms CP PDSD comparisons of the C1-C4 region and the C1-C6 region shown in Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig4\\\" class=\\\"InternalRef\\\"\\u003e4\\u003c/span\\u003e and SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig5\\\" class=\\\"InternalRef\\\"\\u003e5\\u003c/span\\u003e respectively show that the domain 1 glucose environments remain almost unchanged by the production of the hCNFs from wood. The domain 2 environments show some slight NMR shift changes, typically\\u0026thinsp;\\u0026lt;\\u0026thinsp;0.3 ppm, in both the f and g/j environments. As the domain 2 environments are surface chains of the fibril this may be due to the removal of both lignin and the hemicellulose xylan. The domain 2 cellulose region of the 30 ms CP PDSD spectrum shown in SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig5\\\" class=\\\"InternalRef\\\"\\u003e5\\u003c/span\\u003e exhibits distinctly narrower peaks for the xylanased hCNFs, i.e. a broad contribution of somewhat less ordered environments has been removed. Since there were no other changes, we are convinced that the production of the hCNFs causes relatively minimal disturbance to the different glucose environments in the poplar fibrils.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003c/div\\u003e\\n\\u003ch3\\u003ePlant cellulose fibril core environments are identical to tunicate cellulose Iβ\\u003c/h3\\u003e\\n\\u003cp\\u003eThe improvement in the resolution of NMR spectra makes analysis of the xylanased hCNFs ideal for determining NMR shifts, quantities, and the relative location of the different glucose sites within the fibrils. There have been many 1D \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR studies of cellulose Iα and Iβ, but the use of different referencing has led to a spread in the published NMR shifts.\\u003csup\\u003e\\u003cspan citationid=\\\"CR19\\\" class=\\\"CitationRef\\\"\\u003e19\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR20\\\" class=\\\"CitationRef\\\"\\u003e20\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR51\\\" class=\\\"CitationRef\\\"\\u003e51\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR52\\\" class=\\\"CitationRef\\\"\\u003e52\\u003c/span\\u003e\\u003c/sup\\u003e Recently Brouwer et.al. have resolved these discrepancies to give a consistent set of \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR values.\\u003csup\\u003e\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e\\u003c/sup\\u003e Using the same reference as Brouwer et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e\\u003c/sup\\u003e we can compare our shifts from the high-resolution spectra of xylanased hCNFs from poplar wood with those of cellulose Iα and Iβ. None of the domain 2 glucose environments are close to those of the NMR shifts of glucose in these cellulose allomorphs (SI Table\\u0026nbsp;\\u003cspan refid=\\\"Tab1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e). This means we only need to consider the \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR shifts of domain 1 glucose residues, where there are only 3 major sites a, b and c. Figure\\u0026nbsp;\\u003cspan refid=\\\"Fig5\\\" class=\\\"InternalRef\\\"\\u003e5\\u003c/span\\u003e shows the C4-C3/C5 region of the CP refocussed INADEQUATE spectrum of the xylanased hCNFs with the Iα and Iβ shift positions marked. It is evident that the shifts of cellulose Iα are different from those of native plant cellulose whereas the C3, C4 and C5 shifts of a and c are very close to those of Iβ. Indeed, all the shifts for sites a and c, apart from C1, are within ~\\u0026thinsp;0.2 ppm of those of cellulose Iβ (SI Table\\u0026nbsp;\\u003cspan refid=\\\"Tab1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e). Since the only substantial difference we observe from the Iβ shifts is in C1 where it seems likely that the Kono et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR19\\\" class=\\\"CitationRef\\\"\\u003e19\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR20\\\" class=\\\"CitationRef\\\"\\u003e20\\u003c/span\\u003e\\u003c/sup\\u003e assignment was incorrect, as highlighted in SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig6\\\" class=\\\"InternalRef\\\"\\u003e6\\u003c/span\\u003e and SI Table\\u0026nbsp;\\u003cspan refid=\\\"Tab1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e. This misassignment by Kono et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR20\\\" class=\\\"CitationRef\\\"\\u003e20\\u003c/span\\u003e\\u003c/sup\\u003e presumably arose because a refocussed INADEQUATE spectrum was used for their assignment and both glucose environments in cellulose Iβ have nearly identical C2 NMR shifts making it difficult to distinguish the associated C1 shifts. Using the 30 ms CP PDSD spectrum (SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig6\\\" class=\\\"InternalRef\\\"\\u003e6\\u003c/span\\u003e) together with the INADEQUATE spectrum allowed us to determine confidently the C1 shifts for environments a and c. In a later paper Brouwer and Mikolajewski also commented on the possibility that the Kono et al. C1 assignments could be swapped.\\u003csup\\u003e\\u003cspan citationid=\\\"CR53\\\" class=\\\"CitationRef\\\"\\u003e53\\u003c/span\\u003e\\u003c/sup\\u003e With this new NMR shift assignment of cellulose Iβ, all the shifts of fibril core glucose environments a and c match closely tunicate cellulose Iβ. We now assign environment a as origin chain, and environment c as centre chain, because the DFT calculations from the cellulose Iβ predict the C1 chemical shift of the residues in the origin chains is ~\\u0026thinsp;2 ppm higher than the C1 of residues in the centre chains, as seen here for environments a and c respectively.\\u003csup\\u003e\\u003cspan citationid=\\\"CR46\\\" class=\\\"CitationRef\\\"\\u003e46\\u003c/span\\u003e\\u003c/sup\\u003e Native plant cellulose is clearly not a mixture of cellulose Iα and Iβ since, despite environment b having similar shifts to one of the Iα glucose environments for both C1 and C4 (SI Table\\u0026nbsp;2), there is no sign of the second Iα glucose environment which would also be visible in similar quantities within domain 1. This identification of glucose environments a and c now shows that the plant cellulose fibrils are cellulose Iβ.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003eHaving determined that two of the main domain 1 environments correspond to glucose in the classical cellulose Iβ structure, we were interested to understand the origin of the third, remaining, domain 1 environment, b. Since b is from domain 1, it has been thought also to be interior to the fibril. To investigate this further, a 2D water-edited (w.e.) 30 ms CP PDSD spectrum was acquired to probe the glucose environments that are closest to water. Figure\\u0026nbsp;\\u003cspan refid=\\\"Fig6\\\" class=\\\"InternalRef\\\"\\u003e6\\u003c/span\\u003e shows the C1-C4 and C1-C6 regions of the w.e. spectrum compared with a standard 30 ms CP PDSD spectrum. As expected for surface glucose residues, the signal for the domain 2 sites f, g and j is enhanced indicating that they are more water accessible than the fibril core sites a and c. SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig7\\\" class=\\\"InternalRef\\\"\\u003e7\\u003c/span\\u003e shows similar proximity to water of these three major domain 2 glucose environments. Site j may be further from water than f and g, indicating these residues may be located on different surfaces of the fibril. Figure\\u0026nbsp;\\u003cspan refid=\\\"Fig6\\\" class=\\\"InternalRef\\\"\\u003e6\\u003c/span\\u003ea shows that the w.e. C1-C4 peak for glucose environment b is also significantly enhanced in a similar way to f, g, and j, indicating that b is also closer to water than the core sites a or c. Glucose environment b of domain 1 is therefore, unexpectedly, in a surface chain of the fibril. Interestingly, for the C1-C6 region shown in Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig6\\\" class=\\\"InternalRef\\\"\\u003e6\\u003c/span\\u003eb the signal from b is still enhanced in the w.e. experiment over the signal for the core a and c environments, but to a lesser extent than for the domain 2 surface glucose environments. Being in domain 1, environment b likely reflects a glucose residue in a \\u003cem\\u003etg\\u003c/em\\u003e conformation. This conformation may be because the C6 is facing toward the interior of the cellulose fibril where the hydroxymethyl group hydrogen bonds with other residues.\\u003csup\\u003e\\u003cspan citationid=\\\"CR16\\\" class=\\\"CitationRef\\\"\\u003e16\\u003c/span\\u003e\\u003c/sup\\u003e Hence in the glucose residue of environment b the C6 is further from water than its C4, and further from water than the other surface environments where the C6 has a water-facing \\u003cem\\u003egt\\u003c/em\\u003e or \\u003cem\\u003egg\\u003c/em\\u003e conformation. Thus, in contrast to the widely accepted view, not all the domain 1 glucose residues are interior of the fibril. A consequence of this finding is that the ratio of domain 1 and domain 2 signals cannot be used to estimate the size of a fibril since when site b is included in domain 1 the amount of interior will be overestimated, and surface underestimated. The ratio of interior vs surface of the fibril can however be estimated from the C1 region of the INADEQUATE spectrum where sites a and c are resolved from b, and interior a\\u0026thinsp;+\\u0026thinsp;c and surface D2\\u0026thinsp;+\\u0026thinsp;b can now be determined by integration (see SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig8\\\" class=\\\"InternalRef\\\"\\u003e8\\u003c/span\\u003e). The value of ~\\u0026thinsp;0.5 in the poplar hCNFs is consistent with an 18-chain fibril with 6 interior chains and 12 surface chains. The relative amount of site b is ~\\u0026thinsp;0.7 of sites a or c (see SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig8\\\" class=\\\"InternalRef\\\"\\u003e8\\u003c/span\\u003e).\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003eTo investigate the relative proximities of the different glucose environments, two longer (200 ms and 400 ms) mixing time CP PDSD experiments were acquired. As these experiments are probing distances up to ~\\u0026thinsp;5\\u0026ndash;8 \\u0026Aring;, cross-peaks are between glucose residues in different sites within individual fibrils are additionally observed. Figure\\u0026nbsp;\\u003cspan refid=\\\"Fig7\\\" class=\\\"InternalRef\\\"\\u003e7\\u003c/span\\u003e shows that at a mixing time of 200 ms the C1-C1 region of the CP PDSD spectrum gives clear cross peaks between a and c only. This shows that the residue environments a and c are closer to each other than either are to residue environment b. This is consistent with our finding that b is not situated with a and c in the core of the cellulose fibril. The first clear cross peaks between residues in the two cellulose domains can be seen in the C4-C4 region of the 400 ms CP PDSD (Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig8\\\" class=\\\"InternalRef\\\"\\u003e8\\u003c/span\\u003ea). There are cross peaks from domain 2 glucose environments to the single C4\\u003csup\\u003e1\\u003c/sup\\u003e peak corresponding to glucose residue environments a plus b. Given we know that environments a and c are particularly close, and there are no cross peaks to c, the cross peaks from domain 2 observed here are likely to environment b only. This proximity of b and not c to domain 2 glucose residues is also observed in the C6-C6 region of the 400 ms CP PDSD spectrum (Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig8\\\" class=\\\"InternalRef\\\"\\u003e8\\u003c/span\\u003eb). There are also cross peaks between the different domain 2 environments of cellulose. The proximities of b to the domain 2 environments provides further confirmation that b is one of the four major surface glucose residue environments.\\u003c/p\\u003e\"},{\"header\":\"Discussion\",\"content\":\"\\u003cp\\u003eThe structure of native plant cellulose fibrils has been the focus of many studies for decades. Although a wide range of techniques have been used, the structure remains to be fully elucidated.\\u003csup\\u003e\\u003cspan citationid=\\\"CR30\\\" class=\\\"CitationRef\\\"\\u003e30\\u003c/span\\u003e,\\u003cspan additionalcitationids=\\\"CR55\\\" citationid=\\\"CR54\\\" class=\\\"CitationRef\\\"\\u003e54\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR56\\\" class=\\\"CitationRef\\\"\\u003e56\\u003c/span\\u003e\\u003c/sup\\u003e The \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR chemical shift is very sensitive to the local environment such as differing conformation and hydrogen bonding. In principle, NMR therefore allows distinct and recurring glucose residues in the interior and surface of cellulose fibrils to be distinguishable. High resolution is needed to resolve signals from these residues, and was achieved here by studying never dried, hydrated, isolated \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC labelled cellulose fibril samples with high field MAS NMR. We found plant cellulose is comprised of six major glucose environments, with four of these on the surface and just two in the core of the fibril. Importantly, these two core glucose residue sites have \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC NMR shifts very similar to those of tunicate cellulose Iβ centre and origin chains and there is no detectable cellulose Iα, resolving longstanding questions about the composition of plant cellulose.\\u003c/p\\u003e \\u003cp\\u003eThe core of the plant cellulose microfibril is comprised of two types of glucose residues, which lie in cellulose Iβ origin and centre chains. The plant fibril core glucose residue NMR shifts are essentially identical to those of tunicate cellulose Iβ. This finding relied in part on the unified and consistent NMR shifts of cellulose that have recently been determined by Brouwer et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e\\u003c/sup\\u003e Second, we found that the C1 \\u003csup\\u003e13\\u003c/sup\\u003eC shift of the two glucose residues of the origin and centre chains of tunicate cellulose Iβ had previously been misassigned, as was also thought possible by Brouwer and Mikolajewski. \\u003csup\\u003e\\u003cspan citationid=\\\"CR53\\\" class=\\\"CitationRef\\\"\\u003e53\\u003c/span\\u003e\\u003c/sup\\u003e Our corrected assignment of the C1 shifts is also more consistent with DFT calculations of cellulose Iβ origin and centre chain environments.\\u003csup\\u003e\\u003cspan citationid=\\\"CR57\\\" class=\\\"CitationRef\\\"\\u003e57\\u003c/span\\u003e\\u003c/sup\\u003e We therefore propose the plant fibril glucose environments a and c correspond to cellulose Iβ origin and centre chain environments respectively (Table\\u0026nbsp;\\u003cspan refid=\\\"Tab1\\\" class=\\\"InternalRef\\\"\\u003e1\\u003c/span\\u003e). Our NMR shifts for the plant cellulose fibril core are slightly smaller (~\\u0026thinsp;0.2 ppm) than the Brouwer values for tunicate cellulose Iβ.\\u003csup\\u003e\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR53\\\" class=\\\"CitationRef\\\"\\u003e53\\u003c/span\\u003e\\u003c/sup\\u003e Due to the thinness of the plant fibrils, all interior chains are next to surface chains, in contrast to the large crystals of tunicate cellulose that were studied by Kono et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR20\\\" class=\\\"CitationRef\\\"\\u003e20\\u003c/span\\u003e\\u003c/sup\\u003e Therefore the small NMR shift difference could result from these fibril core chains having neighbouring chains, similarly stacked as cellulose Iβ, but which are surface and hence they have different water interactions and bonding. \\u003csup\\u003e\\u003cspan additionalcitationids=\\\"CR20\\\" citationid=\\\"CR19\\\" class=\\\"CitationRef\\\"\\u003e19\\u003c/span\\u003e\\u0026ndash;\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR53\\\" class=\\\"CitationRef\\\"\\u003e53\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eGlucose residues with shifts in domain 1 (C4 around 89ppm) have long been considered to reside in the crystalline core of plant cellulose.\\u003csup\\u003e\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR47\\\" class=\\\"CitationRef\\\"\\u003e47\\u003c/span\\u003e\\u003c/sup\\u003e However, now we have shown that the third major glucose environment in domain 1, site b, is a residue on the surface of the fibril. We previously suggested\\u003csup\\u003e\\u003cspan citationid=\\\"CR31\\\" class=\\\"CitationRef\\\"\\u003e31\\u003c/span\\u003e\\u003c/sup\\u003e that there could be a surface component in the domain 1 region and recently this view has been supported by Addison et al.\\u003csup\\u003e\\u003cspan citationid=\\\"CR33\\\" class=\\\"CitationRef\\\"\\u003e33\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR58\\\" class=\\\"CitationRef\\\"\\u003e58\\u003c/span\\u003e\\u003c/sup\\u003e Indeed, the NMR spectra of cotton of Kirui et al. have relatively less site b than found in the samples analysed here, consistent with a larger microfibril structure with lower surface to interior ratio.\\u003csup\\u003e\\u003cspan citationid=\\\"CR34\\\" class=\\\"CitationRef\\\"\\u003e34\\u003c/span\\u003e\\u003c/sup\\u003e Calculations have shown that a change in C6 hydroxymethyl conformation from \\u003cem\\u003etg\\u003c/em\\u003e, to \\u003cem\\u003egt/gg\\u003c/em\\u003e changes the C4 shift by ~\\u0026thinsp;5 ppm.\\u003csup\\u003e\\u003cspan citationid=\\\"CR45\\\" class=\\\"CitationRef\\\"\\u003e45\\u003c/span\\u003e\\u003c/sup\\u003e Adoption of the \\u003cem\\u003etg\\u003c/em\\u003e conformation is likely to occur where the C6 is facing towards the inside of the fibril and the orientation is fixed by hydrogen bonding to other glucosyl residues rather than water. This indicates that the glucose residue in environment b has the \\u003cem\\u003etg\\u003c/em\\u003e conformation and is likely to be a residue in a surface glucan chain but facing inwards towards another chain. It is now clearly more appropriate to consider domain 1 glucose residues as reflecting \\u003cem\\u003etg\\u003c/em\\u003e C6 hydroxymethyl confirmation rather than present only in crystalline cellulose and only in the core of fibril.\\u003c/p\\u003e \\u003cp\\u003eThe earlier assignment of all types of domain 1 glucose residues as solely interior to the cellulose fibril has led to a number of debates concerning the core structure.\\u003csup\\u003e\\u003cspan citationid=\\\"CR9\\\" class=\\\"CitationRef\\\"\\u003e9\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR34\\\" class=\\\"CitationRef\\\"\\u003e34\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR46\\\" class=\\\"CitationRef\\\"\\u003e46\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR47\\\" class=\\\"CitationRef\\\"\\u003e47\\u003c/span\\u003e\\u003c/sup\\u003e By comparing 1D NMR spectra in the domain 1 region, it was widely thought that the plant cellulose core is a mixture of cellulose Iα and cellulose Iβ.\\u003csup\\u003e\\u003cspan citationid=\\\"CR51\\\" class=\\\"CitationRef\\\"\\u003e51\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR54\\\" class=\\\"CitationRef\\\"\\u003e54\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR59\\\" class=\\\"CitationRef\\\"\\u003e59\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR60\\\" class=\\\"CitationRef\\\"\\u003e60\\u003c/span\\u003e\\u003c/sup\\u003e This misinterpretation arose due to the glucose in environment b having similar C1 and C4 shifts as those of one of the glucose residues of cellulose Iα. Furthermore, the 1D spectrum appears significantly different to that of cellulose Iβ because the C4 and C6 signals from core residue a and surface residue b overlap. This means that the ratio of the core cellulose Iβ environments (now known to be a and c) appears not to be the expected 1:1. The mixture of cellulose of Iα and Iβ has been hypothesised to arise as faults in the fibril structure or by alternating sheets of cellulose Iα and Iβ.\\u003csup\\u003e\\u003cspan citationid=\\\"CR29\\\" class=\\\"CitationRef\\\"\\u003e29\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR54\\\" class=\\\"CitationRef\\\"\\u003e54\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR55\\\" class=\\\"CitationRef\\\"\\u003e55\\u003c/span\\u003e\\u003c/sup\\u003e There have also been suggestions that the core structure could be completely distinct from that of cellulose Iβ.\\u003csup\\u003e\\u003cspan citationid=\\\"CR34\\\" class=\\\"CitationRef\\\"\\u003e34\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e\\u003c/sup\\u003e We have now shown the fibril is solely cellulose Iβ, and so cellulose modelling, computational studies and interpretation of diffraction patterns should be based on the Iβ structure.\\u003c/p\\u003e \\u003cp\\u003eThe breadth of studies across plant and material sciences has given rise to terminology to describe cellulose which can be confusing, especially when the same terminology is applied to data acquired by different methods. For example, descriptions of cellulose as crystalline and amorphous have different meanings between NMR studies and diffraction experiments which require order over a much longer distance.\\u003csup\\u003e\\u003cspan citationid=\\\"CR61\\\" class=\\\"CitationRef\\\"\\u003e61\\u003c/span\\u003e\\u003c/sup\\u003e Historically the glucose residues observed in the two domains of NMR spectra were assigned as crystalline and amorphous cellulose because the characterised crystalline forms of cellulose have \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC shifts in the domain 1 (C4\\u0026thinsp;~\\u0026thinsp;89 ppm) region.\\u003csup\\u003e\\u003cspan citationid=\\\"CR38\\\" class=\\\"CitationRef\\\"\\u003e38\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR62\\\" class=\\\"CitationRef\\\"\\u003e62\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR63\\\" class=\\\"CitationRef\\\"\\u003e63\\u003c/span\\u003e\\u003c/sup\\u003e Surface chains are more mobile and are sometimes described as paracrystalline or amorphous.\\u003csup\\u003e\\u003cspan citationid=\\\"CR25\\\" class=\\\"CitationRef\\\"\\u003e25\\u003c/span\\u003e\\u003c/sup\\u003e In our work the line widths of the signals from the six glucose residue environments within the surface and core of the fibril are very similar (such as in the C6 region of the CP INADEQUATE spectrum of the xylanase treated hCNFs shown in SI Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig9\\\" class=\\\"InternalRef\\\"\\u003e9\\u003c/span\\u003e summarised in SI Table\\u0026nbsp;2). NMR chemical shifts are sensitive only to short range structure (\\u0026thinsp;\\u0026lt;\\u0026thinsp;~\\u0026thinsp;5 A), and so the similar linewidths indicate that both domains have similar short-range order. Therefore, the core and surface glucose residues cannot be divided simply into crystalline and amorphous cellulose. On the other hand, a comparison of the 1D CPMAS spectrum of poplar wood versus the xylanase-treated hCNF sample in Fig. SI4 shows that a weak (\\u0026lt;\\u0026thinsp;10%) broad signal underlying the domain 2 region, which may arise from some less ordered material, is removed in the production of the hCNF sample. This less ordered component includes lignin and xylan but also could be related to a less crystalline portion of cellulose that is known to be easily hydrolysed.\\u003csup\\u003e\\u003cspan citationid=\\\"CR64\\\" class=\\\"CitationRef\\\"\\u003e64\\u003c/span\\u003e\\u003c/sup\\u003e Therefore, whilst there might be a small component of less ordered cellulose giving rise to signal in the domain 2 region this term cannot describe all these glucose residues. The ratio of the C4\\u003csup\\u003e1\\u003c/sup\\u003e and C4\\u003csup\\u003e2\\u003c/sup\\u003e signals from 1D NMR spectra was initially called the crystallinity index (CI). The CI has been used to describe the crystallinity of cellulose and is regularly calculated alongside crystallinity measures by diffraction techniques.\\u003csup\\u003e\\u003cspan citationid=\\\"CR38\\\" class=\\\"CitationRef\\\"\\u003e38\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR62\\\" class=\\\"CitationRef\\\"\\u003e62\\u003c/span\\u003e\\u003c/sup\\u003e This measure of crystallinity by NMR and diffraction has usually been significantly different, although the general trends between samples tend to be consistent.\\u003csup\\u003e\\u003cspan citationid=\\\"CR38\\\" class=\\\"CitationRef\\\"\\u003e38\\u003c/span\\u003e\\u003c/sup\\u003e The traditional CI as measured by NMR is a misnomer and does not measure crystallinity. It will be influenced by the surface to core ratio, as well as the presence of any amorphous material.\\u003c/p\\u003e \\u003cp\\u003e \\u003c/p\\u003e \\u003cp\\u003eThe better estimates of glucose residue environment proportions provide new constraints on plant cellulose fibril size and habit. Previous studies have used the ratio of C4\\u003csup\\u003e1\\u003c/sup\\u003e and C4\\u003csup\\u003e2\\u003c/sup\\u003e from 1D NMR spectra as a measure of microfibril size, which has resulted in variable estimations such as 24 chains in a fibril or in fibril bundling models.\\u003csup\\u003e\\u003cspan citationid=\\\"CR9\\\" class=\\\"CitationRef\\\"\\u003e9\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e\\u003c/sup\\u003e As we now know domain 1 is not solely interior sites, thus the ratio of C4\\u003csup\\u003e1\\u003c/sup\\u003e and C4\\u003csup\\u003e2\\u003c/sup\\u003e will give an overestimate of the interior of the fibril and an overestimate of the fibril size. The ratio of the two domains could still be utilised for characterisation as it is still loosely related to the volume of the fibril, but care is needed in using this interpretation. In this work, we have provided a more accurate measure of the interior to surface ratio for the microfibril of poplar wood which was found to be ~\\u0026thinsp;0.5. We also provide new constraints on the proportions of the environments in the fibril, namely a and c origin and centre chains are similar in proportion, and surface environment b is ~\\u0026thinsp;0.7 of the amount of a or c chain environments (since the number of b is likely an integer, this ratio corresponds to 2/3 or 3/4). Given the further constraints on the fibril from the dimensions from biophysical measurements that limit the size to 18\\u0026ndash;24 chains, there are very few habits that fulfil these criteria. The core to surface value is consistent, for example, with an 18-chain fibril only in a habit that has 6 interior chains and 12 surface chains, or a 24 chain fibril with 8 interior and 12 surface chains. A schematic diagram of two potential 18 chain habits 234432 and 34443 that have the observed interior to surface ratio is shown in Fig.\\u0026nbsp;\\u003cspan refid=\\\"Fig9\\\" class=\\\"InternalRef\\\"\\u003e9\\u003c/span\\u003e, and an alternative 333333 habit can be excluded. These habits fulfil our updated estimate of interior to core chain ratio, but it is difficult to see how the 34443 could have similar amounts of a and c origin and centre sheets. Currently therefore, 234432 is the habit currently that best fits the data. The previously proposed concepts of multi-layered fibril environments\\u003csup\\u003e\\u003cspan citationid=\\\"CR60\\\" class=\\\"CitationRef\\\"\\u003e60\\u003c/span\\u003e\\u003c/sup\\u003e or bundling\\u003csup\\u003e\\u003cspan citationid=\\\"CR37\\\" class=\\\"CitationRef\\\"\\u003e37\\u003c/span\\u003e\\u003c/sup\\u003e to generate additional interior chains are not required to explain the data.\\u003c/p\\u003e \\u003cp\\u003eWhilst we have measured the relative amount of surface site b compared to core chain sites a and c for isolated fibrils from poplar wood, these values are likely to vary with any changes in the microfibril size, habit, and hemicellulose interactions in different plants. Hence the relative proportions of the six major glucose residue environments will be an important diagnostic tool for discovering cellulose variability, studying cellulose interactions, and for detecting changes during degradation or industrial processing of wood and other biomass. This work now provides a strong basis for understanding cellulose fibril surfaces as we discovered there are only four major types of surface residues. Understanding the origin of these NMR environments will be very instrumental in defining the hydrophobic and hydrophilic surfaces of fibrils. The exciting prospect now arises of a more complete description and understanding of cellulose fibrils surfaces and their interactions in biomaterials and in the plant cell wall.\\u003c/p\\u003e\"},{\"header\":\"Methods\",\"content\":\"\\u003cdiv id=\\\"Sec8\\\" class=\\\"Section2\\\"\\u003e \\u003ch2\\u003eSample Production and Preparation\\u003c/h2\\u003e \\u003cp\\u003eThe poplar stem \\u003cem\\u003ePopulus tremula\\u003c/em\\u003e \\u0026times; \\u003cem\\u003etremuloides\\u003c/em\\u003e used in this work was grown from poplar shoot cuttings which were allowed to root for 2 weeks before being transferred into the growth chamber. Here the saplings were grown in a \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eCO\\u003csub\\u003e2\\u003c/sub\\u003e atmosphere\\u003csup\\u003e\\u003cspan citationid=\\\"CR31\\\" class=\\\"CitationRef\\\"\\u003e31\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR65\\\" class=\\\"CitationRef\\\"\\u003e65\\u003c/span\\u003e\\u003c/sup\\u003e for ~\\u0026thinsp;3 months to provide\\u0026thinsp;~\\u0026thinsp;97% \\u003csup\\u003e13\\u003c/sup\\u003eC labelled material that is predominantly secondary cell wall. Only wood of lower (woody) never-dried stems was used after removing bark and cambium. The growth of spruce and \\u003cem\\u003eBrachypodium\\u003c/em\\u003e was as described in Terrett et al. and Tryfona et al. respectively.\\u003csup\\u003e\\u003cspan citationid=\\\"CR44\\\" class=\\\"CitationRef\\\"\\u003e44\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR49\\\" class=\\\"CitationRef\\\"\\u003e49\\u003c/span\\u003e\\u003c/sup\\u003e\\u003c/p\\u003e \\u003cp\\u003eTo prepare holocellulose nanofibrils (hCNFs), the wood was first delignified four times with 3% peracetic acid (PAA; 0.35 g pure PAA/g dry material, pH 4.8) at 85\\u0026deg;C for 45 minutes without stirring. Between cycles, used PAA was decanted, one water wash was done, and new 3% PAA was added. Delignified material (holocellulose) was washed with water after the fourth cycle until the conductivity was below 10 S/cm. The holocellulose was blended for two minutes to form a homogenous slurry. Holocellulose dispersion (0.1 wt%) was blended for 30 minutes in a Vitamix A3500i blender to make a polydispersion. hCNFs were recovered as supernatant from the polydispersion by centrifugation at 5000 rpm for 15 minutes. The hCNFs suspensions were digested with xylanases as described previously.\\u003csup\\u003e\\u003cspan citationid=\\\"CR48\\\" class=\\\"CitationRef\\\"\\u003e48\\u003c/span\\u003e\\u003c/sup\\u003e The xylanase treated hCNFs were then freeze-dried and rewetted before being packed directly into the NMR rotor.\\u003c/p\\u003e \\u003cp\\u003eThe never dried \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC enriched poplar wood was debarked and cut into small pieces of ~\\u0026thinsp;1\\u0026ndash;2 mm size with a razor blade and then packed into a 3.2 mm MAS zirconia NMR rotor whilst removing excess water.\\u003c/p\\u003e \\u003c/div\\u003e \\u003cdiv id=\\\"Sec9\\\" class=\\\"Section2\\\"\\u003e \\u003ch2\\u003eTEM\\u003c/h2\\u003e \\u003cp\\u003eA ThermoFisher Scientific (FEI) TalosF200X G2 microscope operating in the scanning mode at 200 kV was used to obtain the TEM images.\\u003c/p\\u003e \\u003c/div\\u003e \\u003cdiv id=\\\"Sec10\\\" class=\\\"Section2\\\"\\u003e \\u003ch2\\u003eSolid-state NMR\\u003c/h2\\u003e \\u003cp\\u003eSolid-state NMR experiments of poplar wood and the xylanase treated hCNFs were acquired on a Bruker 1GHz AVANCE NEO solid-state NMR spectrometer operating at \\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH and \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC Larmor frequencies of 1000.4 and 251.6 MHz respectively using a 3.2 mm E\\u003csup\\u003eFree\\u003c/sup\\u003e triple resonance MAS probe. All experiments were conducted at an indicated temperature of 10\\u0026deg;C and an MAS frequency of 12.5 kHz with a recycle delay of 2 s. The \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC chemical shifts were determined using the carbonyl peak at 177.8 ppm of L-alanine as an external reference with respect to tetramethylsilane. This referencing was confirmed to correspond with referencing to the adamantane CH\\u003csub\\u003e2\\u003c/sub\\u003e peak to 38.48 ppm\\u003csup\\u003e\\u003cspan citationid=\\\"CR66\\\" class=\\\"CitationRef\\\"\\u003e66\\u003c/span\\u003e\\u003c/sup\\u003e to ensure direct comparison with Brouwer and Mikolajewski.\\u003csup\\u003e\\u003cspan citationid=\\\"CR21\\\" class=\\\"CitationRef\\\"\\u003e21\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR53\\\" class=\\\"CitationRef\\\"\\u003e53\\u003c/span\\u003e\\u003c/sup\\u003e The 90\\u0026deg; pulse lengths were typically 3.2 \\u0026micro;s (\\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH) and 4.0 \\u0026micro;s (\\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC). Cross polarization from \\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH to \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC was achieved using ramped (70\\u0026ndash;100%) \\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH radiofrequency amplitude and a contact time of 1 ms\\u003csup\\u003e\\u003cspan citationid=\\\"CR67\\\" class=\\\"CitationRef\\\"\\u003e67\\u003c/span\\u003e\\u003c/sup\\u003e. SPINAL-64 decoupling was applied at a \\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH nutation frequency of 70\\u0026ndash;80 kHz during acquisition.\\u003csup\\u003e\\u003cspan citationid=\\\"CR68\\\" class=\\\"CitationRef\\\"\\u003e68\\u003c/span\\u003e\\u003c/sup\\u003e Sign discrimination in the indirect dimension of the 2D experiments was achieved using the States-TPPI method. For the principal assignments of cellulose environments, a CP refocused INADEQUATE \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC double-quantum (DQ) \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC single-quantum (SQ) correlation experiment was used whereby the evolution of DQ coherence for directly bonded carbons within the same glucan ring is correlated with directly observed SQ coherence.\\u003csup\\u003e\\u003cspan citationid=\\\"CR69\\\" class=\\\"CitationRef\\\"\\u003e69\\u003c/span\\u003e,\\u003cspan citationid=\\\"CR70\\\" class=\\\"CitationRef\\\"\\u003e70\\u003c/span\\u003e\\u003c/sup\\u003e The acquisition time in the indirect dimension was 6.67 ms with a spectral width of 37.5 kHz and 128 acquisitions per \\u003cem\\u003et\\u003c/em\\u003e\\u003csub\\u003e1\\u003c/sub\\u003e FID. The echo duration, τ, was 2.24 ms giving a total echo time of 8.96 ms. Both intra- and intermolecular contacts were probed using 2D \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC-\\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC proton driven spin diffusion (PDSD) experiments with mixing times of 30 to 400 ms.\\u003csup\\u003e\\u003cspan citationid=\\\"CR71\\\" class=\\\"CitationRef\\\"\\u003e71\\u003c/span\\u003e\\u003c/sup\\u003e The acquisition time in the indirect dimension (\\u003cem\\u003et\\u003c/em\\u003e\\u003csub\\u003e1\\u003c/sub\\u003e) of the CP PDSD experiments was 5.5\\u0026ndash;8.1 ms. The spectral width in the indirect dimension was 37.5 kHz with at least 64 acquisitions per \\u003cem\\u003et\\u003c/em\\u003e\\u003csub\\u003e1\\u003c/sub\\u003e FID. The proximity of water to different cellulose environments was probed using a water-edited \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC-\\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC 30 ms CP PDSD experiment. This is based on the normal CP PDSD experiment, however before the CP contact time there is a \\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH T\\u003csub\\u003e2\\u003c/sub\\u003e filter to remove signal arising from directly attached protons followed by a delay to allow for diffusion from the water protons to its near neighbours.\\u003csup\\u003e\\u003cspan citationid=\\\"CR72\\\" class=\\\"CitationRef\\\"\\u003e72\\u003c/span\\u003e\\u003c/sup\\u003e The CP parameters were as stated above. The acquisition time in the indirect dimension was 4.8 ms with a spectral width of 37.5 kHz and 576 acquisitions per \\u003cem\\u003et\\u003c/em\\u003e\\u003csub\\u003e1\\u003c/sub\\u003e FID. The total \\u003csup\\u003e\\u003cspan citationid=\\\"CR1\\\" class=\\\"CitationRef\\\"\\u003e1\\u003c/span\\u003e\\u003c/sup\\u003eH T\\u003csub\\u003e2\\u003c/sub\\u003e filter was 320 \\u0026micro;s followed by a diffusion delay of 2 ms. All 2D spectra were processed with Fourier transformation into 8 K (F\\u003csub\\u003e2\\u003c/sub\\u003e) \\u0026times; 2 K (F\\u003csub\\u003e1\\u003c/sub\\u003e) points with exponential line broadening of 20\\u0026ndash;50 Hz in F\\u003csub\\u003e2\\u003c/sub\\u003e and cubed sine bell processing in F\\u003csub\\u003e1\\u003c/sub\\u003e using Bruker Topspin v.3.6. Contour levels are x 1.1 throughout. The minimum contour is chosen to show the desired features.\\u003c/p\\u003e \\u003c/div\\u003e \"},{\"header\":\"Declarations\",\"content\":\"\\u003cdiv id=\\\"Sec11\\\" class=\\\"Section2\\\"\\u003e \\u003ch2\\u003eData availability\\u003c/h2\\u003e \\u003cp\\u003eUnprocessed NMR data files will be available from \\u003cspan class=\\\"ExternalRef\\\"\\u003e\\u003cspan class=\\\"RefSource\\\"\\u003ehttp://wrap.warwick.ac.uk\\u003c/span\\u003e\\u003cspan address=\\\"http://wrap.warwick.ac.uk\\\" targettype=\\\"URL\\\" class=\\\"RefTarget\\\"\\u003e\\u003c/span\\u003e\\u003c/span\\u003e\\u003c/p\\u003e \\u003c/div\\u003e\\u003ch2\\u003eAuthor contributions\\u003c/h2\\u003e \\u003cp\\u003eRC and RD conducted the NMR experiments and analysis. PKD prepared the hCNFs. YY generated models of cellulose fibril habits for conceptual analysis of glucose environments. PD and RD conceived and supervised the work. SJB contributed to experimental method development and interpretation. RC, RD and PD wrote the paper with comments from all authors.\\u003c/p\\u003e\\u003ch2\\u003eAcknowledgements\\u003c/h2\\u003e \\u003cp\\u003eMany thanks to Eva Hellmann from the Sainsbury lab at University of Cambridge for donating the poplar saplings that were used to grow the \\u003csup\\u003e13\\u003c/sup\\u003eC material used in this work. Many thanks to Theodora Tryfona for providing the mature leaves of \\u003cem\\u003eBrachypodium\\u003c/em\\u003e as well as Alberto Echevarria Poza for all the help with growing and harvesting the \\u003csup\\u003e13\\u003c/sup\\u003eC labelled plants from the growth chamber, and Trent Franks for technical assistance. We wish to thank Mike Jarvis for giving us the \\u003csup\\u003e13\\u003c/sup\\u003eC NMR chemical shifts of his annealed flax and celery and for a draft version of his review\\u003csup\\u003e42\\u003c/sup\\u003e.. The UK High-Field Solid-State NMR Facility used in this research was funded by EPSRC and BBSRC (EP/T015063/1), as well as, for the 1 GHz instrument, EP/R029946/1. The work of RC, RD and PD was supported by the UKRI grant underwriting the ERC advanced grant EVOCATE, Function and evolution of plant cell wall architecture for sustainable technologies EP/X027120/1. PKD was supported by EPSRC grant Bio-derived and Bio-inspired Advanced Materials for Sustainable Industries (VALUED), EP/W031019/1. The work of YY generating models of cellulose fibril habits for conceptual analysis of glucose environments was supported as part of The Center for Lignocellulose Structure and Formation, an Energy Frontier Research Center funded by the US Department of Energy, Office of Science, Basic Energy Sciences, under Award number DE-SC0001090.\\u003c/p\\u003e\\n\\u003ch2\\u003eCompeting interests.\\u003c/h2\\u003e\\n\\u003cp\\u003eThe authors declare no competing interests\\u003c/p\\u003e\"},{\"header\":\"References\",\"content\":\"\\u003col\\u003e\\u003cli\\u003e\\u003cspan\\u003eField CB, Behrenfeld MJ, Randerson JT, Falkowski P (1998) Primary Production of the Biosphere: Integrating Terrestrial and Oceanic Components. Sci (1979) 281:237\\u0026ndash;240\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eGibson LJ (2012) The hierarchical structure and mechanics of plant materials. J R Soc Interface 9:2749\\u0026ndash;2766\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003ePauly M, Keegstra K (2008) Cell-wall carbohydrates and their modification as a resource for biofuels. Plant J 54:559\\u0026ndash;568\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eForoughi F, Ghomi ER, Dehaghi FM, Borayek R, Ramakrishna S (2021) A review on the life cycle assessment of cellulose: From properties to the potential of making it a low carbon material. Materials 14:714\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eIglesias MC, Gomez-Maldonado D, Via BK, Jiang Z, Peresin MS (2020) Pulping processes and their effects on cellulose fibers and nanofibrillated cellulose properties: A review. Prod J 70:10\\u0026ndash;21\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKlemm D, Heublein B, Fink HP, Bohn A, Cellulose (2005) Fascinating biopolymer and sustainable raw material. Angewandte Chemie - Int Ed 44:3358\\u0026ndash;3393\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eParthasarathi R et al (2011) Insights into hydrogen bonding and stacking interactions in cellulose. J Phys Chem A 115:14191\\u0026ndash;14202\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKubicki JD et al (2018) The Shape of Native Plant Cellulose Microfibrils. Sci Rep 8:13983\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eTai HC et al (2023) Wood cellulose microfibrils have a 24-chain core\\u0026ndash;shell nanostructure in seed plants. Nat Plants 9:1154\\u0026ndash;1168\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eSong B, Zhao S, Shen W, Collings C, Ding SY (2020) Direct Measurement of Plant Cellulose Microfibril and Bundles in Native Cell Walls. Front Plant Sci 11:479\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003ePenttil\\u0026auml; PA, Paajanen A (2024) Critical comment on the assumptions leading to 24-chain microfibrils in wood. Nat Plants 10:1064\\u0026ndash;1066\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eJarvis MC (2013) Cellulose biosynthesis: Counting the chains. Plant Physiol 163:1485\\u0026ndash;1486\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003ePurushotham P et al (2016) A single heterologously expressed plant cellulose synthase isoform is sufficient for cellulose microfibril formation in vitro. PNAS 113:11360\\u0026ndash;11365\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eNixon BT et al (2016) Comparative Structural and Computational Analysis Supports Eighteen Cellulose Synthases in the Plant Cellulose Synthesis Complex. Sci Rep 6:28696\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eCosgrove DJ et al (2024) How many glucan chains form plant cellulose microfibrils? \\u003cem\\u003eBiomacromolecules\\u003c/em\\u003e in press\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eYang H, Kubicki JD (2020) A density functional theory study on the shape of the primary cellulose microfibril in plants: effects of C6 exocyclic group conformation and H-bonding. Cellulose 27:2389\\u0026ndash;2402\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eNishiyama Y, Langan P, Chanzy H (2002) Crystal structure and hydrogen-bonding system in cellulose Iβ from synchrotron X-ray and neutron fiber diffraction. J Am Chem Soc 124:9074\\u0026ndash;9082\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eNishiyama Y, Sugiyama J, Chanzy H, Langan P (2003) Crystal Structure and Hydrogen Bonding System in Cellulose Iα from Synchrotron X-ray and Neutron Fiber Diffraction. J Am Chem Soc 125:14300\\u0026ndash;14306\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKono H et al (2002) CP/MAS \\u003csup\\u003e13\\u003c/sup\\u003eC NMR study of cellulose and cellulose derivatives. 1. Complete assignment of the CP/MAS \\u003csup\\u003e13\\u003c/sup\\u003eC NMR spectrum of the native cellulose. J Am Chem Soc 124:7506\\u0026ndash;7511\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKono H, Erata T, Takai M (2003) Determination of the through-bond carbon-carbon and carbon-proton connectivities of the native celluloses in the solid state. Macromolecules 36:5131\\u0026ndash;5138\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eBrouwer DH, Mikolajewski JG (2023) Resolving the discrepancies in reported \\u003csup\\u003e13\\u003c/sup\\u003eC solid state NMR chemical shifts for native celluloses. Cellulose 30:4827\\u0026ndash;4839\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eThomas LH et al (2015) Diffraction evidence for the structure of cellulose microfibrils in bamboo, a model for grass and cereal celluloses. BMC Plant Biol 15:158\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eNewman RH, Hill SJ, Harris PJ (2013) Wide-angle x-ray scattering and solid-state nuclear magnetic resonance data combined to test models for cellulose microfibrils in mung bean cell walls. Plant Physiol 163:1558\\u0026ndash;1567\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eŠturcova A, His I, Apperley DC, Sugiyama J, Jarvis MC (2004) Structural details of crystalline cellulose from higher plants. Biomacromolecules 5:1333\\u0026ndash;1339\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eThomas LH, Altaner CM, Jarvis MC (2013) Identifying multiple forms of lateral disorder in cellulose fibres. J Appl Crystallogr 46:972\\u0026ndash;979\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eThomas LH et al (2014) Structure and spacing of cellulose microfibrils in woody cell walls of dicots. Cellulose 21:3887\\u0026ndash;3895\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eDeslandes Y, Hamer GK, Marchessault RH (1983) Application of High-Resolution Solid-State NMR with Cross-Polarization/ Magic-Angle Spinning (CP/MAS) Techniques to Cellulose Chemistry. J Macromolecular Sci Part C 23:187\\u0026ndash;216\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eAtalla RH, Vanderhart DL (1984) Native Cellulose: A Composite of Two Distinct Crystalline Forms. Science 223:283\\u0026ndash;285\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eNewman RH (1998) Evidence for Assignment of 13 C NMR Signals to Cellulose Crystallite Surfaces in Wood, Pulp and Isolated Celluloses. Holzforschung 52:157\\u0026ndash;159\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eNewman RH (1999) Estimation of the lateral dimensions of cellulose crystallites using \\u003csup\\u003e13\\u003c/sup\\u003eC NMR signal strengths. Solid State Nucl Magn Reson 15:21\\u0026ndash;29\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eSimmons TJ et al (2016) Folding of xylan onto cellulose fibrils in plant cell walls revealed by solid-state NMR. Nat Commun 7:13902\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eDupree R et al (2015) Probing the molecular architecture of \\u003cem\\u003eArabidopsis thaliana\\u003c/em\\u003e secondary cell walls using two- and three-dimensional \\u003csup\\u003e13\\u003c/sup\\u003eC solid state nuclear magnetic resonance spectroscopy. Biochemistry 54:2335\\u0026ndash;2345\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eAddison B et al (2024) Atomistic, macromolecular model of the Populus secondary cell wall informed by solid-state NMR. Sci Adv 10:7965\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKirui A et al (2019) Atomic resolution of cotton cellulose structure enabled by dynamic nuclear polarization solid-state NMR. Cellulose 26:329\\u0026ndash;339\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eVi\\u0026euml;tor RJ, Newman RH, Ha MA, Apperley DC, Jarvis MC (2002) Conformational features of crystal-surface cellulose from higher plants. Plant J 30:721\\u0026ndash;731\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eTeeyaer RE, Zhbankov RG, Shipina OT, Marchenko GN, Zabelin LV (1986) NMR Spectroscopy of Cellulose and its Derivatives with Magic Angle Spinning of the sample (Review). Zh Prikl Spektrosk 45:535\\u0026ndash;554\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKirui A et al (2022) Carbohydrate-aromatic interface and molecular architecture of lignocellulose. Nat Commun 13:538\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003ePark S, Baker JO, Himmel ME, Parilla PA, Johnson DK (2010) Cellulose crystallinity index: measurement techniques and their impact on interpreting cellulase performance. Biotechnol Biofuels 3:10\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eFernandes AN et al (2011) Nanostructure of cellulose microfibrils in spruce wood. PNAS 108:1195\\u0026ndash;1203\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003ePhyo P, Wang T, Yang Y, O\\u0026rsquo;Neill H, Hong M (2018) Direct Determination of Hydroxymethyl Conformations of Plant Cell Wall Cellulose Using \\u003csup\\u003e1\\u003c/sup\\u003eH Polarization Transfer Solid-State NMR. \\u003cem\\u003eBiomacromolecules\\u003c/em\\u003e 19, 1485\\u0026ndash;1497\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eYang H et al (2018) Structural factors affecting \\u003csup\\u003e13\\u003c/sup\\u003eC NMR chemical shifts of cellulose: a computational study. Cellulose 25:23\\u0026ndash;36\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eJarvis MC (2023) NMR Studies on Cellulose and Plant Cell walls. Plant Cell Walls Research Milestones and Conceptual Insights, vol 1. CRC\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eWang T, Zabotina O, Hong M (2012) Pectin-cellulose interactions in the arabidopsis primary cell wall from two-dimensional magic-angle-spinning solid-state nuclear magnetic resonance. Biochemistry 51:9846\\u0026ndash;9856\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eTerrett OM et al (2019) Molecular architecture of softwood revealed by solid-state NMR. Nat Commun 10:4978\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eCresswell R et al (2021) Importance of Water in Maintaining Softwood Secondary Cell Wall Nanostructure. Biomacromolecules 22:4669\\u0026ndash;4680\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eWang T, Yang H, Kubicki JD, Hong M (2016) Cellulose Structural Polymorphism in Plant Primary Cell Walls Investigated by High-Field 2D Solid-State NMR Spectroscopy and Density Functional Theory Calculations. Biomacromolecules 17:2210\\u0026ndash;2222\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eKang X et al (2019) Lignin-polysaccharide interactions in plant secondary cell walls revealed by solid-state NMR. Nat Commun 10:347\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eDeralia PK et al (2024) Preserving native cellulose structure in isolated holocellulose nanofibrils and controlling their properties by tuning xylan structure for biomaterials. In preparation\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eTryfona T et al (2024) Altering the substitution and cross-linking of glucuronoarabinoxylans affects cell wall architecture in Brachypodium distachyon. New Phytol 242:524\\u0026ndash;543\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eBourdon M et al (2023) Ectopic callose deposition into woody biomass modulates the nano-architecture of macrofibrils. Nat Plants 9:1530\\u0026ndash;1546\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eYamamoto H, Horii F (1993) CP/MAS \\u003csup\\u003e13\\u003c/sup\\u003eC NMR Analysis of the Crystal Transformation Induced for Valonia Cellulose by Annealing at High Temperatures. Macromolecules 26:1313\\u0026ndash;1317\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eLarsson T, Hult E-L, Wickholm K, Pettersson E, Iversen T (1999) CPMAS \\u003csup\\u003e13\\u003c/sup\\u003eC-NMR spectroscopy applied to structure and interaction studies on cellulose I. Solid State Nucl Magn Reson 15:31\\u0026ndash;40\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eBrouwer DH, Mikolajewski JG (2023) Solid-state NMR reveals a structural variation from the Iβ polymorphic form in bacterial cellulose. Cellulose 30:11341\\u0026ndash;11356\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eAtalla RH, Vanderhart DL (1999) The role of solid state \\u003csup\\u003e13\\u003c/sup\\u003eC NMR spectroscopy in studies of the nature of native celluloses. Solid State Nucl Magn Reson 15:1\\u0026ndash;19\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eDriemeier C, Francisco LH (2014) X-ray diffraction from faulted cellulose I constructed with mixed Iα-Iβ stacking. Cellulose 21:3161\\u0026ndash;3169\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eHanley SJ, Revol J-F\\u0026Euml;O, Godbout L, Gray DG (1997) Atomic force microscopy and transmission electron microscopy of cellulose from Micrasterias denticulata; evidence for a chiral helical microfibril twist. Cellulose 4:209\\u0026ndash;220\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eRehman Z et al (2024) Organic NMR Crystallography: Enabling Progress for Applications to Pharmaceuticals and Plant Cell Walls. Faraday Discuss Accepted Manuscr\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eXue Y, Li H, Kang X (2024) Molecular unraveling of polysaccharide digestion in wood-feeding termites: A solid-state NMR perspective. Carbohydr Polym 331:121843\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eSmith BG, Harris PJ, Melton LD, Newman RH (1998) Crystalline Cellulose in Hydrated Primary Cell Walls of Three Monocotyledons and One Dicotyledon. Plant Cell Physiol 39:711\\u0026ndash;720\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eLarsson T, Wickholm K, Iversen T (1997) A CP/MAS \\u003csup\\u003e13\\u003c/sup\\u003eC NMR Investigation of Molecular Ordering in Celluloses \\u003cem\\u003eCarbohydrate Research\\u003c/em\\u003e. 302:19\\u0026ndash;25\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eBryce DL (2017) NMR crystallography: Structure and properties of materials from solid-state nuclear magnetic resonance observables. IUCrJ 4:350\\u0026ndash;359\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003ePark S, Johnson DK, Ishizawa CI, Parilla PA, Davis MF (2009) Measuring the crystallinity index of cellulose by solid state \\u003csup\\u003e13\\u003c/sup\\u003eC nuclear magnetic resonance. Cellulose 16:641\\u0026ndash;647\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eWickholm K, Larsson T, Iversen T (1998) Assignment of non-crystalline forms in cellulose I by CP/MAS 13 C NMR spectroscopy. Carbohydr Res 312:123\\u0026ndash;129\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eChang JKW, Duret X, Berberi V, Zahedi-Niaki H, Lavoie JM (2018) Two-step thermochemical cellulose hydrolysis with partial neutralization for glucose production. Front Chem 6:117\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eChen WP et al (2011) An automated growth enclosure for metabolic labeling of Arabidopsis thaliana with \\u003csup\\u003e13\\u003c/sup\\u003eC-carbon dioxide - an in vivo labeling system for proteomics and metabolomics research. Proteome Sci 9:9\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eMorcombe CR, Zilm KW (2003) Chemical shift referencing in MAS solid state NMR. J Magn Reson 162:479\\u0026ndash;486\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eGunther M, Xiaoling W (1994) Smith Steven O. Ramped-Amplitude Cross polarization in Magic-angle-spinning NMR. J Magn Reson A 110:219\\u0026ndash;227\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eFung BM, Khitrin AK, Ermolaev K (2000) An Improved Broadband Decoupling Sequence for Liquid Crystals and Solids. J Magn Reson 142:97\\u0026ndash;101\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eFayon F et al (2005) Through-space contributions to two-dimensional double-quantum J correlation NMR spectra of magic-angle-spinning solids. J Chem Phys 122:194313\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eLesage A, Bardet M, Emsley L (1999) Through-bond carbon-carbon connectivities in disordered solids by NMR. J Am Chem Soc 121:10987\\u0026ndash;10993\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eManolikas T, Herrmann T, Meier BH (2008) Protein structure determination from \\u003csup\\u003e13\\u003c/sup\\u003eC spin-diffusion solid-state NMR spectroscopy. J Am Chem Soc 130:3959\\u0026ndash;3966\\u003c/span\\u003e\\u003c/li\\u003e \\u003cli\\u003e\\u003cspan\\u003eAder C et al (2009) Structural rearrangements of membrane proteins probed by water-edited solid-state NMR spectroscopy. J Am Chem Soc 131:170\\u0026ndash;176\\u003c/span\\u003e\\u003c/li\\u003e\\u003c/ol\\u003e\"}],\"fulltextSource\":\"\",\"fullText\":\"\",\"funders\":[],\"hasAdminPriorityOnWorkflow\":false,\"hasManuscriptDocX\":true,\"hasOptedInToPreprint\":true,\"hasPassedJournalQc\":\"\",\"hasAnyPriority\":true,\"hideJournal\":true,\"highlight\":\"\",\"institution\":\"\",\"isAcceptedByJournal\":false,\"isAuthorSuppliedPdf\":false,\"isDeskRejected\":\"\",\"isHiddenFromSearch\":false,\"isInQc\":false,\"isInWorkflow\":false,\"isPdf\":false,\"isPdfUpToDate\":true,\"isWithdrawnOrRetracted\":false,\"journal\":{\"display\":true,\"email\":\"info@researchsquare.com\",\"identity\":\"researchsquare\",\"isNatureJournal\":false,\"hasQc\":true,\"allowDirectSubmit\":true,\"externalIdentity\":\"\",\"sideBox\":\"\",\"snPcode\":\"\",\"submissionUrl\":\"/submission\",\"title\":\"Research Square\",\"twitterHandle\":\"researchsquare\",\"acdcEnabled\":true,\"dfaEnabled\":false,\"editorialSystem\":\"\",\"reportingPortfolio\":\"\",\"inReviewEnabled\":false,\"inReviewRevisionsEnabled\":true},\"keywords\":\"\",\"lastPublishedDoi\":\"10.21203/rs.3.rs-4970084/v1\",\"lastPublishedDoiUrl\":\"https://doi.org/10.21203/rs.3.rs-4970084/v1\",\"license\":{\"name\":\"CC BY 4.0\",\"url\":\"https://creativecommons.org/licenses/by/4.0/\"},\"manuscriptAbstract\":\"\\u003cp\\u003eThe structure of native plant cellulose, despite its abundance and utility in industry, remains elusive. The cellulose structure of several species was studied using 2D solid-state Nuclear Magnetic Resonance (NMR) of \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC labelled plants. Six major glucose environments were resolved which are common to the cellulose of poplar wood, spruce wood and grasses. The cellulose structure was maintained in isolated holo-cellulose nanofibrils, allowing more detailed characterisation. There are just two glucose environments within the fibril core which have the same NMR \\u003csup\\u003e\\u003cspan citationid=\\\"CR13\\\" class=\\\"CitationRef\\\"\\u003e13\\u003c/span\\u003e\\u003c/sup\\u003eC chemical shifts as tunicate cellulose Iβ. The third major glucose site with a carbon 4 shift near 89 ppm, previously assigned to the fibril interior, is one of four surface glucose environments. These advances allowed us to obtain a more accurate measure of the interior to surface ratio for poplar wood fibrils of 0.5, consistent with an 18 chain microfibril structure having 6 core and 12 surface chains.\\u003c/p\\u003e\",\"manuscriptTitle\":\"New insights into the structure of cellulose in plant cell walls\",\"msid\":\"\",\"msnumber\":\"\",\"nonDraftVersions\":[{\"code\":1,\"date\":\"2024-08-30 04:00:23\",\"doi\":\"10.21203/rs.3.rs-4970084/v1\",\"editorialEvents\":[{\"type\":\"communityComments\",\"content\":0}],\"status\":\"published\",\"journal\":{\"display\":true,\"email\":\"info@researchsquare.com\",\"identity\":\"researchsquare\",\"isNatureJournal\":false,\"hasQc\":true,\"allowDirectSubmit\":true,\"externalIdentity\":\"\",\"sideBox\":\"\",\"snPcode\":\"\",\"submissionUrl\":\"/submission\",\"title\":\"Research Square\",\"twitterHandle\":\"researchsquare\",\"acdcEnabled\":true,\"dfaEnabled\":false,\"editorialSystem\":\"\",\"reportingPortfolio\":\"\",\"inReviewEnabled\":false,\"inReviewRevisionsEnabled\":true}}],\"origin\":\"\",\"ownerIdentity\":\"85bf5771-a18b-4efa-8671-03d72494f5b1\",\"owner\":[],\"postedDate\":\"August 30th, 2024\",\"published\":true,\"recentEditorialEvents\":[],\"rejectedJournal\":[],\"revision\":\"\",\"amendment\":\"\",\"status\":\"posted\",\"subjectAreas\":[{\"id\":36778080,\"name\":\"Biological sciences/Plant sciences/Plant molecular biology\"},{\"id\":36778081,\"name\":\"Biological sciences/Biochemistry/Structural biology/NMR spectroscopy/Solid-state NMR\"},{\"id\":36778082,\"name\":\"Biological sciences/Structural biology/NMR spectroscopy/Solid-state NMR\"}],\"tags\":[],\"updatedAt\":\"2024-10-05T11:40:11+00:00\",\"versionOfRecord\":[],\"versionCreatedAt\":\"2024-08-30 04:00:23\",\"video\":\"\",\"vorDoi\":\"\",\"vorDoiUrl\":\"\",\"workflowStages\":[]},\"version\":\"v1\",\"identity\":\"rs-4970084\",\"journalConfig\":\"researchsquare\"},\"__N_SSP\":true},\"page\":\"/article/[identity]/[[...version]]\",\"query\":{\"redirect\":\"/article/rs-4970084\",\"identity\":\"rs-4970084\",\"version\":[\"v1\"]},\"buildId\":\"8U1c8b4HqxoKbykW_rLl7\",\"isFallback\":false,\"isExperimentalCompile\":false,\"dynamicIds\":[84888],\"gssp\":true,\"scriptLoader\":[]}","source_license":"CC-BY-4.0","license_restricted":false}