Spectroscopic assessment of flavor-related chemical compounds in fresh tea shoots using deep learning

preprint OA: closed
📄 Open PDF Full text JSON View at publisher
Full text 62,381 characters · extracted from oa-pdf · 6 sections · click to expand

Abstract

9 This study employs a deep-learning method, Y-Net, to estimate 10 tea flavor-related chemical 10 compounds (TFCC), including gallic acid, caffeine and eight catechin isomers, using fresh tea 11 shoot reflectance and transmittance. The unique aspect of Y-Net lies in its utilization of dual 12 inputs, reflectance and transmittance, which are seamlessly integrated within the Y-Net 13 architecture. This architecture harnesses the power of a convolutional neural network-based 14 residual network to fuse tea shoot spectra effectively. This strategic combination enhances the 15 capacity of the model to discern intricate patterns in the optical characteristics of fresh tea shoots, 16 providing a comprehensive framework for TFCC estimation. In this study, we destructively 17 sampled tea shoots from tea farms in Alishan (Ali-Mountain) in Central Taiwan within the 18 elevation range of 879–1552 m a.s.l. Tea shoot reflectance and transmittance data (n = 2032) 19 within the optical region (400–2500 nm) were measured using a portable spectroradiometer and 20 pre-processed using an algorithm; corresponding TFCC were qualified using the high-21 performance liquid chromatography analysis. To enhance the robustness and performance of Y-22 Net, we employed data augmentation techniques for model training. We compared the 23 performances of Y-Net and seven other commonly utilized statistical, machine-/deep-learning 24 models (partial least squared regression, Gaussian process, cubist, random forests and three 25 feedforward neural networks) using root-mean-square error (RMSE). Furthermore, we assessed 26 the prediction accuracies of Y-Net and Y-Net using spectra within the visible and near-infrared 27 (VNIR) regions (for higher energy throughput and low-cost instruments) and reflectance only 28 (for airborne and spaceborne remote sensing applications). The results showed that overall Y-Net 29 (mean RMSE ± standard deviation [SD] = 2.51 ± 2.20 mg g-1) outperformed the other statistical, 30 machine- and deep-learning models (≥ 2.59 ± 2.64 mg g-1), demonstrating its superiority in 31 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 3 predicting TFCC. In addition, this original Y-Net also yielded slightly lower mean RMSE (± SD) 32 compared with VNIR (2.76 ± 2.41 mg g-1) and reflectance-only (2.68 ± 2.74 mg g-1) Y-Nets 33 using validation data. This study highlights the feasibility of using spectroscopy and Y-Net to 34 assess minor biochemical components in fresh tea shoots and sheds light on the potential of the 35 proposed approach for effective regional monitoring of tea shoot quality. 36

Keywords

caffeine, Camellia sinensis, catechin, deep learning, gallic acid, machine learning, 37 neural network, oolong tea, reflectance, transmittance, Y-Net 38 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 4 1. Introduction 39 Plant metabolism collectively produces many metabolites, crucial in resisting biotic stress and 40 adapting to abiotic pressure. These metabolites also serve as invaluable resources for human 41 health and survival [1-6]. Tea plants are predominantly cultivated in Asia, producing some of the 42 most popular non-alcoholic beverages in the world [7, 8]. The characteristics of tea are primarily 43 composed of its chemical components (tea flavor-related chemical compounds [TFCC] 44 hereafter), including gallic acid (GA), caffeine (CAF) and eight catechin isomers including 45 gallocatechin (GC), epicatechin gallate (EGC), catechin (C), epicatechin (EC), epigallocatechin 46 gallate (EGCG), gallocatechin gallate (GCG), epicatechin gallate (ECG) and catechin gallate 47 (CG) [8]. These chemicals contribute not only to the flavor of a cup of tea but also to active 48 responders to the environments in fresh tea leaves as tea plants grow [9]. The catechins, GA and 49 CAF mainly contribute to the astringency and bitterness of taste, which is the main body of the 50 tea soup, and the catechins oxidation is the essential chemical reaction that determines the tea 51 characteristics during the manufacturing process [10, 11]. The more we understand the TFCC 52 status in tea leaves, the more effectively we can decide on the management of the tea plants and 53 control the quality of tea production. 54

Methods

have been developed to quantify TFCC, such as gas chromatography or high-55 performance liquid chromatography (HPLC) [12-14]. However, they require pre-treatment, 56 leading to slow data retrieval, and are infeasible for the real-time TFCC assessment. One 57 practical alternative is non-destructive optical (approximately 400–2500 nm) spectroscopy [15, 58 16]. Previous studies employed optical spectroscopic estimation methods for some TFCC in 59 ground tea leaves [17, 18]. Huang et al. [19] and Wang et al. [20] used leaf reflectance to 60 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 5 estimate pigments and four main catechins and CAF in fresh green leaves using partial least 61 squares regression (PLS). However, these linear models may not be suitable for the prediction 62 due to the complex nature of TFCC. Yamashita et al. [6] uses machine learning, including 63 random forests (RF) and cubist, to analyze leaf reflectance and quantify TFCC in fresh tea 64 shoots. However, information may be lost solely relying on leaf reflectance during optimization. 65 Although reflectance provides valuable insights into interactions between reflected photons and 66 tea leaf surface, some TFCC may absorb light at specific wavelengths, and those features may 67 not be able to be delineated by reflectance. 68 Leaf transmittance, another measurable leaf optical attribute, indicates the proportion of the 69 incident energy passing through a substance without being reflected and absorbed; transmittance 70 variation may also be related to TFCC. By combining both leaf reflectance and transmittance, we 71 could optimize TFCC modeling in fresh tea shoots. Due to the potentially complex relationship 72 between green vegetation optics and TFCC (e.g., [6]), we employed deep learning to derive 73 TFCC from fresh tea shoots in this study. To our knowledge, this has yet to be carried out in 74 previous literature. Deep learning is more effective in handling large and high-dimensional 75 datasets than machine learning, which frequently requires manual feature engineering and 76 domain-specific knowledge to extract relevant features. The key strength of deep learning lies in 77 its ability to automatically learn and extract important features, enabling it to recognize complex 78 patterns and relationships in the data. Hence, the main objective of this study is to use an 79 advanced deep-learning algorithm (Y-Net, a convolutional neural network [CNN] based residual 80 network [ResNet] design approach) to unravel the complex relationship between tea shoot leaf 81 reflectance/transmittance and TFCC. 82 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 6 2. Materials and Methods 83 2.1. Tea shoot spectra collection and analysis 84 We conducted the field campaign in the Greater Alishan (or Ali-Mountain) tea plantation region 85 (23.47 N, 120.69 E) located in Central Taiwan within the elevation range of 800–1600 m a.s.l. 86 (Fig. 1). Fresh tea shoots (mainly the Chin-Hsin-Oolong cultivar [Camellia sinensis var. 87 sinensis]) were collected from 15 tea farms with the mean (± standard deviation [SD]) elevation 88 of 1225 ± 178 m a.s.l. ranging from 879 to 1552 m a.s.l (Table 1). The data collection period was 89 from April 12 to June 30, 2022, and May 11–24, 2023, encompassing the spring and summer tea 90 harvesting seasons. For each farm, we randomly collected two airtight polyethylene bags of tea 91 shoots, and each bag contained approximately 50–60 samples (an apical bud and three leaves). 92 Tea shoots were placed in a 0 ºC cooler right after being destructively sampled and were 93 transferred to the laboratory within the same day (mostly within six hours). 94 We randomly selected 30 samples from each bag to measure tea shoot spectra. Tea leaf 95 reflectance and transmittance were assessed using a portable spectroradiometer (FieldSpec 3, 96 Analytical Spectral Devices, Inc., Boulder, Colorado, USA) with a single integrating sphere 97 (ASD RTS-3ZC). The spectral resolutions were 3 nm (between 350 and 1000 nm) and 10 nm 98 (1000–2500 nm), with sampling intervals set at 1.4 nm and 2 nm, respectively. Due to the sizes 99 of the sample ports of the integrating sphere (diameters ≥ 13 mm), we only measured the spectra 100 of the third leaf (the largest one) for each tea shoot. To ensure the freshness of the tea leaf 101 samples for subsequent biochemical analysis, each spectral reading was an average of 10 102 measurements. To assess the reflectance (Rs) and transmittance (Ts) of tea leaves, a non-103 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 7 polarized measurement method was employed by referring to the manufacturer-recommended 104 procedure and [21]: 105 /g1844 /g3046/g3404 /g4666/g3010/g3294/g3267/g2879/g3010/g3279 /g3267/g4667/g3019/g3293 /g4666/g3010/g3278/g3267/g2879/g3010/g3279 /g3267/g4667, (1) 106 /g1846 /g3046/g3404 /g4666/g3010/g3294/g3269/g2879/g3010/g3279 /g3269/g4667/g3019/g3293 /g4666/g3010/g3296/g3269/g2879/g3010/g3279 /g3269/g4667, (2) 107 where Is, Ic, Iu and Id represent spectroradiometer readings taken from the tea leaf sample, 108 calibrated and un-calibrated white references, and stray light (measured using a light trap), 109 respectively. Superscripts R and T denote reflectance and transmittance, respectively, and Rr 110 indicates the reflectance factor of the calibrated white reference. The procedure effectively 111 measures the directional reflectance and transmittance factors [22]; the terms "leaf reflectance" 112 and "leaf transmittance" are used for simplicity. One major issue with spectral data collection in 113 a moist region like Taiwan is that high humility may introduce erroneous signals [23, 24]. Hence, 114 we employed a statistical/mathematical spectral reconstruction approach to retrieve noise-free 115 fresh tea leaf spectra involving spectral database matching, multivariable linear regression, linear 116 parameter multiplication and spectral reversion. The procedure is beyond the scope of this study, 117 and details of the spectral pre-processing and reconstruction were described in [25]. There were 118 1016 pairs (total n = 2032) of reflectance and transmittance data collected in this study. 119 2.2. TFCC analysis 120 Leaf samples (n = 30) were returned to the polyethylene bags (total 36 bags) after leaf spectral 121 measurement, and the samples were then immediately placed in a -20 ºC cold storage unit until 122 TFCC analysis. We note that instead of using the spectrally measured leaf [19], we processed an 123 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 8 entire tea shoot (an apical bud and three leaves) since tea makers use the former one to 124 manufacture dried tea products. These samples were freeze-dried, ground into a fine powder, and 125 sifted through a 4 mm sieve. Then, 0.5 grams of this homogeneous tea powder was immersed in 126 50 ml of boiling deionized water for 20 minutes in a water bath held at 90/i1 . The obtained tea 127 infusions were subsequently filtered via 0.45 μ m syringe filters in preparation for HPLC 128 analysis. The HPLC analysis was set up using a Shimadzu system (PU-2089, AS-2057, UV-2075 129 detector; Shimadzu, Kyoto, Japan) with an equipped C18 column (5 μ m × 4.6 mm × 250 mm; 130 Waters, Milford, MA, USA). Solvents A (0.1 % formic acid aqueous solution) and B (the 131 acetonitrile) were introduced following a specific gradient, which involved a linear increment of 132 solvent B from 1% to 10% over 15 minutes, subsequently escalating from 10% to 20% in the 133 next 14 minutes, then further increasing from 20% to 22% within 6 minutes, and lastly 134 maintaining the 22% concentration for an additional 5 minutes. A flow rate of 1.0 ml per minute 135 was maintained throughout the separation process. Absorbance was monitored at 280 nm for 136 real-time tracking of peak intensities. The TFCC components GA, CAF and eight catechin 137 isomers (GC, EGC, C, EC, EGCG, GCG, ECG and CG) were quantified utilizing a calibration 138 curve for each compound, respectively. The calibration curve was formulated using peak areas 139 from HPLC chromatograms at pre-determined concentrations with commercially available 140 chemical compounds (Sigma-Aldrich, St. Louis, MO, USA). All samples were analyzed in 141 triplicate to ensure consistency and accuracy (n = 108). 142 2.3. Deep learning (Y-Net) 143 2.3.1. Data augmentation 144 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 9 We divided 36 bags of samples into training (70% of the data for developing models) and testing 145 (30% for validation) groups. It is necessary to obtain sizable spectral and TFCC datasets to 146 develop a robust deep-learning model for learning and optimization. However, it is not feasible 147 to produce a large amount of samples of TFCC for deep learning using HPLC due to the nature 148 of the analytical procedure. To mitigate this technical challenge, a simplified approach was 149 adopted using an augmentation process. The rationale for employing the augmentation process 150 stems from the typically distributed correlation between spectral and biochemical characteristics, 151 similar to the principles applied in Gaussian processes [26]. Hence, the augmentation involves 152 using a normal distribution and requires the mean and SD from each bag of TFCC sample to 153 simulate 30 TFCC data to match the sample size of reflectance and transmittance data (illustrated 154 in Fig. 3a). This augmentation technique facilitates expanding the dataset and increases the 155 variation of the available data, thereby enhancing the training process and improving the ability 156 of the model to generalize new data (also see Chen et al. [27] and Wieland et al. [28]). 157 2.3.2. Y-Net structure 158 Y-Net is a specific architecture that utilizes feedforward neural networks (FNN) or CNN-based 159 ResNet approaches. The algorithm can take both reflectance and spectrally corresponding 160 transmittance as input (Fig. 3b). The Y-Net architecture consists of a one-dimensional (1D) 161 convolution block (Fig. 3c), 1D max-pooling, fusion and fully connected layers. The process can 162 be categorized into three steps, including feature extraction, fusion extraction and TFCC 163 prediction: 164 Feature extraction: This step comprises the encoder, which consists of six processes 165 responsible for extracting meaningful features from the input spectral data. The input dimension 166 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 10 was ~×2101×1, indicating one channel and a 1D spectral array with 2101 data points. The 167 symbol ~ represents the number of sample data corresponding to 36 datasets here. The processes 168 numbered 1, 3 and 5 involved convolutional blocks which are part of ResNet processing design, 169 and those with 2, 4 and 6 entailed max-pooling operations (Fig. 3c). The convolutional block 170 operation uses six layers including convolution, batch normalization, activation, and residual 171 layer added to the last layer (Fig. 3d). The kernel size was 1×5, indicating that the convolution 172 operation performed in the spectral windows with a stride of one and using padding to preserve 173 the data dimension. Each convolutional block comprised several kernel filters, and an increased 174 number of kernel filters in each block reduced the spectral dimension. This convolutional block 175 was responsible for feature extraction, mitigating the vanishing gradient problem and preserving 176 information across layers. The max-pooling process occurs between convolutional blocks and 177 was applied to a 1D array using a window size of two. It found the maximum value within each 178 window and applied a stride of two, meaning it moved by two positions at a time. A rectified 179 linear unit function was employed as the activation function in this architecture. It introduced 180 non-linearity by setting negative values to zero and leaving positive values unchanged. The 181 feature extraction step has six convolutional blocks (three reflectance and three transmittance 182 branches). Each corresponding convolutional block (reflectance and transmittance) has a various 183 number of parameters, namely [96, 64, 1296, 64, 96], [2592, 128, 5152, 128, 2592] and [10304, 184 256, 20544, 256, 10304], in which the total parameters are 10744, and these parameters 185 corresponded to the weights, normalization and biases in the filter masks; where each 186 convolutional block stands for [conv, norm, conv, norm, conv]. These parameters were 187 responsible for capturing and learning the patterns within the data. 188 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 11 Fusion extraction: The fusion extraction consisted of three processes numbered 7, 8 and 9 189 (Fig. 3c). The fusion process combined the outputs of the right (reflectance) and left 190 (transmittance) processes. It merged the information from these two sources using convolutional 191 block and max-pooling operation, resulting in a fused single processing operation. In the fusion 192 extraction step, one convolutional block fuses the reflectance and transmittance branches. It 193 consists of five parameters [82048, 512, 82048, 512, 82048], which are [conv, norm, conv, norm, 194 conv], respectively. The total number of unknown parameters in the fusion extraction step was 195 247168. These parameters include normalization, weights and biases associated with the 196 convolutional and max-pooling operations. 197 TFCC prediction: The process involved in transitioning from the convolutional to the 198 dense processes. This transition included flattening and dropping the data, which converted it 199 from a 2D representation (0-axis for spectral and 1-axis for kernel filters) to 1D dense vectors 200 and passed it through fully connected layers. The reshaped vector had a length of 33290 (Fig. 3c, 201 step 10), which was determined by the size of the output from the last max-pooling in the 202 previous step, which has the dimension ~×26×128. The vector was then connected to a hidden 203 layer with 10 neurons (Fig. 3c, step 11), representing the 10 TFCC components of the first 204 dataset. In the TFCC estimation step, there were 33290 unknown parameters with weights and 205 biases. 206 A total of 388202 parameters were utilized for optimizing Y-Net, encompassing both the 207 convolutional block and fully connected layers. Studies employed FNNs to determine the 208 optimal neural network structure and parameters, such as filter sizes, number of filters, layers, 209 neurons and activation functions [29, 30]. However, this process often involves trial and error 210 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 12 and can be time-consuming. Moreover, due to limited in-situ samples and the potential for 211 overfitting, finding the optimal neural structure with the best parameters may not always be 212 feasible. Therefore, this study adopts an alternative strategy by emphasizing the design of the 213 steps within the neural structures rather than exclusively pursuing the optimization of 214 parameters. 215 2.3.3. Y-Net Transfer Learning 216 The training of Y-Net was performed in two stages. In the first stage, we applied data 217 augmentation to 1080 samples from 36 batches. In the second stage, the data was transformed by 218 taking the average of three observed data points, resulting in 36 averaged data points 219 corresponding to the 36 batches. Subsequently, we split the data into training (70% of the data) 220 and testing (the rest of the data) sets. The number of the observed points was less than the 221 augmented points to train Y-Net. During Y-Net training, observed data were less than the 222 augmented ones. Hence, there were fewer actual data points available for training Y-Net. 223 Utilizing the augmented data points made it possible to train Y-Net effectively despite the 224 limited number of observed data points. In the initial stage of the two-stage training process (Fig. 225 3d), the focus was on reaching a local optimum by adjusting the weights, biases and dense 226 parameters from their initial values. The second stage was designed to achieve the global 227 maximum while also reducing the overall training time required to reach the optimal model. In 228 other words, the initial parameters used in the second stage were derived from the pre-trained 229 model obtained during the first stage of training. In the optimization process, we applied the 230 built-in function Adam optimizer. It incorporates adaptive learning rate and moment techniques 231 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 13 during training to reach the optimal model. We utilized the mean squared error (MSE) as the loss 232 function (eq. 3): 233 /g1838/g3404 /g2869 /g3041∑ /g4666 predicted /g3036/g3398 observed /g3036/g4667 /g2870,/g3036/g2880/g3041 /g3036/g2880/g2868 (3) 234 where the predicted /g3036/g3404 /g4668 /g1855/g1855/g3549 /g4669 /g3037/g2880/g2869 /g3011 and observed /g3036/g3404 /g4668 /g1855/g1855 /g4669 /g3037/g2880/g2869 /g3011; j is the index of TFCC J, n is the 235 number of data, and /g1855/g1855 and /g1855/g1855/g3549 are observed and predicted TFCC, respectively. During the 236 training process, a significant number of epochs were executed and the number of epochs was set 237 to 50. Throughout the training, the parameters of the epoch with the minimal loss value were 238 stored. These optimal parameter values and the network structure were implemented to estimate 239 TCFF. 240 2.4. Performance assessment 241 To assess the effectiveness and viability of Y-Net, we conducted a comparative analysis by 242 contrasting the performance of the two-stage training with that of the conventional one-stage 243 training. We employed the two-stage training approach to determine whether the additional 244 training stage would enhance model performance by referring to metrics losses. Furthermore, we 245 also selected an advanced multivariate statistical (PLS) approach [31] and other commonly used 246 machine-/deep-learning methods, including cubist [32] with de-trend pre-processing [6], 247 feedforward neural networks (FNNs) [33], Gaussian process [34] and RF [35]. Since these 248

Methods

have been commonly used in scientific literature, we only provide key references here 249 for expedience. To assess the performances of these methods and Y-Net, we use the root-mean-250 square error (RMSE) as the evaluation metric, as it measures the average deviation between the 251 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 14 predicted and actual observed values. To further evaluate the consistency and stability of the 252 models, we also took the mean and SD into account. 253 2.5. Y-Net spectral analysis 254 It is important to understand the contribution of different tea shoot spectral regions to estimate 255 TFCC in Y-Net. Therefore, we carried out a spectral feature importance analysis. The feature 256 important analysis took the features of each hidden layer in reflectance and transmittance 257 branches. From the first layer of each branch to the fusion layers, we resampled, averaged and 258 normalized those to the scale of 0–1 to assess their importance to TFCC prediction. The feature 259 importance analysis can be categorized into the local and global aspects to identify the pattern. 260 The global pattern constitutes a significant contribution that can be visually identified, whereas 261 the local pattern represents a minor contribution that requires a tool to unveil subtle information. 262 Therefore, the high-pass filter was also adopted using a fast Fourier transform to enhance the 263 minor contribution by capturing the high frequency in the feature importance [36]. In addition, 264 we carried out a spectral subset analysis. We investigated if visible and near-infrared spectral 265 regions (VNIR, 400–1100 nm) would be sufficient to estimate TFCC using Y-Net since they are 266 responsive to photosynthesis/non-photosynthesis pigments (mainly chlorophyll a and b, 267 carotenoids, xanthophylls and anthocyanins) [37] and cell structure (e.g., the mesophyll cells and 268 the fraction of air space) [38, 39] of fresh green leaves. Therefore, VNIR could be much more 269 sensitive than leaf water content dominant shortwave-infrared bands (SWIR, about 1300–2500 270 nm) [40]. Moreover, the spectroscopic measurement may be preferable within VNIR due to the 271 relatively higher energy throughput by referring to Planck's radiation law and more cost-effective 272 for the instruments (silicon/InGaAs for VNIR vs. InSb). Finally, we used tea shoot reflectance of 273 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 15 the entire optical region only (without transmittance) for Y-Net since the physical property may 274 be measured by airborne or spaceborne sensors permitting regional mapping of TFCC. 275 3. Results 276 3.1. Input data assessment 277 To evaluate the performance of Y-Net, it is important to compare the data distribution of the 278 observed data and augmented data (Fig. 4). We found that they have similar characteristics by 279 referring to statistics indicating the central tendency and spread of the data (Table S1). Therefore, 280 the observed and augmented data have similar distributions, which justifies using the latter for 281 further analysis. In addition, to investigate the data distribution of observed and augmented data, 282 a loss value (MSE) comparison for a hundred epochs was conducted in the optimization process 283 (Fig. 5). The optimization process provided two stages of training, with the first stage using 284 augmented data and the second stage using observed data. There was an apparent improvement 285 in the second stage, especially at the initial epoch; the difference in MSE became relatively 286 negligible after 25 epochs. The performance of the two-stage training through this optimization 287 was satisfactory by comparing the observed and predicted training and validation data (Fig. 6 288 and Table S2). Overall, the data distributions of those groups were comparable except for some 289 outliers of C, ECG and CAF. 290 3.2. Model performance comparison 291 Y-Net underwent pre-training and acquired knowledge through a novel two-stage training 292 approach (Fig. 5), utilizing TFCC from observed and augmented data (Fig. 4 and Table S1). 293 Overall, the performance of Y-Net (mean ± SD of RMSE = 2.51 ± 2.20) to model the variation 294 of TFCC was superior to (yielding the lowest mean and SD) selected statistical (PLS) (7.86 ± 295 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 16 7.58) and machine learning (cubist, FNNs, Gaussian, RF) methods (2.59 ± 2.64). The TFCC 296 assessment of Y-Net was relatively consistent by referring to SD demonstrating its effectiveness 297 across various compounds. However, the performance for modeling each TFCC compound 298 differed (Table 2). 299 3.3. Y-Net spectral analysis 300 According to our spectral feature importance analysis, several spectral regions of reflectance and 301 transmittance were pivotal for modeling TFCC; reflectance (55% of the optical spectral bands) 302 played a more critical role than transmittance (45%) by referring to feature important values 303 (Fig. 7). Specifically, after applied the high-pass filter (adopted with fast Fourier transform), 304 spectral bands of reflectance and transmittance located around 400 nm, 700 nm, 1400 nm, 1900 305 nm and 2500 nm were identified to be crucial for indirectly estimating TFCC. The spectral 306 subset analysis depicts that, overall, the Y-Net using both reflectance and transmittance of the 307 entire optical region (400–2500 nm) (mean ± SD of RMSE = 1.29 ± 1.08 for training and 2.51 ± 308 2.20 for validation datasets) (Table 3) performed slightly better and relatively more consistent 309 (with the lowest mean and SD) than the VNIR (2.72 ± 2.49; 2.76 ± 2.41) and the reflectance-310 only (2.07 ± 2.11; 2.68 ± 2.74) Y-Nets. However, the performance of estimating each TFCC 311 compound was different. 312 4. Discussion 313 4.1. Y-Net performance 314 The Y-Net architecture features a dual-branch input, integrating a ResNet-based CNN [41, 42]. 315 The dual input of Y-Net involves data augmentation or enrichment and pre-training through a 316 novel two-stage training approach (Fig. 3a and 3d), including utilizing TFCC obtained from both 317 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 17 augmented and observed data measurements. This unique process markedly improves the 318 performance in predicting TFCC, as evidenced by the initial epoch of the second-stage training 319 involving pre-training using augmented data. Compared to conventional machine-learning 320 methods, the enhancement in performance is demonstrated (see Table 2). The reduction in loss 321 during this phase can be attributed to the utilization of pre-trained parameters from the first-stage 322 training in the second stage. This process serves a dual purpose: Preventing overfitting arising 323 from limited observational data and augmenting the capacity for model generalization. 324 Furthermore, this approach reinforces the adaptability and robustness of the model. It facilitates 325 effective generalization of model performance across diverse datasets, utilizing data 326 augmentation and chemical compounds (TFCC) through feature extraction, fusion and 327 estimation (Fig. 3b). Therefore, Y-Net becomes adept at capturing intricate data patterns with a 328 specific focus on optimizing both local and global aspects during training. The local aspect of Y-329 Net involves capturing details or specific patterns of the input data. In contrast, the global aspect 330 recognizes the overarching trend or general pattern spanning the entire dataset. Additionally, 331 integrating ResNet into the architecture addresses challenges such as vanishing gradients and 332 ensuring information preservation across layers (Fig. 3c and 3d). 333 4.2. Spectral importance to TFCC retrieval using Y-Net 334 To assess the performance of Y-Net, we aim to compare the spectral effectiveness of the optical 335 region (400–2500 nm) using feature important values by calculating the average feature 336 extraction values in each branch, namely reflectance and transmittance. The average importance 337 was derived from each convolutional block in the deep CNN (total of 37 layers). Each layer can 338 yield results for feature importance, referred to as spectral contribution to predict TFCC. In the 339 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 18 hidden layers, higher spectral feature values contribute significantly to the prediction of TFCC, 340 whereas lower values indicate a comparatively less influential role in the predictive process. The 341 analysis of spectral contributions across layers aids in understanding the significance of various 342 features and their impact on accurately estimating TFCC concentrations by Y-Net throughout the 343 optical range, particularly at 400 nm, 700 nm, 1400 nm, 1900 nm and 2500 nm (Fig. 7). To 344 qualitatively identify patterns, encompassing both local and global aspects, and to isolate local 345 patterns within the branch layers of reflectance and transmittance, a high-pass filter was 346 employed in this study [43, 44]. The high-pass filter adopts a fast Fourier transform that can 347 enhance the minor contribution or local aspect by capturing the high frequency in the feature 348 importance [35]. It is acknowledged that global aspects include visually apparent patterns such as 349 trends and curves of the feature importance. Those global patterns are a significant contribution 350 to TFCC prediction. On the other hand, interpreting local aspects may be challenging due to their 351 minor contribution despite utilizing a substantial influence on TFCC prediction within specific 352 wavelength regions. Using the high-pass filter, we can extract local patterns instead of 353 considering them as interference, especially the feature importance in the deep layer networks. 354 The local pattern was exhibited and appeared in the wavelength at around 400, 500, 700, 1150, 355 1400, 1900 and 2500 mm (Fig. 7). These significant patterns are evident at the two ends of the 356 optical region (400 nm, 2500 nm) and at 1900 nm, while lower significances were observed 357 around 1150 nm and 1400 nm. 358 We also compared the spectral effectiveness of the optical region (reflectance and 359 transmittance) with the models ingesting the VNIR range of 400–1100 nm of reflectance and 360 transmittance (due to high signal-to-noise ratio and cost-effective instrument), and reflectance 361 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 19 only (for airborne and spaceborne remote sensing applications) of the entire optical region (Table 362 3). Since the spectral patterns of green vegetation reflectance and transmittance are similar [38] 363 (Fig. 2), they yielded almost identical feature important values (Fig. 7). Thus, the reflectance 364

Method

was adopted as the comparison in this study. The comparison was visually addressed 365 using the feature importance and their high-pass filter, specifically for reflectance and 366 transmittance, in each branch, while the transmittance was not used in the reflectance method. 367 This implies that by employing VNIR in the experiment, we neglect the significant important 368 features and the less significant features in the spectral range of 1101 nm to 2500 nm (Table 3, 369 VNIR), namely that evident in the spectral edges (400 nm, 2500 nm) and at 1900 nm. Therefore, 370 the proposed Y-Net using entire optical region reflectance/transmittance outperformed the other 371 two Y-Nets models. This is because, although they share almost identical important features and 372 local patterns, they complement each other, with specific patterns present in spectral reflectance 373 that may not be found in spectral transmittance. We also note that the performance of the Y-Net 374 was merely slightly better than the VNIR and reflectance-only Y-Nets. The spectral analysis 375 demonstrates that we could re-produce similar outcomes with Y-Net using low-cost instruments. 376 In addition, this study also underscores the potential of using airborne or spaceborne 377 hyperspectral remote sensing for regional monitoring of TFCC. 378 4.3. Uncertainties and limitations 379 While Y-Net demonstrates satisfactory performance (Table 2), uncertainties exist, primarily 380 attributed to the complexity of TFCC. The intricate interactions and dependencies among various 381 TFCC pose challenges in achieving absolute precision. The performance of Y-Net is contingent 382 on the quality and diversity of the training data, such as using additional augmented data (Fig. 383 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 20 3a). Limited representation of specific chemical profiles in the training set may lead to 384 challenges in accurately predicting less-represented compounds. For instance, the range of 385 chemical compound values in augmented data may not be sufficient for effective Y-Net training. 386 Expanding the range of compound values in Y-Net involves optimizing sensitivity and precision. 387 Additionally, the generalization of Y-Net to diverse tea cultivars and physical environments 388 could pose a limitation. The inherent variability in TFCC across different varieties and growth 389 environments could ramify the predictive accuracy. Continuous refinement and expansion of the 390 training dataset with actual data (without data augmentation) are crucial measures that could 391 address some of these limitations over time. This ongoing effort aims to enhance the adaptability 392 and robustness of Y-Net to a broader spectrum of TFCC profiles, ultimately improving its 393 applicability across diverse environmental conditions and TFCC. 394 5. Conclusions 395 In this study, we proposed Y-Net, a deep-learning approach, to estimate TFCC from fresh tea 396 shoots indirectly using their optical reflectance and transmittance. We found that the 397 performance of Y-Net was superior to some commonly employed advanced statistical (PLS) and 398 machine-/deep-learning (cubist, FNNs, Gaussian and RF) methods by referring to RMSE. The 399 dual-input approach of Y-Net, considering both reflectance and transmittance simultaneously, 400 outperformed those selected single-input models, providing a more comprehensive 401 representation of TFCC and improving predictive accuracy. Our spectral analyses showed that 402 some specific bands located at 400 nm, 500 nm, 700 nm, 1150 nm, 1400 nm, 1900 nm and 2500 403 nm were important for retrieving TFCC. In addition, the reduction of the spectral range (VNIR 404 Y-Net) or exclusion of transmittance (reflectance only Y-Net) would also retard the overall 405 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 21 performance. These results emphasize the importance of advanced optimization techniques and 406 leveraging multiple sources of spectral data for precise chemical compound prediction in fresh 407 tea shoots. 408 Acknowledgments 409 We appreciate the field and lab assistance provided by Chi-Ching Huang, Yi-Ching Hung and 410 Zih-Yu Shen. This work was supported by the National Science and Technology Council (NSTC 411 112-2321-B-002-016), National Taiwan University (NTU-AS-112L104303), and the Research 412 Center for Future Earth, the Featured Areas Research Center Program, the Higher Education 413 Sprout Project, Ministry of Education (Taiwan). 414 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 22

References

415 [1] R.A. Dixon, D. Strack, Phytochemistry meets genome analysis, and beyond ......... 416 Phytochemistry, 62 (2003) 815–816. 417 [2] C. Fang, A.R. Fernie, J. Luo, Exploring the diversity of plant metabolism, Trends in Plant 418 Science, 24 (2019) 83–98. 419 [3] A. Rai, K. Saito, M. Yamazaki, Integrated omics analysis of specialized metabolism in 420 medicinal plants, The Plant Journal, 90 (2017) 764–787. 421 [4] J.K. Weng, The evolutionary paths towards complexity: A metabolic perspective, New 422 Phytologist, 201 (2014) 1141–1149. 423 [5] Q. Yang, Y. He, M. Kabahuma, T. Chaya, A. Kelly, E. Borrego, Y. Bian, F. El Kasmi, L. 424 Yang, P. Teixeira, J. Kolkman, R. Nelson, M. Kolomiets, J.L. Dangl, R. Wisser, J. Caplan, X. Li, 425 N. Lauter, P. Balint-Kurti, A gene encoding maize caffeoyl-CoA O-methyltransferase confers 426 quantitative resistance to multiple pathogens, Nature Genetics, 49 (2017) 1364–1372. 427 [6] H. Yamashita, R. Sonobe, Y. Hirono, A. Morita, T. Ikka, Potential of spectroscopic analyses 428 for non-destructive estimation of tea quality-related metabolites in fresh new leaves, Scientific 429 Reports, 2021, pp. 4169. 430 [7] Y. Hilal, U. Engelhardt, Characterisation of white tea - Comparison to green and black tea, 431 Journal of Consumer Protection and Food Safety, 2 (2007) 414–421. 432 [8] S.Y. Pan, Q. Nie, H.C. Tai, X.L. Song, Y.F. Tong, L.J.F. Zhang, X.W. Wu, Z.H. Lin, Y.Y. 433 Zhang, D.Y. Ye, Y. Zhang, X.Y. Wang, P.L. Zhu, Z.S. Chu, Z.L. Yu, C. Liang, Tea and tea 434 drinking: China’s outstanding contributions to the mankind, Chinese Medicine, 17 (2022) 27. 435 [9] S. Ahmed, T.S. Griffin, D. Kraner, M.K. Schaffner, D. Sharma, M. Hazel, A.R. Leitch, C.M. 436 Orians, W. Han, J.R. Stepp, A. Robbat, C. Matyas, C. Long, D. Xue, R.F. Houser, SB Cash, 437 Environmental factors variably impact tea secondary metabolites in the context of climate 438 change, Frontiers in Plant Science, 10 (2019) 939. 439 [10] L. Zhang, Q.Q. Cao, D. Granato, Y.Q. Xu, C.T. Ho, Association between chemistry and 440 taste of tea: A review, Trends in Food Science & Technology, 101 (2020) 139–149. 441 [11] D. Arumai Selvan, D. Mahendiran, R. Senthil Kumar, A. Kalilur Rahiman, Garlic, green tea 442 and turmeric extracts-mediated green synthesis of silver nanoparticles: Phytochemical, 443 antioxidant and in vitro cytotoxicity studies, Journal of Photochemistry and Photobiology B: 444 Biology, 180 (2018) 243–252. 445 [12] D.A. Dias, O.A.H. Jones, D.J. Beale, B.A. Boughton, D. Benheim, K.A. Kouremenos, J.L. 446 Wolfender, D.S. Wishart, Current and future perspectives on the structural identification of small 447 molecules in biological systems, Metabolites, 6 (2016) 46. 448 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 23 [13] AR Fernie, R.N. Trethewey, A.J. Krotzky, L. Willmitzer I N N Ovat I O N, Metabolite 449 profiling: From diagnostics to systems biology, Nature Reviews Molecular Cell Biology, (2004) 450 763–769. 451 [14] J.L. Wolfender, J.M. Nuzillard, J.J.J. Van Der Hooft, J.H. Renault, S. Bertrand, 452 Accelerating metabolite identification in natural product research: Toward an ideal combination 453 of liquid chromatography-high-resolution tandem mass spectrometry and NMR profiling, in 454 silico databases, and chemometrics, American Chemical Society Publications, 91 (2019) 704–455 742. 456 [15] L. Li, J. Huang, Y. Wang, S. Jin, M. Li, Y. Sun, J. Ning, Q. Chen, Z. Zhang, Intelligent 457 evaluation of storage period of green tea based on VNIR hyperspectral imaging combined with 458 chemometric analysis, Infrared Physics & Technology, 110 (2020) 103450. 459 [16] M.N. Merzlyak, T.B. Melo, K. Razi Naqvi, Estimation of leaf transmittance in the near 460 infrared region through reflectance measurements, Journal of Photochemistry and Photobiology 461 B: Biology, 74 (2004) 145–150. 462 [17] M.S. Lee, Y.S. Hwang, J. Lee, M.G. Choung, The characterization of caffeine and nine 463 individual catechins in the leaves of green tea (Camellia sinensis L.) by near-infrared reflectance 464 spectroscopy, Food Chemistry, 158 (2014) 351–357. 465 [18] H. Schulz, U.H. Engelhardt, A. Wegent, H.H. Drews, S. Lapczynski, Application of near-466 infrared reflectance spectroscopy to the simultaneous prediction of alkaloids and phenolic 467 substances in green tea leaves, Journal of Agricultural and Food Chemistry, 47 (1999) 5064–468 5067. 469 [19] Y. Huang, W. Dong, A. Sanaeifar, X. Wang, W. Luo, B. Zhan, X. Liu, R. Li, H. Zhang, X. 470 Li, Development of simple identification models for four main catechins and caffeine in fresh 471 green tea leaf based on visible and near-infrared spectroscopy, Computers and Electronics in 472 Agriculture, 173 (2020) 105388. 473 [20] Y.J. Wang, S.S. Jin, M.H. Li, Y. Liu, L.Q. Li, J.M. Ning, Z.Z. Zhang, Onsite nutritional 474 diagnosis of tea plants using micro near-infrared spectrometer coupled with chemometrics, 475 Computers and Electronics in Agriculture, 175 (2020) 105538. 476 [21] A. Hovi, P. Forsström, M. Mõttus, M. Rautiainen, Evaluation of accuracy and practical 477 applicability of methods for measuring leaf reflectance and transmittance spectra, Remote 478 Sensing, 10 (2018) 25. 479 [22] M.A. Greiner, BD. Duncan, M.P. Dierking, Bidirectional scattering distribution functions of 480 maple and cottonwood leaves, Applied Optics, 46 (2007) 6485–6494. 481 [23] A. Simic, J.M. Chen, S.G. Leblanc, A. Dyk, H. Croft, T. Han, Testing the top-down model 482 inversion method of estimating leaf reflectance used to retrieve vegetation biochemical content 483 within empirical approaches, IEEE Journal of Selected Topics in Applied Earth Observations 484 and Remote Sensing, 7 (2014) 92–104. 485 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 24 [24] H.M. Noda, T. Motohka, K. Murakami, H. Muraoka, K.N. Nasahara, Reflectance and 486 transmittance spectra of leaves and shoots of 22 vascular plant species and reflectance spectra of 487 trunks and branches of 12 tree species in Japan, Ecological Research, 29 (2014) 111. 488 [25] L. Denaro, H. Li, J. Chong, and C. Huang, Reconstructing fresh green leaf spectra in the 489 SWIR-2 region (2001-2500 nm) collected in a humid environment by referring to publicly 490 available green leaf spectral databases, bioRxiv, doi: https://doi.org/10.1101/2024.01.04.574267. 491 [26] S. Van Wittenberghe, J. Verrelst, J.P. Rivera, L. Alonso, J. Moreno, R. Samson, Gaussian 492 processes retrieval of leaf parameters from a multi-species reflectance, absorbance and 493 fluorescence dataset, Journal of Photochemistry and Photobiology B: Biology, 134 (2014) 37–494 48. 495 [27] L. Chen, Y. Wei, Z. Yao, E. Chen, X. Zhang, Data augmentation in prototypical networks 496 for forest tree species classification using airborne hyperspectral images, IEEE Transactions on 497 Geoscience and Remote Sensing, 60 (2022) 4410116. 498 [28] M. Wieland, Y. Li, S. Martinis, Multi-sensor cloud and cloud shadow segmentation with a 499 convolutional neural network, Remote Sensing of Environment, 230 (2019) 111203. 500 [29] V.K. Ojha, A. Abraham, V. Snášel, Metaheuristic design of feedforward neural networks: A 501 review of two decades of research, Engineering Applications of Artificial Intelligence, 60 (2017) 502 97–116. 503 [30] M. Paliwal, U.A. Kumar, Assessing the contribution of variables in feed forward neural 504 network, Applied Soft Computing, 11 (2011) 3690–3696. 505 [31] GLAA dos Santos, A.S. Reis, M.R. Besen, R.H. Furlanetto, M. Rodrigues, L.G.T. Crusiol, 506 K.M. de Oliveira, R. Falcioni, RB de Oliveira, M.A. Batista, M.R. Nanni, Spectral method for 507 macro and micronutrient prediction in soybean leaves using interval partial least squares 508 regression, European Journal of Agronomy, 143 (2023) 126717. 509 [32] P.C. Agyeman, J. Kingsley, N.M. Kebonye, V. Khosravi, L. Borů vka, R. Vašát, Prediction 510 of the concentration of antimony in agricultural soil using data fusion, terrain attributes 511 combined with regression kriging, Environmental Pollution, 316 (2023) 120697. 512 [33] P. Li, Classification of black tea leaf water content based on hyperspectral imaging, in: 513 2023 IEEE International conference on control, electronics and computer technology (ICCECT), 514 Institute of Electrical and Electronics Engineers Inc., 2023, pp. 621–624. 515 [34] W.S. Holmes, M.P.-L. Ooi, S. Abeysekera, Y.C. Kuang, R. Simpkin, M. Caddie, J. Nowak, 516 S. Demidenko, On machine learning methods to estimate cannabidiolic acid content of Cannabis 517 sativa L. from near-infrared hyperspectral imaging, in: 2023 IEEE International instrumentation 518 and measurement technology conference (I2MTC), 2023, pp. 01–06. 519 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 25 [35] S. Ye, H. Weng, L. Xiang, L. Jia, J. Xu, Synchronously predicting tea polyphenol and 520 epigallocatechin gallate in tea leaves using Fourier transform–near-infrared spectroscopy and 521 machine learning, Molecules, 28 (2023) 5379. 522 [36] X. Qiao, W. Huang, Spectral-spatial-frequency transformer network for hyperspectral image 523 classification, 2023 IEEE Sensors Applications Symposium (SAS), Institute of Electrical and 524 Electronics Engineers Inc., 2023. 525 [37] S.L. Ustin, J.A. Gamon, Remote sensing of plant functional types, New Phytologist, 186 526 (2010) 795–816. 527 [38] S. Jacquemoud, S. Ustin, Leaf optical properties in different wavelength domains, 528 Cambridge University Press, 2019. 529 [39] K. Klancnik, K. Vogel-Mikus, M. Kelemen, P. Vavpetic, P. Pelicon, P. Kump, D. Jezersek, 530 A. Gianoncelli, A. Gaberscik, Leaf optical properties are affected by the location and type of 531 deposited biominerals, Journal of Photochemistry and Photobiology B: Biology, 140 (2014) 532 276–285. 533 [40] C.y. Huang, W.R.L. Anderegg, G.P. Asner, Remote sensing of forest die-off in the 534 Anthropocene: From plant ecophysiology to canopy structure, Remote Sensing of Environment, 535 231 (2019) 111233. 536 [41] O. Andrew, A. Apan, D.R. Paudyal, K. Perera, Convolutional neural network-based deep 537 learning approach for automatic flood mapping using NovaSAR-1 and Sentinel-1 data, ISPRS 538 International Journal of Geo-information, 12 (2023) 194. 539 [42] C.P. Lee, K.M. Lim, Y.X. Song, A. Alqahtani, Plant-CNN-ViT: Plant classification with 540 ensemble of convolutional neural networks and vision transformer, Plants, 12 (2023) 2642. 541 [43] S.H. Yoo, G. Huang, K.S. Hong, Physiological noise filtering in functional near-infrared 542 spectroscopy signals using wavelet transform and long-short term memory networks, 543 Bioengineering, 10 (2023) 685. 544 [44] G. Blaney, C. Fernandez, A. Sassaroli, S. Fantini, Dual-slope imaging of cerebral 545 hemodynamics with frequency-domain near-infrared spectroscopy, Neurophotonics, 10 (2023) 546 013508. 547 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 26 Table 1. The sample sites and the corresponding elevations (from low to high) and sampling 548 dates. Long dashes (—) indicate no data (dates) available. 549 Site name Elevation (m) 2022 2023 Date 1 Date 2 Date 1 ZMG 879 04/19 — 05/24 LM 985 05/24 — — LT 1012 05/03 — — BHS 1080 04/22 — — FS 1116 05/24 — 05/16 SL 1212 06/17 — 05/11 CST 1221 04/22 — 05/24 YCL 1226 04/19 — — JP 1271 04/22 — — YL 1307 04/19 06/30 — HTD 1344 04/19 — — MF 1346 04/12 — 05/11 LY 1362 04/12 — — SY 1463 05/03 06/30 05/16 SYU 1552 05/03 — — 550 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 27 Table 2. Performance comparison of selected statistics (partial linear least squared [PLS]) and 551 machine-/deep-learning methods (cubist, three feedforward neural networks [FNN-1, FNN-2 and 552 FNN-3], Gaussian process, random forests [RF] and Y-Net based convolutional neural network) 553 to estimate TFCC using root-mean-square error (RMSE). The underscored numbers indicate the 554 lowest values for each TFCC and the mean and standard deviation (SD). 555 TFCC PLS Cubist FNN-1 FNN-2 FNN-3 Gaussian RF Y-Net GC 15.03 8.33 3.38 3.35 3.31 1.24 3.74 3.23 ECG 9.02 4.33 7.93 7.92 8.57 21.27 5.71 8.03 C 4.34 0.63 0.74 0.73 0.83 1.55 0.57 0.86 EC 2.68 1.29 1.53 1.52 1.59 2.63 1.14 1.49 EGCG 25.28 6.28 8.13 8.05 6.88 16.12 9.41 4.09 GCG 1.63 0.51 0.28 0.28 0.28 3.31 0.48 0.40 ECG 4.52 1.13 1.52 1.5 1.24 2.86 1.68 2.24 CG 0.12 0.3 0.06 0.05 0.05 4.37 0.05 0.36 GA 2.10 0.8 0.95 0.94 0.98 3.60 0.94 1.24 CAF 13.92 2.37 2.79 2.83 3.31 19.31 2.77 3.19 Mean 7.86 2.59 2.73 2.72 2.70 7.63 2.65 2.51 SD 7.58 2.64 2.82 2.81 2.75 7.52 2.80 2.20 556 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 28 Table 3. Y-Net performance comparison (RMSE) using spectra (reflectance and transmittance) 557 from the whole optical (400–2500 nm) and visible-near infrared (400–1100 nm) regions and 558 reflectance of the optical region only to estimate TFCC. The underscored numbers indicate the 559 lowest values for each TFCC, mean and SD, separated by training and validation. 560 TFCC Optical region VNIR Reflectance Training Validation Training Validation Training Validation GC 1.57 3.23 2.88 2.00 2.75 2.98 ECG 2.69 8.03 4.64 5.63 4.16 7.94 C 0.47 0.86 0.96 1.19 0.64 0.70 EC 0.66 1.49 1.43 2.02 0.92 1.70 EGCG 3.57 4.09 6.64 7.90 7.12 7.70 GCG 0.30 0.40 0.39 0.40 0.37 0.34 ECG 0.63 2.24 1.61 2.42 1.12 1.72 CG 0.38 0.36 0.25 0.29 0.38 0.36 GA 0.54 1.24 0.95 0.88 0.74 1.21 CAF 2.03 3.19 7.43 4.87 2.90 2.79 Mean 1.29 2.51 2.72 2.76 2.07 2.68 SD 1.08 2.20 2.49 2.41 2.11 2.74 561 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint 29 Figure legends 562 Figure 1. (a) The study area, the Greater Alishan tea plantations (green pixels) within the 563 elevation range of 800–1600 m a.s.l. (b) located in Central Taiwan (the yellow polygon). We 564 collected tea shoot spectral and tea flavor-related chemical compounds data in tea farms (Table 565 1) illustrated as yellow dots. 566 Figure 2. Pre-processing of tea leaf reflectance ((a) before and (c) after the spectral 567 reconstruction) and transmittance ((b) before and (d) after the spectral reconstruction) data. 568 Figure 3. The workflow of using Y-Net to estimate tea flavor-related chemical compounds 569 (TFCC) with tea shoot reflectance and transmittance. (a) Illustration of augmented from 570 observed data. (b) Y-Net-based ResNet (residual network) structure for TFCC prediction. (c) 571 Convolutional block of ResNet, Y-Net. (d) Two-stage training: Top (the first stage) and bottom 572 (the second stage). 573 Figure 4. The (a) observed (n = 114) and (b) augmented (n = 1135) TFCC including gallic acids 574 (GA), caffeine (CAF) and eight catechin isomers including gallocatechin (GC), epicatechin 575 gallate (EGC), catechin (C), epicatechin (EC), epigallocatechin gallate (EGCG), gallocatechin 576 gallate (GCG), epicatechin gallate (ECG) and catechin gallate (CG). 577 Fig. 5. Epoch comparison for 1st and 2nd stage learning; MSE represents the mean squared error. 578 Fig. 6. Estimation of TFCC using Y-Net with tea shoot spectra: Observed (a) training and (b) 579 validation; predicted (a) training and (b) validation datasets. 580 Figure 7. Feature important values of tea shoot optical spectra in Y-Net. 581 (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted March 8, 2024. ; https://doi.org/10.1101/2024.03.05.583504doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-06-16T06:25:30.133384+00:00