Full text
1,961 characters
· extracted from
oa-doi-fallback
· click to expand
Abstract
The Fabaceae plants (Legumes) are important for the economy and food sovereignty of México. Traits development of agronomic interest, and other biological activities of the Fabaceae plants are tightly related to the gene regulation, like the post-transcriptional gene repression mediated by microRNAs. Several artificial intelligence models have been developed for the miRNA precursor sequence prediction. They were based mainly on Convolutional Neural Network and Multi-Layer Perceptron architectures. Although the numerical encoding of nucleotide sequence and its secondary structure of pre-miRNAs implemented in these neural networks showed good performance, there are other encoding methods that have not been explored. Recently, a geometric construction of viral genome space and the numerical encoding of the archaea, bacteria, fungi and viruses genomes were successfully achieved employing natural vectors with covariance component. Natural vectors have also been used as input data during neural networks training for the classification of viral genomes. In consequence, in this work we mainly assessed the performance of neural networks as regression or classifier models trained with nucleotide sequences and its secondary structure representation encoded by natural vectors with covariance component alone or nested within the three sequences method. Additionally, we tested other characteristics of neural networks, and the results of training neural networks with natural vectors with covariates showed a better performance in predicting intrinsic nucleotide features, such as percentage of guanine and cytosine, pairwise-aligned sequence identity. Also, it showed good accuracy in categorizing miRNA precursor sequences compared with the results obtained from other encoding methods, that are often used in the numerical representation of nucleotide sequences.
Competing Interest Statement
The authors have declared no competing interest.
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.