Detecting Fake News Using Deep Learning Approaches

preprint OA: closed
Full text JSON View at publisher
Full text 99,191 characters · extracted from preprint-html · click to expand
Detecting Fake News Using Deep Learning Approaches | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Detecting Fake News Using Deep Learning Approaches Enas Tariq Khudair, Onsa Lazzez, Mourad Zaied, Tarek M. Hamdani, and 3 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7223076/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Deep learning (DL)-based detection systems for fake news should improve and adapt as fake news grows more complex in order to continue protecting the integrity of information in digital society. In addition to addressing a serious technological challenge, creating strong DL-based fake news detection systems is an important tool to maintain authenticity of information. Because of the difficulty of finding Arabic data, the method offers a basis for understanding the significance and difficulties of collecting data in Arabic. Due to the limited dataset, the suggested system has been translated from English into Arabic, and its performance and the potential of DL in identifying fake news were verified using another dataset that was accessible. Existing studies utilizing DL methods, like recurrent neural networks (RNNs), LSTMs, and convolutional neural networks (CNNs) for detecting fake news are discussed. For comparison, the term Aribert is used. It makes use of cutting-edge DL methods to propose a new method for identifying fake news. The paper offers a thorough framework for automatically detecting as well as classifying any false information on digital platforms by combining deep neural networks (DNNs) with natural language processing (NLP). Contextual understanding, temporal dependency, and content complexity are some of the main challenges with fake news detection that the approach solves. From purposefully created news articles to thinly veiled misleading content, the system displays great performance in detecting various types of misinformation. Furthermore, cutting-edge feature extraction (FE) methods have been used, which take into account metadata as well as textual content, such as propagation patterns and source credibility. When put to comparison with machine learning (ML) methods, the experimental results show notable gains in detection speed and accuracy. Spacey, Fasttext, and two-word embeddings were among the four methods that were employed. The best DNN method used Spacey to obtain a strong performance of 78%, whereas LSTM model performed well with Spacey at 49% and Fasttext at 51%. On accuracy scale, it scored a 99.1% success rate after converting to BERT, whereas AraBERT earned a 99.3% success rate. AraBERT is thus better. This is due to the fact that both AraBERT and BERT are trained on various Arabic articles. Deep learning Fake news detection CNN DNN RNN Hybrid LSTM and AraBERT Misinformation and Social Media Analysis Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 1. Introduction One of the significant and challenging problems is fake news, which faces numerous obstacles. Fake news is presently being studied because of the nature of languages and the challenge of spotting it. DL is a ML method that aims to depict the world as nested hierarchy of concepts that are identified by DL architecture automatically [ 1 ]. Specifically, DL—also referred to as deep structured learning—is a subset of ML technology that learns representations through artificial neural networks (ANNs). Semi-supervised, supervised, and unsupervised learning are the three learning categories. DL approaches that use DNNs have gained popularity as a result of the development of high-performance computer facilities. In particular, DL has more flexibility and functionality because it could manage a large number of functions while working with unstructured data. In essence, DL feature algorithms transfer data through a number of layers, each of which has the ability to progressively extract features before passing them on to the next layer. Low-level features are extracted in the first layer, and features are combined in the next layers in order to provide a comprehensive representation. DL was developed inseparably with time, achieving a blast of information from all over the world and in all kinds of structures. This information, which is known as "large information," comes from various sources, which include online searches, social media, web-based commercial platforms, and online movies. Through fintech applications, such as distributed computing, this enormous amount of information is instantly accessible and could be shared. DL can be defined as an artificial intelligence (AI) function mimicking the way that the human brain can process data and generate patterns for use in decision-making. In AI, it is a subset of ML that includes networks which can learn in unsupervised manner from unstructured data. To be more specific, DL encompasses a variety of networks, including CNN (Convolutional Neural Network), DBN (Deep Belief Network), Recursive Neural Network, RNN (Recurrent Neural Network), and a lot more. In order to achieve this, neural networks (NNs) are extremely helpful in sentence modelling, sentence classification, vector representation, word representation estimation, and future presentation [ 3 ]. A popular area of ML is DL, which draws inspiration from how the biological brain of humans handles information processing and draws conclusions. In order to improve accuracy for complex tasks, DL employs multiple layers of neurons, each of which is coupled to success layer. The use of an expert is reduced because to the development of DL from ML [ 4 ]. To make predictions, learn from data, and maximize performance, DL models are made up of a number of basic building components. Each of these elements—optimization algorithms, layers, activation functions, and loss functions—is essential to the model's capacity for learning from and generalizing from data [ 5 ]. With quickly developing applications spanning atomistic, image-based, textual, and spectral data modalities, DL is one of the materials data science subjects with the quickest rate of growth. It enables automated feature recognition and analysis of unstructured data. In this sense, the use of DL methods in atomistic prediction, has been stimulated by the recent growth of huge materials databases [ 6 ]. For various ML problems during the past 20 years, DL has provided the solution. Dependency on a large amount of labeled data and training costs are its two main drawbacks. Through reusing knowledge from a source data or task in training on a target data or work, transfer learning (TL) in DL, sometimes referred to as Deep Transfer Learning (DTL), aims to lower such cost and dependency. Most DTL approaches that are used are based upon networks or models [ 7 ]. Our contributions can be summarized below : Reading and preparing the dataset and breaking the task down into manageable parts. The first section used Spacy and Fasttext methods to perform preprocessing on the dataset. The second section used transformer models such as BERT and AraBERT. The third section used different deep learning methods, including RNN, CNN, and LSTM. CNN and RNN were combined to get the results and compare them with the methods without merging. The accuracy was adopted for evaluating the performance regarding the proposed system, combining the two methods, the Spacy and the Fasttext. 2. Related work Through utilizing ML-based fake news detection model, the authors in [ 8 ] offered a solution. Prerequisite data for the model must be taken from a variety of news websites. Data extraction, which is then utilized for creating datasets, was accomplished using the web scraping approach. True datasets and false datasets are the two main groups into which the data was categorized. Logistic Regression, Random Forest, KNN, Decision Tree, and Gradient Booster are the classifiers utilized to classify the data. Data was classified as either false or true depending on the result that has been obtained. The user could therefore determine on the webserver if the provided news is fake or not. To detect the fake text, authors of [ 9 ] suggested many classifier types. In order to determine which classifier would be most effective for identifying fake news in Arabic, they next tested each of such classifiers using two distinct Arabic datasets. Fast Text library was used with all of such classifiers to improve text classification by offering effective word representations. The authors of [ 10 ] employed a DL method that incorporated pre-trained word embedding that was trained with the use of four distinct datasets with many architectures, like the Bidirectional LSTM, CNN, and ResNet. For lessening data imbalances between the classes, every data undergoes a data augmentation procedure that utilizes back-translation method. The outcomes demonstrated that, across all evaluated datasets, the Bidirectional LSTM architecture performed better than ResNet and CNN. In [ 11 ], the researchers used many cutting-edge AI methods to assess the created dataset. A total of five ML methods has been specifically employed: KNN, Naïve Bayes, SVM, Logistic Regression, and Decision Trees. GloVe and BERT embeddings, on the other hand, have been employed with two DL methods, LSTM and CNN. F1-score, precision, recall, and accuracy were used for comparing the performance regarding all applied models as well as embeddings. According to the results, the best performance was achieved with LSTM initialized with GloVe embeddings. Additionally, the study examined the misclassified samples through contrasting them with human judgements. A thorough analysis of ML and DL-based methods for the detection of fake news was published in [ 12 ]. For practitioners and researchers interested in creating efficient fake news detection systems with DL and ML methods, their review offered insights and recommendations. Prior to reporting or publishing on a story, news reporters frequently need to confirm its authenticity. Reporters may eliminate fake news and concentrate on covering reliable and accurate information through using fake news detection models. The authors of [ 13 ] built DL models to identify fake news depending on news titles or content and used NLP methods for text analytics. The research's suggested solution seeks to be implemented in real-worlds social media platforms and get rid of the negative experience that users have when they acquire misleading stories from unreliable sources. Before vectorizing the text into N-gram vectors or sequence vectors with the use of terms frequency inverse document frequency (TF-IDF) or one-hot encoding, respectively, NLP methods employed text preprocessing methods, like regular expression, lemmatization, tokenization, and stop words removal. Tensor Flow was therefore selected as the framework to be utilized thanks to its integrated Keras DL libraries, which have a sizable community and enough comments on its GitHub repository to support the construction of DL-NN models. Since the main objective of fake news site is to sway public opinion on particular topics, the authors of [ 14 ] created an accurate and reliable model that can determine whether a news piece is fake or real. 3. The proposed Work The primary function of the proposed system is to detect news written in Arabic, whether it is fake or not (positive or negative). The practical components of the system are explained in this section. The system consists of two main components: collecting a dataset and using Arabic text to analyze the news. In this regard, collecting the dataset is a challenging step. The second component of the system analyzes the Arabic text. Fake news detection aims to identify forgery conveyed in a given text. Here, the system prioritizes texts written in Arabic. To do this task, this section recommends using different methods, such as DL algorithms. The proposed system achieved the goal of AFN detection by employing deep learning methods. As shown in Figure (1), due to the limited availability of Arabic databases, the dataset was translated from English to Arabic and used with other datasets for validating the performance regarding the suggested system. The procedures included reading and preparing the dataset and dividing the task into manageable parts. The first part of the work used four approaches, including the Spacy and FastText methods. The second part of the work used transformed models such as BERT and AraBERT. The third part used various deep learning methods, including DNNs, CNNs, and LSTMs, as well as hybrid CNN-LSTM models. All of these models were created using Fasttext and Spacey embedding methods. The best approach for deep neural networks was the achieved DNN. In the next subsections, each part of the proposed system architecture will explained : 3.1. Arabic Fake News Dataset To develop our classification model, a Kaggle dataset has been used for fake news detection. English was the initial language used to write the dataset. But since the work was conducted on an Arabic fake news detection feature, Python code and the "google-trans" package have been used for translating it into Arabic before exporting it as an Excel file. Our English-to-Arabic translator code was unable to translate several Russian and German characters in the dataset. As a result, such characters were eliminated as the initial preprocessing step, followed by the elimination of any English words that were still difficult to translate into Arabic, including terms and names (e.g., Donald Trump, Jennifer, etc.). Actually, 20,000 instances were first trained using Kaggle dataset. Following preprocessing, however, the the dataset included 10,000 instances, with 20% going to testing and 80% to training, with 5,000 of those being fake and 5,000 being real. Arabic and English numbers have been also eliminated, in addition to superfluous punctuation and pauses. Lastly, a pre-trained model segmentation tool has been used in order to divide each sentence into tokens throughout the training phase. 4. Word Embedding One of the most notable developments in NLP is Word Embedding. By converting words into vector-based numerical representations, such methods seek to improve computer models' comprehension of word relationships and meanings. Word Embedding approaches like Spacy and Fasttext map each word to a point in a high-dimensional space, in which text with comparable meanings are closely related, rather than encoding words as text strings. This shift from textual to numerical representations improves efficiency on a number of tasks, which include text classification, sentiment analysis, and machine translation. By improving models' capacity to comprehend language and interpret meaning, Word Embedding approaches expand the potential applications of AI. 4.1. Fast Text Embedding The Facebook AI Research team created the word embedding technology. Through adding subword information, it is intended to enhance conventional word representation models and produce better embedding, particularly for morphologically rich languages. Fast Text can comprehend and generate embeddings for morphologically complex languages by converting words down into a set of n-grams, which aids in understanding the internal structure regarding words. Conventional word embedding models, like Word 2Vec, rate a fixed vocabulary throughout training, which might cause problems with OOV words. In order to address this issue, Fast Text combines the embeddings of the n-grams that make up OOV words to create an embedding for them. This feature is especially helpful in practical applications where new words—like new slang and technical terms—occur often. 4.2. Spacy Embedding It is a robust and advanced Python NLP library that combines high performance and ease of use, especially for production use. It is appropriate for a variety of NLP and understanding applications due to its extensive feature set, support for contemporary word embeddings, and deep feature learning models. Data scientists and developers that work with language data frequently choose it because it provides a broad range of functions for effective text processing as well as analysis. Tokenization, part-of-speech tagging, named entity recognition (NER), multilingual support, pipeline engineering, dependency analysis, named entity recognition (NER), word vectors and embedding, semantic analysis, text classification, and visualization are some of the key features. 5. The AraBERT Model A pre-trained language model created especially for processing Arabic text, AraBERT can efficiently handle a variety of NLP tasks. With regard to language understanding, its BERT-based architecture makes use of the most recent developments in DL. AraBERT offers a strong framework for creating complex language applications to scholars and practitioners engaged in Arabic NLP. Key elements of AraBERT include its pre-training on a large dataset of Arabic text and its comprehension of the language's semantics, syntax, and common phrases. It is intended to capture the distinctive complexities and characteristics regarding Arabic language, including its dialectal variances, morphology, and script. Similar to original BERT, AraBERT understands a word in the context of all other words in a sentence, making it especially effective for tasks needing accurate comprehension. In comparison to models trained from scratch, AraBERT could be optimized for particular follow-up tasks, such as named entity recognition, question answering, and sentiment analysis, greatly enhancing performance. There are various AraBERT versions, including AraBERTv2 and AraBERTv1, each of which was trained on a different dataset or optimized for a particular set of tasks. Users are able to select the version that best meets their requirements. Token classification, classification tasks, and sequence classification are just a few of the NLP applications that AraBERT could be used for. Developers and researchers can easily utilize AraBERT for both non-commercial and commercial uses because it is usually available in an open-source format. 6. Neural Networks 6.1. Deep Neural Networks (DNNs): Advances in a variety of domains are made possible by DNNs, a versatile and powerful method for AI and ML. Through their capacity to represent complex relations among data, DNNs continue to propel technological and scientific advancements. Multiple hidden layers, an input layer, and an output layer make up DNNs. The ability for learning the complex features and patterns in data is made possible by the existence of several hidden layers. Neurons make up each layer of a DNN. After applying a weighted sum and adding a bias, each neuron processes the input from the preceding layer before passing it via a nonlinear activation function. The network can learn complex mappings between outputs and inputs by being fed such nonlinear activation functions. Data is fed into the input layer during the process of forward propagation, and it moves through each layer until reaching the output layer, in which classifications or predictions are made. DNNs learn by calculating the error between the actual and predicted outputs, a process known as backpropagation. In order to reduce error, the model after that modifies its weights in accordance with the loss gradient with respect to such weights. The degree to which the DNN's predictions and actual values agree is determined by the loss function. The cross-entropy loss for tasks of classification as well as the mean squared error regarding regression tasks are examples of common loss functions. To help the model converge to a minimum loss, the weights are updated during training using a variety of optimization algorithms. Since DNNs frequently identify hierarchical representations of data, they might automatically learn features from raw data without requiring a great deal of manual feature engineering. They could be used with unstructured data, time series, and structured data, among other data types and domains. DNNs are excellent for complex tasks because they could generalize well with enough data. Figure (2) shows the structure of a DNN-based model. It consists of three layers: hidden, output, and input. Keras was used to define a neural network model with a serial API. Utilizing the leaky ReLU activation function, the initial dense layer consists of eight units. The training data's shape serves as the basis for the input shape. The leaky rule comprises eight units in the second thick layer. The input's shape is deduced from the layer before it, the batch normalization layer, which aids in normalizing the inputs. Eight units comprise another dense layer and a Leaky Relu—leakage layer with leakage rate 0.5. Dropout is used as a regularization method. The output layer has two units and uses softmax activation. Particularly, this is a binary classification problem. The softmax function predicts the class that has the highest probability. Consequently, as neural network for binary classification with regularization methods has been created. 6.2. Convolution Neural Networks (CNNs): CNNs are crucial to modern AI applications and have completely transformed the field of computer vision by offering strong instruments for effectively and efficiently evaluating visual data. They are a particular type of DL model made particularly for handling structured network data, especially images. They do exceptionally well in tasks like object detection, segmentation, and image classification. In contrast to typical NNs, which use convolution in place of global matrix multiplication, pooling layers, activation functions, and fully connected layers, such networks have locally applied convolutional layers. Due to their relatively basic network architecture, CNNs are crucial for DL since they require fewer parameters, which increases processing speed efficiency. CNNs could also help with speech and handwriting recognition, image analysis, and natural language understanding because they require little preprocessing. In the presented study, a custom CNN that is desiged for particular sequence tasks is programmed using Keras and Sequence API. 6.3. Long Short-term Memory (LSTM): LSTM modules address the shortcomings of conventional RNNs and are an effective tool for sequential data analysis. They perform exceptionally well in various applications in speech recognition, time series prediction, and NLP due to their capacity to selectively forget and remember information. Long-term information retention is possible thanks to the memory cells found in LSTMs. More specifically, forget gates, input gates, and output gates are the three types of gates that LSTMs use to control information flow. A more complex NN with bidirectional LSTM layers has been constructed using Keras and a serial API. Each layer of the model will be looked at separately: Sequences are returned by the first layer, a 16-unit bidirectional LSTM layer with leaky ReLU activation. The input shape for such layer is determined by the training data dimensions. A batch normalization layer comes after the first layer of LSM. The model also includes an additional 16-unit bidirectional LSTM layer that generates sequences. One more batch normalizing layer comes after this phase. The model's third LSTM layer consists of 32 bidirectional units that return sequences. A batch normalization layer comes after this layer. The fourth layer of the LSTM simply provides the final state; sequences are not returned, despite having 32 units and being bidirectional. This layer is followed by a batch normalization layer. Leaky ReLU activation and a dense layer with 128 units are included in the model. The model contains an additional thick layer with the same properties. Leaky ReLU is activated at 128 units by the third dense layer. A dense layer with 256 units and Leaky ReLU activation is also present. Specifically, the output layer of the model is suitable for binary classification due to its softmax activation and two units. 6.4. Hybrid CNN and LSTM: CNN-LSTM hybrid model effectively processes data with temporal and spatial dimensions by combining the advantages of LSTM networks with CNNs. Tasks where temporal features (like sequences of frames) and spatial features (like frames in a video) are crucial, such as event recognition, video analysis, and time series prediction, benefit greatly from this architecture. The model can better learn complex patterns that span both dimensions because of such combination. 7. Experimental Results This research faced the difficulties of collecting data in Arabic and choosing the best ones for this work. After that, different DL models have been used, with certain changes made, such changing the number of layers and selecting the right design. Through testing, assessment, and comparison of several models, this procedure sought to show how to rate the most successful results. The models' word embeddings varied, and they were tested using a variety of approaches in an effort to outperform the findings of earlier studies. The focus has been directed towards optimization and quick identification of the most efficient and effective methods to achieve outstanding results. The best method was DNN using word embedding for spacy, achieving 78%, while DNN also achieved the highest performance for Fast text, achieving 76%. As for BERT, it achieved a 99.1% success rate, and AraBERT achieved 99.3% according to the accuracy scale. As a result, the AraBERT is the best method from our proposed deep learning, but after using transformer. Table 1 Model Accuracy% Spacy Fast Text DNN 78% 76% CNN 47% 46% LSTM 51% 49% Hybrid CNN + LSTM 54% 53% BERT 99.1% AraBERT 99.3% As it has been mentioned earlier, the process of identifying fake news is not easy. To handle this issue, four DL models have been proposed to detect or identify fake news depending on the collected database. After completing the training process using two types of embedding, namely Spacy and Fast Text, the best method was the deep neural network. However, after resorting to more comprehensive methods such as BERT and AraBERT, it has been found that the best method is AraBERT, which achieved results of 99.3% based on the database on which it was trained. The reason is that this network was trained on a huge amount of data in the Arabic language, which gave it the strength in discrimination. It is currently considered one of the best bidirectional deep learning methods for the Arabic language. 8. Metrics Several metrics have been calculated for the best approach (AraBERT) with a confusion matrix for this approach, as shown in the table below. Table 2 Metrics for the AraBERT Approach Accuracy Precision Racall F1-Score 99.3% 99.44% 99.16% 49.64% Table 3 Confusion Matrix for the AraBERT Approach Real Fake Real 4972 28 Fake 42 4958 These metrics selected the AraBERT, as it is the best method, and it is suitable for the Arabic language. Specifically, it was trained on data in the Arabic language. 9. Conclusion The detection of fake news has drawn more attention recently because of its detrimental impacts and quick propagation. Given the limitation of existing literature, the detection of fake news in Arabic is a potential yet difficult study topic. Because there are no consistent features to differentiate between real and fake news, there was a trouble with the dataset and detecting fake news. The variety of orthography, dialects, rich vocabulary, and lack of Arabic databases are some of the obstacles that fake news detection in Arabic encounters. In order to determine if Arabic news is real or fake, a method for identifying fake news in Arabic was presented in this research. The English dataset was translated into Arabic, though, because there aren't many databases in this subject. The performance regarding the suggested system was then verified using another Arabic dataset that was accessible. Fasttext and Spacy were two of the four methods that have been employed with two words embedding. DNN, which used spacy to achieve 78% performance, was the best method. With a performance of 51% for Spacy and 49% for Fasttext, LSTM model achieved a good performance. On the accuracy scale, however, it scored a 99.1% success rate following converting to BERT, whereas AraBERT earned a 99.3% success rate. AraBERT is thus better. This is due to the fact that both AraBERT and BERT have extensive Arabic article training. We propose that word embedding approaches with semantic support can be used in future work to support DL. Declarations Author contributions Author Contribution: Enas Tariq Khudair: Conceptualization, Methodology, and Writing– original draft.Onsa Lazzez, Mourad Zaied, Tarek M. Hamdani: Supervision, Data curation, Software, and Methodology. Ahmed T. Sadiq: Writing, Review & editing, Visualization, Formal analysis, and Investigation. Habib Chabchoub, Adel M. Alimi: Project administration, Conceptualization, Investigation, Validation, Supervision, and Review & editing. Funding of Work The manuscript did not require funding from any organization. Clinical Trial The manuscript does not contain any clinical trial. Publish Declaration Consent to Publish declaration: applicable Consent to Participate Declaration Consent to Participate declaration: not applicable Ethical Approval This manuscript does not pose any ethical concerns, as it does not utilize data of individuals' people but rather standard data. References Mohammad Mustafa Taye.” Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions”. Published 25April 2023. Divya P, Aiswarya VB Student.” DEEP LEARNING: METHODS AND APPLICATIONS”. 2021 JETIR July 2021, Volume 8, Issue 7 www.jetir.org (ISSN-2349-5162). ADITYA DUBEY, AKHTAR RASOOL.” RECENT ADVANCES AND APPLICATIONS OF DEEP LEARNING METHOD”. Journal of Theoretical and Applied Information Technology 15th July 2022. Vol.100. No 13 © 2022 Little Lion Scientific. Augmented Reality and Ontology-based Approach for Enhancing Arabic Text Recognition and VisualizationSsss Augmented Reality and Ontology-based Approach for Enhancing Arabic Text Recognition and Ibomoiye Domor Mienye , Theo G. Swart.” A Comprehensive Review of Deep Learning: Architectures, Recent Advances, and Applications”. Published: 27 November 2024, Information 2024, 15 (12), 755; https://doi.org/10.3390/info15120755. Kamal Choudhary, Brian DeCost, Chi Chen, Anubhav Jain, Francesca Tavazza, Francesca Tavazza, Cheol Woo Park, Alok Choudhary, Ankit Agrawal, Simon J. L. Billinge, Elizabeth Holm, Shyue Ping Ong, and Chris Wolverton.” Recent advances and applications of deep learning methods in materials science”. Article number: 59 (2022). Mohamm adreza Iman1, Khaled Rasheed and Hamid Reza Arabnia. ” A Review of Deep Transfer Learning and Recent Advancements”. Springer Nature 2021 LATEX template. Prasad Kulkarni, Suyash Karwande, Rhucha Keskar, Prashant Kale, and Sumitra Iyer, " Fake News Detection using Machine Learning”. ITM Web of Conferences 40, 03003 (2021). Mohammed E.Almandouh, Mohammed F. Alrahmawy, Mohamed Eisa, Mohamed Elhoseny& A. S. Tolba. ” Ensemble based high performance deep learning models for fake news detection”. Published: 04 November 2024, Article number: 26591. I. Kadek Sastrawan, I.P.A. Bayupati, Dewa Made Sri Arsa.” Detection of fake news using deep learning CNN–RNN based methods”. Received 3 July 2021; received in revised form 15 September 2021; accepted 6 October 2021 Available online 22 October 2021. Azka Kishwara, Adeel Zafara. " Fake News Detection on Pakistani News Using Machine Learning and Deep Learning". April 15, 2022. Sangita M. Jaybhaye, Vivek Badade, Aryan Dodke, Apoorva Holkar, and Priyanka Lokhande. " Fake News Detection using LSTM based deep learning approach ". ITM Web of Conferences 56, 03005 (2023). Sheng How Kong, Li Mei Tan, Keng Hoon Gan, Nur Hana Samsudin.” Fake News Detection using Deep Learning”. 2020. Kushal Agarwalla, Shubham Nandan, Varun Anil Nair, D. Deva Hema.” Fake News Detection using Machine Learning and Natural Language Processing”. Published: 10 November 2022. International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878 (Online), Volume-7, Issue-6, March 2019. Mohamad Abdolahi, Morteza Zahedi.” A New Method for Sentence Vector Normalization Using Word2vec”. Int. J. Nonlinear Anal. Appl. 10 (2019) No. 2, 87-96 ISSN: 2008-6822 (electronic) https://dx.doi.org/10.22075/ijnaa.2019.4177. Khalid Shaker, Arwa Alqudsi.” Approach for Detecting Arabic Fake News using Deep Learning”. DOI: https://doi.org/10.52866/ijcsm.2024.05.03.049 Received April 2024; Accepted June 2024; Available online August 2024. Hanen Himdi, George Weir, Fatmah Assiri, Hassanin Al‑Barhamtoshy.” Arabic Fake News Detection Based on Textual Analysis”. Arabian Journal for Science and Engineering (2022) 47:10453–10469. Ahmed Redha Mahlous, Ali Al-Laith.” Fake News Detection in Arabic Tweets during the COVID-19 Pandemic”. (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 12, No. 6, 2021. Ali Bou Nassif, Ashraf Elnagar, Omar Elgendy, Yaman Afadar.” Arabic fake news detection based on deep contextualized embedding models”. Neural Computing and Applications (2022) 34:16019–16032. Bohan Li, Yutai Hou, Wanxiang Che.” Data augmentation approaches in natural language processing: A survey”. Available online 24 March 2022 https://doi.org/10.1016/j.aiopen.2022.03.001 Sebastien C. Wong; Adam Gatt; Victor Stamatescu; Mark D. McDonnell. ” Understanding Data Augmentation for Classification: When to Warp?”. Date Added to IEEE Xplore : 26 December 2016 DOI: 10.1109/DICTA.2016.7797091. Sosuke Kobayashi.” Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations”. June 1 - 6, 2018. Xiang Zhang Junbo Zhao Yann LeCun.” Character-level Convolutional Networks for Text Classification”. posted in Feb 2015 as arXiv:1502.01710. Connor Shorten, Taghi M. Khoshgoftaar and Borko Furht.” Text Data Augmentation for Deep Learning”. Shorten et al. J Big Data (2021) 8:101, https://doi.org/10.1186/s40537-021-00492-0. Heng Yang, Ke Li.” Boosting Text Augmentation via Hybrid Instance Filtering Framework”. Findings of the Association for Computational Linguistics: ACL 2023, pages 1652–1669 July 9-14, 2023 ©2023 Association for Computational Linguistics. Additional Declarations No competing interests reported. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7223076","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":519929147,"identity":"4272456e-befc-496f-b8c3-798ea86774db","order_by":0,"name":"Enas Tariq Khudair","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABRklEQVRIie2RMUvDQBSALxTa5XR+JSX5C68EWsVA/objhUC6ZMhUIjhECukiuGao+hdaCp0bAnGJ4thRKWSqEHEUxORihYSAjiL5uHscd+/j3rsjpKHhD6K5WRA+iD4nBNfE4ZvtbB4V55SPEvgVDeRKzDNyBX5UzFwhglejVEFxsk2zZGsIxiI8uwklrTNJtrYDknzKhKcXj/QO1+VeetEAMsU59k07uF+FCqXRUPFjUPob1upfe4R2ywoCG+QlObixMLhYhfolWG3xwAN94bN8QShWldEbL6xQZpki776VznutYiHw9rni5rfQQrkF1m7VK2NgCAbGiR240UihsTkQadYL0udJd/YA1V40f7RMU1T1+Z2xfHXPT6TONExE6qiSPDWCdDdWtcqLcdj+F8oFPOpu8UE1Sj3y1T7110pDQ0PD/+QT5uZw43R84zQAAAAASUVORK5CYII=","orcid":"","institution":"University of Sfax, University of Sfax, National Engineering School of Sfax (ENETCOM)","correspondingAuthor":true,"prefix":"","firstName":"Enas","middleName":"Tariq","lastName":"Khudair","suffix":""},{"id":519929148,"identity":"7d15e97b-929e-4b6d-a2d9-6001004e9486","order_by":1,"name":"Onsa Lazzez","email":"","orcid":"","institution":"University of Sfax, University of Sfax, National Engineering School of Sfax (ENETCOM)","correspondingAuthor":false,"prefix":"","firstName":"Onsa","middleName":"","lastName":"Lazzez","suffix":""},{"id":519929149,"identity":"209b5877-2668-477c-b132-3403bbe05742","order_by":2,"name":"Mourad Zaied","email":"","orcid":"","institution":"University of Gabes","correspondingAuthor":false,"prefix":"","firstName":"Mourad","middleName":"","lastName":"Zaied","suffix":""},{"id":519929150,"identity":"8e1f4dfd-9d1f-498a-b05f-7d2f43f76157","order_by":3,"name":"Tarek M. Hamdani","email":"","orcid":"","institution":"University of Monastir","correspondingAuthor":false,"prefix":"","firstName":"Tarek","middleName":"M.","lastName":"Hamdani","suffix":""},{"id":519929151,"identity":"da1ecbd0-d60b-41a3-9215-dc57a6bcac80","order_by":4,"name":"Ahmed T. Sadiq","email":"","orcid":"","institution":"University of Technology-Iraq","correspondingAuthor":false,"prefix":"","firstName":"Ahmed","middleName":"T.","lastName":"Sadiq","suffix":""},{"id":519929152,"identity":"05e184ab-176d-4656-a10d-95a4dbf7aa06","order_by":5,"name":"Habib Chabchoub","email":"","orcid":"","institution":"Al Ain University","correspondingAuthor":false,"prefix":"","firstName":"Habib","middleName":"","lastName":"Chabchoub","suffix":""},{"id":519929153,"identity":"70873510-cb13-43e4-ba41-45935f03b509","order_by":6,"name":"Adel M. Alimi","email":"","orcid":"","institution":"University of Johannesburg","correspondingAuthor":false,"prefix":"","firstName":"Adel","middleName":"M.","lastName":"Alimi","suffix":""}],"badges":[],"createdAt":"2025-07-26 20:08:13","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7223076/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7223076/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":92200829,"identity":"1cf9fc40-4839-4002-8ff4-babcf408af21","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"docx","order_by":0,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":445169,"visible":true,"origin":"","legend":"","description":"","filename":"DetectFakeNewsusingDeepLearningpaper3.docx","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/54d39d204fd420338a8512c7.docx"},{"id":92201507,"identity":"33c59225-d7c3-47b1-bc63-d795351f9ddb","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"json","order_by":1,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":8551,"visible":true,"origin":"","legend":"","description":"","filename":"b511defa5a5645b1a499364c76581364.json","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/2f4287a2e00e8fcafbd4a058.json"},{"id":92200833,"identity":"e88a1c4d-0cff-4ff7-88ee-dd75552135a8","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"xml","order_by":2,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":76533,"visible":true,"origin":"","legend":"","description":"","filename":"b511defa5a5645b1a499364c765813641enriched.xml","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/2320ec788a8410df83d0a192.xml"},{"id":92201508,"identity":"b2b78427-f861-409f-b803-b3f1f19341c9","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"jpeg","order_by":3,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":41218,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage1.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/350b4fc0380d1a85fcebd3ca.jpeg"},{"id":92200827,"identity":"351ab718-07c6-4887-b9b8-9b29753a60e7","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"jpeg","order_by":4,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":92260,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage2.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/999dbb0944b257572a97a549.jpeg"},{"id":92200825,"identity":"e3c56619-6816-418d-bea4-c9bfba5e78f8","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"jpeg","order_by":5,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":1074,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage3.jpeg","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/1386ccfd813aa684eeae6294.jpeg"},{"id":92200830,"identity":"eed50088-aa44-43b0-bef0-00c4e8d6b2cd","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":6,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":56450,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/5e652f56695ec922a469901f.png"},{"id":92201512,"identity":"74c63159-eed9-444b-b0c3-25a20ddb60b0","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"png","order_by":7,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":119604,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/fbb51217eaa46ec608a67b9f.png"},{"id":92201511,"identity":"367fc9d6-fca9-4450-ab27-4dc677003f3a","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"png","order_by":8,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":112888,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage6.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/4af9859af0a4cf1cfbae48fa.png"},{"id":92201510,"identity":"0197a1b8-a121-4ae7-bd32-d769094a8250","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"png","order_by":9,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":74449,"visible":true,"origin":"","legend":"","description":"","filename":"floatimage7.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/d35e861ac46a44a5753110d2.png"},{"id":92200831,"identity":"cfa2e98b-51bd-4db3-aa75-1720265b0e0d","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":10,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":7459,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage1.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/b8dbc9f0a98847e5abc7d59a.png"},{"id":92200837,"identity":"848ea5d9-cebb-4d22-9537-73666bf7b395","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":11,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":13164,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage2.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/5d2bb5fa44ca8d1ac4ebec0b.png"},{"id":92201513,"identity":"9bf1ad61-74ce-4b0b-b1ab-3fdb279f4d66","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"png","order_by":12,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":935,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage3.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/91543c9a023dce857ad2a22c.png"},{"id":92200836,"identity":"f536d957-62dd-4e0d-b865-fe97d5927531","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":13,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":26720,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage4.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/07d65b018935bd7e1c6dec57.png"},{"id":92200838,"identity":"e06c1502-7e8f-4e53-b738-4596b40441cf","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":14,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":37170,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage5.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/8c485efbeea1455c4cc53bef.png"},{"id":92200834,"identity":"6937440a-c0df-42fe-8dce-5338465c0d3c","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":15,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":39680,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage6.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/cbd88969c288b3a7ed468b76.png"},{"id":92200839,"identity":"74067e41-20fe-468b-a86d-3a5f633e961a","added_by":"auto","created_at":"2025-09-25 17:05:45","extension":"png","order_by":16,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":35242,"visible":true,"origin":"","legend":"","description":"","filename":"Onlinefloatimage7.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/2ba1796f6f007d40f8a437cf.png"},{"id":92200846,"identity":"ef505b13-c21a-4f57-a69f-da4825687da8","added_by":"auto","created_at":"2025-09-25 17:05:45","extension":"xml","order_by":17,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":73907,"visible":true,"origin":"","legend":"","description":"","filename":"b511defa5a5645b1a499364c765813641structuring.xml","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/ca69c7b76423ccfc7d63e324.xml"},{"id":92200842,"identity":"edccebd0-7f2e-4c40-8be9-2269f2d10f84","added_by":"auto","created_at":"2025-09-25 17:05:45","extension":"html","order_by":18,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":86999,"visible":true,"origin":"","legend":"","description":"","filename":"earlyproof.html","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/6bda8c53a0dd6cc520ce55fd.html"},{"id":92200819,"identity":"48a442ce-ed9a-4653-93d6-2865b6cf778c","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":16802,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eBlock diagram of the suggested system\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/c5e150a8da3a4fd9a80efb6d.png"},{"id":92200820,"identity":"c39b4bd9-c02d-4dc6-9d8e-1869c64b82b5","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":76506,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eDNN Architecture\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/7a4607923f2dddf6a647f6b6.png"},{"id":92200823,"identity":"79e8329e-9116-422f-b133-2cf60cde6c3a","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":154037,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCNN Architecture\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/0d091899e0ae252a88fa2904.png"},{"id":92200821,"identity":"bcc5f931-1d33-407f-a280-1bc82ce1e0dc","added_by":"auto","created_at":"2025-09-25 17:05:44","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":185313,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eLSTM Architecture\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/a9f435ed9a02a7b393194a73.png"},{"id":92201509,"identity":"3441140d-bf07-4431-8ff5-2a4cdf40494b","added_by":"auto","created_at":"2025-09-25 17:13:44","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":122492,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCNN and LSTM Architecture\u003c/strong\u003e\u003c/p\u003e","description":"","filename":"5.png","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/18d76d06c227b61a62ceab67.png"},{"id":94490021,"identity":"d4556033-7f4d-4125-97cd-be679b4e43a5","added_by":"auto","created_at":"2025-10-27 17:07:01","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":1125749,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7223076/v1/d1954ded-022b-4054-8d91-477d49d287c6.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Detecting Fake News Using Deep Learning Approaches","fulltext":[{"header":"1. Introduction","content":"\u003cp\u003eOne of the significant and challenging problems is fake news, which faces numerous obstacles. Fake news is presently being studied because of the nature of languages and the challenge of spotting it. DL is a ML method that aims to depict the world as nested hierarchy of concepts that are identified by DL architecture automatically [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e]. Specifically, DL—also referred to as deep structured learning—is a subset of ML technology that learns representations through artificial neural networks (ANNs). Semi-supervised, supervised, and unsupervised learning are the three learning categories. DL approaches that use DNNs have gained popularity as a result of the development of high-performance computer facilities. In particular, DL has more flexibility and functionality because it could manage a large number of functions while working with unstructured data. In essence, DL feature algorithms transfer data through a number of layers, each of which has the ability to progressively extract features before passing them on to the next layer. Low-level features are extracted in the first layer, and features are combined in the next layers in order to provide a comprehensive representation. DL was developed inseparably with time, achieving a blast of information from all over the world and in all kinds of structures. This information, which is known as \"large information,\" comes from various sources, which include online searches, social media, web-based commercial platforms, and online movies. Through fintech applications, such as distributed computing, this enormous amount of information is instantly accessible and could be shared.\u003c/p\u003e\u003cp\u003eDL can be defined as an artificial intelligence (AI) function mimicking the way that the human brain can process data and generate patterns for use in decision-making. In AI, it is a subset of ML that includes networks which can learn in unsupervised manner from unstructured data. To be more specific, DL encompasses a variety of networks, including CNN (Convolutional Neural Network), DBN (Deep Belief Network), Recursive Neural Network, RNN (Recurrent Neural Network), and a lot more. In order to achieve this, neural networks (NNs) are extremely helpful in sentence modelling, sentence classification, vector representation, word representation estimation, and future presentation [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e].\u003c/p\u003e\u003cp\u003eA popular area of ML is DL, which draws inspiration from how the biological brain of humans handles information processing and draws conclusions. In order to improve accuracy for complex tasks, DL employs multiple layers of neurons, each of which is coupled to success layer. The use of an expert is reduced because to the development of DL from ML [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e].\u003c/p\u003e\u003cp\u003eTo make predictions, learn from data, and maximize performance, DL models are made up of a number of basic building components. Each of these elements—optimization algorithms, layers, activation functions, and loss functions—is essential to the model's capacity for learning from and generalizing from data [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e].\u003c/p\u003e\u003cp\u003eWith quickly developing applications spanning atomistic, image-based, textual, and spectral data modalities, DL is one of the materials data science subjects with the quickest rate of growth. It enables automated feature recognition and analysis of unstructured data. In this sense, the use of DL methods in atomistic prediction, has been stimulated by the recent growth of huge materials databases [\u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e].\u003c/p\u003e\u003cp\u003eFor various ML problems during the past 20 years, DL has provided the solution. Dependency on a large amount of labeled data and training costs are its two main drawbacks. Through reusing knowledge from a source data or task in training on a target data or work, transfer learning (TL) in DL, sometimes referred to as Deep Transfer Learning (DTL), aims to lower such cost and dependency. Most DTL approaches that are used are based upon networks or models [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e].\u003c/p\u003e\u003cp\u003e\u003cb\u003eOur contributions can be summarized below\u003c/b\u003e:\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003col\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eReading and preparing the dataset and breaking the task down into manageable parts.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eThe first section used Spacy and Fasttext methods to perform preprocessing on the dataset.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eThe second section used transformer models such as BERT and AraBERT.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eThe third section used different deep learning methods, including RNN, CNN, and LSTM.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eCNN and RNN were combined to get the results and compare them with the methods without merging.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003cspan\u003e\u003cli\u003e\u003cp\u003eThe accuracy was adopted for evaluating the performance regarding the proposed system, combining the two methods, the Spacy and the Fasttext.\u003c/p\u003e\u003c/li\u003e\u003c/span\u003e\u003c/ol\u003e"},{"header":"2. Related work","content":"\u003cp\u003eThrough utilizing ML-based fake news detection model, the authors in [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e] offered a solution. Prerequisite data for the model must be taken from a variety of news websites. Data extraction, which is then utilized for creating datasets, was accomplished using the web scraping approach. True datasets and false datasets are the two main groups into which the data was categorized. Logistic Regression, Random Forest, KNN, Decision Tree, and Gradient Booster are the classifiers utilized to classify the data. Data was classified as either false or true depending on the result that has been obtained. The user could therefore determine on the webserver if the provided news is fake or not.\u003c/p\u003e\u003cp\u003eTo detect the fake text, authors of [\u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e] suggested many classifier types. In order to determine which classifier would be most effective for identifying fake news in Arabic, they next tested each of such classifiers using two distinct Arabic datasets. Fast Text library was used with all of such classifiers to improve text classification by offering effective word representations.\u003c/p\u003e\u003cp\u003eThe authors of [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e] employed a DL method that incorporated pre-trained word embedding that was trained with the use of four distinct datasets with many architectures, like the Bidirectional LSTM, CNN, and ResNet. For lessening data imbalances between the classes, every data undergoes a data augmentation procedure that utilizes back-translation method. The outcomes demonstrated that, across all evaluated datasets, the Bidirectional LSTM architecture performed better than ResNet and CNN.\u003c/p\u003e\u003cp\u003eIn [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e], the researchers used many cutting-edge AI methods to assess the created dataset. A total of five ML methods has been specifically employed: KNN, Naïve Bayes, SVM, Logistic Regression, and Decision Trees. GloVe and BERT embeddings, on the other hand, have been employed with two DL methods, LSTM and CNN. F1-score, precision, recall, and accuracy were used for comparing the performance regarding all applied models as well as embeddings. According to the results, the best performance was achieved with LSTM initialized with GloVe embeddings. Additionally, the study examined the misclassified samples through contrasting them with human judgements.\u003c/p\u003e\u003cp\u003eA thorough analysis of ML and DL-based methods for the detection of fake news was published in [\u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. For practitioners and researchers interested in creating efficient fake news detection systems with DL and ML methods, their review offered insights and recommendations. Prior to reporting or publishing on a story, news reporters frequently need to confirm its authenticity. Reporters may eliminate fake news and concentrate on covering reliable and accurate information through using fake news detection models.\u003c/p\u003e\u003cp\u003eThe authors of [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e] built DL models to identify fake news depending on news titles or content and used NLP methods for text analytics. The research's suggested solution seeks to be implemented in real-worlds social media platforms and get rid of the negative experience that users have when they acquire misleading stories from unreliable sources. Before vectorizing the text into N-gram vectors or sequence vectors with the use of terms frequency inverse document frequency (TF-IDF) or one-hot encoding, respectively, NLP methods employed text preprocessing methods, like regular expression, lemmatization, tokenization, and stop words removal. Tensor Flow was therefore selected as the framework to be utilized thanks to its integrated Keras DL libraries, which have a sizable community and enough comments on its GitHub repository to support the construction of DL-NN models.\u003c/p\u003e\u003cp\u003eSince the main objective of fake news site is to sway public opinion on particular topics, the authors of [\u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] created an accurate and reliable model that can determine whether a news piece is fake or real.\u003c/p\u003e"},{"header":"3. The proposed Work","content":"\u003cp\u003eThe primary function of the proposed system is to detect news written in Arabic, whether it is fake or not (positive or negative). The practical components of the system are explained in this section. The system consists of two main components: collecting a dataset and using Arabic text to analyze the news. In this regard, collecting the dataset is a challenging step. The second component of the system analyzes the Arabic text. Fake news detection aims to identify forgery conveyed in a given text. Here, the system prioritizes texts written in Arabic. To do this task, this section recommends using different methods, such as DL algorithms. The proposed system achieved the goal of AFN detection by employing deep learning methods. As shown in Figure (1), due to the limited availability of Arabic databases, the dataset was translated from English to Arabic and used with other datasets for validating the performance regarding the suggested system. The procedures included reading and preparing the dataset and dividing the task into manageable parts. The first part of the work used four approaches, including the Spacy and FastText methods. The second part of the work used transformed models such as BERT and AraBERT. The third part used various deep learning methods, including DNNs, CNNs, and LSTMs, as well as hybrid CNN-LSTM models. All of these models were created using Fasttext and Spacey embedding methods. The best approach for deep neural networks was the achieved DNN.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003cp\u003e\u003cb\u003eIn the next subsections, each part of the proposed system architecture will explained\u003c/b\u003e:\u003c/p\u003e\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e\u003ch2\u003e3.1. Arabic Fake News Dataset\u003c/h2\u003e\u003cp\u003eTo develop our classification model, a Kaggle dataset has been used for fake news detection. English was the initial language used to write the dataset. But since the work was conducted on an Arabic fake news detection feature, Python code and the \"google-trans\" package have been used for translating it into Arabic before exporting it as an Excel file. Our English-to-Arabic translator code was unable to translate several Russian and German characters in the dataset. As a result, such characters were eliminated as the initial preprocessing step, followed by the elimination of any English words that were still difficult to translate into Arabic, including terms and names (e.g., Donald Trump, Jennifer, etc.). Actually, 20,000 instances were first trained using Kaggle dataset. Following preprocessing, however, the the dataset included 10,000 instances, with 20% going to testing and 80% to training, with 5,000 of those being fake and 5,000 being real. Arabic and English numbers have been also eliminated, in addition to superfluous punctuation and pauses. Lastly, a pre-trained model segmentation tool has been used in order to divide each sentence into tokens throughout the training phase.\u003c/p\u003e\u003c/div\u003e"},{"header":"4. Word Embedding","content":"\u003cp\u003eOne of the most notable developments in NLP is Word Embedding. By converting words into vector-based numerical representations, such methods seek to improve computer models' comprehension of word relationships and meanings. Word Embedding approaches like Spacy and Fasttext map each word to a point in a high-dimensional space, in which text with comparable meanings are closely related, rather than encoding words as text strings. This shift from textual to numerical representations improves efficiency on a number of tasks, which include text classification, sentiment analysis, and machine translation. By improving models' capacity to comprehend language and interpret meaning, Word Embedding approaches expand the potential applications of AI.\u003c/p\u003e\u003cdiv id=\"Sec5\" class=\"Section2\"\u003e\u003ch2\u003e4.1. Fast Text Embedding\u003c/h2\u003e\u003cp\u003eThe Facebook AI Research team created the word embedding technology. Through adding subword information, it is intended to enhance conventional word representation models and produce better embedding, particularly for morphologically rich languages. Fast Text can comprehend and generate embeddings for morphologically complex languages by converting words down into a set of n-grams, which aids in understanding the internal structure regarding words. Conventional word embedding models, like Word 2Vec, rate a fixed vocabulary throughout training, which might cause problems with OOV words. In order to address this issue, Fast Text combines the embeddings of the n-grams that make up OOV words to create an embedding for them. This feature is especially helpful in practical applications where new words\u0026mdash;like new slang and technical terms\u0026mdash;occur often.\u003c/p\u003e\u003c/div\u003e\u003cdiv id=\"Sec6\" class=\"Section2\"\u003e\u003ch2\u003e4.2. Spacy Embedding\u003c/h2\u003e\u003cp\u003eIt is a robust and advanced Python NLP library that combines high performance and ease of use, especially for production use. It is appropriate for a variety of NLP and understanding applications due to its extensive feature set, support for contemporary word embeddings, and deep feature learning models. Data scientists and developers that work with language data frequently choose it because it provides a broad range of functions for effective text processing as well as analysis. Tokenization, part-of-speech tagging, named entity recognition (NER), multilingual support, pipeline engineering, dependency analysis, named entity recognition (NER), word vectors and embedding, semantic analysis, text classification, and visualization are some of the key features.\u003c/p\u003e\u003c/div\u003e"},{"header":"5. The AraBERT Model","content":"\u003cp\u003eA pre-trained language model created especially for processing Arabic text, AraBERT can efficiently handle a variety of NLP tasks. With regard to language understanding, its BERT-based architecture makes use of the most recent developments in DL. AraBERT offers a strong framework for creating complex language applications to scholars and practitioners engaged in Arabic NLP. Key elements of AraBERT include its pre-training on a large dataset of Arabic text and its comprehension of the language's semantics, syntax, and common phrases. It is intended to capture the distinctive complexities and characteristics regarding Arabic language, including its dialectal variances, morphology, and script. Similar to original BERT, AraBERT understands a word in the context of all other words in a sentence, making it especially effective for tasks needing accurate comprehension. In comparison to models trained from scratch, AraBERT could be optimized for particular follow-up tasks, such as named entity recognition, question answering, and sentiment analysis, greatly enhancing performance. There are various AraBERT versions, including AraBERTv2 and AraBERTv1, each of which was trained on a different dataset or optimized for a particular set of tasks. Users are able to select the version that best meets their requirements. Token classification, classification tasks, and sequence classification are just a few of the NLP applications that AraBERT could be used for. Developers and researchers can easily utilize AraBERT for both non-commercial and commercial uses because it is usually available in an open-source format.\u003c/p\u003e"},{"header":"6. Neural Networks","content":"\u003cdiv id=\"Sec9\" class=\"Section2\"\u003e\u003ch2\u003e6.1. Deep Neural Networks (DNNs):\u003c/h2\u003e\u003cp\u003eAdvances in a variety of domains are made possible by DNNs, a versatile and powerful method for AI and ML. Through their capacity to represent complex relations among data, DNNs continue to propel technological and scientific advancements. Multiple hidden layers, an input layer, and an output layer make up DNNs. The ability for learning the complex features and patterns in data is made possible by the existence of several hidden layers. Neurons make up each layer of a DNN. After applying a weighted sum and adding a bias, each neuron processes the input from the preceding layer before passing it via a nonlinear activation function. The network can learn complex mappings between outputs and inputs by being fed such nonlinear activation functions. Data is fed into the input layer during the process of forward propagation, and it moves through each layer until reaching the output layer, in which classifications or predictions are made. DNNs learn by calculating the error between the actual and predicted outputs, a process known as backpropagation. In order to reduce error, the model after that modifies its weights in accordance with the loss gradient with respect to such weights. The degree to which the DNN's predictions and actual values agree is determined by the loss function. The cross-entropy loss for tasks of classification as well as the mean squared error regarding regression tasks are examples of common loss functions. To help the model converge to a minimum loss, the weights are updated during training using a variety of optimization algorithms. Since DNNs frequently identify hierarchical representations of data, they might automatically learn features from raw data without requiring a great deal of manual feature engineering. They could be used with unstructured data, time series, and structured data, among other data types and domains. DNNs are excellent for complex tasks because they could generalize well with enough data.\u003c/p\u003e\u003cp\u003eFigure (2) shows the structure of a DNN-based model. It consists of three layers: hidden, output, and input. Keras was used to define a neural network model with a serial API. Utilizing the leaky ReLU activation function, the initial dense layer consists of eight units. The training data's shape serves as the basis for the input shape. The leaky rule comprises eight units in the second thick layer. The input's shape is deduced from the layer before it, the batch normalization layer, which aids in normalizing the inputs. Eight units comprise another dense layer and a Leaky Relu\u0026mdash;leakage layer with leakage rate 0.5. Dropout is used as a regularization method. The output layer has two units and uses softmax activation. Particularly, this is a binary classification problem. The softmax function predicts the class that has the highest probability. Consequently, as neural network for binary classification with regularization methods has been created.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003c/div\u003e\u003cdiv id=\"Sec10\" class=\"Section2\"\u003e\u003ch2\u003e6.2. Convolution Neural Networks (CNNs):\u003c/h2\u003e\u003cp\u003eCNNs are crucial to modern AI applications and have completely transformed the field of computer vision by offering strong instruments for effectively and efficiently evaluating visual data. They are a particular type of DL model made particularly for handling structured network data, especially images. They do exceptionally well in tasks like object detection, segmentation, and image classification. In contrast to typical NNs, which use convolution in place of global matrix multiplication, pooling layers, activation functions, and fully connected layers, such networks have locally applied convolutional layers. Due to their relatively basic network architecture, CNNs are crucial for DL since they require fewer parameters, which increases processing speed efficiency. CNNs could also help with speech and handwriting recognition, image analysis, and natural language understanding because they require little preprocessing. In the presented study, a custom CNN that is desiged for particular sequence tasks is programmed using Keras and Sequence API.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003c/div\u003e\u003cdiv id=\"Sec11\" class=\"Section2\"\u003e\u003ch2\u003e6.3. Long Short-term Memory (LSTM):\u003c/h2\u003e\u003cp\u003eLSTM modules address the shortcomings of conventional RNNs and are an effective tool for sequential data analysis. They perform exceptionally well in various applications in speech recognition, time series prediction, and NLP due to their capacity to selectively forget and remember information. Long-term information retention is possible thanks to the memory cells found in LSTMs. More specifically, forget gates, input gates, and output gates are the three types of gates that LSTMs use to control information flow.\u003c/p\u003e\u003cp\u003eA more complex NN with bidirectional LSTM layers has been constructed using Keras and a serial API. Each layer of the model will be looked at separately: Sequences are returned by the first layer, a 16-unit bidirectional LSTM layer with leaky ReLU activation. The input shape for such layer is determined by the training data dimensions. A batch normalization layer comes after the first layer of LSM. The model also includes an additional 16-unit bidirectional LSTM layer that generates sequences. One more batch normalizing layer comes after this phase. The model's third LSTM layer consists of 32 bidirectional units that return sequences. A batch normalization layer comes after this layer. The fourth layer of the LSTM simply provides the final state; sequences are not returned, despite having 32 units and being bidirectional.\u003c/p\u003e\u003cp\u003eThis layer is followed by a batch normalization layer. Leaky ReLU activation and a dense layer with 128 units are included in the model. The model contains an additional thick layer with the same properties. Leaky ReLU is activated at 128 units by the third dense layer. A dense layer with 256 units and Leaky ReLU activation is also present. Specifically, the output layer of the model is suitable for binary classification due to its softmax activation and two units.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003c/div\u003e\u003cdiv id=\"Sec12\" class=\"Section2\"\u003e\u003ch2\u003e6.4. Hybrid CNN and LSTM:\u003c/h2\u003e\u003cp\u003eCNN-LSTM hybrid model effectively processes data with temporal and spatial dimensions by combining the advantages of LSTM networks with CNNs. Tasks where temporal features (like sequences of frames) and spatial features (like frames in a video) are crucial, such as event recognition, video analysis, and time series prediction, benefit greatly from this architecture. The model can better learn complex patterns that span both dimensions because of such combination.\u003c/p\u003e\u003cp\u003e\u003c/p\u003e\u003c/div\u003e"},{"header":"7. Experimental Results","content":"\u003cp\u003eThis research faced the difficulties of collecting data in Arabic and choosing the best ones for this work. After that, different DL models have been used, with certain changes made, such changing the number of layers and selecting the right design. Through testing, assessment, and comparison of several models, this procedure sought to show how to rate the most successful results. The models' word embeddings varied, and they were tested using a variety of approaches in an effort to outperform the findings of earlier studies. The focus has been directed towards optimization and quick identification of the most efficient and effective methods to achieve outstanding results. The best method was DNN using word embedding for spacy, achieving 78%, while DNN also achieved the highest performance for Fast text, achieving 76%. As for BERT, it achieved a 99.1% success rate, and AraBERT achieved 99.3% according to the accuracy scale. As a result, the AraBERT is the best method from our proposed deep learning, but after using transformer.\u003c/p\u003e\u003cp\u003e\u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e\u003ccaption language=\"En\"\u003e\u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e\u003cdiv class=\"CaptionContent\"\u003e\u003c/div\u003e\u003c/caption\u003e\u003ccolgroup cols=\"3\"\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e\u003cthead\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\" morerows=\"1\" rowspan=\"2\"\u003e\u003cp\u003eModel\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colspan=\"2\" nameend=\"c3\" namest=\"c2\"\u003e\u003cp\u003eAccuracy%\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003eSpacy\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003eFast Text\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\"\u003e\u003cp\u003eDNN\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003e78%\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003e76%\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003c/thead\u003e\u003ctbody\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eCNN\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003e\u003cb\u003e47%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003e\u003cb\u003e46%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eLSTM\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003e\u003cb\u003e51%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003e\u003cb\u003e49%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eHybrid CNN\u0026thinsp;+\u0026thinsp;LSTM\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003e\u003cb\u003e54%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003e\u003cb\u003e53%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eBERT\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colspan=\"2\" nameend=\"c3\" namest=\"c2\"\u003e\u003cp\u003e\u003cb\u003e99.1%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e\u003cb\u003eAraBERT\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colspan=\"2\" nameend=\"c3\" namest=\"c2\"\u003e\u003cp\u003e\u003cb\u003e99.3%\u003c/b\u003e\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003c/tbody\u003e\u003c/colgroup\u003e\u003c/table\u003e\u003c/div\u003e\u003c/p\u003e\u003cp\u003eAs it has been mentioned earlier, the process of identifying fake news is not easy. To handle this issue, four DL models have been proposed to detect or identify fake news depending on the collected database. After completing the training process using two types of embedding, namely Spacy and Fast Text, the best method was the deep neural network. However, after resorting to more comprehensive methods such as BERT and AraBERT, it has been found that the best method is AraBERT, which achieved results of 99.3% based on the database on which it was trained. The reason is that this network was trained on a huge amount of data in the Arabic language, which gave it the strength in discrimination. It is currently considered one of the best bidirectional deep learning methods for the Arabic language.\u003c/p\u003e"},{"header":"8. Metrics","content":"\u003cp\u003eSeveral metrics have been calculated for the best approach (AraBERT) with a confusion matrix for this approach, as shown in the table below.\u003c/p\u003e\u003cp\u003e\u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e\u003ccaption language=\"En\"\u003e\u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e\u003cdiv class=\"CaptionContent\"\u003e\u003cp\u003eMetrics for the AraBERT Approach\u003c/p\u003e\u003c/div\u003e\u003c/caption\u003e\u003ccolgroup cols=\"4\"\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e\u003cthead\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\"\u003e\u003cp\u003eAccuracy\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003ePrecision\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003eRacall\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c4\"\u003e\u003cp\u003eF1-Score\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003c/thead\u003e\u003ctbody\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003e99.3%\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003e99.44%\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003e99.16%\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c4\"\u003e\u003cp\u003e49.64%\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003c/tbody\u003e\u003c/colgroup\u003e\u003c/table\u003e\u003c/div\u003e\u003c/p\u003e\u003cp\u003e\u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab3\" border=\"1\"\u003e\u003ccaption language=\"En\"\u003e\u003cdiv class=\"CaptionNumber\"\u003eTable 3\u003c/div\u003e\u003cdiv class=\"CaptionContent\"\u003e\u003cp\u003eConfusion Matrix for the AraBERT Approach\u003c/p\u003e\u003c/div\u003e\u003c/caption\u003e\u003ccolgroup cols=\"3\"\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e\u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e\u003cthead\u003e\u003ctr\u003e\u003cth align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/th\u003e\u003cth align=\"left\" colname=\"c2\"\u003e\u003cp\u003eReal\u003c/p\u003e\u003c/th\u003e\u003cth align=\"left\" colname=\"c3\"\u003e\u003cp\u003eFake\u003c/p\u003e\u003c/th\u003e\u003c/tr\u003e\u003c/thead\u003e\u003ctbody\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eReal\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003e4972\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003e28\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003ctr\u003e\u003ctd align=\"left\" colname=\"c1\"\u003e\u003cp\u003eFake\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c2\"\u003e\u003cp\u003e42\u003c/p\u003e\u003c/td\u003e\u003ctd align=\"left\" colname=\"c3\"\u003e\u003cp\u003e4958\u003c/p\u003e\u003c/td\u003e\u003c/tr\u003e\u003c/tbody\u003e\u003c/colgroup\u003e\u003c/table\u003e\u003c/div\u003e\u003c/p\u003e\u003cp\u003eThese metrics selected the AraBERT, as it is the best method, and it is suitable for the Arabic language. Specifically, it was trained on data in the Arabic language.\u003c/p\u003e"},{"header":"9. Conclusion","content":"\u003cp\u003eThe detection of fake news has drawn more attention recently because of its detrimental impacts and quick propagation. Given the limitation of existing literature, the detection of fake news in Arabic is a potential yet difficult study topic. Because there are no consistent features to differentiate between real and fake news, there was a trouble with the dataset and detecting fake news. The variety of orthography, dialects, rich vocabulary, and lack of Arabic databases are some of the obstacles that fake news detection in Arabic encounters. In order to determine if Arabic news is real or fake, a method for identifying fake news in Arabic was presented in this research. The English dataset was translated into Arabic, though, because there aren't many databases in this subject. The performance regarding the suggested system was then verified using another Arabic dataset that was accessible. Fasttext and Spacy were two of the four methods that have been employed with two words embedding. DNN, which used spacy to achieve 78% performance, was the best method. With a performance of 51% for Spacy and 49% for Fasttext, LSTM model achieved a good performance. On the accuracy scale, however, it scored a 99.1% success rate following converting to BERT, whereas AraBERT earned a 99.3% success rate. AraBERT is thus better. This is due to the fact that both AraBERT and BERT have extensive Arabic article training.\u003c/p\u003e\u003cp\u003eWe propose that word embedding approaches with semantic support can be used in future work to support DL.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eAuthor contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eAuthor Contribution: Enas Tariq Khudair: Conceptualization, Methodology, and Writing\u0026ndash; original draft.Onsa Lazzez, Mourad Zaied, Tarek M. Hamdani: Supervision, Data curation, Software, and Methodology. Ahmed T. Sadiq: Writing, Review \u0026amp; editing, Visualization, Formal analysis, and Investigation. Habib Chabchoub, Adel M. Alimi: Project administration, Conceptualization, Investigation, Validation, Supervision, and Review \u0026amp; editing.\u003c/p\u003e\n\u003ch3\u003e\u003cstrong\u003eFunding of Work\u003c/strong\u003e\u003c/h3\u003e\n\u003cp\u003eThe manuscript did not require funding from any organization.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eClinical Trial\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe manuscript does not contain any clinical trial.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003ePublish Declaration\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConsent to Publish declaration:\u0026nbsp;applicable\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConsent to Participate Declaration\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eConsent to Participate declaration: not applicable\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eEthical Approval\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis manuscript does not pose any ethical concerns, as it does not utilize data of individuals\u0026apos; people but rather standard data.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eMohammad Mustafa Taye.\u0026rdquo; Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions\u0026rdquo;. Published 25April 2023.\u003c/li\u003e\n\u003cli\u003eDivya P, Aiswarya VB Student.\u0026rdquo; DEEP LEARNING: METHODS AND APPLICATIONS\u0026rdquo;. 2021 JETIR July 2021, Volume 8, Issue 7 www.jetir.org (ISSN-2349-5162).\u003c/li\u003e\n\u003cli\u003eADITYA DUBEY, AKHTAR RASOOL.\u0026rdquo; RECENT ADVANCES AND APPLICATIONS OF DEEP LEARNING METHOD\u0026rdquo;. Journal of Theoretical and Applied Information Technology 15th July 2022. Vol.100. No 13 \u0026copy; 2022 Little Lion Scientific. Augmented Reality and Ontology-based Approach for Enhancing Arabic Text Recognition and VisualizationSsss Augmented Reality and Ontology-based Approach for Enhancing Arabic Text Recognition and\u003c/li\u003e\n\u003cli\u003eIbomoiye Domor Mienye\u003csup\u003e, \u003c/sup\u003eTheo G. Swart.\u0026rdquo; A Comprehensive Review of Deep Learning: Architectures, Recent Advances, and Applications\u0026rdquo;. Published: 27 November 2024, \u003cem\u003eInformation\u003c/em\u003e 2024, \u003cem\u003e15\u003c/em\u003e(12), 755; https://doi.org/10.3390/info15120755.\u003c/li\u003e\n\u003cli\u003eKamal Choudhary, Brian DeCost, Chi Chen, Anubhav Jain, Francesca Tavazza, Francesca Tavazza, Cheol Woo Park, Alok Choudhary, Ankit Agrawal, Simon J. L. Billinge, Elizabeth Holm, Shyue Ping Ong, and Chris Wolverton.\u0026rdquo; Recent advances and applications of deep learning methods in materials science\u0026rdquo;. Article number: 59 (2022).\u003c/li\u003e\n\u003cli\u003eMohamm adreza Iman1, Khaled Rasheed and Hamid Reza Arabnia. \u0026rdquo; A Review of Deep Transfer Learning and Recent Advancements\u0026rdquo;. Springer Nature 2021 LATEX template. \u003c/li\u003e\n\u003cli\u003ePrasad Kulkarni, Suyash Karwande, Rhucha Keskar, Prashant Kale, and Sumitra Iyer, \u0026quot; Fake News Detection using Machine Learning\u0026rdquo;. ITM Web of Conferences 40, 03003 (2021).\u003c/li\u003e\n\u003cli\u003eMohammed E.Almandouh, Mohammed F. Alrahmawy, Mohamed Eisa, Mohamed Elhoseny\u0026amp; A. S. Tolba. \u0026rdquo; Ensemble based high performance deep learning models for fake news detection\u0026rdquo;. Published: 04 November 2024, Article number: 26591.\u003c/li\u003e\n\u003cli\u003eI. Kadek Sastrawan, I.P.A. Bayupati, Dewa Made Sri Arsa.\u0026rdquo; Detection of fake news using deep learning CNN\u0026ndash;RNN based methods\u0026rdquo;. Received 3 July 2021; received in revised form 15 September 2021; accepted 6 October 2021 Available online 22 October 2021.\u003c/li\u003e\n\u003cli\u003eAzka Kishwara, Adeel Zafara. \u0026quot; Fake News Detection on Pakistani News Using Machine Learning and Deep Learning\u0026quot;. April 15, 2022.\u003c/li\u003e\n\u003cli\u003eSangita M. Jaybhaye, Vivek Badade, Aryan Dodke, Apoorva Holkar, and Priyanka Lokhande. \u0026quot; Fake News Detection using LSTM based deep learning approach \u0026quot;. ITM Web of Conferences 56, 03005 (2023).\u003c/li\u003e\n\u003cli\u003eSheng How Kong, Li Mei Tan, Keng Hoon Gan, Nur Hana Samsudin.\u0026rdquo; Fake News Detection using Deep Learning\u0026rdquo;. 2020.\u003c/li\u003e\n\u003cli\u003eKushal Agarwalla, Shubham Nandan, Varun Anil Nair, D. Deva Hema.\u0026rdquo; Fake News Detection using Machine Learning and Natural Language Processing\u0026rdquo;. Published: 10 November 2022. International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878 (Online), Volume-7, Issue-6, March 2019. \u003c/li\u003e\n\u003cli\u003eMohamad Abdolahi, Morteza Zahedi.\u0026rdquo; A New Method for Sentence Vector Normalization Using Word2vec\u0026rdquo;. Int. J. Nonlinear Anal. Appl. 10 (2019) No. 2, 87-96 ISSN: 2008-6822 (electronic) https://dx.doi.org/10.22075/ijnaa.2019.4177.\u003c/li\u003e\n\u003cli\u003eKhalid Shaker, Arwa Alqudsi.\u0026rdquo; Approach for Detecting Arabic Fake News using Deep Learning\u0026rdquo;. DOI: https://doi.org/10.52866/ijcsm.2024.05.03.049 Received April 2024; Accepted June 2024; Available online August 2024.\u003c/li\u003e\n\u003cli\u003eHanen Himdi, George Weir, Fatmah Assiri, Hassanin Al‑Barhamtoshy.\u0026rdquo; Arabic Fake News Detection Based on Textual Analysis\u0026rdquo;. Arabian Journal for Science and Engineering (2022) 47:10453\u0026ndash;10469.\u003c/li\u003e\n\u003cli\u003eAhmed Redha Mahlous, Ali Al-Laith.\u0026rdquo; Fake News Detection in Arabic Tweets during the COVID-19 Pandemic\u0026rdquo;. (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 12, No. 6, 2021.\u003c/li\u003e\n\u003cli\u003eAli Bou Nassif, Ashraf Elnagar, Omar Elgendy, Yaman Afadar.\u0026rdquo; Arabic fake news detection based on deep contextualized embedding models\u0026rdquo;. Neural Computing and Applications (2022) 34:16019\u0026ndash;16032.\u003c/li\u003e\n\u003cli\u003eBohan Li, Yutai Hou, Wanxiang Che.\u0026rdquo; Data augmentation approaches in natural language processing: A survey\u0026rdquo;. Available online 24 March 2022\u003cu\u003e \u003c/u\u003ehttps://doi.org/10.1016/j.aiopen.2022.03.001\u003c/li\u003e\n\u003cli\u003eSebastien C. Wong; Adam Gatt; Victor Stamatescu; Mark D. McDonnell. \u0026rdquo; Understanding Data Augmentation for Classification: When to Warp?\u0026rdquo;. \u003cstrong\u003eDate Added to IEEE \u003cem\u003eXplore\u003c/em\u003e: \u003c/strong\u003e26 December 2016 \u003cstrong\u003eDOI: \u003c/strong\u003e10.1109/DICTA.2016.7797091.\u003c/li\u003e\n\u003cli\u003eSosuke Kobayashi.\u0026rdquo; Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations\u0026rdquo;. June 1 - 6, 2018.\u003c/li\u003e\n\u003cli\u003eXiang Zhang Junbo Zhao Yann LeCun.\u0026rdquo; Character-level Convolutional Networks for Text Classification\u0026rdquo;. posted in Feb 2015 as arXiv:1502.01710.\u003c/li\u003e\n\u003cli\u003eConnor Shorten, Taghi M. Khoshgoftaar and Borko Furht.\u0026rdquo; Text Data Augmentation for Deep Learning\u0026rdquo;. Shorten et al. J Big Data (2021) 8:101, https://doi.org/10.1186/s40537-021-00492-0.\u003c/li\u003e\n\u003cli\u003eHeng Yang, Ke Li.\u0026rdquo; Boosting Text Augmentation via Hybrid Instance Filtering Framework\u0026rdquo;. Findings of the Association for Computational Linguistics: ACL 2023, pages 1652\u0026ndash;1669 July 9-14, 2023 \u0026copy;2023 Association for Computational Linguistics.\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":true,"highlight":"","institution":"","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Deep learning, Fake news detection, CNN, DNN, RNN, Hybrid LSTM and AraBERT, Misinformation, and Social Media Analysis","lastPublishedDoi":"10.21203/rs.3.rs-7223076/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7223076/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eDeep learning (DL)-based detection systems for fake news should improve and adapt as fake news grows more complex in order to continue protecting the integrity of information in digital society. In addition to addressing a serious technological challenge, creating strong DL-based fake news detection systems is an important tool to maintain authenticity of information. Because of the difficulty of finding Arabic data, the method offers a basis for understanding the significance and difficulties of collecting data in Arabic. Due to the limited dataset, the suggested system has been translated from English into Arabic, and its performance and the potential of DL in identifying fake news were verified using another dataset that was accessible. Existing studies utilizing DL methods, like recurrent neural networks (RNNs), LSTMs, and convolutional neural networks (CNNs) for detecting fake news are discussed. For comparison, the term Aribert is used. It makes use of cutting-edge DL methods to propose a new method for identifying fake news. The paper offers a thorough framework for automatically detecting as well as classifying any false information on digital platforms by combining deep neural networks (DNNs) with natural language processing (NLP). Contextual understanding, temporal dependency, and content complexity are some of the main challenges with fake news detection that the approach solves. From purposefully created news articles to thinly veiled misleading content, the system displays great performance in detecting various types of misinformation. Furthermore, cutting-edge feature extraction (FE) methods have been used, which take into account metadata as well as textual content, such as propagation patterns and source credibility. When put to comparison with machine learning (ML) methods, the experimental results show notable gains in detection speed and accuracy. Spacey, Fasttext, and two-word embeddings were among the four methods that were employed. The best DNN method used Spacey to obtain a strong performance of 78%, whereas LSTM model performed well with Spacey at 49% and Fasttext at 51%. On accuracy scale, it scored a 99.1% success rate after converting to BERT, whereas AraBERT earned a 99.3% success rate. AraBERT is thus better. This is due to the fact that both AraBERT and BERT are trained on various Arabic articles.\u003c/p\u003e","manuscriptTitle":"Detecting Fake News Using Deep Learning Approaches","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-09-25 17:05:40","doi":"10.21203/rs.3.rs-7223076/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"b2efe318-898f-4bfe-b994-21307c1ac321","owner":[],"postedDate":"September 25th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2025-10-27T15:27:44+00:00","versionOfRecord":[],"versionCreatedAt":"2025-09-25 17:05:40","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-7223076","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7223076","identity":"rs-7223076","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00