DETECTION MODEL FOR FAKE NEWS ON COVID-19 IN INDONESIA
DOI:
https://doi.org/10.11113/aej.v13.19648Keywords:
Fake news detection, Indonesia, Covid-19, LSTM, RoBERTa, Fake news detection, COVID-19 misinformation, fake news in Indonesian, machine learning, LSTM, RoBERTaAbstract
Today, fake information has become a significant problem, exacerbated by the acceleration of access to information. The spread of fake information has a dangerous impact, especially regarding global health issues, for example COVID-19. People can access various resources to obtain information, including online sites and social media. One of the methods to control the spread of false information is detecting hoaxes. Many methods have been developed to identify hoaxes; most previous studies have focused on developing hoax detection methods using data from a single source in English. The present study is carried out to detect fake news in Indonesian language using multiple data sources, including traditional and social media in the context of COVID-19. The study uses Long Short-Term Memory (LSTM) and the Robustly Optimised Bidirectional Encoder Representations from Transformers Pre-Training Approach (RoBERTa). The LSTM approach is used to develop four different architectures that varied based on: (1) the use of text-only versus the use of both title and text; (2) the number of LSTM and dense layers; and (3) the activation function. The LSTM model with text-only data, a single LSTM layer and two dense layers, outperformed other LSTM architectures, achieving the highest accuracy of 92.17%. The LSTM models require a considerably short training time of 23–27 minutes for 3,847 articles and has a detection time of 3.8–4.1 ms per article. The RoBERTa classifiers outperformed all LSTM models with an accuracy of over 97% and a significantly better training time, with a margin of more than 50% compared to LSTM classifiers, although it had a slightly longer test time. Both LSTM and RoBERTa models outperformed the Naïve Bayes and SVM benchmark methods in terms of accuracy, precision, and recall. Therefore, this study shows that both LSTM and RoBERTa methods are reliable and can be reasonably implemented for real-time fake news detection.
References
Medioni, G., Cohen, I., BreAˆ mond, F., Hongeng, S. and Nevatia, R. 2001. Event Detection and Analysis from Video Streams. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(8): 873-889. DOI: 10.1109/34.946990
Nayoga, B. P., Adipradana, R., Suryadi, R., and Suhartono, D. 2021. Hoax Analyzer for Indonesian News Using Deep Learning Models, Procedia Computer Science, 179: 704-712. DOI: https://doi.org/10.1016/j.procs.2021.01.059
Utami, E., Iskandar, A. F., Hidayat, W., Prasetyo, A. B., Hartanto, A. D. 2021. COVID-19 Hoax Detection Using KNN in Jaccard Space. Indonesian Journal of Computing and Cybernetics Systems, 15(3): 255-264. DOI: https://doi.org/10.22146/ijccs.67392
Aldwairi, M., and Alwahedi, A. 2018. Detecting Fake News in Social Media Networks. Procedia Computer Science, 141: 215-222. DOI: https://doi.org/10.1016/j.procs.2018.10.171
Dhawan, A., Bhalla, M., Arora, D., Kaushal, R., and Kumaraguru, P. 2022. FakeNewsIndia: A Benchmark Dataset of Fake News Incidents in India, Collection Methodology and Impact Assessment in Social Media, Computer Communication, 185: 130-141. DOI: https://doi.org/10.1016/j.comcom.2022.01.003
Bahad, P., Saxena, P., and Kamal, R. 2019. Fake News Detection using Bi-directional LSTM-Recurrent Neural Network, Procedia Computer Science, 165: 74-82. DOI: https://doi.org/10.1016/j.procs.2020.01.072
Yesugade, T., Kokate, S., Patil, S., Varma, R., and Pawar, S. 2021. Fake News Detection using LSTM, International Research Journal of Engineering and Technology, 8(4): 2500-2507.
Lin, N., Fu, S., and Jiang, S. 2020. Fake News Detection in the Urdu Language using CharCNN-RoBERTa. In Forum for Information Retrieval Evaluation 2020
Samadi, M., Mousavian, M., and Momtazi, S. 2021. Deep Contextualized Text Representation and Learning for Fake News Detection, Information Processing and Management. 58(6): 1-13. DOI: https://doi.org/10.1016/j.ipm.2021.102723
Davoudi, M., Moosavi, M. R., and Sadreddini, M. H. 2022. DSS: A Hybrid Deep Model for Fake News Detection using Propagation Tree and Stance Network, Expert Systems with Applications, 198: 1-21. DOI: https://doi.org/10.1016/j.eswa.2022.116635
Deepak S., and Chitturi, B. 2020. Deep Neural Approach to Fake-News Identification, Procedia Computer Science, 167: 2236-2242. DOI: https://doi.org/10.1016/j.procs.2020.03.276
Apriliyanto, A., and Kusumaningrum, R. 2020. Hoax Detection in Indonesia Language using Long Short-Term Memory Model., Sinergi. 24(3): 189-196. DOI: 10.22441/sinergi.2020.3.003
Prasetijo, A. B., Isnanto, R. R., Eridani, D., Soetrisno, Y. A. A., Arfan, M., and Sofwan, A. 2017. Hoax Detection System on Indonesian News Sites Based on Text Classification using SVM and SGD. In 4th International Conference on Information Technology, Computer, and Electrical Engineering. 45-49. IEEE. DOI: 10.1109/ICITACEE.2017.8257673
Kencana, C. W., Setiawan, E. B., and Kurniawan, I. 2020. Hoax Detection on Twitter using Feed-forward and Back-propagation Neural Networks Method, Jurnal Resti, 4(4): 648-654. DOI: 10.29207/resti.v4i4.2038
Mazzeo, V., Rapisarda, A., and Giuffrida, G. 2021. Detection of Fake News on COVID-19 on Web Search Engines, Frontiers in Physics. 9: 1-14. DOI: 10.3389/fphy.2021.685730
Khan, S., Hakak, S., Deepa, N., Prabadevi, B., Dev, K., and Trelova, S. 2022. Detecting COVID-19-Related Fake News Using Feature Extraction, Frontiers in Public Health. 9: 1-9. DOI: 10.3389/fpubh.2021.788074
Bondielli, A., and Marcelloni, F. 2019. A Survey on Fake News and Rumour Detection Techniques, Information Sciences, 497: 38-55. DOI: https://doi.org/10.1016/j.ins.2019.05.035
Asano, E., 2017, How Much Time Do People Spend on Social Media? [Infographic], [Online, accessed July 24th. 2022] URL: https://www.socialmediatoday.com/marketing/how-much-time-do-people-spend-social-media-infographic
Hochreiter, S., and Schmidhuber, J. 1997. Long short-term memory. Neural computation, 9(8): 1735-1780.
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., ... & Stoyanov, V. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
Kumar, S., and Singh, T., D. 2022. Fake News Detection on Hindi News Dataset, Global Transitions Proceedings 2022, 3(1): 289-297. DOI: https://doi.org/10.1007/s10389-021-01658-z.
Abd Elaziz, M., Dahou, A., Orabi, D. A., Alshathri, S., Soliman, E. M., and Ewees, A. A. 2023. A Hybrid Multitask Learning Framework with a Fire Hawk Optimizer for Arabic Fake News Detection. Mathematics, 11(2): 258.
Ma, K., Tang, C., Zhang, W., Cui, B., Ji, K., Chen, Z., & Abraham, A. 2023. DC-CNN: Dual-channel Convolutional Neural Networks with attention-pooling for fake news detection. Applied Intelligence, 53(7): 8354-8369.