DETECTION MODEL FOR FAKE NEWS ON COVID-19 IN INDONESIA

Authors

  • Achmad Pratama Rifai Department of Mechanical and Industrial Engineering, Universitas Gadjah Mada, Indonesia
  • Yun Prihantina Mulyani Department of Mechanical and Industrial Engineering, Universitas Gadjah Mada, Indonesia
  • Rian Febrianto Department of Mechanical and Industrial Engineering, Universitas Gadjah Mada, Indonesia
  • Hilya Mudrika Arini Department of Mechanical and Industrial Engineering, Universitas Gadjah Mada, Indonesia
  • Titis Wijayanto Department of Mechanical and Industrial Engineering, Universitas Gadjah Mada, Indonesia
  • Nurul Lathifah Department of Industrial Engineering, Universitas Indonesia, Indonesia
  • Xiao Liu School of Information Technology, Deakin University, Australia
  • Jianxin Li School of Information Technology, Deakin University, Australia
  • Hui Yin School of Information Technology, Deakin University, Australia
  • Yutao Wu School of Information Technology, Deakin University, Australia
  • Rami Mohawesh School of Information Technology, Deakin University, Australia

DOI:

https://doi.org/10.11113/aej.v13.19648

Keywords:

Fake news detection, Indonesia, Covid-19, LSTM, RoBERTa, Fake news detection, COVID-19 misinformation, fake news in Indonesian, machine learning, LSTM, RoBERTa

Abstract

Today, fake information has become a significant problem, exacerbated by the acceleration of access to information. The spread of fake information has a dangerous impact, especially regarding global health issues, for example COVID-19. People can access various resources to obtain information, including online sites and social media. One of the methods to control the spread of false information is detecting hoaxes. Many methods have been developed to identify hoaxes; most previous studies have focused on developing hoax detection methods using data from a single source in English. The present study is carried out to detect fake news in Indonesian language using multiple data sources, including traditional and social media in the context of COVID-19. The study uses Long Short-Term Memory (LSTM) and the Robustly Optimised Bidirectional Encoder Representations from Transformers Pre-Training Approach (RoBERTa). The LSTM approach is used to develop four different architectures that varied based on: (1) the use of text-only versus the use of both title and text; (2) the number of LSTM and dense layers; and (3) the activation function. The LSTM model with text-only data, a single LSTM layer and two dense layers, outperformed other LSTM architectures, achieving the highest accuracy of 92.17%. The LSTM models require a considerably short training time of 23–27 minutes for 3,847 articles and has a detection time of 3.8–4.1 ms per article. The RoBERTa classifiers outperformed all LSTM models with an accuracy of over 97% and a significantly better training time, with a margin of more than 50% compared to LSTM classifiers, although it had a slightly longer test time. Both LSTM and RoBERTa models outperformed the Naïve Bayes and SVM benchmark methods in terms of accuracy, precision, and recall. Therefore, this study shows that both LSTM and RoBERTa methods are reliable and can be reasonably implemented for real-time fake news detection.

References

Medioni, G., Cohen, I., BreAˆ mond, F., Hongeng, S. and Nevatia, R. 2001. Event Detection and Analysis from Video Streams. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(8): 873-889. DOI: 10.1109/34.946990

Nayoga, B. P., Adipradana, R., Suryadi, R., and Suhartono, D. 2021. Hoax Analyzer for Indonesian News Using Deep Learning Models, Procedia Computer Science, 179: 704-712. DOI: https://doi.org/10.1016/j.procs.2021.01.059

Utami, E., Iskandar, A. F., Hidayat, W., Prasetyo, A. B., Hartanto, A. D. 2021. COVID-19 Hoax Detection Using KNN in Jaccard Space. Indonesian Journal of Computing and Cybernetics Systems, 15(3): 255-264. DOI: https://doi.org/10.22146/ijccs.67392

Aldwairi, M., and Alwahedi, A. 2018. Detecting Fake News in Social Media Networks. Procedia Computer Science, 141: 215-222. DOI: https://doi.org/10.1016/j.procs.2018.10.171

Dhawan, A., Bhalla, M., Arora, D., Kaushal, R., and Kumaraguru, P. 2022. FakeNewsIndia: A Benchmark Dataset of Fake News Incidents in India, Collection Methodology and Impact Assessment in Social Media, Computer Communication, 185: 130-141. DOI: https://doi.org/10.1016/j.comcom.2022.01.003

Bahad, P., Saxena, P., and Kamal, R. 2019. Fake News Detection using Bi-directional LSTM-Recurrent Neural Network, Procedia Computer Science, 165: 74-82. DOI: https://doi.org/10.1016/j.procs.2020.01.072

Yesugade, T., Kokate, S., Patil, S., Varma, R., and Pawar, S. 2021. Fake News Detection using LSTM, International Research Journal of Engineering and Technology, 8(4): 2500-2507.

Lin, N., Fu, S., and Jiang, S. 2020. Fake News Detection in the Urdu Language using CharCNN-RoBERTa. In Forum for Information Retrieval Evaluation 2020

Samadi, M., Mousavian, M., and Momtazi, S. 2021. Deep Contextualized Text Representation and Learning for Fake News Detection, Information Processing and Management. 58(6): 1-13. DOI: https://doi.org/10.1016/j.ipm.2021.102723

Davoudi, M., Moosavi, M. R., and Sadreddini, M. H. 2022. DSS: A Hybrid Deep Model for Fake News Detection using Propagation Tree and Stance Network, Expert Systems with Applications, 198: 1-21. DOI: https://doi.org/10.1016/j.eswa.2022.116635

Deepak S., and Chitturi, B. 2020. Deep Neural Approach to Fake-News Identification, Procedia Computer Science, 167: 2236-2242. DOI: https://doi.org/10.1016/j.procs.2020.03.276

Apriliyanto, A., and Kusumaningrum, R. 2020. Hoax Detection in Indonesia Language using Long Short-Term Memory Model., Sinergi. 24(3): 189-196. DOI: 10.22441/sinergi.2020.3.003

Prasetijo, A. B., Isnanto, R. R., Eridani, D., Soetrisno, Y. A. A., Arfan, M., and Sofwan, A. 2017. Hoax Detection System on Indonesian News Sites Based on Text Classification using SVM and SGD. In 4th International Conference on Information Technology, Computer, and Electrical Engineering. 45-49. IEEE. DOI: 10.1109/ICITACEE.2017.8257673

Kencana, C. W., Setiawan, E. B., and Kurniawan, I. 2020. Hoax Detection on Twitter using Feed-forward and Back-propagation Neural Networks Method, Jurnal Resti, 4(4): 648-654. DOI: 10.29207/resti.v4i4.2038

Mazzeo, V., Rapisarda, A., and Giuffrida, G. 2021. Detection of Fake News on COVID-19 on Web Search Engines, Frontiers in Physics. 9: 1-14. DOI: 10.3389/fphy.2021.685730

Khan, S., Hakak, S., Deepa, N., Prabadevi, B., Dev, K., and Trelova, S. 2022. Detecting COVID-19-Related Fake News Using Feature Extraction, Frontiers in Public Health. 9: 1-9. DOI: 10.3389/fpubh.2021.788074

Bondielli, A., and Marcelloni, F. 2019. A Survey on Fake News and Rumour Detection Techniques, Information Sciences, 497: 38-55. DOI: https://doi.org/10.1016/j.ins.2019.05.035

Asano, E., 2017, How Much Time Do People Spend on Social Media? [Infographic], [Online, accessed July 24th. 2022] URL: https://www.socialmediatoday.com/marketing/how-much-time-do-people-spend-social-media-infographic

Hochreiter, S., and Schmidhuber, J. 1997. Long short-term memory. Neural computation, 9(8): 1735-1780.

Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., ... & Stoyanov, V. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.

Kumar, S., and Singh, T., D. 2022. Fake News Detection on Hindi News Dataset, Global Transitions Proceedings 2022, 3(1): 289-297. DOI: https://doi.org/10.1007/s10389-021-01658-z.

Abd Elaziz, M., Dahou, A., Orabi, D. A., Alshathri, S., Soliman, E. M., and Ewees, A. A. 2023. A Hybrid Multitask Learning Framework with a Fire Hawk Optimizer for Arabic Fake News Detection. Mathematics, 11(2): 258.

Ma, K., Tang, C., Zhang, W., Cui, B., Ji, K., Chen, Z., & Abraham, A. 2023. DC-CNN: Dual-channel Convolutional Neural Networks with attention-pooling for fake news detection. Applied Intelligence, 53(7): 8354-8369.

Downloads

Published

2023-10-24

Issue

Section

Articles

How to Cite

DETECTION MODEL FOR FAKE NEWS ON COVID-19 IN INDONESIA. (2023). ASEAN Engineering Journal, 13(4), 119-126. https://doi.org/10.11113/aej.v13.19648