Digital Epidemiology: Using Social Media Data and Machine Learning to Forecast Influenza Outbreaks and Inform Public Health Responses

Benny Novico Zani; Ravi Raj  Pandey; Le Thi Lan  Anh

doi:10.70177/jssut.v3i3.2735

Authors

Benny Novico Zani
bennynovico.phd@gmail.com
Sekolah Tinggi Ilmu Kesehatan Raflesia, Indonesia
Ravi Raj Pandey Pokhara University, Nepal
Le Thi Lan Anh Hanoi Medical University, Viet Nam

Vol. 3 No. 3 (2025)

Articles

Accepted December 26, 2025

Published June 20, 2025

Downloads

PDF

Abstract
How to Cite
Metrics
References
License

Background. Traditional influenza surveillance systems inherently suffer from a critical one-to-two-week reporting lag, severely hindering timely public health interventions and resource allocation.

Purpose. This research aims to develop and validate a hybrid digital epidemiology model using unstructured social media data and advanced Machine Learning (ML) to provide accurate, long-range influenza outbreak forecasts.

Method. The methodology involved quantitative time-series forecasting, training Long Short-Term Memory (LSTM) and XGBoost models on five years of social media data, and benchmarking against official clinical reports.

Results. The optimized LSTM model achieved significantly superior accuracy, recording a Root Mean Square Error (RMSE) of 0.145 for the four-week forecasting horizon, less than half the error of the traditional ARIMA baseline. This high predictive power confirms that social media is a statistically reliable, non-clinical leading indicator.

Conclusion. The study establishes a transparent policy translation framework, linking predicted incidence rates (e.g., exceeding 0.20) directly to required operational responses (e.g., hospital surge activation). This model offers a robust, actionable template for transforming public health surveillance from a reactive system into a proactive intelligence platform for epidemic preparedness.

Zani, B. N., Pandey, R. R. ., & Anh, L. T. L. . (2025). Digital Epidemiology: Using Social Media Data and Machine Learning to Forecast Influenza Outbreaks and Inform Public Health Responses. Journal of Social Science Utilizing Technology, 3(3), 119–130. https://doi.org/10.70177/jssut.v3i3.2735

Download Citation

Aljabali, A. A. A., Obeid, M. A., El-Tanani, M., Mishra, V., Mishra, Y., & Tambuwala, M. M. (2024). Precision epidemiology at the nexus of mathematics and nanotechnology: Unraveling the dance of viral dynamics. Gene, 905, 148174. https://doi.org/10.1016/j.gene.2024.148174

Assudani, P. J., Bhurgy, A. S., Kollem, S., Bhurgy, B. S., Ahmad, Md. O., Kulkarni, M. B., & Bhaiyya, M. (2025). Artificial intelligence and machine learning in infectious disease diagnostics: A comprehensive review of applications, challenges, and future directions. Microchemical Journal, 218, 115802. https://doi.org/10.1016/j.microc.2025.115802

Aswini, R., Saranya, B., Gayathri, K., & Karthikeyan, E. (2025). Revolutionizing infectious disease surveillance: Multi-omics technologies and AI-driven integration. Decoding Infection and Transmission, 3, 100061. https://doi.org/10.1016/j.dcit.2025.100061

Atella, V., & Scandizzo, P. L. (2024). Chapter 9—What did we learn after more than 6 million deaths? Dalam V. Atella & P. L. Scandizzo (Ed.), The Covid-19 Disruption and the Global Health Challenge (hlm. 325–379). Academic Press. https://doi.org/10.1016/B978-0-44-318576-2.00023-8

Beyrer, C., Kamarulzaman, A., Isbell, M., Amon, J., Baral, S., Bassett, M. T., Cepeda, J., Deacon, H., Dean, L., Fan, L., Giacaman, R., Gomes, C., Gruskin, S., Goyal, R., Mon, S. H. H., Jabbour, S., Kazatchkine, M., Kasoka, K., Lyons, C., … Rubenstein, L. (2024). Under threat: The International AIDS Society–Lancet Commission on Health and Human Rights. The Lancet, 403(10434), 1374–1418. https://doi.org/10.1016/S0140-6736(24)00302-7

Fallatah, D. I., & Adekola, H. A. (2024). Digital epidemiology: Harnessing big data for early detection and monitoring of viral outbreaks. Infection Prevention in Practice, 6(3), 100382. https://doi.org/10.1016/j.infpip.2024.100382

Ghavi Hossein-Zadeh, N. (2025). Artificial intelligence in veterinary and animal science: Applications, challenges, and future prospects. Computers and Electronics in Agriculture, 235, 110395. https://doi.org/10.1016/j.compag.2025.110395

Gresham, L., Alemu, W., Divi, N., Alhusseini, N., Awoniyi, O., Bashir, A., Shaikh, A. T., & McNabb, S. J. N. (2024). Chapter 17—Modernizing public health surveillance. Dalam S. J. N. McNabb, A. T. Shaikh, & C. J. Haley (Ed.), Modernizing Global Health Security to Prevent, Detect, and Respond (hlm. 307–327). Academic Press. https://doi.org/10.1016/B978-0-323-90945-7.00002-6

Gruessner, R. W. G., & Benedetti, E. (Ed.). (2024). Chapter 17—Kidney transplantation: Assessment of the Kidney Donor Candidate. Dalam Living Donor Organ Transplantation (Second Edition) (hlm. 255–409). Academic Press. https://doi.org/10.1016/B978-0-443-23571-9.00017-7

Khalaf, W. S., Morgan, R. N., & Elkhatib, W. F. (2025). Clinical microbiology and artificial intelligence: Different applications, challenges, and future prospects. Journal of Microbiological Methods, 232–234, 107125. https://doi.org/10.1016/j.mimet.2025.107125

Li, J.-H., Tseng, Y.-J., Chen, S.-H., & Chen, K.-F. (2025). Artificial Intelligence in Infection Surveillance: Data Integration, Applications and Future Directions. Biomedical Journal, 100929. https://doi.org/10.1016/j.bj.2025.100929

Melo, C. L., Mageste, L. R., Guaraldo, L., Paula, D. P., & Wakimoto, M. D. (2024a). Use of Digital Tools in Arbovirus Surveillance: Scoping Review. Journal of Medical Internet Research, 26. https://doi.org/10.2196/57476

Melo, C. L., Mageste, L. R., Guaraldo, L., Paula, D. P., & Wakimoto, M. D. (2024b). Use of Digital Tools in Arbovirus Surveillance: Scoping Review. Journal of Medical Internet Research, 26. https://doi.org/10.2196/57476

Mohamad, U. H. (2025). Chapter 6—Comparative analysis of AI and nanotech approaches for pandemic prediction. Dalam A. Ahmadian, F. Ghaemi, A. K. Yadav, M. J. Ebadi, & S. Salahshour (Ed.), The Prediction of Future Pandemics (hlm. 69–104). Elsevier. https://doi.org/10.1016/B978-0-443-33871-7.00006-4

Mohammed, A. M., Mohammed, M., Oleiwi, J. K., Adam, T., Betar, B. O., & Gopinath, S. C. B. (2025). Advancing anti-infective drug discovery: The pivotal role of artificial intelligence in overcoming infectious diseases and antimicrobial resistance. In Silico Research in Biomedicine, 1, 100118. https://doi.org/10.1016/j.insi.2025.100118

Monlezun, D. J. (2025). Chapter 5—Quantum AI for public health. Dalam D. J. Monlezun (Ed.), Quantum Health AI (hlm. 125–154). Academic Press. https://doi.org/10.1016/B978-0-443-33353-8.00003-5

Noor, F., Saleem, M. A. U., Rafique, A., Danish, M. A. U., Bano, F., Shehzad, M. S. U., Noor, A., Fatima, I., Kamal, M. A., Muzammil, A., & Rehman, A. (2026). Chapter 13—Computational tools and techniques for disease modeling: Bridging the gap. Dalam S. N. Rai, S. K. Singh, & V. Singh (Ed.), Advancements in Modeling-Based Therapeutics and Technology for Chronic Diseases (hlm. 373–418). Academic Press. https://doi.org/10.1016/B978-0-443-33877-9.00017-3

Nunes, M. C., Thommes, E., Fröhlich, H., Flahault, A., Arino, J., Baguelin, M., Biggerstaff, M., Bizel-Bizellot, G., Borchering, R., Cacciapaglia, G., Cauchemez, S., Barbier--Chebbah, A., Claussen, C., Choirat, C., Cojocaru, M., Commaille-Chapus, C., Hon, C., Kong, J., Lambert, N., … Coudeville, L. (2024). Redefining pandemic preparedness: Multidisciplinary insights from the CERP modelling workshop in infectious diseases, workshop report. Infectious Disease Modelling, 9(2), 501–518. https://doi.org/10.1016/j.idm.2024.02.008

Odone, A., Barbati, C., Amadasi, S., Schultz, T., & Resnik, D. B. (2025). Artificial intelligence and infectious diseases: An evidence-driven conceptual framework for research, public health, and clinical practice. The Lancet Infectious Diseases. https://doi.org/10.1016/S1473-3099(25)00412-8

Oh, S., & Wijaya, J. (2026). Predictive surveillance and diagnosis of COVID-19: An integrative machine learning and wastewater multi-omics approach. Water Research, 289, 124981. https://doi.org/10.1016/j.watres.2025.124981

Raina MacIntyre, C., Lim, S., Gurdasani, D., Miranda, M., Metcalf, D., Quigley, A., Hutchinson, D., Burr, A., & Heslop, D. J. (2024). Early detection of emerging infectious diseases—Implications for vaccine development. Vaccine, 42(7), 1826–1830. https://doi.org/10.1016/j.vaccine.2023.05.069

Santra, D. (2025). Artificial intelligence in urban health epidemic management. Dalam Advances in Computers. Elsevier. https://doi.org/10.1016/bs.adcom.2025.10.001

Shen, Y., Liu, Y., Krafft, T., & Wang, Q. (2025). Progress and challenges in infectious disease surveillance and early warning. Medicine Plus, 2(1), 100071. https://doi.org/10.1016/j.medp.2025.100071

Singh, S., & Singh, S. (2025). Chapter 4—Zoonotic diseases and their implications. Dalam K. B. Pandey, D. J. Newman, & C. Egbuna (Ed.), Drug Discovery and One Health Approach in Combating Infectious Diseases (hlm. 59–75). Elsevier. https://doi.org/10.1016/B978-0-443-27461-9.00007-X

Singhal, N., Vardhan, H., Jain, R., Gupta, P., Pandey, A., Wagri, N. K., & Gaur, A. (2025). Role of artificial intelligence in automating diagnostic procedures in clinical microbiology laboratories. Current Research in Biotechnology, 10, 100351. https://doi.org/10.1016/j.crbiot.2025.100351

Suhag, A., Burgess, R., & Skatova, A. (2025). Shopping Data for Population Health Surveillance: Opportunities, Challenges, and Future Directions. Journal of Medical Internet Research, 27. https://doi.org/10.2196/75720

Thakur, R., Baghel, M., Bhoj, S., Jamwal, S., Chandratre, G. A., Vishaal, M., Badgujar, P. C., Pandey, H. O., & Tarafdar, A. (2024). CHAPTER 8—Digitalization of livestock farms through blockchain, big data, artificial intelligence, and Internet of Things?. Dalam A. Tarafdar, A. Pandey, G. K. Gaur, M. Singh, & H. O. Pandey (Ed.), Engineering Applications in Livestock Production (hlm. 179–206). Academic Press. https://doi.org/10.1016/B978-0-323-98385-3.00012-8

Winkler, A. S., Brux, C. M., Carabin, H., das Neves, C. G., Häsler, B., Zinsstag, J., Fèvre, E. M., Okello, A., Laing, G., Harrison, W. E., Pöntinen, A. K., Huber, A., Ruckert, A., Natterson-Horowitz, B., Abela, B., Aenishaenslin, C., Heymann, D. L., Rødland, E. K., Berthe, F. C. J., … Amuasi, J. H. (2025). The Lancet One Health Commission: Harnessing our interconnectedness for equitable, sustainable, and healthy socioecological systems. The Lancet, 406(10502), 501–570. https://doi.org/10.1016/S0140-6736(25)00627-0

Zhao, A. P., Li, S., Cao, Z., Hu, P. J.-H., Wang, J., Xiang, Y., Xie, D., & Lu, X. (2024). AI for science: Predicting infectious diseases. Journal of Safety Science and Resilience, 5(2), 130–146. https://doi.org/10.1016/j.jnlssr.2024.02.002

	Semua	Sejak 2021
Kutipan	263	263
indeks-h	7	7
indeks-i10	4	4

Digital Epidemiology: Using Social Media Data and Machine Learning to Forecast Influenza Outbreaks and Inform Public Health Responses

Authors

Downloads

People

Journal Policies

Submission

Google Scholar Citation

Dikutip oleh

Article Template

Visitor Counter

Tools

Indexed In

Address

Contact Info:

Digital Epidemiology: Using Social Media Data and Machine Learning to Forecast Influenza Outbreaks and Inform Public Health Responses

Authors

Downloads

Login

People

Journal Policies

Submission

Google Scholar Citation

Dikutip oleh

Article Template

Visitor Counter

Tools

Indexed In