Digital Epidemiology: Using Social Media Data and Machine Learning to Forecast Influenza Outbreaks and Inform Public Health Responses
Downloads
Background. Traditional influenza surveillance systems inherently suffer from a critical one-to-two-week reporting lag, severely hindering timely public health interventions and resource allocation.
Purpose. This research aims to develop and validate a hybrid digital epidemiology model using unstructured social media data and advanced Machine Learning (ML) to provide accurate, long-range influenza outbreak forecasts.
Method. The methodology involved quantitative time-series forecasting, training Long Short-Term Memory (LSTM) and XGBoost models on five years of social media data, and benchmarking against official clinical reports.
Results. The optimized LSTM model achieved significantly superior accuracy, recording a Root Mean Square Error (RMSE) of 0.145 for the four-week forecasting horizon, less than half the error of the traditional ARIMA baseline. This high predictive power confirms that social media is a statistically reliable, non-clinical leading indicator.
Conclusion. The study establishes a transparent policy translation framework, linking predicted incidence rates (e.g., exceeding 0.20) directly to required operational responses (e.g., hospital surge activation). This model offers a robust, actionable template for transforming public health surveillance from a reactive system into a proactive intelligence platform for epidemic preparedness.
Aljabali, A. A. A., Obeid, M. A., El-Tanani, M., Mishra, V., Mishra, Y., & Tambuwala, M. M. (2024). Precision epidemiology at the nexus of mathematics and nanotechnology: Unraveling the dance of viral dynamics. Gene, 905, 148174. https://doi.org/10.1016/j.gene.2024.148174
Assudani, P. J., Bhurgy, A. S., Kollem, S., Bhurgy, B. S., Ahmad, Md. O., Kulkarni, M. B., & Bhaiyya, M. (2025). Artificial intelligence and machine learning in infectious disease diagnostics: A comprehensive review of applications, challenges, and future directions. Microchemical Journal, 218, 115802. https://doi.org/10.1016/j.microc.2025.115802
Aswini, R., Saranya, B., Gayathri, K., & Karthikeyan, E. (2025). Revolutionizing infectious disease surveillance: Multi-omics technologies and AI-driven integration. Decoding Infection and Transmission, 3, 100061. https://doi.org/10.1016/j.dcit.2025.100061
Atella, V., & Scandizzo, P. L. (2024). Chapter 9—What did we learn after more than 6 million deaths? Dalam V. Atella & P. L. Scandizzo (Ed.), The Covid-19 Disruption and the Global Health Challenge (hlm. 325–379). Academic Press. https://doi.org/10.1016/B978-0-44-318576-2.00023-8
Beyrer, C., Kamarulzaman, A., Isbell, M., Amon, J., Baral, S., Bassett, M. T., Cepeda, J., Deacon, H., Dean, L., Fan, L., Giacaman, R., Gomes, C., Gruskin, S., Goyal, R., Mon, S. H. H., Jabbour, S., Kazatchkine, M., Kasoka, K., Lyons, C., … Rubenstein, L. (2024). Under threat: The International AIDS Society–Lancet Commission on Health and Human Rights. The Lancet, 403(10434), 1374–1418. https://doi.org/10.1016/S0140-6736(24)00302-7
Fallatah, D. I., & Adekola, H. A. (2024). Digital epidemiology: Harnessing big data for early detection and monitoring of viral outbreaks. Infection Prevention in Practice, 6(3), 100382. https://doi.org/10.1016/j.infpip.2024.100382
Ghavi Hossein-Zadeh, N. (2025). Artificial intelligence in veterinary and animal science: Applications, challenges, and future prospects. Computers and Electronics in Agriculture, 235, 110395. https://doi.org/10.1016/j.compag.2025.110395
Gresham, L., Alemu, W., Divi, N., Alhusseini, N., Awoniyi, O., Bashir, A., Shaikh, A. T., & McNabb, S. J. N. (2024). Chapter 17—Modernizing public health surveillance. Dalam S. J. N. McNabb, A. T. Shaikh, & C. J. Haley (Ed.), Modernizing Global Health Security to Prevent, Detect, and Respond (hlm. 307–327). Academic Press. https://doi.org/10.1016/B978-0-323-90945-7.00002-6
Gruessner, R. W. G., & Benedetti, E. (Ed.). (2024). Chapter 17—Kidney transplantation: Assessment of the Kidney Donor Candidate. Dalam Living Donor Organ Transplantation (Second Edition) (hlm. 255–409). Academic Press. https://doi.org/10.1016/B978-0-443-23571-9.00017-7
Khalaf, W. S., Morgan, R. N., & Elkhatib, W. F. (2025). Clinical microbiology and artificial intelligence: Different applications, challenges, and future prospects. Journal of Microbiological Methods, 232–234, 107125. https://doi.org/10.1016/j.mimet.2025.107125
Li, J.-H., Tseng, Y.-J., Chen, S.-H., & Chen, K.-F. (2025). Artificial Intelligence in Infection Surveillance: Data Integration, Applications and Future Directions. Biomedical Journal, 100929. https://doi.org/10.1016/j.bj.2025.100929
Melo, C. L., Mageste, L. R., Guaraldo, L., Paula, D. P., & Wakimoto, M. D. (2024a). Use of Digital Tools in Arbovirus Surveillance: Scoping Review. Journal of Medical Internet Research, 26. https://doi.org/10.2196/57476
Melo, C. L., Mageste, L. R., Guaraldo, L., Paula, D. P., & Wakimoto, M. D. (2024b). Use of Digital Tools in Arbovirus Surveillance: Scoping Review. Journal of Medical Internet Research, 26. https://doi.org/10.2196/57476
Mohamad, U. H. (2025). Chapter 6—Comparative analysis of AI and nanotech approaches for pandemic prediction. Dalam A. Ahmadian, F. Ghaemi, A. K. Yadav, M. J. Ebadi, & S. Salahshour (Ed.), The Prediction of Future Pandemics (hlm. 69–104). Elsevier. https://doi.org/10.1016/B978-0-443-33871-7.00006-4
Mohammed, A. M., Mohammed, M., Oleiwi, J. K., Adam, T., Betar, B. O., & Gopinath, S. C. B. (2025). Advancing anti-infective drug discovery: The pivotal role of artificial intelligence in overcoming infectious diseases and antimicrobial resistance. In Silico Research in Biomedicine, 1, 100118. https://doi.org/10.1016/j.insi.2025.100118
Monlezun, D. J. (2025). Chapter 5—Quantum AI for public health. Dalam D. J. Monlezun (Ed.), Quantum Health AI (hlm. 125–154). Academic Press. https://doi.org/10.1016/B978-0-443-33353-8.00003-5
Noor, F., Saleem, M. A. U., Rafique, A., Danish, M. A. U., Bano, F., Shehzad, M. S. U., Noor, A., Fatima, I., Kamal, M. A., Muzammil, A., & Rehman, A. (2026). Chapter 13—Computational tools and techniques for disease modeling: Bridging the gap. Dalam S. N. Rai, S. K. Singh, & V. Singh (Ed.), Advancements in Modeling-Based Therapeutics and Technology for Chronic Diseases (hlm. 373–418). Academic Press. https://doi.org/10.1016/B978-0-443-33877-9.00017-3
Nunes, M. C., Thommes, E., Fröhlich, H., Flahault, A., Arino, J., Baguelin, M., Biggerstaff, M., Bizel-Bizellot, G., Borchering, R., Cacciapaglia, G., Cauchemez, S., Barbier--Chebbah, A., Claussen, C., Choirat, C., Cojocaru, M., Commaille-Chapus, C., Hon, C., Kong, J., Lambert, N., … Coudeville, L. (2024). Redefining pandemic preparedness: Multidisciplinary insights from the CERP modelling workshop in infectious diseases, workshop report. Infectious Disease Modelling, 9(2), 501–518. https://doi.org/10.1016/j.idm.2024.02.008
Odone, A., Barbati, C., Amadasi, S., Schultz, T., & Resnik, D. B. (2025). Artificial intelligence and infectious diseases: An evidence-driven conceptual framework for research, public health, and clinical practice. The Lancet Infectious Diseases. https://doi.org/10.1016/S1473-3099(25)00412-8
Oh, S., & Wijaya, J. (2026). Predictive surveillance and diagnosis of COVID-19: An integrative machine learning and wastewater multi-omics approach. Water Research, 289, 124981. https://doi.org/10.1016/j.watres.2025.124981
Raina MacIntyre, C., Lim, S., Gurdasani, D., Miranda, M., Metcalf, D., Quigley, A., Hutchinson, D., Burr, A., & Heslop, D. J. (2024). Early detection of emerging infectious diseases—Implications for vaccine development. Vaccine, 42(7), 1826–1830. https://doi.org/10.1016/j.vaccine.2023.05.069
Santra, D. (2025). Artificial intelligence in urban health epidemic management. Dalam Advances in Computers. Elsevier. https://doi.org/10.1016/bs.adcom.2025.10.001
Shen, Y., Liu, Y., Krafft, T., & Wang, Q. (2025). Progress and challenges in infectious disease surveillance and early warning. Medicine Plus, 2(1), 100071. https://doi.org/10.1016/j.medp.2025.100071
Singh, S., & Singh, S. (2025). Chapter 4—Zoonotic diseases and their implications. Dalam K. B. Pandey, D. J. Newman, & C. Egbuna (Ed.), Drug Discovery and One Health Approach in Combating Infectious Diseases (hlm. 59–75). Elsevier. https://doi.org/10.1016/B978-0-443-27461-9.00007-X
Singhal, N., Vardhan, H., Jain, R., Gupta, P., Pandey, A., Wagri, N. K., & Gaur, A. (2025). Role of artificial intelligence in automating diagnostic procedures in clinical microbiology laboratories. Current Research in Biotechnology, 10, 100351. https://doi.org/10.1016/j.crbiot.2025.100351
Suhag, A., Burgess, R., & Skatova, A. (2025). Shopping Data for Population Health Surveillance: Opportunities, Challenges, and Future Directions. Journal of Medical Internet Research, 27. https://doi.org/10.2196/75720
Thakur, R., Baghel, M., Bhoj, S., Jamwal, S., Chandratre, G. A., Vishaal, M., Badgujar, P. C., Pandey, H. O., & Tarafdar, A. (2024). CHAPTER 8—Digitalization of livestock farms through blockchain, big data, artificial intelligence, and Internet of Things?. Dalam A. Tarafdar, A. Pandey, G. K. Gaur, M. Singh, & H. O. Pandey (Ed.), Engineering Applications in Livestock Production (hlm. 179–206). Academic Press. https://doi.org/10.1016/B978-0-323-98385-3.00012-8
Winkler, A. S., Brux, C. M., Carabin, H., das Neves, C. G., Häsler, B., Zinsstag, J., Fèvre, E. M., Okello, A., Laing, G., Harrison, W. E., Pöntinen, A. K., Huber, A., Ruckert, A., Natterson-Horowitz, B., Abela, B., Aenishaenslin, C., Heymann, D. L., Rødland, E. K., Berthe, F. C. J., … Amuasi, J. H. (2025). The Lancet One Health Commission: Harnessing our interconnectedness for equitable, sustainable, and healthy socioecological systems. The Lancet, 406(10502), 501–570. https://doi.org/10.1016/S0140-6736(25)00627-0
Zhao, A. P., Li, S., Cao, Z., Hu, P. J.-H., Wang, J., Xiang, Y., Xie, D., & Lu, X. (2024). AI for science: Predicting infectious diseases. Journal of Safety Science and Resilience, 5(2), 130–146. https://doi.org/10.1016/j.jnlssr.2024.02.002
Copyright (c) 2025 Benny Novico Zani, Ravi Raj Pandey, Le Thi Lan Anh

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


















