DIGITAL QUR’AN ANNOTATION THROUGH DEEP LEARNING: TOWARD AN INTELLIGENT TAFSIR ECOSYSTEM
Abstract
Traditional Qur’anic exegesis (tafsir) encompasses vast, complex textual datasets. Current digital platforms provide static access but lack the semantic understanding to intelligently connect, compare, and synthesize diverse interpretative traditions, limiting deep analysis for both scholars and lay users. This study aims to develop and evaluate a deep learning framework for the semantic annotation of Qur’anic verses. We seek to automate the classification of verses based on thematic content, legal rulings (ahkam), and intertextual references to classical tafsir, thereby prototyping an intelligent tafsir ecosystem. A gold-standard annotated corpus was constructed by mapping Qur’anic verses to thematic categories from authoritative tafsir. A custom transformer-based (BERT) model was trained on this corpus to perform multi-label classification and evaluated using F1-score, precision, and recall. The model demonstrated high efficacy, achieving a macro F1-score of 0.92 in thematic annotation and 0.89 in identifying intertextual links. The system accurately predicts and suggests relevant tafsir passages for un-annotated verses, significantly outperforming traditional keyword-based search methods in relevance. This study validates deep learning as a tool for sophisticated digital Qur’an annotation. The framework provides the foundation for an intelligent tafsir ecosystem, capable of offering dynamic, contextual, and deeply networked access to Qur’anic knowledge.
Full text article
References
Abbott, E. E., Apakama, D., Richardson, L. D., Chan, L., & Nadkarni, G. N. (2024). Leveraging Artificial Intelligence and Data Science for Integration of Social Determinants of Health in Emergency Medicine: Scoping Review. JMIR Medical Informatics, 12. https://doi.org/https://doi.org/10.2196/57124
Almashour, M., Aldamen, H. A. K., & Jarrah, M. (2025). Algorithmic feedback and multilingual identity: Translanguaging practices in Jordanian EFL academic writing. Social Sciences & Humanities Open, 12, 102016. https://doi.org/https://doi.org/10.1016/j.ssaho.2025.102016
Alotaibi, B., Almutarie, A., Alotaibi, S., & Alotaibi, M. (2025). Improving Fashion Sentiment Detection on X through Hybrid Transformers and RNNs. Computers, Materials and Continua, 84(3), 4451–4467. https://doi.org/https://doi.org/10.32604/cmc.2025.066050
Alshaya, S. A. (2025). Enhancing Educational Materials: Integrating Emojis and AI Models into Learning Management Systems. Computers, Materials and Continua, 83(2), 3075–3095. https://doi.org/https://doi.org/10.32604/cmc.2025.062360
Bucher, S. F., Gebauer, S., Grieb, J., Körschens, M., Müller, J., Ritz, C. M., Rajendran, R., Weiland, C., Wesche, K., Victor, K., & Römermann, C. (2025). Collectomics in plant biodiversity research ? looking into the past to understand the present and shape the future. Basic and Applied Ecology, 88, 1–8. https://doi.org/https://doi.org/10.1016/j.baae.2025.07.002
Burghardt, M. B. T.-R. M. in S. S. (2025). Digital humanities. Elsevier. https://doi.org/https://doi.org/10.1016/B978-0-443-26629-4.00016-2
Chapinal-Heras, D., & Díaz-Sánchez, C. (2024). A review of AI applications in human sciences research. Digital Applications in Archaeology and Cultural Heritage, 32, e00323. https://doi.org/https://doi.org/10.1016/j.daach.2024.e00323
Chen, X., Xie, H., Zou, D., Cheng, G., Tao, X., & Lee Wang, F. (2025). Perceived MOOC satisfaction: A review mining approach using machine learning and fine-tuned BERTs. Computers and Education: Artificial Intelligence, 8, 100366. https://doi.org/https://doi.org/10.1016/j.caeai.2025.100366
Chu, D., Wan, B., Ni, H., Li, H., Tan, Z., Dai, Y., Wan, Z., Tang, T., & Zhou, S. (2025). GeoSMIE: An event extraction framework for Document-Level spatial morphological information extraction. Expert Systems with Applications, 268, 126378. https://doi.org/https://doi.org/10.1016/j.eswa.2024.126378
Dawson, M., & Lewin, J. (2025). Automated classification and mapping for alluvial geomorphic units: Current approaches and future directions. Earth-Science Reviews, 271, 105292. https://doi.org/https://doi.org/10.1016/j.earscirev.2025.105292
Deshmukh, P. V, & Shahade, A. K. (2026). Elevating human-machine collaboration in NLP for enhanced content creation and decision support. Data & Knowledge Engineering, 161, 102505. https://doi.org/https://doi.org/10.1016/j.datak.2025.102505
Dimililer, K., Erdem, B. D., Kayali, D., & Priscilla Olawale, O. (2024). Chapter 3 - Image preprocessing phase with artificial intelligence methods on medical images. In W. A. Zgallai & D. U. B. T.-A. I. and I. P. in M. I. Ozsahin (Eds.), Developments in Biomedical Engineering and Bioelectronics (pp. 51–82). Academic Press. https://doi.org/https://doi.org/10.1016/B978-0-323-95462-4.00003-0
Dong, W., Wang, W., Han, X., Huang, J., Li, L., & Huang, Y. (2025). Can GPT-4 provide human-level emotion support? Insights from machine learning-based evaluation framework. Computers in Biology and Medicine, 196, 110789. https://doi.org/https://doi.org/10.1016/j.compbiomed.2025.110789
Eli, E., Wang, D., Xu, W., Mamat, H., Aysa, A., & Ubul, K. (2025). A comprehensive review of non-Latin natural scene text detection and recognition techniques. Engineering Applications of Artificial Intelligence, 156, 111107. https://doi.org/https://doi.org/10.1016/j.engappai.2025.111107
Ganga, B., B.T., L., & K.R., V. (2024). Object detection and crowd analysis using deep learning techniques: Comprehensive review and future directions. Neurocomputing, 597, 127932. https://doi.org/https://doi.org/10.1016/j.neucom.2024.127932
Gao, Q., & Huang, J. (2025). Design and implementation of classical literature sentiment analysis system based on ensemble learning and graph neural network. International Journal of Cognitive Computing in Engineering, 6, 603–616. https://doi.org/https://doi.org/10.1016/j.ijcce.2025.05.004
Gupta, A., & Shivers-McNair, A. (2024). “Wayfinding” through the AI wilderness: Mapping rhetorics of ChatGPT prompt writing on X (formerly Twitter) to promote critical AI literacies. Computers and Composition, 74, 102882. https://doi.org/https://doi.org/10.1016/j.compcom.2024.102882
Harisanty, D., Obille, K. L. B., Anna, N. E. V., Purwanti, E., & Retrialisca, F. (2024). Cultural heritage preservation in the digital age, harnessing artificial intelligence for the future: a bibliometric analysis. Digital Library Perspectives, 40(4), 609–630. https://doi.org/https://doi.org/10.1108/DLP-01-2024-0018
Jahanshahi, H., & Zhu, Z. H. (2024). Review of machine learning in robotic grasping control in space application. Acta Astronautica, 220, 37–61. https://doi.org/https://doi.org/10.1016/j.actaastro.2024.04.012
Jia, J. (2025). Mining Educational Value of Visualization of Sentiment Classification in Ancient Chinese Literature: International Journal of Web-Based Learning and Teaching Technologies, 20(1). https://doi.org/https://doi.org/10.4018/IJWLTT.385603
Korte, J. W., Bartsch, S., Beckmann, R., El Baff, R., & Hecking, T. (2025). The Different Artificial Intelligences of Science and Wikipedia. Technology in Society, 83, 103034. https://doi.org/https://doi.org/10.1016/j.techsoc.2025.103034
Liu, T., Deng, L., & Zhou, Y. (2026). Unpacking self-regulation and social interaction in “Study With Me” videos through large-scale analytics. Computers & Education, 241, 105488. https://doi.org/https://doi.org/10.1016/j.compedu.2025.105488
Mahmoudi-Dehaki, M., & Nasr-Esfahani, N. (2025). Automated vs. manual linguistic annotation for assessing pragmatic competence in English classes. Research Methods in Applied Linguistics, 4(3), 100253. https://doi.org/https://doi.org/10.1016/j.rmal.2025.100253
Moreno-Ortiz, A. (2025). Corpus sense: A comprehensive tool for advanced text and discourse exploration. Applied Corpus Linguistics, 5(3), 100145. https://doi.org/https://doi.org/10.1016/j.acorp.2025.100145
Nashir, W. A., Mohsen, A. M., Al-Shargabi, A. A., Nour, M. K., & Al-onazi, B. B. (2025). A complete, multi-layered quranic treebank dataset with hybrid syntactic annotations for classical arabic processing. Data in Brief, 62, 111940. https://doi.org/https://doi.org/10.1016/j.dib.2025.111940
Ong, J. C. L., Seng, B. J. J., Law, J. Z. F., Low, L. L., Kwa, A. L. H., Giacomini, K. M., & Ting, D. S. W. (2024). Artificial intelligence, ChatGPT, and other large language models for social determinants of health: Current state and future directions. Cell Reports Medicine, 5(1), 101356. https://doi.org/https://doi.org/10.1016/j.xcrm.2023.101356
Quiñonez-Baca, L.-C., Ramirez-Alonso, G., Guzman-Pando, A., Camarillo-Cisneros, J., & Lopez-Flores, D. R. (2025). Advances in meta-learning and zero-shot learning for multi-label classification: A review. Digital Signal Processing, 163, 105220. https://doi.org/https://doi.org/10.1016/j.dsp.2025.105220
Sarkar Farshi, G. (2025). MLNA – A Python package and app for MultiLingual Network Analysis. SoftwareX, 30, 102104. https://doi.org/https://doi.org/10.1016/j.softx.2025.102104
Schettino, L., Origlia, A., & Cutugno, F. (2024). Though this be hesitant, yet there is method in ’t: Effects of disfluency patterns in neural speech synthesis for cultural heritage presentations. Computer Speech & Language, 85, 101585. https://doi.org/https://doi.org/10.1016/j.csl.2023.101585
Schuegraf, P., Shan, J., & Bittner, K. (2024). PLANES4LOD2: Reconstruction of LoD-2 building models using a depth attention-based fully convolutional neural network. ISPRS Journal of Photogrammetry and Remote Sensing, 211, 425–437. https://doi.org/https://doi.org/10.1016/j.isprsjprs.2024.04.015
Sharma, R., & Kukreja, V. (2024). Image segmentation, classification and recognition methods for comics: A decade systematic literature review. Engineering Applications of Artificial Intelligence, 131, 107715. https://doi.org/https://doi.org/10.1016/j.engappai.2023.107715
Shen, D. (2025). Exploring Digital Visualization Techniques for Ancient Literary Works in the Digital Humanities. International Journal of Information System Modeling and Design, 16(1). https://doi.org/https://doi.org/10.4018/IJISMD.389171
Singh, P., Agarwal, A., & Kumar, P. (2025). Navigating the 6G landscape: A systematic review of technology integration, implementation challenges, and strategic solutions. Computer Networks, 272, 111663. https://doi.org/https://doi.org/10.1016/j.comnet.2025.111663
Sree, K. D., Geethika, G., Nair, A. R., Gupta, D., & Veena, G. (2025). Named Entity Recognition for Telugu and Kannada on Naamapadam Dataset Using Various Machine Learning and Deep Learning Algorithms. Procedia Computer Science, 258, 3381–3392. https://doi.org/https://doi.org/10.1016/j.procs.2025.04.595
Stacchio, L., Garzarella, S., Cascarano, P., De Filippo, A., Cervellati, E., & Marfia, G. (2024). DanXe: An extended artificial intelligence framework to analyze and promote dance heritage. Digital Applications in Archaeology and Cultural Heritage, 33, e00343. https://doi.org/https://doi.org/10.1016/j.daach.2024.e00343
Tang, X., Wang, L., & Wang, J. (2026). Language model collaboration for relation extraction from classical Chinese historical documents. Information Processing & Management, 63(1), 104286. https://doi.org/https://doi.org/10.1016/j.ipm.2025.104286
Thakre, B., Yadav, U., & Bondre, S. V. (2025). Chapter 8 - Deep reinforcement learning in healthcare and biomedical application (S. Mahajan & A. K. B. T.-I. in B. E. Pandit (eds.); pp. 241–299). Academic Press. https://doi.org/https://doi.org/10.1016/B978-0-443-30146-9.00008-3
Thammastitkul, A. (2025). Optimizing AI-generated image metadata with hybrid color analysis and semantic keyword structuring. Egyptian Informatics Journal, 31, 100775. https://doi.org/https://doi.org/10.1016/j.eij.2025.100775
Vesalainen, A., Tolonen, M., & Ruotsalainen, L. (2024). Document Layout Error Rate (DLER) metric to evaluate image segmentation methods. Machine Learning with Applications, 18, 100606. https://doi.org/https://doi.org/10.1016/j.mlwa.2024.100606
Wang, H., He, M., Zhu, M., & Liu, G. (2025). WCG-VMamba: A multi-modal classification model for corn disease. Computers and Electronics in Agriculture, 230, 109835. https://doi.org/https://doi.org/10.1016/j.compag.2024.109835
Wang, Y., He, Y., Wang, J., Li, K., Sun, L., Yin, J., Zhang, M., & Wang, X. (2025). Enhancing intent understanding for ambiguous prompt: A human–machine co-adaption strategy. Neurocomputing, 646, 130415. https://doi.org/https://doi.org/10.1016/j.neucom.2025.130415
Wang, Z., & Chai, J. (2024). On Metaphor Translation into English Based on Artificial Intelligence. Procedia Computer Science, 247, 1359–1365. https://doi.org/https://doi.org/10.1016/j.procs.2024.10.162
Wu, F., & Tang, W. (2025). An Indoor Design Assistance System Integrating Depth Sensors and Computer Graphics. International Journal of E-Collaboration, 21(1). https://doi.org/https://doi.org/10.4018/IJeC.389057
Xia, K., Hu, Y., Cai, S., Lin, M., Lu, M., Lu, H., Ye, Y., Lin, F., Gao, L., Xia, Q., Tian, R., Lin, W., Xie, L., Tan, D., Lu, Y., Lin, X., Yang, X., Zhong, L., Xu, L., … Xu, H. (2025). GastritisMIL: An interpretable deep learning model for the comprehensive histological assessment of chronic gastritis. Patterns, 6(8), 101286. https://doi.org/https://doi.org/10.1016/j.patter.2025.101286
Xing, Y., Gan, W., Chen, Q., & Yu, P. S. (2025). AI-generated content in landscape architecture: A survey. AI Open. https://doi.org/https://doi.org/10.1016/j.aiopen.2025.10.002
Yang, X., Bahadur Bist, R., Paneru, B., Liu, T., Applegate, T., Ritz, C., Kim, W., Regmi, P., & Chai, L. (2024). Computer Vision-Based cybernetics systems for promoting modern poultry Farming: A critical review. Computers and Electronics in Agriculture, 225, 109339. https://doi.org/https://doi.org/10.1016/j.compag.2024.109339
Yuan, F., Wang, G., Huang, Q., & Li, X. (2025). A newton interpolation network for smoke semantic segmentation. Pattern Recognition, 159, 111119. https://doi.org/https://doi.org/10.1016/j.patcog.2024.111119
Zafar, M. B., Ali, H., & Yasin, T. (2025). Reimagining human creativity and learning in the age of generative AI: A multi-method meta-thematic synthesis. Next Research, 2(4), 100802. https://doi.org/https://doi.org/10.1016/j.nexres.2025.100802
Zeng, X., & Liu, S. (2024). Research on the application of knowledge mapping and knowledge structure construction based on adaptive learning model. Expert Systems with Applications, 249, 123400. https://doi.org/https://doi.org/10.1016/j.eswa.2024.123400
Zhai, Y. (2025). Research on library management paradigm in the AIGC era: Theoretical construction and practical exploration. The Journal of Academic Librarianship, 51(4), 103085. https://doi.org/https://doi.org/10.1016/j.acalib.2025.103085
Zhai, Z., Wang, Z., Xu, L., Zhang, L., Zhang, Y., Yin, J., Zeng, P., Li, C., Sun, T., & Jiang, T. (2025). A systematic review of computer-aided acupoint localization. IScience, 28(11), 113708. https://doi.org/https://doi.org/10.1016/j.isci.2025.113708
Zhang, C., Wang, Y., Zhao, Z., Chen, X., Ye, H., Liu, S., Yang, Y., & Peng, K. (2024). Performance-driven closed-loop optimization and control for smart manufacturing processes in the cloud-edge-device collaborative architecture: A review and new perspectives. Computers in Industry, 162, 104131. https://doi.org/https://doi.org/10.1016/j.compind.2024.104131
Zhang, L., Cao, H., & Yan, Q. (2024). Insights from cross-cultural memes: An empirical study on instagram and Douban. Telematics and Informatics, 94, 102186. https://doi.org/https://doi.org/10.1016/j.tele.2024.102186
Zhang, X., Hong, S., Luo, H., Jiang, Z., & Yuan, F. (2025). A dual-level part distillation network for fine-grained visual categorization. Signal Processing: Image Communication, 138, 117383. https://doi.org/https://doi.org/10.1016/j.image.2025.117383
Zhang, Y., Wang, Y., Sheng, Q. Z., Yao, L., Chen, H., Wang, K., Mahmood, A., Zhang, W. E., Zaib, M., Sagar, S., & Zhao, R. (2025). Deep learning meets bibliometrics: A survey of citation function classification. Journal of Informetrics, 19(1), 101608. https://doi.org/https://doi.org/10.1016/j.joi.2024.101608
Zhao, B., & Pan, D. (2025). Immersive e-learning application in intelligent teaching of English composition based on neural network algorithm. Entertainment Computing, 52, 100710. https://doi.org/https://doi.org/10.1016/j.entcom.2024.100710
Zhao, T., Wang, S., Ouyang, C., Chen, M., Liu, C., Zhang, J., Yu, L., Wang, F., Xie, Y., Li, J., Wang, F., Grunwald, S., Wong, B. M., Zhang, F., Qian, Z., Xu, Y., Yu, C., Han, W., Sun, T., … Wang, L. (2024). Artificial intelligence for geoscience: Progress, challenges, and perspectives. The Innovation, 5(5), 100691. https://doi.org/https://doi.org/10.1016/j.xinn.2024.100691
Zhu, W., & Yuan, F. (2025). Multimodal learning with feature fusion transformer for image captioning. Displays, 90, 103126. https://doi.org/https://doi.org/10.1016/j.displa.2025.103126
Zhu, Y. (2024). A knowledge graph and BiLSTM-CRF-enabled intelligent adaptive learning model and its potential application. Alexandria Engineering Journal, 91, 305–320. https://doi.org/https://doi.org/10.1016/j.aej.2024.02.011
Zinnen, M., Madhu, P., Leemans, I., Bell, P., Hussian, A., Tran, H., Hürriyeto?lu, A., Maier, A., & Christlein, V. (2024). Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset. Expert Systems with Applications, 255, 124576. https://doi.org/https://doi.org/10.1016/j.eswa.2024.124576
Authors
Copyright (c) 2025 Muhammad Taufiq, Javlonbek Khamraev, Giovanni Rossi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.