DIGITAL QUR’AN ANNOTATION THROUGH DEEP LEARNING: TOWARD AN INTELLIGENT TAFSIR ECOSYSTEM

Muhammad Taufiq; Roswati Nurdin; Cut Nurul Fajri Harlita

doi:10.70177/ijonis.v2i5.2572

Muhammad Taufiq ⁽¹⁾, Roswati Nurdin ⁽²⁾, Cut Nurul Fajri Harlita ⁽³⁾

(1) UIN Sjech M. Djamil Djambek Bukittinggi,

Indonesia,

(2) Institut Agama Islam Negeri Parepare,

Indonesia,

(3) Universitas Islam Negeri Ar-Raniry Banda Aceh,

Indonesia

https://doi.org/10.70177/ijonis.v2i5.2572

Issue
Vol. 2 No. 5 (2025)

Submitted
3 April 2025

Published
15 October 2025

Keywords:

Deep Learning, Semantic Annotation, Digital Humanities

PDF

Abstract

Traditional Qur’anic exegesis (tafsir) encompasses vast, complex textual datasets. Current digital platforms provide static access but lack the semantic understanding to intelligently connect, compare, and synthesize diverse interpretative traditions, limiting deep analysis for both scholars and lay users. This study aims to develop and evaluate a deep learning framework for the semantic annotation of Qur’anic verses. We seek to automate the classification of verses based on thematic content, legal rulings (ahkam), and intertextual references to classical tafsir, thereby prototyping an intelligent tafsir ecosystem. A gold-standard annotated corpus was constructed by mapping Qur’anic verses to thematic categories from authoritative tafsir. A custom transformer-based (BERT) model was trained on this corpus to perform multi-label classification and evaluated using F1-score, precision, and recall. The model demonstrated high efficacy, achieving a macro F1-score of 0.92 in thematic annotation and 0.89 in identifying intertextual links. The system accurately predicts and suggests relevant tafsir passages for un-annotated verses, significantly outperforming traditional keyword-based search methods in relevance. This study validates deep learning as a tool for sophisticated digital Qur’an annotation. The framework provides the foundation for an intelligent tafsir ecosystem, capable of offering dynamic, contextual, and deeply networked access to Qur’anic knowledge.

Full text article

Generated from XML file

References

Abbott, E. E., Apakama, D., Richardson, L. D., Chan, L., & Nadkarni, G. N. (2024). Leveraging Artificial Intelligence and Data Science for Integration of Social Determinants of Health in Emergency Medicine: Scoping Review. JMIR Medical Informatics, 12. https://doi.org/https://doi.org/10.2196/57124

Almashour, M., Aldamen, H. A. K., & Jarrah, M. (2025). Algorithmic feedback and multilingual identity: Translanguaging practices in Jordanian EFL academic writing. Social Sciences & Humanities Open, 12, 102016. https://doi.org/https://doi.org/10.1016/j.ssaho.2025.102016

Alotaibi, B., Almutarie, A., Alotaibi, S., & Alotaibi, M. (2025). Improving Fashion Sentiment Detection on X through Hybrid Transformers and RNNs. Computers, Materials and Continua, 84(3), 4451–4467. https://doi.org/https://doi.org/10.32604/cmc.2025.066050

Alshaya, S. A. (2025). Enhancing Educational Materials: Integrating Emojis and AI Models into Learning Management Systems. Computers, Materials and Continua, 83(2), 3075–3095. https://doi.org/https://doi.org/10.32604/cmc.2025.062360

Bucher, S. F., Gebauer, S., Grieb, J., Körschens, M., Müller, J., Ritz, C. M., Rajendran, R., Weiland, C., Wesche, K., Victor, K., & Römermann, C. (2025). Collectomics in plant biodiversity research ? looking into the past to understand the present and shape the future. Basic and Applied Ecology, 88, 1–8. https://doi.org/https://doi.org/10.1016/j.baae.2025.07.002

Burghardt, M. B. T.-R. M. in S. S. (2025). Digital humanities. Elsevier. https://doi.org/https://doi.org/10.1016/B978-0-443-26629-4.00016-2

Chapinal-Heras, D., & Díaz-Sánchez, C. (2024). A review of AI applications in human sciences research. Digital Applications in Archaeology and Cultural Heritage, 32, e00323. https://doi.org/https://doi.org/10.1016/j.daach.2024.e00323

Chen, X., Xie, H., Zou, D., Cheng, G., Tao, X., & Lee Wang, F. (2025). Perceived MOOC satisfaction: A review mining approach using machine learning and fine-tuned BERTs. Computers and Education: Artificial Intelligence, 8, 100366. https://doi.org/https://doi.org/10.1016/j.caeai.2025.100366

Chu, D., Wan, B., Ni, H., Li, H., Tan, Z., Dai, Y., Wan, Z., Tang, T., & Zhou, S. (2025). GeoSMIE: An event extraction framework for Document-Level spatial morphological information extraction. Expert Systems with Applications, 268, 126378. https://doi.org/https://doi.org/10.1016/j.eswa.2024.126378

Dawson, M., & Lewin, J. (2025). Automated classification and mapping for alluvial geomorphic units: Current approaches and future directions. Earth-Science Reviews, 271, 105292. https://doi.org/https://doi.org/10.1016/j.earscirev.2025.105292

Deshmukh, P. V, & Shahade, A. K. (2026). Elevating human-machine collaboration in NLP for enhanced content creation and decision support. Data & Knowledge Engineering, 161, 102505. https://doi.org/https://doi.org/10.1016/j.datak.2025.102505

Dimililer, K., Erdem, B. D., Kayali, D., & Priscilla Olawale, O. (2024). Chapter 3 - Image preprocessing phase with artificial intelligence methods on medical images. In W. A. Zgallai & D. U. B. T.-A. I. and I. P. in M. I. Ozsahin (Eds.), Developments in Biomedical Engineering and Bioelectronics (pp. 51–82). Academic Press. https://doi.org/https://doi.org/10.1016/B978-0-323-95462-4.00003-0

Dong, W., Wang, W., Han, X., Huang, J., Li, L., & Huang, Y. (2025). Can GPT-4 provide human-level emotion support? Insights from machine learning-based evaluation framework. Computers in Biology and Medicine, 196, 110789. https://doi.org/https://doi.org/10.1016/j.compbiomed.2025.110789

Eli, E., Wang, D., Xu, W., Mamat, H., Aysa, A., & Ubul, K. (2025). A comprehensive review of non-Latin natural scene text detection and recognition techniques. Engineering Applications of Artificial Intelligence, 156, 111107. https://doi.org/https://doi.org/10.1016/j.engappai.2025.111107

Ganga, B., B.T., L., & K.R., V. (2024). Object detection and crowd analysis using deep learning techniques: Comprehensive review and future directions. Neurocomputing, 597, 127932. https://doi.org/https://doi.org/10.1016/j.neucom.2024.127932

Gao, Q., & Huang, J. (2025). Design and implementation of classical literature sentiment analysis system based on ensemble learning and graph neural network. International Journal of Cognitive Computing in Engineering, 6, 603–616. https://doi.org/https://doi.org/10.1016/j.ijcce.2025.05.004

Gupta, A., & Shivers-McNair, A. (2024). “Wayfinding” through the AI wilderness: Mapping rhetorics of ChatGPT prompt writing on X (formerly Twitter) to promote critical AI literacies. Computers and Composition, 74, 102882. https://doi.org/https://doi.org/10.1016/j.compcom.2024.102882

Harisanty, D., Obille, K. L. B., Anna, N. E. V., Purwanti, E., & Retrialisca, F. (2024). Cultural heritage preservation in the digital age, harnessing artificial intelligence for the future: a bibliometric analysis. Digital Library Perspectives, 40(4), 609–630. https://doi.org/https://doi.org/10.1108/DLP-01-2024-0018

Jahanshahi, H., & Zhu, Z. H. (2024). Review of machine learning in robotic grasping control in space application. Acta Astronautica, 220, 37–61. https://doi.org/https://doi.org/10.1016/j.actaastro.2024.04.012

Jia, J. (2025). Mining Educational Value of Visualization of Sentiment Classification in Ancient Chinese Literature: International Journal of Web-Based Learning and Teaching Technologies, 20(1). https://doi.org/https://doi.org/10.4018/IJWLTT.385603

Korte, J. W., Bartsch, S., Beckmann, R., El Baff, R., & Hecking, T. (2025). The Different Artificial Intelligences of Science and Wikipedia. Technology in Society, 83, 103034. https://doi.org/https://doi.org/10.1016/j.techsoc.2025.103034

Liu, T., Deng, L., & Zhou, Y. (2026). Unpacking self-regulation and social interaction in “Study With Me” videos through large-scale analytics. Computers & Education, 241, 105488. https://doi.org/https://doi.org/10.1016/j.compedu.2025.105488

Mahmoudi-Dehaki, M., & Nasr-Esfahani, N. (2025). Automated vs. manual linguistic annotation for assessing pragmatic competence in English classes. Research Methods in Applied Linguistics, 4(3), 100253. https://doi.org/https://doi.org/10.1016/j.rmal.2025.100253

Moreno-Ortiz, A. (2025). Corpus sense: A comprehensive tool for advanced text and discourse exploration. Applied Corpus Linguistics, 5(3), 100145. https://doi.org/https://doi.org/10.1016/j.acorp.2025.100145

Nashir, W. A., Mohsen, A. M., Al-Shargabi, A. A., Nour, M. K., & Al-onazi, B. B. (2025). A complete, multi-layered quranic treebank dataset with hybrid syntactic annotations for classical arabic processing. Data in Brief, 62, 111940. https://doi.org/https://doi.org/10.1016/j.dib.2025.111940

Ong, J. C. L., Seng, B. J. J., Law, J. Z. F., Low, L. L., Kwa, A. L. H., Giacomini, K. M., & Ting, D. S. W. (2024). Artificial intelligence, ChatGPT, and other large language models for social determinants of health: Current state and future directions. Cell Reports Medicine, 5(1), 101356. https://doi.org/https://doi.org/10.1016/j.xcrm.2023.101356

Quiñonez-Baca, L.-C., Ramirez-Alonso, G., Guzman-Pando, A., Camarillo-Cisneros, J., & Lopez-Flores, D. R. (2025). Advances in meta-learning and zero-shot learning for multi-label classification: A review. Digital Signal Processing, 163, 105220. https://doi.org/https://doi.org/10.1016/j.dsp.2025.105220

Sarkar Farshi, G. (2025). MLNA – A Python package and app for MultiLingual Network Analysis. SoftwareX, 30, 102104. https://doi.org/https://doi.org/10.1016/j.softx.2025.102104

Schettino, L., Origlia, A., & Cutugno, F. (2024). Though this be hesitant, yet there is method in ’t: Effects of disfluency patterns in neural speech synthesis for cultural heritage presentations. Computer Speech & Language, 85, 101585. https://doi.org/https://doi.org/10.1016/j.csl.2023.101585

Schuegraf, P., Shan, J., & Bittner, K. (2024). PLANES4LOD2: Reconstruction of LoD-2 building models using a depth attention-based fully convolutional neural network. ISPRS Journal of Photogrammetry and Remote Sensing, 211, 425–437. https://doi.org/https://doi.org/10.1016/j.isprsjprs.2024.04.015

Sharma, R., & Kukreja, V. (2024). Image segmentation, classification and recognition methods for comics: A decade systematic literature review. Engineering Applications of Artificial Intelligence, 131, 107715. https://doi.org/https://doi.org/10.1016/j.engappai.2023.107715

Shen, D. (2025). Exploring Digital Visualization Techniques for Ancient Literary Works in the Digital Humanities. International Journal of Information System Modeling and Design, 16(1). https://doi.org/https://doi.org/10.4018/IJISMD.389171

Singh, P., Agarwal, A., & Kumar, P. (2025). Navigating the 6G landscape: A systematic review of technology integration, implementation challenges, and strategic solutions. Computer Networks, 272, 111663. https://doi.org/https://doi.org/10.1016/j.comnet.2025.111663

Sree, K. D., Geethika, G., Nair, A. R., Gupta, D., & Veena, G. (2025). Named Entity Recognition for Telugu and Kannada on Naamapadam Dataset Using Various Machine Learning and Deep Learning Algorithms. Procedia Computer Science, 258, 3381–3392. https://doi.org/https://doi.org/10.1016/j.procs.2025.04.595

Stacchio, L., Garzarella, S., Cascarano, P., De Filippo, A., Cervellati, E., & Marfia, G. (2024). DanXe: An extended artificial intelligence framework to analyze and promote dance heritage. Digital Applications in Archaeology and Cultural Heritage, 33, e00343. https://doi.org/https://doi.org/10.1016/j.daach.2024.e00343

Tang, X., Wang, L., & Wang, J. (2026). Language model collaboration for relation extraction from classical Chinese historical documents. Information Processing & Management, 63(1), 104286. https://doi.org/https://doi.org/10.1016/j.ipm.2025.104286

Thakre, B., Yadav, U., & Bondre, S. V. (2025). Chapter 8 - Deep reinforcement learning in healthcare and biomedical application (S. Mahajan & A. K. B. T.-I. in B. E. Pandit (eds.); pp. 241–299). Academic Press. https://doi.org/https://doi.org/10.1016/B978-0-443-30146-9.00008-3

Thammastitkul, A. (2025). Optimizing AI-generated image metadata with hybrid color analysis and semantic keyword structuring. Egyptian Informatics Journal, 31, 100775. https://doi.org/https://doi.org/10.1016/j.eij.2025.100775

Vesalainen, A., Tolonen, M., & Ruotsalainen, L. (2024). Document Layout Error Rate (DLER) metric to evaluate image segmentation methods. Machine Learning with Applications, 18, 100606. https://doi.org/https://doi.org/10.1016/j.mlwa.2024.100606

Wang, H., He, M., Zhu, M., & Liu, G. (2025). WCG-VMamba: A multi-modal classification model for corn disease. Computers and Electronics in Agriculture, 230, 109835. https://doi.org/https://doi.org/10.1016/j.compag.2024.109835

Wang, Y., He, Y., Wang, J., Li, K., Sun, L., Yin, J., Zhang, M., & Wang, X. (2025). Enhancing intent understanding for ambiguous prompt: A human–machine co-adaption strategy. Neurocomputing, 646, 130415. https://doi.org/https://doi.org/10.1016/j.neucom.2025.130415

Wang, Z., & Chai, J. (2024). On Metaphor Translation into English Based on Artificial Intelligence. Procedia Computer Science, 247, 1359–1365. https://doi.org/https://doi.org/10.1016/j.procs.2024.10.162

Wu, F., & Tang, W. (2025). An Indoor Design Assistance System Integrating Depth Sensors and Computer Graphics. International Journal of E-Collaboration, 21(1). https://doi.org/https://doi.org/10.4018/IJeC.389057

Xia, K., Hu, Y., Cai, S., Lin, M., Lu, M., Lu, H., Ye, Y., Lin, F., Gao, L., Xia, Q., Tian, R., Lin, W., Xie, L., Tan, D., Lu, Y., Lin, X., Yang, X., Zhong, L., Xu, L., … Xu, H. (2025). GastritisMIL: An interpretable deep learning model for the comprehensive histological assessment of chronic gastritis. Patterns, 6(8), 101286. https://doi.org/https://doi.org/10.1016/j.patter.2025.101286

Xing, Y., Gan, W., Chen, Q., & Yu, P. S. (2025). AI-generated content in landscape architecture: A survey. AI Open. https://doi.org/https://doi.org/10.1016/j.aiopen.2025.10.002

Yang, X., Bahadur Bist, R., Paneru, B., Liu, T., Applegate, T., Ritz, C., Kim, W., Regmi, P., & Chai, L. (2024). Computer Vision-Based cybernetics systems for promoting modern poultry Farming: A critical review. Computers and Electronics in Agriculture, 225, 109339. https://doi.org/https://doi.org/10.1016/j.compag.2024.109339

Yuan, F., Wang, G., Huang, Q., & Li, X. (2025). A newton interpolation network for smoke semantic segmentation. Pattern Recognition, 159, 111119. https://doi.org/https://doi.org/10.1016/j.patcog.2024.111119

Zafar, M. B., Ali, H., & Yasin, T. (2025). Reimagining human creativity and learning in the age of generative AI: A multi-method meta-thematic synthesis. Next Research, 2(4), 100802. https://doi.org/https://doi.org/10.1016/j.nexres.2025.100802

Zeng, X., & Liu, S. (2024). Research on the application of knowledge mapping and knowledge structure construction based on adaptive learning model. Expert Systems with Applications, 249, 123400. https://doi.org/https://doi.org/10.1016/j.eswa.2024.123400

Zhai, Y. (2025). Research on library management paradigm in the AIGC era: Theoretical construction and practical exploration. The Journal of Academic Librarianship, 51(4), 103085. https://doi.org/https://doi.org/10.1016/j.acalib.2025.103085

Zhai, Z., Wang, Z., Xu, L., Zhang, L., Zhang, Y., Yin, J., Zeng, P., Li, C., Sun, T., & Jiang, T. (2025). A systematic review of computer-aided acupoint localization. IScience, 28(11), 113708. https://doi.org/https://doi.org/10.1016/j.isci.2025.113708

Zhang, C., Wang, Y., Zhao, Z., Chen, X., Ye, H., Liu, S., Yang, Y., & Peng, K. (2024). Performance-driven closed-loop optimization and control for smart manufacturing processes in the cloud-edge-device collaborative architecture: A review and new perspectives. Computers in Industry, 162, 104131. https://doi.org/https://doi.org/10.1016/j.compind.2024.104131

Zhang, L., Cao, H., & Yan, Q. (2024). Insights from cross-cultural memes: An empirical study on instagram and Douban. Telematics and Informatics, 94, 102186. https://doi.org/https://doi.org/10.1016/j.tele.2024.102186

Zhang, X., Hong, S., Luo, H., Jiang, Z., & Yuan, F. (2025). A dual-level part distillation network for fine-grained visual categorization. Signal Processing: Image Communication, 138, 117383. https://doi.org/https://doi.org/10.1016/j.image.2025.117383

Zhang, Y., Wang, Y., Sheng, Q. Z., Yao, L., Chen, H., Wang, K., Mahmood, A., Zhang, W. E., Zaib, M., Sagar, S., & Zhao, R. (2025). Deep learning meets bibliometrics: A survey of citation function classification. Journal of Informetrics, 19(1), 101608. https://doi.org/https://doi.org/10.1016/j.joi.2024.101608

Zhao, B., & Pan, D. (2025). Immersive e-learning application in intelligent teaching of English composition based on neural network algorithm. Entertainment Computing, 52, 100710. https://doi.org/https://doi.org/10.1016/j.entcom.2024.100710

Zhao, T., Wang, S., Ouyang, C., Chen, M., Liu, C., Zhang, J., Yu, L., Wang, F., Xie, Y., Li, J., Wang, F., Grunwald, S., Wong, B. M., Zhang, F., Qian, Z., Xu, Y., Yu, C., Han, W., Sun, T., … Wang, L. (2024). Artificial intelligence for geoscience: Progress, challenges, and perspectives. The Innovation, 5(5), 100691. https://doi.org/https://doi.org/10.1016/j.xinn.2024.100691

Zhu, W., & Yuan, F. (2025). Multimodal learning with feature fusion transformer for image captioning. Displays, 90, 103126. https://doi.org/https://doi.org/10.1016/j.displa.2025.103126

Zhu, Y. (2024). A knowledge graph and BiLSTM-CRF-enabled intelligent adaptive learning model and its potential application. Alexandria Engineering Journal, 91, 305–320. https://doi.org/https://doi.org/10.1016/j.aej.2024.02.011

Zinnen, M., Madhu, P., Leemans, I., Bell, P., Hussian, A., Tran, H., Hürriyeto?lu, A., Maier, A., & Christlein, V. (2024). Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset. Expert Systems with Applications, 255, 124576. https://doi.org/https://doi.org/10.1016/j.eswa.2024.124576

Authors

Muhammad Taufiq

UIN Sjech M. Djamil Djambek Bukittinggi

muhammadtaufiq@uinbukittinggi.ac.id (Primary Contact)

Roswati Nurdin

Institut Agama Islam Negeri Parepare

Cut Nurul Fajri Harlita

Universitas Islam Negeri Ar-Raniry Banda Aceh

Taufiq, M., Nurdin, R., & Harlita, C. N. F. (2025). DIGITAL QUR’AN ANNOTATION THROUGH DEEP LEARNING: TOWARD AN INTELLIGENT TAFSIR ECOSYSTEM. International Jornal of Noesantara Islamic Studies, 2(5), 226–241. https://doi.org/10.70177/ijonis.v2i5.2572