A Deep Learning–based Framework for Certificate Information Extraction and Authentication

Doan Van Thang; Nguyen Ngoc Dung

doi:10.36548/jiip.2025.3.024

A Deep Learning–based Framework for Certificate Information Extraction and Authentication

Open Access

https://doi.org/10.36548/jiip.2025.3.024

Vol. 7, No. 3 (2025)

Published: 30 September, 2025

Pages: 1037-1058

Doan Van Thang , Doan Van Thang

Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Viet Nam

Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Viet Nam
Nguyen Ngoc Dung Nguyen Ngoc Dung

Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Viet Nam

Faculty of Information Technology, Industrial University of Ho Chi Minh City, Ho Chi Minh City, Viet Nam

view PDF

How to Cite

Thang, Doan Van, and Nguyen Ngoc Dung. 2025. “A Deep Learning–based Framework for Certificate Information Extraction and Authentication”. Journal of Innovative Image Processing 7 (3): 1037-58. https://doi.org/10.36548/jiip.2025.3.024.

Keywords

Information Extraction

Object Detection

Deep Learning

OCR

Abstract

This paper presents a deep learning-based end-to-end system for the automatic extraction of key information from structured certificate documents. The model was trained with 1,784 manually labeled certified corpus images. The research finds locations in the dataset utilizing the YOLO models v11 and v12. While YOLOv11 has precision = 0.987, recall = 0.996, mAP@50 = 0.981, and mAP@50–95 = 0.678, YOLOv12 has precision = 0.992, recall = 0.998, mAP@50 = 0.986, and mAP@50–95 = 0.690. It can be seen from the experimental results that YOLO v12 excels in detecting objects (19.1 ms vs. 224.4 ms per image). In order to realize the capability to extract and verify good certificate information, the research then proposes an integrated object detection, optical character recognition (OCR), and database comparison process. In the future, instance segmentation, multimodal learning, and personalized OCR enhancement can be employed to further improve the system's performance on different categories of documents.

References

Al-Qudah, Rabiah, and Ching Y. Suen. "Enhancing YOLO deep networks for the detection of license plates in complex scenes." In Proceedings of the Second International Conference on Data Science, E-Learning and Information Systems, (2019): 1-6.
Zeng, Jie, Kan Wang, Xiong Hu, Yuanzhi Hu, Xi Liu, and Zhengqian Cheng. "YOLO series object detection networks optimized with luminance attention mechanism network." In Seventh International Conference on Traffic Engineering and Transportation System (ICTETS 2023), vol. 13064, SPIE, (2024): 689-694.
Chen, Bo. "Research overview of YOLO series object detection algorithms based on deep learning." Journal of Computing and Electronic Information Management 15, no. 3 (2024): 84–92. https://doi.org/10.54097/p81rtv77.
He, Zijian, Kang Wang, Tian Fang, Lei Su, Rui Chen, and Xihong Fei. "Comprehensive performance evaluation of YOLOv11, YOLOv10, YOLOv9, YOLOv8 and YOLOv5 on object detection of power equipment." In 2025 37th Chinese Control and Decision Conference (CCDC), IEEE, (2025): 1281-1286.
Dwivedi, Upendra, Kireet Joshi, Surendra Kumar Shukla, and Anand Singh Rajawat. "An overview of moving object detection using yolo deep learning models." In 2024 2nd International Conference on Disruptive Technologies (ICDT), IEEE, (2024): 1014-1020.
Cong, Xiaohan, Shixin Li, Fankai Chen, Chen Liu, and Yue Meng. "A review of YOLO object detection algorithms based on deep learning." Frontiers in Computing and Intelligent Systems 4, no. 2 (2023): 17-20.
Safaldin, Mukaram, Nizar Zaghden, and Mahmoud Mejdoub. "Moving object detection based on enhanced Yolo-V2 model." In 2023 5th international congress on human-computer interaction, optimization and robotic applications (HORA), IEEE, (2023): 1-8.
Ananth, Aluri Dev, Abhiram Seemakurthi, Sasank Tumma, and Prasanthi Boyapati. "YOLO CNN Approach for Object Detection." In Algorithms in Advanced Artificial Intelligence, CRC Press, (2024): 481-486.
Bhardwaj, Khushi, and T. Poongodi. "Deep learning approach for multi-object detection using yolo algorithm." In 2023 6th International Conference on Contemporary Computing and Informatics (IC3I), vol. 6, IEEE, (2023): 689-693.
Boussaad, Leila, and Aldjia Boucetta. "YOLO Network-based URL Detection in Varied Conditions with Small-Sample Insights." International Journal of Informatics and Applied Mathematics 7, no. 1: 33-56.
Atik, Muhammed Enes, Zaide Duran, and Roni Özgünlük. "Comparison of YOLO versions for object detection from aerial images." International journal of environment and geoinformatics 9, no. 2 (2022): 87-93.
Padmane, Priyanka, Tushar Dasare, Prathamesh Deshkar, Nikhil Dasgupta, Ashish Kale, and Burhanuddin Hamdard. "A Review on Real Time Object Detection Using Deep Learning.".
Patil, Suvarna, Soham Waghule, Siddhesh Waje, Prasad Pawar, and Shreyash Domb. "Efficient object detection with YOLO: a comprehensive guide." Int J Adv Res Sci Communi Technol 2022 (2024): 519-31.
Ponika, Mannem, Kopalli Jahnavi, P. S. V. S. Sridhar, and Kavuri Veena. "Developing a YOLO based object detection application using OpenCV." In 2023 7th International Conference on Computing Methodologies and Communication (ICCMC), IEEE, (2023): 662-668.
Sirisha, Uddagiri, S. Phani Praveen, Parvathaneni Naga Srinivasu, Paolo Barsocchi, and Akash Kumar Bhoi. "Statistical analysis of design aspects of various YOLO-based deep learning models for object detection." International Journal of Computational Intelligence Systems 16, no. 1 (2023): 126.
Huang, Yupan, Tengchao Lv, Lei Cui, Yutong Lu, and Furu Wei. "Layoutlmv3: Pre-training for document ai with unified text and image masking." In Proceedings of the 30th ACM international conference on multimedia, (2022): 4083-4091.
Rausch, Johannes, Octavio Martinez, Fabian Bissig, Ce Zhang, and Stefan Feuerriegel. "Docparser: Hierarchical document structure parsing from renderings." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 5, (2021); 4328-4338.
Sapkota, Ranjan, and Manoj Karkee. "Improved yolov12 with llm-generated synthetic data for enhanced apple detection and benchmarking against yolov11 and yolov10." arXiv preprint arXiv:2503.00057 (2025).

A Deep Learning–based Framework for Certificate Information Extraction and Authentication

How to Cite

Download Citation

Keywords

Abstract

References