Abstract
Cancer is one of the most serious health challenges in the world and hence pathologists need accurate and clinically reliable diagnostic support. Even though deep learning algorithms have proven to be very effective in analyzing histopathology images, most of the models currently are not interpretable and prone to false negatives. In this paper, a hybrid CNN-Transformer model detect breast cancer automatically and it combines the local and global contextual modelling of features to capture the complex tissue patterns. The standard preprocessing and augmentation approaches are used to make experiments on large-scale histopathology image dataset robust. Accuracy, precision, recall, F1-score and AUC-ROC are used to evaluate the proposed model, but the main focus of the model is on recall and reduction of false negatives. Grad-CAM and transformer attention maps as explainable AI methods are used to visualize regions of diagnostic interest to enhance model transparency and clinical trust. The experimental outcomes prove that the hybrid framework is more efficient in comparison to independent CNN and transformer models which provide a sensitive, reliable and interpretable solution to AI-assisted histopathology based breast cancer detection.
References
- Alom, Md Romzan, Fahmid Al Farid, Muhammad Aminur Rahaman, Anichur Rahman, Tanoy Debnath, Abu Saleh Musa Miah, and Sarina Mansor. "An explainable AI-driven deep neural network for accurate breast cancer detection from histopathological and ultrasound images." Scientific Reports 15, no. 1 (2025): 17531.
- Filiot, Alexandre, Ridouane Ghermi, Antoine Olivier, Paul Jacob, Lucas Fidon, Axel Camara, Alice Mac Kain, Charlie Saillard, and Jean-Baptiste Schiratti. "Scaling self-supervised learning for histopathology with masked image modeling." MedRxiv (2023): 2023-07.
- Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani et al. "An image is worth 16x16 words: Transformers for image recognition at scale." arXiv preprint arXiv:2010.11929 (2020).
- Mir, Aqib Nazir, Danish Raza Rizvi, and Md Rizwan Ahmad. "Enhancing histopathological image analysis: an explainable vision transformer approach with comprehensive interpretation methods and evaluation of explanation quality." Engineering Applications of Artificial Intelligence 149 (2025): 110519.
- Wu, Yawen, Michael Cheng, Shuo Huang, Zongxiang Pei, Yingli Zuo, Jianxin Liu, Kai Yang et al. "Recent advances of deep learning for computational histopathology: principles and applications." Cancers 14, no. 5 (2022): 1199.
- Selvaraju, Ramprasaath R., Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. "Grad-CAM: visual explanations from deep networks via gradient-based localization." International journal of computer vision 128, no. 2 (2020): 336-359.
- Siden, Hagia Sofia, and I. Gusti Ngurah Lanang Wijayakusuma. "Implementation of Convolutional Neural Networks (CNN) for Breast Cancer Detection Using ResNet18 Architecture." Journal of Applied Informatics and Computing 9, no. 4 (2025): 1423-1430.
- An, Jianpeng, Yong Wang, Qing Cai, Gang Zhao, Stephan Dooper, Geert Litjens, and Zhongke Gao. "Transformer-based weakly supervised learning for whole slide lung cancer image classification." IEEE Journal of Biomedical and Health Informatics (2024).
- Wang, Xiyue, Sen Yang, Jun Zhang, Minghui Wang, Jing Zhang, Junzhou Huang, Wei Yang, and Xiao Han. "Transpath: Transformer-based self-supervised learning for histopathological image classification." In International Conference on Medical Image Computing and Computer-Assisted Intervention, Cham: Springer International Publishing, (2021): 186-195.
- Kumar, Tajinder, Manoj Arora, Vikram Verma, Sachin Lalar, and Shashi Bhushan. "Pre-Examination of Breast Cancer Dataset Using Exploratory Data Analysis (EDA) Approach." In 2024 International Conference on Computational Intelligence and Computing Applications (ICCICA), vol. 1, IEEE, (2024): 1-7.
- Anandhi, K., and K. Karuppasamy. "Histopathological Cancer Detection Using Deep Convolutional Neural Networks: A Step Toward Transformer-Based Explainable AI Models."
- Anandhi, K., and K. Karuppasamy. "A Comparative Study of CNN, Vision Transformer, and Hybrid CNN–Transformer Models for Histopathology-Based Cancer Detection."
- Campanella, Gabriele, Matthew G. Hanna, Luke Geneslaw, Allen Miraflor, Vitor Werneck Krauss Silva, Klaus J. Busam, Edi Brogi, Victor E. Reuter, David S. Klimstra, and Thomas J. Fuchs. "Clinical-grade computational pathology using weakly supervised deep learning on whole slide images." Nature medicine 25, no. 8 (2019): 1301-1309.
- Stebbing, Richard V., and J. Alison Noble. "Delineating anatomical boundaries using the boundary fragment model." Medical image analysis 17, no. 8 (2013): 1123-1136.
- He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "Deep residual learning for image recognition." In Proceedings of the IEEE conference on computer vision and pattern recognition, (2016): 770-778.
- Kaggle. (2018). ‘Histopathologic cancer detection dataset.’ Kaggle. https://www.kaggle.com/competitions/histopathologic-cancer-detection.
