A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images

Venkata B Hangarage; Gururaj Mukarambi

doi:10.36548/jiip.2025.3.021

A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images

Venkata B Hangarage , Gururaj Mukarambi

Open Access

Volume - 7 • Issue - 3 • september 2025

https://doi.org/10.36548/jiip.2025.3.021

976-990 404 PDF

Abstract

Natural Scene Text Detection and Language Identification is a challenging problem in the field of computer vision, due to autonomous video surveillance and the design of an OCR system for natural scene images. The drawback of an autonomous video surveillance and monolingual OCR system is that it will not work efficiently on natural scene images, where text appears in different orientations, backgrounds, and lighting conditions with multilingual scripts. Hence, we proposed a deep learning model, i.e. fine-tuned YOLOv5, for text detection and language identification in bilingual scene images. For testing the proposed (fine-tuned) model, there is no standard ground truth database in the literature. Therefore, we created our own real-time natural scene dataset from the Kalaburagi and Bidar districts in the state of Karnataka. The proposed (fine-tuned) model involves training YOLOv5 on a real-time dataset, and it works with a genetic approach. It produces the anchor boxes for the objects present in the natural scene image. To test the performance of the fine-tuned YOLOv5 model, we employed evaluation metrics like precision, recall and accuracy. The experimental setup demonstrates robustness of the fine-tuned YOLOv5 model for text detection and language identification. We obtained an optimized precision rate of 86.8%, a recall rate of 83.4%, an F1 score of 85%, and an accuracy of 94.4%. The training of 80% and testing of 20% was carried out in the experiment. A comparative analysis of the fine-tuned YOLOv5 model with existing methods found in the literature is carried out, and observed that the fine-tuned YOLOv5 model shows better performance. The novelty of the paper is that the fine-tuned YOLOv5 model and dataset were constrained with a mixture of low-resolution and complex background images.

Cite this article

Chicago APA MLA Vancouver IEEE Harvard BibTeX

Hangarage, Venkata B, and Gururaj Mukarambi. "A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images." Journal of Innovative Image Processing 7, no. 3 (2025): 976-990. doi: 10.36548/jiip.2025.3.021

Copy Citation

Hangarage, V. B., & Mukarambi, G. (2025). A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images. Journal of Innovative Image Processing, 7(3), 976-990. https://doi.org/10.36548/jiip.2025.3.021

Copy Citation

Hangarage, Venkata B, et al. "A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images." Journal of Innovative Image Processing, vol. 7, no. 3, 2025, pp. 976-990. DOI: 10.36548/jiip.2025.3.021.

Copy Citation

Hangarage VB, Mukarambi G. A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images. Journal of Innovative Image Processing. 2025;7(3):976-990. doi: 10.36548/jiip.2025.3.021

Copy Citation

V. B. Hangarage, and G. Mukarambi, "A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images," Journal of Innovative Image Processing, vol. 7, no. 3, pp. 976-990, Sep. 2025, doi: 10.36548/jiip.2025.3.021.

Copy Citation

Hangarage, V.B. and Mukarambi, G. (2025) 'A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images', Journal of Innovative Image Processing, vol. 7, no. 3, pp. 976-990. Available at: https://doi.org/10.36548/jiip.2025.3.021.

Copy Citation

@article{hangarage2025,
  author    = {Venkata B Hangarage and Gururaj Mukarambi},
  title     = {{A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images}},
  journal   = {Journal of Innovative Image Processing},
  volume    = {7},
  number    = {3},
  pages     = {976-990},
  year      = {2025},
  publisher = {IRO Journals},
  doi       = {10.36548/jiip.2025.3.021},
  url       = {https://doi.org/10.36548/jiip.2025.3.021}
}

Copy Citation

Keywords

YOLOv5 SPPF Deep Learning Computer Vision Image Processing

Category	Fee
Article Access Charge	30 USD
Article Processing Charge	400 USD
Annual Subscription Fee	200 USD

A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images

Venkata B Hangarage

Published

23 September, 2025

e-ISSN: 2582-4252
4 issues per year
DOI: https://doi.org/10.36548/jiip

Indexing
Scopus | GoogleScholar | Crossref | MicrosoftAcademic | ScienceGate | J-Gate

Publisher

Inventive Research Organization

Open Access Journal

A Deep Learning Framework for Kannada-English Text Recognition and Language Identification in Natural Scene Images

Venkata B Hangarage

Published

23 September, 2025

e-ISSN: 2582-4252 4 issues per year DOI: https://doi.org/10.36548/jiip

Indexing Scopus | GoogleScholar | Crossref | MicrosoftAcademic | ScienceGate | J-Gate

Publisher Inventive Research Organization

Open Access Journal

e-ISSN: 2582-4252
4 issues per year
DOI: https://doi.org/10.36548/jiip

Indexing
Scopus | GoogleScholar | Crossref | MicrosoftAcademic | ScienceGate | J-Gate

Publisher

Inventive Research Organization