Comparative Analysis of Machine Learning Algorithms for Early Prediction of Parkinson’s Disorder based on Voice Features
Volume-4 | Issue-4

Automated Waste Sorting with Delta Arm and YOLOv8 Detection
Volume-6 | Issue-3

Detection of Fake Job Advertisements using Machine Learning algorithms
Volume-4 | Issue-3

Smart Fashion: A Review of AI Applications in Virtual Try-On & Fashion Synthesis
Volume-3 | Issue-4

AI-Integrated Proctoring System for Online Exams
Volume-4 | Issue-2

Deep Convolution Neural Network Model for Credit-Card Fraud Detection and Alert
Volume-3 | Issue-2

Using Deep Reinforcement Learning For Robot Arm Control
Volume-4 | Issue-3

Sentiment Analysis of Nepali COVID19 Tweets Using NB, SVM AND LSTM
Volume-3 | Issue-3

Blockchain-Enabled Federated Learning on Kubernetes for Air Quality Prediction Applications
Volume-3 | Issue-3

An Overview of Artificial Intelligence Ethics: Issues and Solution for Challenges in Different Fields
Volume-5 | Issue-1

Real Time Anomaly Detection Techniques Using PySpark Frame Work
Volume-2 | Issue-1

Deniable Authentication Encryption for Privacy Protection using Blockchain
Volume-3 | Issue-3

Smart Fashion: A Review of AI Applications in Virtual Try-On & Fashion Synthesis
Volume-3 | Issue-4

Sentiment Analysis of Nepali COVID19 Tweets Using NB, SVM AND LSTM
Volume-3 | Issue-3

Audio Tagging Using CNN Based Audio Neural Networks for Massive Data Processing
Volume-3 | Issue-4

Frontiers of AI beyond 2030: Novel Perspectives
Volume-4 | Issue-4

Smart Medical Nursing Care Unit based on Internet of Things for Emergency Healthcare
Volume-3 | Issue-4

Early Stage Detection of Crack in Glasses by Hybrid CNN Transformation Approach
Volume-3 | Issue-4

ARTIFICIAL INTELLIGENCE APPLICATION IN SMART WAREHOUSING ENVIRONMENT FOR AUTOMATED LOGISTICS
Volume-1 | Issue-2

Deep Convolution Neural Network Model for Credit-Card Fraud Detection and Alert
Volume-3 | Issue-2

Home / Archives / Volume-7 / Issue-4 / Article-3

Volume - 7 | Issue - 4 | december 2025

Integrating Automatic Speech Recognition and Emotion Detection: A Conformer-XGBoost Framework for Human-Centered Speech Systems Open Access
Mohan Bikram K C.  , Smita Adhikari , Tara Bahadur Thapa  48
Pages: 343-361
Full Article PDF pdf-white-icon
Cite this article
C., Mohan Bikram K, Smita Adhikari, and Tara Bahadur Thapa. "Integrating Automatic Speech Recognition and Emotion Detection: A Conformer-XGBoost Framework for Human-Centered Speech Systems." Journal of Artificial Intelligence and Capsule Networks 7, no. 4 (2025): 343-361
Published
02 December, 2025
Abstract

Advanced speech technology pushes human-machine interaction to a new frontier. Most of the models address this either as a matter of speech-to-text transcription or emotion detection. By integrating an XGBoost-driven emotion classification component with a Conformer-based speech recognition system, an integrated solution has been developed. It will, therefore, strive to transcribe spoken utterances and estimate the emotional condition of the speaker with as much accuracy as possible to improve context-sensitive interaction. The transcription process combines large, multilingual speech corpora. A Conformer architecture captures both short- and long-range temporal dependencies. In this regard, an error rate of 0.322 words and 0.146 characters was achieved in transcription. For emotion recognition, several emotional speech datasets were collected, and various acoustic features were extracted under noisy conditions. Using an XGBoost model, 86.58% accuracy in emotion detection was attained. These results demonstrate the feasibility of integrating speech transcription with emotion recognition and form a basis for the further development of more human-like, empathic, and adaptive voice systems.

Keywords

Automatic Speech Recognition Speech Emotion Recognition Conformer Architecture XGBoost Classifier Human Computer Interaction Multimodal Speech Processing

×

Currently, subscription is the only source of revenue. The subscription resource covers the operating expenses such as web presence, online version, pre-press preparations, and staff wages.

To access the full PDF, please complete the payment process.

Subscription Details

Category Fee
Article Access Charge
15 USD
Open Access Fee Nil
Annual Subscription Fee
200 USD
After payment,
please send an email to irojournals.contact@gmail.com / journals@iroglobal.com requesting article access.
Subscription form: click here