Journal of Trends in Computer Science and Smart Technology is accepted for inclusion in Scopus. click here
Home / Archives / Volume-7 / Issue-3 / Article-3

Volume - 7 | Issue - 3 | september 2025

Cross-Lingual Attention-based Mechanism for Speech Emotion Recognition Open Access
Tummala Vamsi Aditya  , Swarna Kuchibhotla, Devi Venkata Revathi Poduri, Hima Deepthi Vankayalapati  126
Pages: 331-356
Cite this article
Aditya, Tummala Vamsi, Swarna Kuchibhotla, Devi Venkata Revathi Poduri, and Hima Deepthi Vankayalapati. "Cross-Lingual Attention-based Mechanism for Speech Emotion Recognition." Journal of Trends in Computer Science and Smart Technology 7, no. 3 (2025): 331-356
Published
16 August, 2025
Abstract

Speech emotion recognition is one of the most emerging areas for emotion detection that may fall within the scope of affective computing. In this particular case, emotional speech files of spoken words delivered during verbal communication are of interest. The emotions of speech are investigated through sound and emotion in speech and are modeled through machine learning. Through machine learning, we performed a series of experiments on datasets like RAVDESS, TESS, SAVEE, and EMO-DB, which lean toward the objective that a Recurrent Neural Network (RNN) and (CLAF-SER): The Cross-Lingual Attention-Based Adversarial Framework for SER would be able to detect and classify such emotions as sadness, anger, happiness, neutrality, and fear. Features such as MFCC, LPCC, pitch, energy, and chroma were extracted before implementing the RNN. Through this model, TESS achieved the highest accuracy among the other datasets. However, CLAF-SER gives the best performance when all datasets are combined.

Keywords

Speech Emotion Recognition (SER) RNN (Recurrent Neural Network) CLAF-SER (Cross-Lingual Attention-based Adversarial Framework for SER) SAVEE (Surrey Audio-Visual Expressed Emotion Database) RAVDESS (Ryerson Audio-Visual Database of Emotional Speech and Song) TESS (Toronto Emotional Speech Set) EMO-DB (Berlin Database of Emotional Speech) MFCC (Mel-Frequency Cepstral Coefficients) LPCC (Linear Prediction-based Cepstral Coefficients) Pitch Energy Chroma

×
Article Processing Charges

Journal of Trends in Computer Science and Smart Technology (jtcsst) is an open access journal. When a paper is accepted for publication, authors are required to pay Article Processing Charges (APCs) to cover its editorial and production costs. The APC for each submission is 400 USD. There are no additional charges based on color, length, figures, or other elements.

Category Fee
Article Access Charge 30 USD
Article Processing Charge 400 USD
Annual Subscription Fee 200 USD
Payment Gateway
Paypal: click here
Townscript: click here
Razorpay: click here
After payment,
please send an email to irojournals.contact@gmail.com / journals@iroglobal.com requesting article access.
Subscription form: click here