Abstract
Acoustic signal classification issues are addressed in this work using spectral examination, channel extracting the features from the input and machine learning algorithm. This brief article examines the effect of various settings on feature extraction. This feature-level channel combination's accuracy increase is then observed. To categorise things, pattern recognition utilises a variety of classification schemes. "Pattern" refers to the measures that must be categorised with accurate feature extracted. Images and audio signals are among the most common kinds of measurements. The proposed Support Vector Machine (SVM) is used for the necessity of an effective categorization of acoustic signals driven by the continual improvements in multimedia technology. This study uses two machine learning algorithms to enhance audio classification and categorization. The proposed SVM achieves superior performance than the other ML algorithm by spectral features.
References
Lim and Chang, “Enhancing Support Vector Machine-Based Speech/Music Classification using Conditional Maximum a Posteriori Criterion,” Signal Processing, IET, vol. 6, no. 4, pp. 335-340, 2012.
Md. Al Mehedi Hasan and Shamim Ahmad. predSucc-Site: Lysine Succinylation Sites Prediction in Proteins by using Support Vector Machine and Resolving Data Imbalance Issue. International Journal of Computer Applications 182(15):8-13, September 2018.
Gerazov, B.; Ivanovski, Z. Kernel Power Flow Orientation Coefficients for Noise Robust Speech Recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 2015, 23, 407–419.
Venkitaraman, A.; Adiga, A.; Seelamantula, C.S. Auditory-motivated Gammatone wavelet transform. Signal Process. 2014, 94, 608–619.
Gerazov, B.; Ivanovski, Z. Gaussian Power flow Orientation Coefficients for noise-robust speech recognition. In Proceedings of the 22nd European Signal Processing Conference (EUSIPCO), Lisbon, Portugal, 1–5 September 2014; pp. 1467–1471.
Hend Ab. ELLaban, A A Ewees and Elsaeed E AbdElrazek. A Real-Time System for Facial Expression Recognition using Support Vector Machines and k-Nearest Neighbor Classifier. International Journal of Computer Applications 159(8):23-29, February 2017.
Kruspe, D. Zapf, and H. Lukashevich, “Automatic speech/music discrimination for broadcast signals,” in INFORMATIK 2017, M. Eibl and M. Gaedke, Eds. Gesellschaft f ¨ur Informatik, Bonn, 2017, pp. 151–162.
Pikrakis and S. Theodoridis, “Speech-music discrimination: A deep learning perspective,” in 2014 22nd European Signal Processing Conference (EUSIPCO), Sept 2014, pp. 616–620.
K. Khonglah and S. R. M. Prasanna, “Low frequency region of vocal tract information for speech / music classification,” in 2016 IEEE Region 10 Conference (TENCON), Nov 2016, pp. 2593–2597.
Lim and J. h. Chang, “Enhancing support vector machine-based speech/music classification using conditional maximum a posteriori criterion,” IET Signal Processing, vol. 6, no. 4, pp. 335–340, June 2012.
K. Khonglah and S. R. M. Prasanna, “Speech / music classification using vocal tract constriction aspect of speech,” in 2015 Annual IEEE India Conference (INDICON), Dec 2015, pp. 1–6.
Wang, J.C.; Lin, C.H.; Chen, B.W.; Tsai, M.K. Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation. IEEE Trans. Autom. Sci. Eng. 2014, 11, 607–613.
Palo, H.K.; Mohanty, M.N.; Chandra, M. Novel feature extraction technique for child emotion recognition. In Proceedings of the 2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO), Visakhapatnam, India, 24–25 January 2015; pp. 1–5.
Zão, L.; Coelho, R.; Flandrin, P. Speech Enhancement with EMD and Hurst-Based Mode Selection. IEEE/ACM Trans. Audio Speech Lang. Process. 2014, 22, 899–911.
Dranka, E.; Coelho, R.F. Robust Maximum Likelihood Acoustic Energy Based Source Localization in Correlated Noisy Sensing Environments. J. Sel. Top. Signal Process. 2015, 9, 259–267.
Valero, X.; Alías, F. Gammatone Wavelet features for sound classification in surveillance applications. In Proceedings of the 20th European Signal Processing Conference (EUSIPCO), Bucharest, Romania, 27–31 August 2012; pp. 1658–1662.
Chungsoo Lim Mokpo, Yeon-Woo Lee, and Joon-Hyuk Chang, “New Techniques for Improving the practicality of a SVM-Based Speech/Music Classifier,” IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1657-1660, 2012.
Theodorou, T., I. Mporas and N. Fakotakis, 2012. Automatic sound classification of radio broadcast news. Int. J. Signal Process. Image Process. Patt. Recogn., 5: 37-48.
Md. Al Mehedi Hasan and Shamim Ahmad. predSucc-Site: Lysine Succinylation Sites Prediction in Proteins by using Support Vector Machine and Resolving Data Imbalance Issue. International Journal of Computer Applications 182(15):8-13, September 2018.
Poonam Sharma and Anjali Garg. Feature Extraction and Recognition of Hindi Spoken Words using Neural Networks. International Journal of Computer Applications 142(7):12-17, May 2016.
Hend Ab. ELLaban, A A Ewees and Elsaeed E AbdElrazek. A Real-Time System for Facial Expression Recognition using Support Vector Machines and k-Nearest Neighbor Classifier. International Journal of Computer Applications 159(8):23-29, February 2017.
Serwach, M., & Stasiak, B. GA-based parameterization and feature selection for automatic music genre recognition. In Proceedings of 2016 17th International Conference Computational Problems of Electrical Engineering, CPEE 2016.
