A REVIEW ON SPEECH EMOTION FEATURES

Authors

  • Noor Aina Zaidan Department of Computer Science, Faculty of Computing, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia
  • Md Sah Hj. Salam Department of Computer Science, Faculty of Computing, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia

DOI:

https://doi.org/10.11113/jt.v75.4988

Keywords:

Emotion, features, prosodic, wavelet, spectral, hybrid

Abstract

Research works on combining emotions in intelligent machines are expanding and improving. Human’s speeches basically have various emotional states. The finding of reliable speech features is an ongoing research. Specific features in the speech signal that contribute to emotional information are uncertain, extremely challenging problem and continue being explored. The recognition rate of emotion in speech signal is inconsistent depending on the features used in the experiment and also the database itself. Prosodic, spectral and wavelet features are mostly being used to determine which of these features or its hybrid carry more information about emotions. This paper intends to summarize previous work and make reviews about single and hybrid features based on prosodic, spectral and wavelet feature.  

References

Lanjewar, R. B. & Chaudhari, D.S., 2013. Speech Emotion Recognition : A Review. International Journal of Innovative Technology and Exploring Engineering. 2(4): 68-71.

Seehapoch, T. & Wongthanavasu, S. 2013. Speech Emotion Recognition Using Support Vector Machines. 5th International Conference on Knowledge and Smart Technology (KST).

Sezgin, M., Gunsel, B. & Kurt, G., 2012. Perceptual Audio Features for Emotion Detection. EURASIP Journal on Audio, Speech, and Music Processing. 2012(1): 16.

Ingale, A. & Chaudhari, D. 2012. Speech Emotion Recognition. International Journal of Soft Computing and Engineering (IJSCE). 2(1): 235-238.

Bitouk, D., Verma, R. & Nenkova, A., 2010. Class-level Spectral Features for Emotion Recognition. Speech Communication. 52(7-8): 613-625.

Shen, P., Changjun, Z. & Chen, X., 2011. Automatic Speech Emotion Recognition using Support Vector Machine. International Conference on Electronic and Mechanical Engineering and Information Technology. 621-625.

Ali, S. A. et al. 2013. Development and Analysis of Speech Emotion Corpus Using Prosodic Features for Cross Linguistics. International Journal of Scientific & Engineering Research. 4(1): 1-8.

Zhou, Y. et al. 2009. Speech Emotion Recognition Using Both Spectral and Prosodic Features. 2009 International Conference on Information Engineering and Computer Science. 1-4.

Ayadi, M. El, Kamel, M. & Karray, F. 2011. Survey on Speech Emotion Recognition: Features, classification schemes, and databases. Pattern Recognition. 44(3): 572-587.

Kishore, K. V. K. & Satish, P. K. 2013. Emotion Recognition in Speech using MFCC and Wavelet Features. Advance Computing Conference (IACC), 2013 IEEE 3rd International. 842-847.

Safdarkhani, M. K. et al. 2012. Emotion Recognition of Speech Using ANN and GMM. Australian Journal of Basic and Applied Sciences. 6(9): 45-57.

Bozkurt, E. & Erzin, E. 2009. Improving Automatic Emotion Recognition from Speech Signals. In Interspeech 2009: 10th Annual Conference of the International Speech Communication Association.

Schuller, B. & Burkhardt, F. 2010. Learning with Synthesized Speech for Automatic Emotion Recognition. Speech and Signal Processing. 5150-5153.

Ahtaridis, E., Cieri, C. & DiPersio, D. 2009. LDC Language Resource Papers: Building a Bibliographic Database. 8th International Conference on Language Resources and Evaluation, Istanbul. (5).

Haq, S. & Jackson, P. 2009. Speaker-dependent Audio-visual Emotion Recognition. In International Conference on Auditory-Visual Speech Processing (AVSP).

Barra-Chicote, R. et al. 2008. Spanish Expressive Voices: Corpus for Emotion Research in Spanish. 6th conference of Language Resources & Evaluation (Workshop on Corpora for Research on Emotion and Affect). 2.

Caponetti, L., Buscicchio, C. & Castellano, G., 2011. Biologically Inspired Emotion Recognition from Speech. EURASIP Journal on Advances in Signal Processing. 1): 10.

Plannerer, B. 2005. An Introduction to Speech Recognition. Munich, Germany.

Han, Z., Lun, S. & Wang, J. 2012. Speech Emotion Recognition System Based on Integrating Feature and Improved HMM. Proceedings of the 2nd International Conference on Computer Application and System Modeling. 571-574.

Polzehl, T., Schmitt, A. & Metze, F. 2010. Approaching Multi-Lingual Emotion Recognition from Speech-On Language Dependency of Acoustic/Prosodic Features for Anger Recognition. Speech Prosody 2010-Fifth. 1-4.

Iliev, A. 2009. Emotion Recognition Using Glottal and Prosodic Features. University of Miami.

Downloads

Published

2015-07-13

Issue

Section

Science and Engineering

How to Cite

A REVIEW ON SPEECH EMOTION FEATURES. (2015). Jurnal Teknologi (Sciences & Engineering), 75(2). https://doi.org/10.11113/jt.v75.4988