Automated Detection of Emotional States from Speech using Hybrid Deep Learning Model

Balaji Venkateswaran; Jyotirmay Mishra; Amit Kumar Ahuja; Rahul Kumar Jain; Sanjeev Kumar; Arshad Rafiq Khan

Authors

Balaji Venkateswaran Research scholar, Department of Computer Science & Engineering, Shri Venkateshwara University, Gajraula, UP, INDIA
Jyotirmay Mishra Research scholar, Department of Computer Science & Engineering, Shri Venkateshwara University, Gajraula, UP, India
Amit Kumar Ahuja Department of Electronics and Communication Engineering, JSS Academy of Technical Education, Sectors 62, Gautam Buddha Nagar, Noida, UP, India
Rahul Kumar Jain Project Lead, Nagarro, Gurgaon, Haryana,India
Sanjeev Kumar Assistant Professor, Department of Computer Science, Maharaja Agrasen Institute of Technology, Rohini Sector -22, New Delhi (India)
Arshad Rafiq Khan Research scholar, Department of Computer Science and Engineering, Shri Venkateshwara University, Gajraula, UP, India

Keywords:

Deep Learning, LSTM, CNN, SVM, RAVDESS

Abstract

Detection of Emotion State (ES) is an essential yet challenging field with applications across various domains such as psychology, speech therapy, and customer service. In this paper, we present a novel approach to SER using hybrid deep learning techniques, specifically focusing on recurrent neural networks. The proposed model is trained on carefully labeled datasets that include diverse speech samples representing different emotional states. By analyzing critical audio features like pitch, rhythm, and prosody, the system aims to improve emotion detection accuracy for unseen speech data. This work seeks to advance SER by enhancing both precision and reliability, while also providing deeper insights into the complex connection between emotions and speech patterns. Our approach utilizes Long Short-Term Memory (LSTM) neural networks, which are adept at capturing temporal dependencies crucial for recognizing emotions in speech. The LSTM model is rigorously trained on a comprehensive dataset covering a wide range of emotional states, and its performance is evaluated through extensive experimentation. The results demonstrate that our method outperforms conventional techniques, underscoring the effectiveness of LSTM in speech emotion tasks. This research contributes significantly to the development of emotion recognition technology, with promising applications in human-computer interaction, mental health monitoring, and sentiment analysis.

Automated Detection of Emotional States from Speech using Hybrid Deep Learning Model

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

Most read articles by the same author(s)

Make a Submission

Information

Announcements

Call for Papers

indexing

scimogo

important links

Keywords