论文标题
AHD Convnet用于语音情感分类
AHD ConvNet for Speech Emotion Classification
论文作者
论文摘要
人工智能领域的成就用于计算和制造智能机器的发展,以促进人类和改善用户体验。情绪对人来说是基本的,影响了对应,学习和方向等思维和普通练习。语音情感识别是在这方面感兴趣的领域,在这项工作中,我们提出了一种新颖的MEL频谱学习方法,其中我们的模型使用数据点从普遍的Crema-d数据集中从给定的WAV表格语音注释中学习情感。我们的模型使用对数MEL光谱图作为特征,其中MELS = 64。与解决情感语音识别问题的其他方法相比,训练时间较少。
Accomplishments in the field of artificial intelligence are utilized in the advancement of computing and making of intelligent machines for facilitating mankind and improving user experience. Emotions are rudimentary for people, affecting thinking and ordinary exercises like correspondence, learning and direction. Speech emotion recognition is domain of interest in this regard and in this work, we propose a novel mel spectrogram learning approach in which our model uses the datapoints to learn emotions from the given wav form voice notes in the popular CREMA-D dataset. Our model uses log mel-spectrogram as feature with number of mels = 64. It took less training time compared to other approaches used to address the problem of emotion speech recognition.