Please use this identifier to cite or link to this item:
https://open.uns.ac.rs/handle/123456789/8231
Title: | Temporal discrete cosine transform for speech emotion recognition | Authors: | Popović, Branislav Stanković, Igor Ostrogonac, Stevan |
Issue Date: | 1-Jan-2013 | Journal: | 4th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2013 - Proceedings | Abstract: | Temporal Discrete Cosine Transform (TDCT) features have shown good performance in the speaker verification task, and in this paper we utilize them in speech emotion recognition. Tests were conducted on a Serbian emotional speech database, using Neural Networks (NN) as a classifier and Mel-Frequency Cepstral Coefficients (MFFC) as a reference feature set. Even though MFCC is one of the most employed techniques in emotion recognition, our results show that the TDCT features outperform MFCCs (with the first and second derivation) with any number of hidden nodes in the network, hence proving as an excellent starting feature set for recognizing emotions in South Slavic languages. © 2013 IEEE. | URI: | https://open.uns.ac.rs/handle/123456789/8231 | ISBN: | 9781479915439 | DOI: | 10.1109/CogInfoCom.2013.6719219 |
Appears in Collections: | FTN Publikacije/Publications |
Show full item record
SCOPUSTM
Citations
3
checked on May 10, 2024
Page view(s)
16
Last Week
9
9
Last month
0
0
checked on May 3, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.