Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/8231
Title: Temporal discrete cosine transform for speech emotion recognition
Authors: Popović, Branislav 
Stanković, Igor
Ostrogonac, Stevan
Issue Date: 1-Jan-2013
Journal: 4th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2013 - Proceedings
Abstract: Temporal Discrete Cosine Transform (TDCT) features have shown good performance in the speaker verification task, and in this paper we utilize them in speech emotion recognition. Tests were conducted on a Serbian emotional speech database, using Neural Networks (NN) as a classifier and Mel-Frequency Cepstral Coefficients (MFFC) as a reference feature set. Even though MFCC is one of the most employed techniques in emotion recognition, our results show that the TDCT features outperform MFCCs (with the first and second derivation) with any number of hidden nodes in the network, hence proving as an excellent starting feature set for recognizing emotions in South Slavic languages. © 2013 IEEE.
URI: https://open.uns.ac.rs/handle/123456789/8231
ISBN: 9781479915439
DOI: 10.1109/CogInfoCom.2013.6719219
Appears in Collections:FTN Publikacije/Publications

Show full item record

SCOPUSTM   
Citations

3
checked on May 10, 2024

Page view(s)

16
Last Week
9
Last month
0
checked on May 3, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.