Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/7560
Title: Tree-based phone duration modelling of the Serbian language
Authors: Sovilj-Nikic S.
Delić, Vlado 
Sovilj-Nikic I.
Marković, Marko
Issue Date: 1-Jan-2014
Journal: Elektronika ir Elektrotechnika
Abstract: Considering the importance of segmental duration from a perceptive point of view, the possibility of automatic prediction of natural duration of phones is essential for achieving the naturalness of synthesized speech. In this paper phone duration prediction model for the Serbian language using tree-based machine learning approach is presented. A large speech corpus and a feature set of 21 parameters describing phones and their contexts were used for segmental duration prediction. Phone duration modelling is based on attributes such as the current segment identity, preceding and following segment types, manner of articulation (for consonants) and voicing of neighbouring phones, lexical stress, part-of-speech, word length, the position of the segment in the syllable, the position of the syllable in a word, the position of a word in a phrase, phrase break level, etc. These features have been extracted from the large speech database for the Serbian language. The results obtained for the full phoneme set using regression tree, RMSE (root-mean-squared-error) 14.8914 ms, MAE (mean absolute error) 11.1947 ms and correlation coefficient 0.8796 are comparable with those reported in the literature for Czech, Greek, Lithuanian, Korean, Indian languages Hindi and Telugu, Turkish.
URI: https://open.uns.ac.rs/handle/123456789/7560
ISSN: 13921215
DOI: 10.5755/j01.eee.20.3.4090
Appears in Collections:FTN Publikacije/Publications

Show full item record

SCOPUSTM   
Citations

6
checked on May 3, 2024

Page view(s)

32
Last Week
9
Last month
0
checked on May 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.