Please use this identifier to cite or link to this item:
https://open.uns.ac.rs/handle/123456789/5164
Title: | A phonetic segmentation procedure based on hidden markov models | Authors: | Pakoci, Edvin Popović, Boris Jakovljević, Nikša Pekar, Darko Yassa F. |
Issue Date: | 1-Jan-2016 | Journal: | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | Abstract: | © Springer International Publishing Switzerland 2016. In this paper, a novel variant of an automatic phonetic segmentation procedure is presented, especially useful if data is scarce. The procedure uses the Kaldi speech recognition toolkit as its basis, and combines and modifies several existing methods and Kaldi recipes. Both the specifics of model training and test data alignment are explained in detail. Effectiveness of artificial extension of the starting amount of manually labeled material during training is examined as well. Experimental results show the admirable overall correctness of the proposed procedure in the given test environment. Several variants of the procedure are compared, and the usage of speaker-adapted context-dependent triphone models trained without the expanded manually checked data is proven to produce the best results. A few ways to improve the procedure even more, as well as future work, are also discussed. | URI: | https://open.uns.ac.rs/handle/123456789/5164 | ISBN: | 9783319439570 | ISSN: | 3029743 | DOI: | 10.1007/978-3-319-43958-7_7 |
Appears in Collections: | FTN Publikacije/Publications |
Show full item record
SCOPUSTM
Citations
3
checked on May 10, 2024
Page view(s)
20
Last Week
7
7
Last month
0
0
checked on May 10, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.