Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/3624
DC FieldValueLanguage
dc.contributor.authorPakoci, Edvinen
dc.contributor.authorPopović, Borisen
dc.contributor.authorPekar, Darkoen
dc.date.accessioned2019-09-23T10:28:58Z-
dc.date.available2019-09-23T10:28:58Z-
dc.date.issued2017-01-01en
dc.identifier.isbn9783319664286en
dc.identifier.issn3029743en
dc.identifier.urihttps://open.uns.ac.rs/handle/123456789/3624-
dc.description.abstract© Springer International Publishing AG 2017. This paper presents the results obtained using several variants of trigram language models in a large vocabulary continuous speech recognition (LVCSR) system for the Serbian language, based on the deep neural network (DNN) framework implemented within the Kaldi speech recognition toolkit. This training approach allows parallelization using several threads on either multiple GPUs or multiple CPUs, and provides a natural-gradient modification to the stochastic gradient descent (SGD) optimization method. Acoustic models are trained over a fixed number of training epochs with parameter averaging in the end. This paper discusses recognition using different language models trained with Kneser-Ney or Good-Turing smoothing methods, as well as several pruning parameter values. The results on a test set containing more than 120000 words and different utterance types are explored and compared to the referent results with GMM-HMM speaker-adapted models for the same speech database. Online and offline recognition results are compared to each other as well. Finally, the effect of additional discriminative training using a language model prior to the DNN stage is explored.en
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en
dc.titleLanguage model optimization for a deep neural network based speech recognition system for Serbianen
dc.typeConference Paperen
dc.identifier.doi10.1007/978-3-319-66429-3_48en
dc.identifier.scopus2-s2.0-85029497238en
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85029497238en
dc.relation.lastpage492en
dc.relation.firstpage483en
dc.relation.volume10458 LNAIen
item.grantfulltextnone-
item.fulltextNo Fulltext-
Appears in Collections:FTN Publikacije/Publications
Show simple item record

SCOPUSTM   
Citations

8
checked on May 10, 2024

Page view(s)

20
Last Week
7
Last month
0
checked on May 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.