Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/11482
DC FieldValueLanguage
dc.contributor.authorOstrogonac S.en
dc.contributor.authorMišković, Dragišaen
dc.contributor.authorSečujski, Milanen
dc.contributor.authorPekar, Darkoen
dc.contributor.authorDelić, Vladoen
dc.date.accessioned2020-03-03T14:44:33Z-
dc.date.available2020-03-03T14:44:33Z-
dc.date.issued2012-12-12en
dc.identifier.isbn9781467347518en
dc.identifier.urihttps://open.uns.ac.rs/handle/123456789/11482-
dc.description.abstractThis paper proposes a method of creating language models for highly inflective non-agglutinative languages. Three types of language models were considered - a common n-gram model, an n-gram model of lemmas and a class n-gram model. The last two types were specially designed for the Serbian language reflecting its unique grammar structure. All the language models were trained on a carefully collected data set incorporating several literary styles and a great variety of domain-specific textual documents in Serbian. Language models of the three types were created for different sets of textual corpora and evaluated by perplexity values they have given on the test data. A log-linear combination of the common, lemma-based and class n-gram models that was also created shows promising results in overcoming the data sparsity problem. However, the evaluation of this combined model in the context of a large vocabulary continuous speech recognition system (LVCSR) is yet to be done in order to establish the improvement in terms of word error rate (WER). © 2012 IEEE.en
dc.relation.ispartof2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics, SISY 2012en
dc.titleA language model for highly inflective non-agglutinative languagesen
dc.typeConference Paperen
dc.identifier.doi10.1109/SISY.2012.6339510en
dc.identifier.scopus2-s2.0-84870657018en
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/84870657018en
dc.relation.lastpage181en
dc.relation.firstpage177en
item.fulltextNo Fulltext-
item.grantfulltextnone-
crisitem.author.deptFakultet tehničkih nauka, Departman za energetiku, elektroniku i telekomunikacije-
crisitem.author.deptFakultet tehničkih nauka, Departman za energetiku, elektroniku i telekomunikacije-
crisitem.author.parentorgFakultet tehničkih nauka-
crisitem.author.parentorgFakultet tehničkih nauka-
Appears in Collections:FTN Publikacije/Publications
Show simple item record

SCOPUSTM   
Citations

5
checked on Sep 9, 2023

Page view(s)

37
Last Week
4
Last month
12
checked on May 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.