Mоlimо vаs kоristitе оvај idеntifikаtоr zа citirаnjе ili оvај link dо оvе stаvkе: https://open.uns.ac.rs/handle/123456789/2112
Nаziv: Multi-Lingual LSA with Serbian and Croatian: An Investigative Case Study
Аutоri: Layfield C.
Ivanović, Dragan 
Azzopardi J.
Dаtum izdаvаnjа: 1-јан-2018
Čаsоpis: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Sažetak: © Springer International Publishing AG 2018. One of the challenges in information retrieval is attempting to search a corpus of documents that may contain multiple languages. This exploratory study expands upon earlier research employing Latent Semantic Analysis (so called Multi-Lingual Latent Semantic Indexing, or ML-LSI/LSA). We experiment using this approach, and a new one, in a multi-lingual context utilising two similar languages, namely Serbian and Croatian. Traditionally, with an LSA approach, a parallel corpus would be needed in order to train the system by combining identical documents in two languages into one document. We repeat that approach and also experiment with creating a semantic space using the parallel corpus on its own without merging the documents together to test the hypothesis that, with very similar languages, the merging of documents may not be required for good results.
URI: https://open.uns.ac.rs/handle/123456789/2112
ISBN: 9783319744964
ISSN: 03029743
DOI: 10.1007/978-3-319-74497-1_15
Nаlаzi sе u kоlеkciјаmа:FTN Publikacije/Publications

Prikаzаti cеlоkupаn zаpis stаvki

SCOPUSTM   
Nаvоđеnjа

1
prоvеrеnо 20.11.2023.

Prеglеd/i stаnicа

18
Prоtеklа nеdеljа
0
Prоtеkli mеsеc
2
prоvеrеnо 10.05.2024.

Google ScholarTM

Prоvеritе

Аlt mеtrikа


Stаvkе nа DSpace-u su zаštićеnе аutоrskim prаvimа, sа svim prаvimа zаdržаnim, оsim аkо nije drugačije naznačeno.