Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/12382
DC FieldValueLanguage
dc.contributor.authorRadovanović M.en
dc.contributor.authorIvanović, Mirjanaen
dc.date.accessioned2020-03-03T14:48:17Z-
dc.date.available2020-03-03T14:48:17Z-
dc.date.issued2006-01-01en
dc.identifier.isbn3540377360en
dc.identifier.issn03029743en
dc.identifier.urihttps://open.uns.ac.rs/handle/123456789/12382-
dc.description.abstractMotivated by applying Text Categorization to sorting Web search results, this paper describes an extensive experimental study of the impact of bag-of-words document representations on the performance of five major classifiers - Naïve Bayes, SVM, Voted Perceptron, kNN and C4.5. The texts represent short Web-page descriptions from the dmoz Open Directory Web-page ontology. Different transformations of input data: stemming, normalization, logtf and idf, together with dimensionality reduction, are found to have a statistically significant improving or degrading effect on classification performance measured by classical metrics - accuracy, precision, recall, F <inf>1</inf> and F<inf>2</inf>. The emphasis of the study is not on determining the best document representation which corresponds to each classifier, but rather on describing the effects of every individual transformation on classification, together with their mutual relationships. © Springer-Verlag Berlin Heidelberg 2006.en
dc.relation.ispartofLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en
dc.titleDocument representations for classification of short Web-page descriptionsen
dc.typeConference Paperen
dc.identifier.scopus2-s2.0-33749415472en
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/33749415472en
dc.relation.lastpage553en
dc.relation.firstpage544en
dc.relation.volume4081 LNCSen
item.fulltextNo Fulltext-
item.grantfulltextnone-
crisitem.author.deptPrirodno-matematički fakultet, Departman za matematiku i informatiku-
crisitem.author.orcid0000-0003-1946-0384-
crisitem.author.parentorgPrirodno-matematički fakultet-
Appears in Collections:PMF Publikacije/Publications
Show simple item record

Page view(s)

21
Last Week
7
Last month
0
checked on May 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.