Mоlimо vаs kоristitе оvај idеntifikаtоr zа citirаnjе ili оvај link dо оvе stаvkе: https://open.uns.ac.rs/handle/123456789/12382
Nаziv: Document representations for classification of short Web-page descriptions
Аutоri: Radovanović M.
Ivanović, Mirjana 
Dаtum izdаvаnjа: 1-јан-2006
Čаsоpis: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Sažetak: Motivated by applying Text Categorization to sorting Web search results, this paper describes an extensive experimental study of the impact of bag-of-words document representations on the performance of five major classifiers - Naïve Bayes, SVM, Voted Perceptron, kNN and C4.5. The texts represent short Web-page descriptions from the dmoz Open Directory Web-page ontology. Different transformations of input data: stemming, normalization, logtf and idf, together with dimensionality reduction, are found to have a statistically significant improving or degrading effect on classification performance measured by classical metrics - accuracy, precision, recall, F <inf>1</inf> and F<inf>2</inf>. The emphasis of the study is not on determining the best document representation which corresponds to each classifier, but rather on describing the effects of every individual transformation on classification, together with their mutual relationships. © Springer-Verlag Berlin Heidelberg 2006.
URI: https://open.uns.ac.rs/handle/123456789/12382
ISBN: 3540377360
ISSN: 03029743
Nаlаzi sе u kоlеkciјаmа:PMF Publikacije/Publications

Prikаzаti cеlоkupаn zаpis stаvki

Prеglеd/i stаnicа

21
Prоtеklа nеdеljа
7
Prоtеkli mеsеc
0
prоvеrеnо 10.05.2024.

Google ScholarTM

Prоvеritе

Аlt mеtrikа


Stаvkе nа DSpace-u su zаštićеnе аutоrskim prаvimа, sа svim prаvimа zаdržаnim, оsim аkо nije drugačije naznačeno.