Please use this identifier to cite or link to this item: https://open.uns.ac.rs/handle/123456789/15438
DC FieldValueLanguage
dc.contributor.authorKovačević, Aleksandaren
dc.contributor.authorKonjović Z.en
dc.contributor.authorMilosavljević, Brankoen
dc.contributor.authorNenad, G.en
dc.date.accessioned2020-03-03T14:59:56Z-
dc.date.available2020-03-03T14:59:56Z-
dc.date.issued2012-04-01en
dc.identifier.issn08852308en
dc.identifier.urihttps://open.uns.ac.rs/handle/123456789/15438-
dc.description.abstractThe task of reviewing scientific publications and keeping up with the literature in a particular domain is extremely time-consuming. Extraction and exploration of methodological information, in particular, requires systematic understanding of the literature, but in many cases is performed within a limited context of publications that can be manually reviewed by an individual or group. Automated methodology identification could provide an opportunity for systematic retrieval of relevant documents and for exploring developments within a given discipline. In this paper we present a system for the identification of methodology mentions in scientific publications in the area of natural language processing, and in particular in automatic terminology recognition. The system comprises two major layers: the first layer is an automatic identification of methodological sentences; the second layer highlights methodological phrases (segments). Each mention is categorised in four semantic categories: Task, Method, Resource/Feature and Implementation. Extraction and classification of the segments is formalised as a sequence tagging problem and four separate phrase-based Conditional Random Fields are used to accomplish the task. The system has been evaluated on a manually annotated corpus comprising 45 full text articles. The results for the segment level annotation show an F-measure of 53% for identification of Task and Method mentions (with 70% precision), whereas the F-measures for Resource/Feature and Implementation identification were 61% (with 67% precision) and 75% (with 86% precision) respectively. At the document-level, an F-measure of 72% (with 81% precision) for Task mentions, 60% (with 81% precision) for Method mentions, 74% (with 78% precision) for the Resource/Feature and 79% (with 81% precision) for the Implementation categories have been achieved. We provide a detailed analysis of errors and explore the impact that the particular groups of features have on the extraction of methodological segments. © 2011 Elsevier Ltd. All rights reserved.en
dc.relation.ispartofComputer Speech and Languageen
dc.titleMining methodologies from NLP publications: A case study in automatic terminology recognitionen
dc.typeJournal/Magazine Articleen
dc.identifier.doi10.1016/j.csl.2011.09.001en
dc.identifier.scopus2-s2.0-80055032042en
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/80055032042en
dc.relation.lastpage126en
dc.relation.firstpage105en
dc.relation.issue2en
dc.relation.volume26en
item.grantfulltextnone-
item.fulltextNo Fulltext-
crisitem.author.deptDepartman za računarstvo i automatiku-
crisitem.author.deptDepartman za računarstvo i automatiku-
crisitem.author.deptDepartman za hemiju, biohemiju i zaštitu životne sredine-
crisitem.author.orcid0000-0003-4551-9802-
crisitem.author.parentorgFakultet tehničkih nauka-
crisitem.author.parentorgFakultet tehničkih nauka-
crisitem.author.parentorgPrirodno-matematički fakultet-
Appears in Collections:PMF Publikacije/Publications
Show simple item record

SCOPUSTM   
Citations

16
checked on Nov 20, 2023

Page view(s)

64
Last Week
4
Last month
4
checked on Mar 15, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.