Zhang, Q; Cao, YG; Yu, H. 2011. Parsing citations in biomedical articles using conditional random fields. COMPUTERS IN BIOLOGY AND MEDICINE 41 (4): 190-194
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Fri Jun 10 18:43:39 EDT 2011
Zhang, Q; Cao, YG; Yu, H. 2011. Parsing citations in biomedical articles using
conditional random fields. COMPUTERS IN BIOLOGY AND MEDICINE 41 (4): 190-
194.
Author Full Name(s): Zhang, Qing; Cao, Yong-Gang; Yu, Hong
Language: English
Document Type: Article
Author Keywords: Natural language processing; Information extraction; Citation
parsing; Citation indexing; Conditional random fields; Machine learning;
Biomedical text mining
KeyWords Plus: INFORMATION
Abstract: Citations are used ubiquitously in biomedical full-text articles and play
an important role for representing both the rhetorical structure and the
semantic content of the articles. As a result, text mining systems will
significantly benefit from a tool that automatically extracts the content of a
citation. In this study, we applied the supervised machine-learning algorithms
Conditional Random Fields (CRFs) to automatically parse a citation into its fields
(e.g., Author, Title, Journal, and Year). With a subset of html format open-
access PubMed Central articles, we report an overall 97.95% F1-score. The
citation parser can be accessed at:
http://www.cs.uwm.edu/qing/projects/cithit/index.html. (C) 2011 Elsevier Ltd.
All rights reserved.
Addresses: [Zhang, Qing; Cao, Yong-Gang; Yu, Hong] Univ Wisconsin,
Milwaukee, WI 53211 USA
Reprint Address: Yu, H, Univ Wisconsin, Milwaukee, WI 53211 USA.
E-mail Address: qing at uwm.edu; yonggang at uwm.edu; hongyu at uwm.edu
ISSN: 0010-4825
DOI: 10.1016/j.compbiomed.2011.02.005
URL (not open access):
http://www.computersinbiologyandmedicine.com/article/S0010-4825(11)00029-
1/abstract
More information about the SIGMETRICS
mailing list