Zhang, Q; Cao, YG; Yu, H. 2011. Parsing citations in biomedical articles using conditional random fields. COMPUTERS IN BIOLOGY AND MEDICINE 41 (4): 190-194

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Fri Jun 10 18:43:39 EDT 2011


Zhang, Q; Cao, YG; Yu, H. 2011. Parsing citations in biomedical articles using 
conditional random fields. COMPUTERS IN BIOLOGY AND MEDICINE 41 (4): 190-
194.

Author Full Name(s): Zhang, Qing; Cao, Yong-Gang; Yu, Hong
Language: English
Document Type: Article

Author Keywords: Natural language processing; Information extraction; Citation 
parsing; Citation indexing; Conditional random fields; Machine learning; 
Biomedical text mining
KeyWords Plus: INFORMATION

Abstract: Citations are used ubiquitously in biomedical full-text articles and play 
an important role for representing both the rhetorical structure and the 
semantic content of the articles. As a result, text mining systems will 
significantly benefit from a tool that automatically extracts the content of a 
citation. In this study, we applied the supervised machine-learning algorithms 
Conditional Random Fields (CRFs) to automatically parse a citation into its fields 
(e.g., Author, Title, Journal, and Year). With a subset of html format open-
access PubMed Central articles, we report an overall 97.95% F1-score. The 
citation parser can be accessed at: 
http://www.cs.uwm.edu/qing/projects/cithit/index.html. (C) 2011 Elsevier Ltd. 
All rights reserved.

Addresses: [Zhang, Qing; Cao, Yong-Gang; Yu, Hong] Univ Wisconsin, 
Milwaukee, WI 53211 USA
Reprint Address: Yu, H, Univ Wisconsin, Milwaukee, WI 53211 USA.

E-mail Address: qing at uwm.edu; yonggang at uwm.edu; hongyu at uwm.edu
ISSN: 0010-4825
DOI: 10.1016/j.compbiomed.2011.02.005
URL (not open access): 
http://www.computersinbiologyandmedicine.com/article/S0010-4825(11)00029-
1/abstract



More information about the SIGMETRICS mailing list