Now posted with cited references and e-mail address - Bernstam EV, Herskovic JR, Aphinyanaphongs Y, Aliferis CF, Sriram MG, Hersh WR "Using citation data to improve retrieval from MEDLINE "
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Tue Mar 14 14:55:23 EST 2006
Elmer V. Bernstam : E-mail Addresses: elmer.v.bernstam at uth.tmc.edu
Title: Using citation data to improve retrieval from MEDLINE
Author(s): Bernstam EV, Herskovic JR, Aphinyanaphongs Y, Aliferis CF,
Sriram MG, Hersh WR
Source: JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 13 (1): 96-
105 JAN-FEB 2006
Document Type: Article Language: English Cited References: 34
Times Cited: 0
Abstract:
Objective: To determine whether algorithms developed for the World Wide Web
can be applied to the biomedical literature in order to identify articles
that are important as well as relevant.
Design and Measurements: A direct comparison of eight algorithms: simple
PubMed queries, clinical queries (sensitive and specific versions), vector
cosine comparison, citation count, journal impact factor, PageRank, and
machine learning based on polynomial support vector machines. The objective
was to prioritize important articles, defined as being included in a pre-
existing bibliography of important literature in surgical oncology.
Results: Citation-based algorithms were more effective than noncitation-
based algorithms at identifying important articles. The most effective
strategies were simple citation count and PageRank, which on average
identified over six important articles in the first 100 results compared to
0.85 for the best noncitation-based algorithm (p < 0.001). The authors saw
similar differences between citation-based and noncitation-based algorithms
at 10, 20, 50, 200, 500, and 1,000 results (p < 0.001). Citation lag
affects performance of PageRank more than simple citation count. However,
in spite of citation lag, citation-based algorithms remain more effective
than noncitation-based algorithms.
Conclusion: Algorithms that have proved successful on the World Wide Web
can be applied to biomedical information retrieval. Citation-based
algorithms can help identify important articles within large sets of
relevant results. Further studies are needed to determine whether citation-
based algorithms can effectively meet actual user information needs.
Addresses: Bernstam EV (reprint author), Univ Texas, Hlth Sci Ctr, Sch Hlth
Informat Sci, 7000 Fannin St,Suite 600, Houston, TX 77030 USA
Univ Texas, Hlth Sci Ctr, Sch Hlth Informat Sci, Houston, TX 77030 USA
Vanderbilt Univ, Dept Biomed Informat, Nashville, TN USA
Oregon Hlth & Sci Univ, Dept Med Informat & Clin Epidemiol, Portland, OR
USA
E-mail Addresses: elmer.v.bernstam at uth.tmc.edu
Publisher: HANLEY & BELFUS INC, 210 S 13TH ST, PHILADELPHIA, PA 19107 USA
IDS Number: 004AI
ISSN: 1067-5027
COMPUTATION RELATED : 2003
*MI HLTH SCI LIB A
RES COMM REP U MICH : 1992
*NLM
NLM RES LISTS BIBL : 2003
*NLM
PUBM CLIN QUER TABL
*THOMS ISI
J CITATION REPORTS : 2003
*THOMS ISI
SCI CIT IND EXP 2004 : 2004
APHINYANAPHONGS Y
Text categorization models for high-quality article retrieval in internal
medicine
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 12 : 207 2005
APHINYANAPHONGS Y
MED SAN FRANC CA : 2004
BACHMANN LM
Identifying diagnostic studies in MEDLINE: Reducing the number needed to
read
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 9 : 653 2002
BAEZAYATES R
MODERN INFORM RETRIE : 1999
BORODIN A
ACM T INTERNET TECHN 5 : 231 2005
BRIN S
WWW7 COMPUTER NETWOR 30 : 107 1998
GARFIELD E
Journal impact factor: a brief review
CANADIAN MEDICAL ASSOCIATION JOURNAL 161 : 979 1999
GARFIELD E
CITATION INDEXING AU 1 : 1977
HAYNES RB
ACP J CLUB 142 : A8 2005
HAYNES RB
DEVELOPING OPTIMAL SEARCH STRATEGIES FOR DETECTING CLINICALLY SOUND STUDIES
IN MEDLINE
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 1 : 447 1994
HERSH WR
How well do physicians use electronic information retrieval systems? A
framework for investigation and systematic review
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION 280 : 1347 1998
HERSH WR
MED INFORMATICS COMP : 539 2000
HERSH WR
SIGIR 94 DUBL IR : 1994
HERSKOVIC JR
AM MED INF ASS FALL : 2005
JOACHIMS T
SIGIR WORKSH MATH FO : 2002
KING DN
THE CONTRIBUTION OF HOSPITAL LIBRARY INFORMATION-SERVICES TO CLINICAL CARE -
A STUDY IN 8 HOSPITALS
BULLETIN OF THE MEDICAL LIBRARY ASSOCIATION 75 : 291 1987
KLEINBERG J
9 ANN ACM SIAM S DIS : 1998
LANCASTER F
INFORM RETRIEVAL TOD : 1993
MANNING CD
FDN STAT NATURAL LAN : 1999
MARSHALL JG
IMPACT INFORM PROVID : 1991
OPTHOF T
Sense and nonsense about the impact factor
CARDIOVASCULAR RESEARCH 33 : 1 1997
PAGE L
PAGERANK CITATION RA : 1998
STEGMANN J
How to evaluate journal impact factors
NATURE 390 : 550 1997
WILKINSON R
INFORMATION RETRIEVA : 257 1996
WILLIAMS H
ZETTAIR SEARCH ENGIN : 2004
WILSON SR
USE CRITICAL INCIDEN : 1989
ZHU M
JOINT STAT M BIOPH S : 2003
ZIPSER J
MEDLINE PUBMED BEYON
More information about the SIGMETRICS
mailing list