Leydesdorff, L On the normalization and visualization of author co-citation data: Salton's cosine versus the Jaccard index JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 59 (1): 77-85 JAN 1 2008

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Tue Apr 15 11:13:01 EDT 2008


E-mail Address: loet at leydesdorff.net 

Author(s): Leydesdorff, L (Leydesdorff, Loet) 

Title: On the normalization and visualization of author co-citation data: 
Salton's cosine versus the Jaccard index 

Source: JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND 
TECHNOLOGY, 59 (1): 77-85 JAN 1 2008 

Language: English 

Document Type: Article 

Keywords Plus: SIMILARITY MEASURES; PEARSONS-R; COLLABORATION; SCIENCE
 
Abstract: The debate about which similarity measure one should use for the 
normalization in the case of Author Co-citation Analysis (ACA) is further 
complicated when one distinguishes between the symmetrical co-citation-or, 
more generally, co-occurrence-matrix and the underlying asymmetrical 
citation-occurrence-matrix. In the Web environment, the approach of 
retrieving original citation data is often not feasible. In that case, one 
should use the Jaccard index, but preferentially after adding the number 
of total citations (i.e., occurrences) on the main diagonal. Unlike 
Salton's cosine and the Pearson correlation, the Jaccard index abstracts 
from the shape of the distributions and focuses only on the intersection 
and the sum of the two sets. Since the correlations in the co-occurrence 
matrix may be spurious, this property of the Jaccard index can be 
considered as an advantage in this case. 

Addresses: Amsterdam Sch Commun Res, NL-1012 CX Amsterdam, Netherlands 

Reprint Address: Leydesdorff, L, Amsterdam Sch Commun Res, 
Kloveniersbrugwal 48, NL-1012 CX Amsterdam, Netherlands.
 
E-mail Address: loet at leydesdorff.net 

Cited Reference Count: 29 

Times Cited: 0 

Publisher: JOHN WILEY & SONS INC 

Publisher Address: 111 RIVER ST, HOBOKEN, NJ 07030 USA 

ISSN: 1532-2882 

29-char Source Abbrev.: J AM SOC INF SCI TECHNOL 

ISO Source Abbrev.: J. Am. Soc. Inf. Sci. Technol. 

Source Item Page Count: 9 

Subject Category: Computer Science, Information Systems; Information 
Science & Library Science 

ISI Document Delivery No.: 252CR 

AHLGREN P
Author cocitation analysis and Pearson's r 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
55 : 843 DOI 10.1002/asi.20030 2004
 
AHLGREN P
Requirements for a cocitation similarity measure, with special reference 
to Pearson's correlation coefficient 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
54 : 550 DOI 10.1002/asi.10242 2003
 
BENSMAN SJ
Pearson's r and author cocitation analysis: A commentary on the 
controversy 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
55 : 935 DOI 10.1002/asi.20028 2004 

BORGATTI SP
UCINET WINDOWS SOFTW : 2002 

EGGHE L
INTRO INFORMETRICS A : 1990 

GLANZEL W
National characteristics in international scientific co-authorship 
relations 
SCIENTOMETRICS 51 : 69 2001 

HAMERS L
SIMILARITY MEASURES IN SCIENTOMETRIC RESEARCH - THE JACCARD INDEX VERSUS 
SALTON COSINE FORMULA 
INFORMATION PROCESSING & MANAGEMENT 25 : 315 1989 

JACCARD P
B SOC VAUD SCI NAT 37 : 241 1901 

JONES WP
PICTURES OF RELEVANCE - A GEOMETRIC ANALYSIS OF SIMILARITY MEASURES 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE 38 : 420 1987 

KAMADA T
AN ALGORITHM FOR DRAWING GENERAL UNDIRECTED GRAPHS 
INFORMATION PROCESSING LETTERS 31 : 7 1989 

KENNY DA
DYADIC DATA ANAL : 2006 

LEYDESDORFF L
INFORMETRICS 87 : 105 1988 

LEYDESDORFF L
Co-occurrence matrices and their applications in information science: 
Extending ACA to the Web environment 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
57 : 1616 DOI 10.1002/asi.20335 2006 

LEYDESDORFF L
Similarity measures, author cocitation analysis, and information theory 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
56 : 769 DOI 10.1002/asi.20130 2005 

LEYDESDORFF L
J AM SOC INFORM SCI : 2007 

LIPKUS AH
A proof of the triangle inequality for the Tanimoto distance 
JOURNAL OF MATHEMATICAL CHEMISTRY 26 : 263 1999 

LUUKKONEN T
THE MEASUREMENT OF INTERNATIONAL SCIENTIFIC COLLABORATION 
SCIENTOMETRICS 28 : 15 1993 

MICHELET B
THESIS U PARIS 7 PAR : 1988 

SALTON G
INTRO MODERN INFORM : 1983 

SCHNEIDER JW
Matrix comparison, part 1: Motivation and important issues for measuring 
the resemblance between proximity measures or ordination results 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
58 : 1586 DOI 10.1002/asi.20643 2007 

SMALL H
COCITATION IN SCIENTIFIC LITERATURE - NEW MEASURE OF RELATIONSHIP BETWEEN 
2 DOCUMENTS 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE 24 : 265 1973 

TANIMOTO TT
IBM TECHNICAL REPORT : 1957 

VANRIJSBERGEN CJ
THEORETICAL BASIS FOR USE OF CO-OCCURRENCE DATA IN INFORMATION-RETRIEVAL 
JOURNAL OF DOCUMENTATION 33 : 106 1977 

WAGNER C
INT J TECHNOLOGY GLO 1 : 185 2005 

WALTMAN L
Some comments on the question whether co-occurrence data should be 
normalized 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
58 : 1701 DOI 10.1002/asi.20647 2007 

WHITE HD
Author cocitation analysis and Pearson's r - Reply 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
55 : 843 DOI 10.1002/asi.20032 2004 

WHITE HD
Author cocitation analysis and Pearson's r 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 
54 : 1250 DOI 10.1002/asi.10325 2003 

WHITE HD
AUTHOR COCITATION - A LITERATURE MEASURE OF INTELLECTUAL STRUCTURE 
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE 32 : 163 1981 

ZITT M
Shadows of the past in international cooperation: Collaboration profiles 
of the top five producers of science 
SCIENTOMETRICS 47 : 627 2000 



More information about the SIGMETRICS mailing list