A mathematical theory of citing, Simkin, MV; Roychowdhury, VP, JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 58 (11). SEP 2007. p.1661-1673

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Thu Dec 6 11:06:41 EST 2007


E-mail Address: simkin at ee.ucla.edu

Author(s): Simkin, MV (Simkin, Mikhail V.); Roychowdhury, VP 
(Roychowdhury, Vwani P.) 

Title: A mathematical theory of citing 

Source: JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND 
TECHNOLOGY, 58 (11): 1661-1673 SEP 2007 

Language: English 
Document Type: Article 

Keywords Plus: NETWORKS; EVOLUTION; MODEL; CRITICALITY; ALLELES; CHANCE

Cited Reference Count: 50 
Times Cited: 0 

Publisher: JOHN WILEY & SONS INC 
Publisher Address: 111 RIVER ST, HOBOKEN, NJ 07030 USA 

ISSN: 1532-2882 

Subject Category: Computer Science, Information Systems; Information 
Science & Library Science 
 
Abstract: Recently we proposed a model in which when a scientist writes a 
manuscript, he picks up several random papers, cites them, and also copies 
a fraction of their references. The model was stimulated by our finding 
that a majority of scientific citations are copied from the lists of 
references used in other papers. It accounted quantitatively for several 
properties of empirically observed distribution of citations; however, 
important features such as power-law distributions of citations to papers 
published during the same year and the fact that the average rate of 
citing decreases with aging of a paper were not accounted for by that 
model. Here, we propose a modified model: When a scientist writes a 
manuscript, he picks up several random recent papers, cites them, and also 
copies some of their references. The difference with the original model is 
the word recent. We solve the model using methods of the theory of 
branching processes, and find that it can explain the aforementioned 
features of citation distribution, which our original model could not 
account for. The model also can explain "sleeping beauties in science;" 
that is, papers that are little cited for a decade or so and 
later "awaken" and get many citations. Although much can be understood 
from purely random models, we find that to obtain a good quantitative 
agreement with empirical citation data, one must introduce Darwinian 
fitness parameter for the papers. 

Addresses: Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90095 
USA 

Reprint Address: Simkin, MV, Univ Calif Los Angeles, Dept Elect Engn, Los 
Angeles, CA 90095 USA. 

E-mail Address: simkin at ee.ucla.edu; vwani at ee.ucla.edu 

Cited References: 
ALSTROM P, 1988, PHYS REV A, V38, P4905.
ASIMOV I, 1958, ONLY TRILLION.
BAK P, 1988, PHYS REV A, V38, P364.
BAK P, 1993, PHYS REV LETT, V71, P4083.
BAK P, 1999, NATURE WORKS SCI SEL.
BARABASI AL, 1999, SCIENCE, V286, P509.
BENTLEY RA, 2004, P ROY SOC LOND B BIO, V271, P1443.
BIANCONI G, 2001, EUROPHYS LETT, V54, P436.
BRODY T, 2005, EARLIER WEB USAGE ST.
BURRELL QL, 2002, SCIENTOMETRICS, V55, P273.
BURRELL QL, 2003, J AM SOC INF SCI TEC, V54, P372.
BURRELL QL, 2005, SCIENTOMETRICS, V65, P381.
CAVALLISFORZA LL, 1981, CULTURAL TRANSMISSIO.
CRONIN B, 1981, J DOC, V37, P16.
DOROGOVTSEV SN, 2002, ADV PHYS, V51, P1079.
EWENS WJ, 1964, GENETICS, V50, P891.
FISHER RA, 1958, GENETICAL THEORY NAT.
GARFIELD E, 2004, CURR CONTENTS, V21, P5.
GLANZEL W, 1994, SCIENTOMETRICS, V30, P49.
GLANZEL W, 2003, SCIENTOMETRICS, V58, P571.
GLANZEL W, 2004, SCIENTIST, V18, P8.
GUNTHER R, 1996, INT J THEOR PHYS, V35, P395.
HAHN MW, 2003, P ROYAL SOC LOND B S.
HARRIS TE, 1963, THEORY BRANCHING PRO.
HERZOG HA, 1963, P ROYAL SOC LOND B S.
KIMURA M, 1964, GENETICS, V49, P725.
KLEENE SC, 1952, INTRO METANMATHEMATI.
KRAPIVSKY PL, 2001, PHYS REV E 2, V63.
LAURITSEN KB, 1996, PHYS REV E, V54, P2483.
LEE DS, 2004, SANDPILE AVALANCHE D.
MOTYLEV VM, 1989, SCIENTOMETRICS, V15, P97.
NAKAMOTO H, 1988, INFORMETRIC 87 88.
OTTER R, 1949, ANN MATH STAT, V20, P206.
POLLMANN T, 2000, SCIENTOMETRICS, V47, P43.
PRICE DJD, 1965, SCIENCE, V149, P510.
PRICE DJD, 1976, J AM SOC INFORM SCI, V27, P292.
RAAN AFJ, 2004, SCIENTOMETRICS, V59, P467.
REDNER S, 1998, EUR PHYS J B, V4, P131.
REDNER S, 2004, CITATION STAT MORE C.
SIMKIN MV, 2003, COMPLEX SYSTEMS, V14, P269.
SIMKIN MV, 2005, ANN IMPROBABKE RES, V1, P24.
SIMKIN MV, 2005, SCIENTOMETRICS, V62, P367.
SIMKIN MV, 2006, J MATH SOCIOL, V30, P33.
SIMON HA, 1955, BIOMETRIKA, V42, P425.
SOKAL A, 1998, FASHIONABLE NONSENSE.
VANDEWALLE N, 1996, PHYSICA D, V90, P262.
VAZQUEZ A, 2001, EUROPHYS LETT, V54, P430.
WATSON HW, 1874, J ANTHR I, V4, P138.
WEISSTEIN EW, ERF MATHWORLD.
WEISSTEIN EW, LAGRANGE EXPANSION M. 

 



More information about the SIGMETRICS mailing list