Burrell, QL (Burrell, Quentin L.) Some comments on "The estimation of lost multi-copy documents: A new type of informetrics theory" by Egghe and Proot JOURNAL OF INFORMETRICS, 2 (1): 101-105 2008
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Wed Apr 2 16:36:51 EDT 2008
E-mail Address: q.burrell at ibs.ac.im
Author(s): Burrell, QL (Burrell, Quentin L.)
Title: Some comments on "The estimation of lost multi-copy documents: A
new type of informetrics theory" by Egghe and Proot
Source: JOURNAL OF INFORMETRICS, 2 (1): 101-105 2008
Language: English
Document Type: Article
Author Keywords: multi-copy documents; truncated Poisson distribution;
maximum likelihood; unseen species problem
Keywords Plus: NUMBER; SAMPLE; POPULATION
Abstract: Egghe and Proot [Egghe, L., & Proot, G. (2007). The estimation
of the number of lost multi-copy documents: A new type of informetrics
theory. Journal of Informetrics] introduce a simple probabilistic model to
estimate the number of lost multi-copy documents based on the numbers of
retrieved ones. We show that their model in practice can essentially be
described by the well-known Poisson approximation to the binomial. This
enables us to adopt a traditional maximum likelihood estimation (MLE)
approach which allows the construction of (approximate) confidence
intervals for the parameters of interest, thereby resolving an open
problem left by the authors. We further show that the general estimation
problem is a variant of a well-known unseen species problem. This work
should be viewed as supplementing that of Egghe and Proot [Egghe, L., &
Proot, G. (2007). The estimation of the number of lost multi-copy
documents: A new type of informetrics theory. Journal of Informetrics]. It
turns out that their results are broadly in line with those produced by
this rather more robust statistical analysis. (C) 2007 Elsevier Ltd. All
rights reserved.
Addresses: Isle Man Int Business Sch, Douglas IM2 1QB, Man, England
Reprint Address: Burrell, QL, Isle Man Int Business Sch, Old Castletown
Rd, Douglas IM2 1QB, Man, England.
E-mail Address: q.burrell at ibs.ac.im
Cited Reference Count: 16
Times Cited: 0
Publisher: ELSEVIER SCIENCE BV
Publisher Address: PO BOX 211, 1000 AE AMSTERDAM, NETHERLANDS
ISSN: 1751-1577
BAIN LJ
INTRO PROBABILITY MA : 1992
BROOKES BC
SAMPLING THEOREM FOR FINITE DISCRETE DISTRIBUTIONS
JOURNAL OF DOCUMENTATION 31 : 26 1975
BURRELL QL
The sample size dependency of statistical measures in informetrics? Some
comments
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY
54 : 1076 DOI 10.1002/asi.10307 2003
BURRELL QL
ON THE GROWTH OF BIBLIOGRAPHIES WITH TIME - AN EXERCISE IN BIBLIOMETRIC
PREDICTION
JOURNAL OF DOCUMENTATION 45 : 302 1989
BURRELL QL
A SIMPLE EMPIRICAL-METHOD FOR PREDICTING LIBRARY CIRCULATIONS
JOURNAL OF DOCUMENTATION 44 : 302 1988
BURRELL WL
INFORMETRICS 89 : 57 1990
CHUNG K
ELEMENTARY PROBABILI : 2003
DEGROOT HM
PROBABILITY STAT : 1986
EFRON B
ESTIMATING NUMBER OF UNSEEN SPECIES - HOW MANY WORDS DID SHAKESPEARE KNOW
BIOMETRIKA 63 : 435 1976
EGGHE L
The estimation of the number of lost multi-copy documents: A new type of
informetrics theory
JOURNAL OF INFORMETRICS 1 : 257 DOI 10.1016/j.joi.2007.02.003 2007
ENGEN S
STOCHASTIC ABUNDANCE : 1978
FELLER W
INTRO PROBABILITY TH : 1968
FISHER RA
The relation between the number of species and the number of individuals
in a random sample of an animal population
JOURNAL OF ANIMAL ECOLOGY 12 : 42 1943
GOOD IJ
THE NUMBER OF NEW SPECIES, AND THE INCREASE IN POPULATION COVERAGE, WHEN A
SAMPLE IS INCREASED
BIOMETRIKA 43 : 45 1956
KENDALL MG
THE BIBLIOGRAPHY OF OPERATIONAL-RESEARCH
OPERATIONAL RESEARCH QUARTERLY 11 : 31 1960
PRESS WH
NUMERICAL RECIPES AR : 1986
More information about the SIGMETRICS
mailing list