Jacso P "Errors of omission and their implications for computing scientometric measures in evaluating the publishing productivity and impact of countries" Online Information Review 33(2):376-385, 2009
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Mon Jul 20 16:40:52 EDT 2009
-----------------------------------------------------------
e-MAIL: jacso at hawaii.edu
TITLE : Errors of omission and their implications for computing
scientometric measures in evaluating the publishing productivity and impact
of countries
Author(s): Jacso P (Jacso, Peter)
Source: ONLINE INFORMATION REVIEW Volume: 33 Issue: 2 Pages: 376-
385 Published: 2009
Times Cited: 0 References: 44 Citation Map
Abstract: Purpose - The purpose of the paper is to explore the extent of
the absence of data elements that are critical from the perspective of
scientometric evaluation of the scientific productivity and impact of
countries in terms of the most common indicators - such as the number of
publications, the number of citations and the impact factor (the ratio of
citations received to papers published), and the effect these may have on
the h-index of countries - in two of the most widely used citation-enhanced
databases.
Design/methodology/approach - The author uses the Scopus database and
Thomson-Reuters' (earlier known as ISI) three citation databases (Science,
Social Sciences and Arts & Humanities), both as implemented on the Dialog
Information Services (Thomson ISI databases) and on the Web of Knowledge
platform, known as Web of Science (WoS). The databases were searched to
discover how many records they have for each year, how many of those have
cited references for each year, and what percentage of the records have
other essential or often used data elements for bibliometric/scientometric
evaluation.
Findings - There is no difference between the databases in the presence of
publication year data all of them include this element for all the records.
The presence of the language field is comparable between the Thomson and
Scopus databases, but it should be noted that a 2 per cent difference for
mega-databases of such size is not entirely negligible. The rate of
presence of the subject category field is better in Scopus, even though it
has far fewer subject categories (27) than the Thomson databases (well over
200). The rate of absence of country identification is the most critical
and disappointing. It is caused primarily by the fact that journals have
not had consistent policies for including the country affiliation of the
authors. The huge 34 percent omission rate of country identification in
Scopus also hurts its impressive author identification feature.
Unfortunately, the country information is not available in more than 12
million records.
Originality/value - Irrespective of the reasons for the very high rate of
omission of country names or codes, it should be realised and prominently
mentioned in any scientometric country reports. The author has never seen
this mentioned in published papers, nor in the manuscripts that he has peer
reviewed. Many can live with the low omission rates of the language,
document type and subject category elements, and many can just avoid using
these filters. The two factors that define the level of distortion in the
assessment and ranking of the research achievements of countries are the
rate of cited reference enhanced records and the rate of presence of
country affiliation data.
Reprint Address: Jacso, P (reprint author), Univ Hawaii, Honolulu, HI 96822
USA
Addresses:
1. Univ Hawaii, Honolulu, HI 96822 USA
Publisher: EMERALD GROUP PUBLISHING LIMITED, HOWARD HOUSE, WAGON LANE,
BINGLEY BD16 1WA, W YORKSHIRE, ENGLAND
IDS Number: 447HU
ISSN: 1468-4527
DOI: 10.1108/14684520910951276
CITED REFERENCES:
1. *SCOP SCOP DET FACTS FIG : 2009
2. *THOMS REUT WEB SCI : 2009
3. BARILAN J Some measures for comparing citation databases
JOURNAL OF INFORMETRICS 1 : 26 DOI 10.1016/j.joi.2006.08.001 2007
4. BARILAN J Which h-index? - A comparison of WoS, Scopus and Google
Scholar SCIENTOMETRICS 74 : 257 DOI 10.1007/s11192-008-0216-y 2008
5. BORNMANN L The state of h index research Is the h index the ideal way
to measure research performance? EMBO REPORTS 10 : 2 DOI
1038/embor.2008.233 2009
6. BORNMANN L Are there better indices for evaluation purposes than the h
index? a comparison of nine different variants of the h index using data
from biomedicine JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE
AND TECHNOLOGY 59 : 830 DOI 10.1002/asi.20806 2008
7. BORNMANN L Convergent validity of bibliometric Google Scholar data in
the field of chemistry-Citation counts for papers that were accepted by
Angewandte Chemie International Edition or rejected but published
elsewhere, using Google Scholar, Science Citation Index, Scopus, and
Chemical Abstracts JOURNAL OF INFORMETRICS 3 : 27
DOI 1016/j.joi.2008.11.001 2009
8. BUTLER L ETHICS SCI ENV POLIT 8 : 83 2008
9. CRONIN B Using the h-index to rank influential information scientists
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 57 :
1275 DOI 10.1002/asi.20354 2006
10. CSAJBOK E Hirsch-index for countries based on essential science
indicators data SCIENTOMETRICS 73 : 91 DOI 10.1007/s11192-007-1859-9 2007
11. DALUZ MP Institutional h-index: The performance of a new metric in
the evaluation of Brazilian Psychiatric Post-graduation Programs
SCIENTOMETRICS 77 : 361 DOI 10.1007/s11192-007-1964-9 2008
12. DEARAUJO AFP Increasing discrepancy between absolute and effective
indexes of research output in a Brazilian academic department
SCIENTOMETRICS 74 : 425 DOI 10.1007/s11192-007-1817-6 2008
13. DEMOYAANEGON F Coverage analysis of Scopus: A journal metric approach
SCIENTOMETRICS 73 : 53 DOI 10.1007/s11192-007-1681-4 2007
14. DESS HM SCOPUS : 2006
15. FINGERMAN S SCOPUS: Profusion and confusion ONLINE 29 : 36 2005
16. FINGERMAN S WEB SCI SCOPUS CURRE : 2006
17. GAVEL Y Web of Science and Scopus: a journal title overlap study
ONLINE INFORMATION REVIEW 32 : 8 DOI 10.1108/14684520810865958 2008
18. GOODMAN CD Fatty acid biosynthesis as a drug target in apicomplexan
parasites CURRENT DRUG TARGETS 8 : 15 2007
19. GORMAN GE "They can't read, but they sure can count" Flawed rules of
the journal rankings game ONLINE INFORMATION REVIEW 32 : 705 DOI
1108/14684520810923872 2008
20. HOOD WW Informetric studies using databases: Opportunities and
challenges SCIENTOMETRICS 58 : 587 2003
21. JACSO P Content evaluation of databases ANNUAL REVIEW OF INFORMATION
SCIENCE AND TECHNOLOGY 32 : 231 1997
22. JACSO P CONTENT EVALUATION T : 2001
23. JACSO P As we may search - Comparison of major features of the Web of
Science, Scopus, and Google Scholar citation-based and citation-enhanced
databases CURRENT SCIENCE 89 : 1537 2005
24. JACSO P SEARCHING FOR SKELETONS IN THE DATABASE CUPBOARD .1. ERRORS
OF OMISSION DATABASE 16 : 38 1993
25. JACSO P Comparison and analysis of the citedness scores in web of
science And Google Scholar DIGITAL LIBRARIES: IMPLEMENTING STRATEGIES AND
SHARING EXPERIENCES, PROCEEDINGS 3815 : 360 2005
26. JACSO P Testing the calculation of a realistic h-index in Google
Scholar, Scopus, and Web of Science for F. W. Lancaster
LIBRARY TRENDS 56 : 784 2008
27. JACSO P Savvy searching - Google Scholar revisited
ONLINE INFORMATION REVIEW 32 : 102 DOI 10.1108/14684520810866010 2008
28. JACSO P ONLINE INFORM REV 32 : 262 2008
29. JACSO P The pros and cons of computing the h-index using Google
Scholar ONLINE INFORMATION REVIEW 32 : 437 DOI 10.1108/14694520810889718
2008
30. JACSO P The pros and cons of computing the h-index using Scopus
ONLINE INFORMATION REVIEW 32 : 524 DOI 10.1108/14684520810897403 2008
31. JACSO P The pros and cons of computing the h-index using Web of
Science ONLINE INFORMATION REVIEW 32 : 673 DOI 10.1108/14684520810914043
2008
32. JACSO P The dimensions of cited reference enhanced database subsets
ONLINE INFORMATION REVIEW 31 : 694 DOI 10.1108/14684520710832360 2007
33. JACSO P Deflated, inflated and phantom citation counts
ONLINE INFORMATION REVIEW 30 : 297 DOI 10.1108/14684520610675816 2006
34. LEYDESDORFF L Caveats for the use of citation indicators in research
and journal evaluations JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION
SCIENCE AND TECHNOLOGY 59 : 278 DOI 10.1002/asi.20743 2008
35. MEHO LI Impact of data sources on citation counts and rankings of LIS
faculty: Web of science versus scopus and google scholar
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 58 :
2105 DOI 10.1002/asi.20677 2007
36. MEHO LI P 11 INT C INT SOC S : 2007
37. NEUHAUS C Data sources for performing citation analysis: an overview
JOURNAL OF DOCUMENTATION 64 : 193 DOI 10.1108/00220410810858010 2008
38. NEUHAUS C The depth and breadth of Google Scholar: An empirical study
PORTAL-LIBRARIES AND THE ACADEMY 6 : 127 2006
39. NORRIS M Comparing alternatives to the Web of Science for coverage of
the social sciences' literature JOURNAL OF INFORMETRICS 1 : 161 DOI
10.1016/j.joi.2006.12.001 2007
40. ROUSSEAU R The influence of missing publications on the Hirsch index
JOURNAL OF INFORMETRICS 1 : 2 2007
41. VANCLAY JK On the robustness of the h-index JOURNAL OF THE AMERICAN
SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY 58 : 1547 2007
42. VANRAAN AFJ SCIENTOMETRICS 69 : 117 2005
43. WHITE B NZ LIB INFORM MANAGE 50 : 11 2006
44. YANG K P 69 ANN M AM SOC IN : 43 2006
More information about the SIGMETRICS
mailing list