contents of Journal of Inforrmation Science v.32(3) 2006 and Information Processing & Management 42(6) Dec. 2006
Eugene Garfield
eugene.garfield at THOMSON.COM
Fri Jul 28 14:46:11 EDT 2006
The following articles from the Journal of Information Science and the
journal Informqtion Processing and Management are called to your
attention without the usual detailed data extraction from the
WebofScience. A few will be processed separately by readers of the
listserv are encouraged to seek out these papers themselves. The volume
of literature these days is getting harder and harder to keep up with.
Gene Garfield
TITLE: Sample size and informetric model goodness-of-fit
outcomes: a search engine log case study (Article,
English)
AUTHOR: Ajiferuke, I; Wolfram, D; Famoye, F
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.212-222
SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): J DOC* rwork; SCIENTOMETR* rwork;
INFORMETRIC* item_title; J INF SCI source_abbrev_20
KEYWORDS: informetric modelling; frequency distributions; internet
usage patterns; goodness-of-fit tests
KEYWORDS+: POISSON-DISTRIBUTION; TESTS; WEB; QUERIES
ABSTRACT: The influence of sample size on informetric
characteristics is examined to determine whether theoretical
mathematical
models can adequately fit large data sets. Two large data sets of
queries
submitted to the Excite search service were sampled for search
characteristics (term frequencies, terms used per query, pages viewed
per
query, queries submitted per session) producing data sets of various
sizes that were fitted to theoretical models to determine how the sample
may influence a model's goodness-of-fit. Although theoretical models
could adequately fit smaller data sets of up to 5000 observations in
some
cases, larger data sets could not be satisfactorily fitted using several
goodness-of-fit techniques. Investigators must take into account that
sample size does influence goodness-of-fit outcomes. The nature of the
data and not the limitations of given goodness-of-fit tests results in
significant outcomes. Such goodness-of-fit tests should be used for
comparative purposes, rather than significance testing.
AUTHOR ADDRESS: I Ajiferuke, Univ Western Ontario, Fac Informat & Media
Studies, London, ON N6A 5B7, Canada
[ ]<-- Enter an X to order article (IDS: 062ML 00001) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: Document delivery as a source for bibliometric analyses:
the case of Subito (Article, English)
AUTHOR: Schloegl, C; Gorraiz, J
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.223-237
SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): LINE MB rauth; J DOC* rwork; BIBLIOMETR* item_title;
J INF SCI source_abbrev_20
KEYWORDS: document delivery; Subito; bibliometric analysis;
article
orders; journal requests; age of ordered articles;
citation frequency
KEYWORDS+: JOURNAL USE; DEMAND
ABSTRACT: This paper deals with a bibliometric analysis of data
from the document delivery service Subito. After a short introduction,
Subito will be presented briefly. The main part reports on the design
and
the results of the study, which covers the following major topics:
distribution of article orders to journals,
identification of the core journals which contribute to most article
supplies,
subject distribution of the most requested journals,
relation between the most requested (Subito) and the most cited journals
(SCI),
differences in age of ordered and cited articles, and
dependency of journal requests on their subscription rates.
As will be shown, most Subito article orders are covered by a relatively
small number of journals, most of which are from life sciences,
especially medicine. There is only a slight overlap between the most
requested and the most cited journals suggesting that these two
indicators represent different concepts. This is also confirmed by
different obsolescence characteristics. The share of current
publications
is much higher among ordered than among cited articles. Finally, there
was no evidence that articles of journals with higher subscription rates
are ordered more often.
AUTHOR ADDRESS: C Schloegl, Univ Str 15-F3, A-8010 Graz, Austria
[ ]<-- Enter an X to order article (IDS: 062ML 00002) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: Automated support specification for efficient mining of
interesting association rules (Article, English)
AUTHOR: Lin, WY; Tseng, MC
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.238-250
SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): J INF SCI source_abbrev_20
KEYWORDS: data mining; decision support systems; association
rules;
support specification
ABSTRACT: In recent years, the weakness of the canonical support-
confidence framework for associations mining has been widely studied.
One
of the difficulties in applying association rules mining is the setting
of support constraints. A high-support constraint avoids the
combinatorial explosion in discovering frequent itemsets, but at the
expense of missing interesting patterns of low support. Instead of
seeking a way to set the appropriate support constraints, all current
approaches leave the users in charge of the support setting, which,
however, puts the users in a dilemma. This paper is an effort to answer
this long-standing open question. According to the notion of confidence
and lift measures, we propose an automatic support specification for
efficiently mining high-confidence and positive-lift associations
without
consulting the users. Experimental results show that the proposed method
is not only good at discovering high-confidence and positive-lift
associations, but also effective in reducing spurious frequent itemsets.
AUTHOR ADDRESS: WY Lin, Natl Univ Kaohsiung, Dept Comp Sci & Informat
Engn,
Kaohsiung 811, Taiwan
[ ]<-- Enter an X to order article (IDS: 062ML 00003) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: Review - Knowledge reuse in action: the case of CALL
(Review, English)
AUTHOR: Chua, AYK; Lam, W; Majid, S
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.251-260
SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): J INF SCI source_abbrev_20
KEYWORDS: knowledge sharing; knowledge reuse; case study;
organizational learning; military; after-action review
KEYWORDS+: MANAGEMENT
ABSTRACT: This paper reviews how the Center for Army Lessons
Learned (CALL), an intelligence unit within the US Army, has entrenched
a
unique knowledge reuse process into its modus operandi. It highlights
several issues related to knowledge reuse, including the collection,
distillation and dissemination of knowledge, the role of subject experts
in the knowledge reuse process and how technology facilitates knowledge
reuse. For practitioners, this paper offers an inspiring exemplar of
knowledge reuse success. For researchers, existing theoretical grounds
for knowledge reuse have been opened for further debate.
AUTHOR ADDRESS: AYK Chua, 31 Nanyang Link,SCI Bldg, Singapore 637718,
Singapore
[ ]<-- Enter an X to order article (IDS: 062ML 00004) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: Towards the faster transformation of XML documents
(Article, English)
AUTHOR: Shin, DH; Lee, KH
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.261-276
SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): J INF SCI source_abbrev_20
KEYWORDS: transformation; XML document; XSLT script
KEYWORDS+: MAPPINGS
ABSTRACT: XML is so flexible that several different schemas are
often used even in the same application domain. To interchange XML
documents between two parties, it is necessary to transform XML
documents
into ones that conform to the schema of a partner. Since a
transformation
is repeatedly applied to a large volume of XML documents, the
transformation speed is important. This paper proposes a method for
generating XSLT scripts, which support the fast transformation of XML
documents, given one-to-one matching relationships between leaf nodes of
XML schemas. The proposed method consists of two steps: computing
matches
between internal nodes and generating XSLT scripts. Specifically, the
proposed method considers many-to-one matches among cardinality nodes.
The transformation of recursive structures is also supported. The
proposed method generates XSLT scripts with fewer templates that are
proportional to the number of the matches between recursive nodes.
Experimental results show that the proposed method generates XSLT
scripts
that support the faster transformation of XML documents, compared with
previous work.
AUTHOR ADDRESS: KH Lee, Yonsei Univ, Dept Comp Sci, 134 Shinchon Dong,
Seoul 120749, South Korea
[ ]<-- Enter an X to order article (IDS: 062ML 00005) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: How influential is Brooks' law? A longitudinal citation
context analysis of Frederick Brooks' The Mythical
Man-Month (Article,
English)
AUTHOR: McCain, KW; Salvucci, LJ
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.277-295
SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): CRONIN B rauth; GARFIELD E rauth; MORAVCSIK MJ
rauth;
SMALL H rauth; SMALL HG rauth; J DOC* rwork;
J INF SCI rwork; SCIENTOMETR* rwork;
CITATION item_title; CITATION* item_title;
J INF SCI source_abbrev_20;
GARFIELD E SCIENTOMETRICS 7:487 1985
KEYWORDS: software engineering; citation analysis; citation
context
analysis; longitudinal citation analysis; scholarly
communication; diffusion of ideas; concept symbols
KEYWORDS+: BIG SCIENCE; PART II; SOCIOLOGY; ECONOMICS;
CONSTRUCTION;
KNOWLEDGE; PATTERNS; ARTICLE; QUALITY; SYSTEMS
ABSTRACT: Citation context analysis is used to demonstrate the
diversity of concept symbols that a book-length publication can
represent
and the diffusion of influence of these concepts over time and across
scholarly disciplines. A content analysis of 574 citation contexts from
497 journal articles citing an edition of Frederick P. Brooks, Jr's The
Mythical Man-Month (MMM) over the period 1975-1999 showed that MMM
represents a variety of different concepts and is cited in a wide range
of subject areas. Over time, a high level of interest in MMM spread from
software engineering and computer science to management and information
systems, with different areas showing different patterns of focus on
concepts within the work. 'Brooks' Law' (the 'mythical man-month' or
'adding more people to a late project makes it later'), accounted for
less than 30% of the classified citation contexts. The findings
contribute to our understanding of the diffusion of ideas in scholarly
communication, and the diversity that can underlie the creation of a
reference in a scholarly publication.
AUTHOR ADDRESS: KW McCain, Drexel Univ, Coll Informat Sci & Technol,
3141
Chestnut St, Philadelphia, PA 19104 USA
[ ]<-- Enter an X to order article (IDS: 062ML 00006) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: Collaborative interaction behaviors in an information
technology problem-solving context: cognitive movements
of the helper and
the helped (vol 31, pg 483, 2005) (Correction, English)
AUTHOR: Kim, SJ; Wang, C
SOURCE: JOURNAL OF INFORMATION SCIENCE 32 (3). 2006. p.296 SAGE
PUBLICATIONS LTD, LONDON
SEARCH TERM(S): J INF SCI rwork; J INF SCI source_abbrev_20;
*CORRECT* doctype
[ ]<-- Enter an X to order article (IDS: 062ML 00007) ISSN: 0165-5515
------------------------------------------------------------------------
--
TITLE: Sovereign inequalities and hierarchy in anarchy:
American
power and international society (Review, English)
AUTHOR: Donnelly, J
SOURCE: EUROPEAN JOURNAL OF INTERNATIONAL RELATIONS 12 (2). JUN
2006. p.139-170 SAGE PUBLICATIONS LTD, LONDON
SEARCH TERM(S): CRONIN B rauth
KEYWORDS: anarchy; empire; hierarchy; inequality;
semi-sovereignty;
sovereignty
KEYWORDS+: COLLECTIVE SECURITY; INFORMAL EMPIRE; CONCERT; STATES;
EUROPE; LOGICS
ABSTRACT: How is unrivalled American power reshaping 21st-century
international society? Is the United States an empire, in fact or in the
making? This article attempts to elaborate the conceptual resources
required to answer such quesfions. I focus on multiple forms of
hierarchy
in anarchy and diverse practices of sovereign inequality - concepts that
most main-diverse stream perspectives ignore, find paradoxical, or even
dismiss as selfconLradictory.' After defining empire and hierarchy in
anarchy, I present a typology of international orders tuned to thinking
about empire and its alternatives. The central section of the article
explores three classes of formal inequalities common during the
Westphalian era - special rights of Great Powers, restricted rights for
outlaws, and a wide range of particular practices of 'semi-sovereignty'.
I then sketch ten historically grounded models of hierarchical
international relations. Two brief applications to contemporary American
power seek to illustrate the value of this conceptual apparatus.
Throughout, my focus is on appreciating the precise nature and
considerable variety of international inequalities. I argue that the
concepts of hierarchy in anarchy and sovereign inequality but not
empire,
are essential for understanding the shape and development of
contemporary
international order.
AUTHOR ADDRESS: J Donnelly, Univ Denver, Denver, CO 80208 USA
[ ]<-- Enter an X to order article (IDS: 062MR 00001) ISSN: 1354-0661
------------------------------------------------------------------------
--
------------------------------------------------------------------------
--
TITLE: Expansion of the field of informetrics: The second
special issue (Editorial Material, English)
AUTHOR: Egghe, L
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1405-1407 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): INFORMETRIC* item_title; EGGHE L
primaryauthor,author;
EDITORIAL doctype
AUTHOR ADDRESS: L Egghe, Univ Hasselt, Campus Diepenbeek,Agoralaan,
B-3590
Diepenbeek, Belgium
[ ]<-- Enter an X to order article (IDS: 062TI 00001) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Measures of international collaboration in scientific
literature: Part I (Article, English)
AUTHOR: Bookstein, A; Moed, H; Yitzahki, M
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1408-1421 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SCIENTOMETR* rwork
ABSTRACT: Research evaluating models of scientific productivity
require coherent metrics that quantify various key relations among
papers
as revealed by patterns of citation. This paper focuses on the various
conceptual problems inherent in measuring the degree to which papers
tend
to cite other papers written by authors of the same nationality. We
suggest that measures can be given a degree of assurance of coherence by
being based on mathematical models describing the citation process. A
number of such models are developed. (c) 2006 Published by Elsevier Ltd.
AUTHOR ADDRESS: A Bookstein, Univ Chicago, 1010 E 59 St, Chicago, IL
60637
USA
[ ]<-- Enter an X to order article (IDS: 062TI 00002) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Measures of international collaboration in scientific
literature: Part II (Article, English)
AUTHOR: Bookstein, A; Moed, H; Yitzahki, M
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1422-1427 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SCIENTOMETR* rwork
ABSTRACT: This paper continues the attempt of Part I to develop a
coherent family of measures of influence between classes of documents,
for example, language or nationality classes, as indicated by citation
choice. In this paper we focus on situations in which there is some
ambiguity as to how to assign items to a class. For simplicity, we
change
our focus from citations to co-authorship patterns, restricting most of
our discussion to papers with two authors. Like the earlier paper, we
propose very simple models of the citation decision, and base our
measures on the parameters that appear in the model. (c) 2006 Published
by Elsevier Ltd.
AUTHOR ADDRESS: A Bookstein, Univ Chicago, 1010 E 59 St, Chicago, IL
60637
USA
[ ]<-- Enter an X to order article (IDS: 062TI 00003) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Systems without low-productive sources (Article,
English)
AUTHOR: Egghe, L; Rousseau, R
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1428-1441 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): PRICE DJD rauth; J DOC* rwork; SCIENTOMETR* rwork;
EGGHE L primaryauthor,author
KEYWORDS+: SIZE DISTRIBUTION; POWER LAWS; PARETO; SCIENCE; CITIES
ABSTRACT: Information production processes (IPPs) without low-
productive sources are studied. A success-breeds-success or preferential
attachment mechanism is established in which, from some point in time
on,
no new sources are created. Such systems are called mature systems. When
time increases in mature systems the expected number of sources with a
low number of items strictly decreases. An adaptation of the Naranan-
Egghe model indicates that IPPs without low-productive sources must have
small alpha exponents (alpha < 2) in their size-frequency power law
descriptions.
A positive reinforcement model explains all the essential properties.
Using this approach it is shown that, when time increases in mature
systems the alpha exponent of the power size-frequency function
decreases, while, moreover, the minimum source size increases. This is
the main result of this article.
Examples related to country and city sizes illustrate the concepts and
results discussed in this article. (c) 2006 Elsevier Ltd. All rights
reserved.
AUTHOR ADDRESS: L Egghe, Univ Hasselt, Campus Diepenbeek,Agorlaan,Gebouw
D,
B-3590 Diepenbeek, Belgium
[ ]<-- Enter an X to order article (IDS: 062TI 00004) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: An interpretation of the effort function through the
mathematical formalism of Exponential Informetric
Process (Article,
English)
AUTHOR: Lafouge, T; Smolczewska, A
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1442-1450 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): J INF SCI rwork; SCIENTOMETR* rwork;
INFORMETRIC* item_title
KEYWORDS+: INFORMATION; DISTRIBUTIONS; LAWS
ABSTRACT: Statistical distributions in the production or
utilization of information are most often studied in the framework of
Lotkaian informetrics. In this article, we show that an Information
Production Process (IPP), traditionally characterized by Lotkaian
distributions, can be fruitfully studied using the effort function, a
concept introduced in an earlier article to define an Exponential
Informetric Process. We thus propose replacing the concept of Lotkaian
distribution by the logarithmic effort function. In particular, we show
that an effort function defines an Exponential Informetric Process if
its
asymptotic behavior is equivalent to the logarithmic function
beta(.)Log(x) with beta > 1, which is the effort function of a Lotkaian
distribution. (c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: T Lafouge, Univ Lyon 1, Lab Ursidoc, 43 Blvd 11 Novembre
1918, F-69622 Villeurbanne, France
[ ]<-- Enter an X to order article (IDS: 062TI 00005) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Modeling citation behavior in Management Science
journals
(Article, English)
AUTHOR: Mingers, J; Burrell, QL
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1451-1464 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): PRICE DJD rauth; J DOC* rwork; J INF SCI rwork;
SCIENTOMETR* rwork; JOURNALS item_title;
CITATION item_title; CITATION* item_title
KEYWORDS: citations; gamma-Poisson model; negative binomial
distribution; obsolescence; stochastic modeling
KEYWORDS+: CUMULATIVE ADVANTAGE
ABSTRACT: Citation rates are becoming increasingly important in
judging the research quality of journals, institutions and departments,
and individual faculty. This paper looks at the pattern of citations
across different management science journals and over time. A stochastic
model is proposed which views the generating mechanism of citations as a
gamma mixture of Poisson processes generating overall a negative
binomial
distribution. This is tested empirically with a large sample of papers
published in 1990 from six management science journals and found to fit
well. The model is extended to include obsolescence, i.e., that the
citation rate for a paper varies over its cited lifetime. This leads to
the additional citations distribution which shows that future citations
are a linear function of past citations with a time-dependent and
decreasing slope. This is also verified empirically in a way that allows
different obsolescence functions to be fitted to the data. Conclusions
concerning the predictability of future citations, and future research
in
this area are discussed. (c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: J Mingers, Univ Kent, Kent Business Sch, Canterbury CT2
7NZ, Kent, England
[ ]<-- Enter an X to order article (IDS: 062TI 00006) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: A distribution of impact factors of journals in the area
of software: An empirical study (Article, English)
AUTHOR: Sahoo, BB; Rao, IKR
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1465-1470 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): ARUNACHALAM S rauth; JOURNALS item_title;
IMPACT FACTOR* item_title
KEYWORDS: distribution of impact factors; gamma distribution;
informetric modeling
KEYWORDS+: LIFE SCIENCES RESEARCH; INDIA
ABSTRACT: Data from two bibliographic databases i.e. COMPENDEX and
INSPEC have been collected on the broad subject software and related
topics. The titles of the journals were extracted from the database. The
impact factors of the titles were also collected in order to study the
distribution of impact factors. An attempt has been made to identify a
suitable model to describe a distribution of impact factors. It has been
observed that a Gamma distribution fits the observed data very well with
one of the parameters as mean of the distribution and other parameter as
I. Further it has been observed that the researchers are now publishing
their articles in high impact factor journals. (c) 2006 Elsevier Ltd.
All
rights reserved.
AUTHOR ADDRESS: BB Sahoo, Indian Stat Inst, Documentat Res & Training
Ctr,
8th Mile,Mysore Rd, Bangalore 560059, Karnataka, India
[ ]<-- Enter an X to order article (IDS: 062TI 00007) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Automated Web issue analysis: A nurse prescribing case
study (Article, English)
AUTHOR: Thelwall, M; Thelwall, S; Fairclough, R
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1471-1483 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SMALL H rauth; J DOC* rwork; J INF SCI rwork;
SCIENTOMETR* rwork
KEYWORDS: Web; Automated Web issue analysis; link analysis; nurse
prescribing; medical informatics
KEYWORDS+: WORLD-WIDE-WEB; SCHOLARLY COMMUNICATION; INFORMATICS
EDUCATION; TOPIC IDENTIFICATION; HEALTH INFORMATION;
MEDICAL-EDUCATION; SITE INTERLINKING; QUALITY;
COCITATION;
WEBSITES
ABSTRACT: Web issue analysis, a new automated technique designed
to
rapidly give timely management intelligence about a topic from an
automated large-scale analysis of relevant pages from the Web, is
introduced and demonstrated. The technique includes hyperlink and URL
analysis to identify common direct and indirect sources of Web
information. In addition, text analysis through natural language
processing techniques is used identify relevant common nouns and noun
phrases. A case study approach is taken, applying Web issue analysis to
the topic of nurse prescribing. The results are presented in descriptive
form and a qualitative analysis is used to argue that new information
has
been found. The nurse prescribing results demonstrate interesting new
findings, such as the parochial nature of the topic in the UK, an
apparent absence of similar concepts internationally, at least in the
English-speaking world, and a significant concern with mental health
issues. These demonstrate that automated Web issue analysis is capable
of
quickly delivering new insights into a problem. General limitations are
that the success of Web issue analysis is dependant upon the particular
topic chosen and the ability to find a phrase that accurately captures
the topic and is not used in other contexts, as well as being language-
specific. (c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: M Thelwall, Wolverhampton Univ, Sch Comp & Informat
Technol, Wulfruna St, Wolverhampton WV1 1SB, W Midlands,
England
[ ]<-- Enter an X to order article (IDS: 062TI 00008) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Binary Pathfinder: An improvement to the Pathfinder
algorithm (Article, English)
AUTHOR: Guerrero-Bote, VP; Zapico-Alonso, F; Espinosa-Calvo, ME;
Crisostomo, RG; de Moya-Anegon, F
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1484-1490 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SCIENTOMETR* rwork
KEYWORDS: PFNETs; social networks; citation analysis; information
visualization
KEYWORDS+: COCITATION; NETWORKS
ABSTRACT: The Pathfinder algorithm is widely used to prune social
networks. The pruning maintains the geodesic distances between nodes. It
has shown itself to be very useful in the analysis of, amongst others,
citations in BIS (bibliometrics, informetrics, and scientometrics). It
has even been proposed for the online display of the search results in
an
information retrieval system. However, its great time and space
complexity limits its use in real-time applications and in networks of
any considerable size.
The present work describes an improved algorithm with considerably
reduced time and space complexity. Its lower execution costs thus
increase its applicability both in real time and to large networks. (c)
2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: VP Guerrero-Bote, Univ Extremadura, Fac Lib & Informat
Sci,
Antiguo Hosp Militar, Plaza Ibn Marwan S-N, Badajoz
06071,
Spain
[ ]<-- Enter an X to order article (IDS: 062TI 00009) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: A comparison of feature selection methods for an
evolving
RSS feed corpus (Article, English)
AUTHOR: Prabowo, R; Thelwall, M
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1491-1512 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): J INF SCI rwork
KEYWORDS: feature selection; chi-square; mutual information;
information gain
ABSTRACT: Previous researchers have attempted to detect
significant
topics in news stories and blogs through the use of word frequency-based
methods applied to RSS feeds. In this paper, the three statistical
feature selection methods: chi(2), Mutual Information (MI) and
Information Gain (I) are proposed as alternative approaches for ranking
term significance in an evolving RSS feed corpus. The extent to which
the
three methods agree with each other on determining the degree of the
significance of a term on a certain date is investigated as well as the
assumption that larger values tend to indicate more significant terms.
An
experimental evaluation was carried out with 39 different levels of data
reduction to evaluate the three methods for differing degrees of
significance. The three methods showed a significant degree of
disagreement for a number of terms assigned an extremely large value.
Hence, the assumption that the larger a value, the higher the degree of
the significance of a term should be treated cautiously. Moreover, MI
and
I show significant disagreement. This suggests that MI is different in
the way it ranks significant terms, as MI does not take the absence of a
term into account, although I does. I, however, has a higher degree of
term reduction than MI and chi(2). This can result in loosing some
significant terms. In summary, chi(2) seems to be the best method to
determine term significance for RSS feeds, as chi(2) identifies both
types of significant behavior. The chi(2) method, however, is far from
perfect as an extremely high value can be assigned to relatively
insignificant terms. (c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: R Prabowo, Wolverhampton Univ, Sch Comp & Informat
Technol,
Wulfruna St, Wolverhampton WV1 1SB, W Midlands, England
[ ]<-- Enter an X to order article (IDS: 062TI 00010) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Delineating complex scientific fields by an hybrid
lexical-citation method: An application to nanosciences
(Article, English)
AUTHOR: Zitt, M; Bassecoulard, E
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1513-1531 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): GARFIELD E rauth; SMALL H rauth; J INF SCI rwork;
SCIENTOMETR* rwork; CITATION item_title;
CITATION* item_title
KEYWORDS: information retrieval; lexical query; citation network;
nanosciences; scientific area delineation; bibliometrics
KEYWORDS+: INFORMETRIC DISTRIBUTIONS; WORD ANALYSIS; SCIENCE;
COCITATION; NANOTECHNOLOGY; LAWS; SPECIALTIES; PATENTS;
SYSTEMS
ABSTRACT: Relevance of bibliometric indicators on scientific areas
critically depends on the quality of their delineation. Macro-level
studies, often based on a selected list of journals, accept a high
degree
of fuzziness. Micro-level studies rely on sets of individual articles in
order to reduce noise and enhance precision of retrieval. The most usual
information retrieval process is based on lexical queries with various
levels of sophistication. In the experiment on Nanosciences reported
here, this process was used as a first step, to delineate a 'seed' of
literature. It has strong limitations, especially for emerging or
transversal fields. In a second step, the alternative approach of
citation linkages, was used to expand the bibliography starting from
lexical seed. The extension process presented is ruled by three
parameters, two deal with the cited side (threshold on citation score,
and specificity towards the field), one with the citing side (threshold
on the number of relevant references) interplaying in the 'referencing
structure' function (RSF) introduced in a previous work. This type of
combination proves effective for delineating the transversal field of
Nanosciences. Further improvements of the method are discussed. (c) 2006
Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: M Zitt, Observ Sci & Tech, 93 Rue Vaugirard, F-75015
Paris,
France
[ ]<-- Enter an X to order article (IDS: 062TI 00011) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Text mining without document context (Article, English)
AUTHOR: SanJuan, E; Ibekwe-SanJuan, F
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1532-1552 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SCIENTOMETR* rwork
KEYWORDS: multi-word term clustering; lexico-syntactic relations;
text mining; informetrics; cluster evaluation
KEYWORDS+: CLUSTER-ANALYSIS; WORD ANALYSIS; COCITATION; CRITERIA
ABSTRACT: We consider a challenging clustering task: the
clustering
of multi-word terms without document co-occurrence information in order
to form coherent groups of topics. For this task, we developed a
methodology taking as input multi-word terms and lexico-syntactic
relations between them. Our clustering algorithm, named CPCL is
implemented in the Term-Watch system. We compared CPCL to other existing
clustering algorithms, namely hierarchical and partitioning (k-means, k-
medoids). This out-of-context clustering task led us to adapt multi-word
term representation for statistical methods and also to refine an
existing cluster evaluation metric, the editing distance in order to
evaluate the methods. Evaluation was carried out on a list of multi-word
terms from the genomic field which comes with a hand built taxonomy.
Results showed that while k-means and k-medoids obtained good scores on
the editing distance, they were very sensitive to term length. CPCL on
the other hand obtained a better cluster homogeneity score and was less
sensitive to term length. Also, CPCL showed good adaptability for
handling very large and sparse matrices. (c) 2006 Elsevier Ltd. All
rights reserved.
AUTHOR ADDRESS: E SanJuan, Univ Metz, LITA, Ile du Saulcy, F-57045 Metz
1,
France
[ ]<-- Enter an X to order article (IDS: 062TI 00012) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: An ego-centric citation analysis of the works of Michael
O. Rabin based on multiple citation indexes (Article,
English)
AUTHOR: Bar-Ilan, J
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1553-1566 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): CRONIN B rauth; J DOC* rwork; J INF SCI rwork;
SCIENTOMETR* rwork; CITATION item_title;
CITATION ANALYS* item_title; CITATION* item_title
KEYWORDS: ego-centric citation analysis; Web of Science; Citeseer;
Google Scholar; multiple manifestations
KEYWORDS+: SCIENCE; AUTHORS
ABSTRACT: The primary goal of this study was to carry out an ego-
centric citation and reference analysis of the works of the
mathematician
and computer scientist, Michael O. Rabin. Until recently only a single
citation database was available for such research - the ISI Citation
Indexes. In this study we utilized and compared three major sources that
provide citation data: the Web of Science, Google Scholar and Citeseer.
Most cited works, citation identity, citation image makers and coauthors
were identified. The citation image makers acquired through these
sources
differ considerably. Advantages and shortcomings of each of the tools
are
discussed in the context of computer science. A major issue in computer
science is multiple manifestations of a work, i.e., its publication in
several venues (technical reports, proceedings, journals, collections).
The implications of multiple manifestations for citation analysis are
discussed. (c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: J Bar-Ilan, Bar Ilan Univ, Dept Informat Sci, IL-52900
Ramat Gan, Israel
[ ]<-- Enter an X to order article (IDS: 062TI 00013) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Journal self-citation study for semiconductor
literature:
Synchronous and diachronous approach (Article, English)
AUTHOR: Tsay, MY
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1567-1577 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): GARFIELD E rauth; LIPETZ BA rauth;
MACROBERTS MH rauth; SCIENTOMETR* rwork;
CITATION item_title; CITATION* item_title;
JOURNAL item_title
KEYWORDS: analysis; citation analysis; journal self-citing;
journal
self-cited; self-citation; synchronous vs. diachronous;
semiconductor journals
KEYWORDS+: BIBLIOMETRICS; PRODUCTIVITY
ABSTRACT: The present study investigates the self-citations of the
most productive semiconductor journals by synchronous (self-citing rate)
and diachronous (self-cited rate) approaches. Journal's productivity of
100 most productive semiconductor journals was gathered from INSPEC
database, 1978-1997 through OVID. Data of citation frequency were
obtained from the Science Citation Index (SCI), Journal Citation Reports
(JCR) 2001 CDROM edition by the title-by-title search. The self-citing
and self-cited data were drawn from the Citing Journal Listing and the
Cited Journal Listing of the JCR CDROM version 1990-2001. Self-citing
and
self-cited rates were determined by the method suggested by the JCR.
Eighty-seven journals common to INSPEC and JCR in semiconductor were
selected as the object of this study and were listed for statistical
tests. The results of the present study demonstrate that high
self-citing
journals are usually older than low self-citing journals. In contrast to
the self-citing data, the journal self-cited rate is not closely related
to the publication year but reflects the characteristics of various
journals. Journals with a short time interval of publication are more
possible with high self-citing and self-cited rates. Journals with
higher
self-citing rate tend to be more productive and receive more citation
than journals with lower self-citing rate. The journal self-cited rate
has no association with the number of articles that a journal published
and the citation it received. A journal with a higher self-citing rate
tends to be cited more by itself. The mean self-citing rate is 9.59% and
the mean self-cited rate is 15.03%. There is a significant difference
between self-citing and self-cited rates within the same set of
journals.
(c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: MY Tsay, Natl Chengchi Univ, Grad Inst Lib Informat &
Archival Studies, Wenshan Sect, 64 Sect,2 Chinan Rd,
Taipei
11623, Taiwan
[ ]<-- Enter an X to order article (IDS: 062TI 00014) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Towards all-author co-citation analysis (Article,
English)
AUTHOR: Zhao, DZ
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1578-1591 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SMALL H rauth; SCIENTOMETR* rwork;
CITATION item_title; CITATION ANALYS* item_title;
CITATION* item_title; CO CITATION* item_title
KEYWORDS: author co-citation analysis; scholarly communication;
citation analysis; web publishing
KEYWORDS+: SCIENCE; PUBLICATIONS; WEB
ABSTRACT: The present study examines one of the fundamental
aspects
of author co-citation analysis (ACA): the way co-citation counts are
defined. Co-citation counting provides the data on which all subsequent
statistical analyses and mappings are based, and we compare ACA results
based on two different types of co-citation counting: on the one hand,
the traditional type that only counts the first one among a cited work's
authors, and on the other hand, a simplified approach to all-author co-
citation counting that takes into account the first five authors of a
cited work. Results indicate that the picture produced through this
simplified all-author co-citation counting contains author groups that
are more coherent, and is therefore considerably clearer. However, this
picture represents fewer specialties in the research field being studied
than that produced through the traditional first-author co-citation
counting when the same number of top-ranked authors is selected and
analyzed. Reasons for these effects are discussed. Variations of
counting
more than first authors are compared. (c) 2006 Elsevier Ltd. All rights
reserved.
AUTHOR ADDRESS: DZ Zhao, Univ Alberta, Sch Lib & Informat Studies,
Edmonton, AB T6G 2J4, Canada
[ ]<-- Enter an X to order article (IDS: 062TI 00015) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Connection and stratification in research collaboration:
An analysis of the COLLNET network (Article, English)
AUTHOR: Yin, LC; Kretschmer, H; Hanneman, RA; Liu, ZY
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1599-1613 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): PRICE DJD rauth; SCIENTOMETR* rwork
KEYWORDS: social network analysis; cooperation network;
topological
structure; scale-free
KEYWORDS+: COMPLEX NETWORKS; SOCIAL NETWORKS; WEB; VISIBILITY;
CENTRALITY; AUTHORS
ABSTRACT: Co-authorship among scientists represents a prototype of
a social network. By mapping the graph containing all relevant
publications of members in an international collaboration network:
COLLNET, we infer the structural mechanisms that govern the topology of
this social system. The structure of the network affects the information
available to individuals, and their opportunities to collaborate. The
structure of the network also affects the overall flow of information,
and the nature of the scientific community. We present a number of
measures of both the macro- (whole-network) and micro(actor-centered)
structure of collaboration, and apply these to COLLNET. We find that
this
scientific community displays many aspects of a "small-world," and is
somewhat vulnerable to disruption should major figures become inactive.
We also find inequality in the roles played by individuals in the
network. The inequalities, however, do not create a closed and isolated
"core" or elite. (c) 2006 Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: LC Yin, Dalian Univ Technol, WISE Lab, Dalian 116023,
Peoples R China
[ ]<-- Enter an X to order article (IDS: 062TI 00017) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: Towards mapping library and information science
(Article,
English)
AUTHOR: Janssens, F; Leta, J; Glanzel, W; De Moor, B
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1614-1642 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): MARSHAKOVA IV rauth; J INF SCI rwork;
SCIENTOMETR* rwork
KEYWORDS: full-text analysis; text-based clustering; mapping of
science; library and information science
KEYWORDS+: CO-WORD ANALYSIS; NEURAL-NETWORK RESEARCH; COMBINING
FULL-
TEXT; COCITATION ANALYSIS; SCIENTOMETRICS; RETRIEVAL;
VALIDATION; INDICATORS; ALGORITHM; FIELD
ABSTRACT: In an earlier study by the authors, full-text analysis
and traditional bibliometric methods were combined to map research
papers
published in the journal Scientometrics. The main objective was to
develop appropriate techniques of full-text analysis and to improve the
efficiency of the individual methods in the mapping of science. The
number of papers was, however, rather limited. In the present study, we
extend the quantitative linguistic part of the previous studies to a set
of five journals representing the field of Library and Information
Science (LIS). Almost 1000 articles and notes published in the period
2002-2004 have been selected for this exercise. The optimum solution for
clustering LIS is found for six clusters. The combination of different
mapping techniques, applied to the full text of scientific publications,
results in a characteristic tripod pattern. Besides two clusters in
bibliometrics, one cluster in information retrieval and one containing
general issues, webometrics and patent studies are identified as small
but emerging clusters within LIS. The study is concluded with the
analysis of cluster representations by the selected journals. (c) 2006
Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: F Janssens, Katholieke Univ Leuven, ESAT, SCD, Kasteelpk
Arenberg 10, B-3001 Louvain, Belgium
[ ]<-- Enter an X to order article (IDS: 062TI 00018) ISSN: 0306-4573
------------------------------------------------------------------------
--
TITLE: A note on growth of superconductivity patents with two
new indicators (Article, English)
AUTHOR: Sen, SK; Sharma, HP
SOURCE: INFORMATION PROCESSING & MANAGEMENT 42 (6). DEC 2006.
p.1643-1651 PERGAMON-ELSEVIER SCIENCE LTD, OXFORD
SEARCH TERM(S): SCIENTOMETR* rwork
KEYWORDS: superconductivity patents; scientoinformetrics; patent-
paper ratio; trend indicator; knowledge-technology
orientation level
KEYWORDS+: SYSTEM
ABSTRACT: Patent statistics may be indicative of growth and
evolution of technologies in relation to a scientific field or sub-field
of research. In this paper annual growth of patents in the field of
superconductivity has been studied. The data in five-yearly cumulation
have been set against research papers on superconductivity to show
comparative growth of basic research and technological development. Two
simple indicators have been devised to understand the apparent influence
of basic research on technology and trends of relative growth. (c) 2006
Elsevier Ltd. All rights reserved.
AUTHOR ADDRESS: HP Sharma, Bengal Engn & Sci Univ, Howrah 711103, W
Bengal,
India
[ ]<-- Enter an X to order article (IDS: 062TI 00019) ISSN: 0306-4573
------------------------------------------------------------------------
--
More information about the SIGMETRICS
mailing list