Papers from PNAS of the USA 101 (Suppl) April 6, 2004

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Mon Aug 2 17:23:14 EDT 2004


FULL TEXT AVAILABLE AT : http://www.pnas.org/cgi/content/full/101/suppl_1/5186

Monika Henzinger:   E-mail Address: monika at google.com

Author(s): Henzinger, M; Lawrence, S

Title: Extracting knowledge from the World Wide Web

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5186-5191 Suppl. 1 APR 6 2004
Language: English
Document Type: Article

Abstract: The World Wide Web provides a unprecedented opportunity to
automatically analyze a large sample of interests and activity in the world.
We discuss methods for extracting knowledge from the web by randomly
sampling and analyzing hosts and pages, and by analyzing the link structure
of the web and how links accumulate over time. A variety of interesting and
valuable information can be extracted, such as the distribution of web pages
over domains, the distribution of interest in different areas, communities
related to different topics, the nature of competition in different
categories of sites, and the degree of communication between different
communities or countries.

Addresses: Google Inc, Mountain View, CA 94043 USA

Reprint Address: Henzinger, M, Google Inc, 2400 Bayshore Pkwy, Mountain
View, CA 94043 USA.

Cited References:
ALBERT R, 1999, NATURE, V401, P130.
ALBERT R, 2000, PHYS REV LETT, V85, P5234.
BARABASI AL, 1999, PHYSICA A, V272, P173.
BARABASI AL, 1999, SCIENCE, V286, P509.
BARYOSSEF Z, 2000, P 26 INT C VER LARG, P535.
BHARAT K, 1998, P 21 ANN INT ACM SIG, P104.
BHARAT K, 2001, P 2001 IEEE INT C DA, P51.
BRIN S, 1998, P 7 INT WORLD WID WE, P107.
BRODER A, 1997, 6 INT WORLD WID WEB, P391.
BRODER A, 2000, COMPUT NETW, V33, P309.
CHAKRABARTI S, 1999, P 8 INT WORLD WID WE, P545.
CHAKRABARTI S, 2002, P 11 INT WORLD WID W, P517.
CHUNG F, 1996, SPECTRAL GRAPH THEOR.
COOPER C, 2002, RANDOM STRUCT ALGORH, V22, P311.
DOROGOVTSEV SN, 2000, PHYS REV LETT, V85, P4633.
EIRON N, 2003, P HYP 2003, P85.
FLAKE G, 2002, GRAPH CLUSTERING TEC.
FLAKE GW, 2000, P 6 INT C KNOWL DISC, P150.
FLAKE GW, 2002, COMPUTER, V35, P66.
GAREY MR, 1979, COMPUTERS INTRACTABI.
GARFIELD E, 1979, CITATION INDEXING IT.
GIBSON D, 1998, P ACM C HYP HYP, P225.
HENZINGER MR, 2000, COMPUT NETW, V33, P295.
HUBERMAN BA, 1998, SCIENCE, V280, P95.
JEONG H, 2000, NATURE, V407, P651.
KLEINBERG J, 1998, P ACM SIAM S DISCR A, P668.
KLEINBERG JM, 1999, P 5 INT C COMP COMB, P1.
KUMAR R, 1999, COMPUT NETW, V31, P1481.
KUMAR R, 2000, P 41 ANN S FDN COMP, P57.
LARSON RR, 1996, P 59 ANN M AM SOC IN, P71.
LAWRENCE S, 1999, NATURE, V400, P107.
LEVENE M, 2002, COMPUT NETWORKS, V29, P277.
MACSKASSY S, 1998, P 4 INT C KNOWL DISC, P264.
PARK ST, 2003, IEEE INFOCOM 2003 SA.
PENNOCK DM, 2002, P NATL ACAD SCI USA, V99, P5207.
PIROLLI P, 1996, P ACM C HUM FACT COM, P118.
REDDY PK, 2002, WORKSH WEB AN APR 13, P11.
RUSMEVICHIENTON.P, 2001, AM ASS ART INT FALL, P121.
SIMON HA, 1955, BIOMETRIKA, V42, P425.
WATTS DJ, 1998, NATURE, V393, P440.
WHITE HD, 1989, ANNU REV INFORM SCI, V24, P119.
Times Cited: 1
Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 6
ISI Document Delivery No.: 812EO

_______________________________________

FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5192
E-mail Address: kboyack at sandia.gov

Author(s): Boyack, KW

Title: Mapping knowledge domains: Characterizing PNAS

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5192-5199 Suppl. 1 APR 6 2004

Language: English
Document Type: Article

Abstract: A review of data mining and analysis techniques that can be used
for the mapping of knowledge domains is given. Literature mapping techniques
can be based on authors, documents, journals, words, and/or indicators. Most
mapping questions are related to research assessment or to the structure and
dynamics of disciplines or networks. Several mapping techniques are
demonstrated on a data set comprising 20 years of papers published in PNAS.
Data from a variety of sources are merged to provide unique indicators of
the domain bounded by PNAS. By using funding source information and citation
counts, it is shown that, on an aggregate basis, papers funded jointly by
the U.S. Public Health Service (which includes the National Institutes of
Health) and non-U.S. government sources outperform papers funded by other
sources, including by the U.S. Public Health Service alone. Grant data from
the National Institute on Aging show that, on average, papers from large
grants are cited more than those from small grants, with performance
increasing with grant amount. A map of the highest performing papers over
the 20-year period was generated by using citation analysis. Changes and
trends in the subjects of highest impact within the PNAS domain are
described. Interactions between topics over the most recent 5-year period
are also detailed.
Addresses: Sandia Natl Labs, Computat Comp Informat & Math Ctr, Albuquerque,
NM 87185 USA

Reprint Address: Boyack, KW, Sandia Natl Labs, Computat Comp Informat & Math
Ctr, POB 5800, Albuquerque, NM 87185 USA.

Cited References: *NAT SCI BOARD, 2002, SCI ENG IND 2002.
BASSECOULARD E, 1999, SCIENTOMETRICS, V44, P323.
BATAGELJ V, 1998, CONNECTIONS, V21, P47.
BORNER K, 2003, ANNU REV INFORM SCI, V37, P179.
BOYACK KW, 2002, J AM SOC INF SCI TEC, V53, P764.
BOYACK KW, 2003, J AM SOC INF SCI TEC, V54, P447.
BUTLER L, 2001, RES EVALUAT, V10, P59.
CALLON M, 1983, SOC SCI INFORM, V22, P191.
CARD S, 1999, READINGS INFORMATION.
CARPENTER MP, 1973, J AM SOC INFORM SCI, V24, P425.
CHEN C, 2003, J AM SOC INF SCI TEC, V54, P453.
CHEN C, 2003, MAPPING SCI FRONTIER.
DAVIDSON GS, 1998, J INTELL INF SYST, V11, P259.
DAVIDSON GS, 2001, 7 IEEE S INF VIS INF, P23.
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391.
EROSHEVA E, 2004, P NATL ACAD SCI U S1, V101, P5220.
FRAME JD, 1976, FED PROC, V35, P2529.
GARFIELD E, 1955, SCIENCE, V122, P108.
GARFIELD E, 1970, NATURE, V227, P669.
GODIN B, 2003, RES POLICY, V32, P679.
GRIFFITHS TL, 2004, P NATL ACAD SCI U S1, V101, P5228.
HOOD WW, 2001, J AM SOC INF SCI TEC, V52, P1242.
INGWERSEN P, 1997, J AM SOC INFORM SCI, V48, P205.
IRVINE J, 1984, FORESIGHT SCI PICKIN.
KESSLER MM, 1963, AM DOC, V14, P10.
KIM SK, 2001, SCIENCE, V293, P2087.
KING J, 1987, J INFORM SCI, V13, P261.
LANDAUER TK, 2004, P NATL ACAD SCI U S1, V101, P5214.
LEWISON G, 1995, 5 INT C INT SOC SCI, P255.
LEWISON G, 1998, GUT, V43, P288.
LEWISON G, 1998, SCIENTOMETRICS, V41, P17.
LEYDESDORFF L, 1997, J AM SOC INFORM SCI, V48, P418.
LIN X, 1997, J AM SOC INFORM SCI, V48, P40.
MARTIN BR, 1983, RES POLICY, V12, P61.
MCALLISTER PR, 1983, J AM SOC INFORM SCI, V34, P123.
MORRIS SA, 2003, J AM SOC INF SCI TEC, V54, P413.
NARIN F, 1996, SCIENTOMETRICS, V36, P293.
NEWMAN MEJ, 2001, P NATL ACAD SCI USA, V98, P404.
NOYONS ECM, 1999, J AM SOC INFORM SCI, V50, P115.
PRICE DJD, 1963, LITTLE SCI BIG SCI.
PRICE DJD, 1965, SCIENCE, V149, P510.
RINIA EJ, 2002, SCIENTOMETRICS, V54, P347.
SALTON G, 1975, COMMUN ACM, V18, P613.
SCHEFFE H, 1953, BIOMETRIKA, V40, P87.
SEGLEN PO, 1997, ALLERGY, V52, P1050.
SEGLEN PO, 1997, BRIT MED J, V314, P498.
SMALL H, 1997, SCIENTOMETRICS, V38, P275.
URATA H, 1990, SCIENTOMETRICS, V18, P309.
WHITE HD, 1998, J AM SOC INFORM SCI, V49, P327.
WISE JA, 1999, J AM SOC INFORM SCI, V50, P1224.
Times Cited: 1
Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 8
ISI Document Delivery No.: 812EO

___________________________________________________

FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5200

E-mail Address: mejn at umich.edu

Author(s): Newman, MEJ

Title: Coauthorship networks and patterns of scientific collaboration

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5200-5205 Suppl. 1 APR 6 2004
Language: English
Document Type: Article

Abstract: By using data from three bibliographic databases in biology,
physics, and mathematics, respectively, networks are constructed in which
the nodes are scientists, and two scientists are connected if they have
coauthored a paper. We use these networks to answer a broad variety of
questions about collaboration patterns, such as the numbers of papers
authors write, how many people they write them with, what the typical
distance between scientists is through the network, and how patterns of
collaboration vary between subjects and over time. We also summarize a
number of recent results by other authors on coauthorship patterns.
Addresses: Univ Michigan, Ctr Study Complex Syst, Ann Arbor, MI 48109 USA;
Univ Michigan, Dept Phys, Ann Arbor, MI 48109 USA
Reprint Address: Newman, MEJ, Univ Michigan, Ctr Study Complex Syst, Ann
Arbor, MI 48109 USA.

Cited References: BARABASI AL, 1999, SCIENCE, V286, P509.
BARABASI AL, 2002, PHYSICA A, V311, P590.
BATAGELJ V, 2000, SOC NETWORKS, V22, P173.
BORDENS M, 2000, WEB KNOWLEDGE FESTSC.
CORMEN TH, 2001, INTRO ALGORITHMS.
CRANE D, 1972, INVISIBLE COLL DIFFU.
DECASTRO R, 1999, MATH INTELL, V21, P51.
DING Y, 1999, INT INF LIBR REV, V30, P367.
EGGHE L, 1990, INTRO INFORMETRICS.
FENNER T, 2002, CONDMAT0209463.
FREEMAN LC, 1977, SOCIOMETRY, V40, P35.
GIRVAN M, 2002, P NATL ACAD SCI USA, V99, P7821.
GOH KI, 2002, P NATL ACAD SCI USA, V99, P12583.
GOH KI, 2003, PHYS REV E, V67.
GROSSMAN JW, 1995, C NUMERANTIUM, V108, P129.
GROSSMAN JW, 2002, C NUMERANTIUM, V158, P202.
HOLME P, 2002, PHYS REV E 2, V65.
KAUTZ H, 1997, COMMUN ACM, V40, P63.
KRETSCHMER H, 1994, SCIENTOMETRICS, V30, P363.
LOTKA AJ, 1926, J WASHINGTON ACADEMY, V16, P317.
MELIN G, 1996, SCIENTOMETRICS, V36, P363.
MILGRAM S, 1967, PSYCHOL TODAY, V2, P60.
NEWMAN MEJ, 2001, P NATL ACAD SCI USA, V98, P404.
NEWMAN MEJ, 2001, PHSY REV E STAT PHYS, V64.
NEWMAN MEJ, 2001, PHYS REV E 2, V64.
NEWMAN MEJ, 2001, PHYS REV E 2, V64.
NEWMAN MEJ, 2002, PHYS REV LETT, V89.
PAO ML, 1986, J AM SOC INFORM SCI, V37, P26.
PERSSON O, 1995, SCIENTOMETRICS, V33, P351.
POOL I, 1978, SOC NETWORKS, V1, P1.
PRICE DJD, 1965, SCIENCE, V149, P510.
PRICE DJD, 1976, J AM SOC INFORM SCI, V27, P292.
SHOCKLEY W, 1957, P IRE, V45, P279.
TRAVERS J, 1969, SOCIOMETRY, V32, P425.
VOOS H, 1974, J AM SOC INFORM SCI, V25, P270.
WATTS DJ, 1998, NATURE, V393, P440.

Times Cited: 1

Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424

Source Item Page Count: 6

ISI Document Delivery No.: 812EO

_______________________________________________


full text available at :
http://www.pnas.org/cgi/content/full/101/suppl_1/5249

E-mail Address: selman at cs.cornell.edu

Author(s): Hopcroft, J; Khan, O; Kulis, B; Selman, B

Title: Tracking evolving communities in large linked networks

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5249-5253 Suppl. 1 APR 6 2004

Language: English Document Type: Article

Abstract: We are interested in tracking changes in large-scale data by
periodically creating an agglomerative clustering and examining the
evolution of clusters (communities) over time. We examine a large real-world
data set: the NEC CiteSeer database, a linked network of >250,000 papers.
Tracking changes over time requires a clustering algorithm that produces
clusters stable under small perturbations of the input data. However, small
perturbations of the CiteSeer data lead to significant changes to most of
the clusters. One reason for this is that the order in which papers within
communities are combined is somewhat arbitrary. However, certain subsets of
papers, called natural communities, correspond to real structure in the
CiteSeer database and thus appear in any clustering. By identifying the
subset of clusters that remain stable under multiple clustering runs, we get
the set of natural communities that we can track over time. We demonstrate
that such natural communities allow us to identify emerging communities and
track temporal changes in the underlying structure of our network data.
Addresses: Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA; Google Inc,
Mountain View, CA 94043 USA; Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
Reprint Address: Selman, B, Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA.

Cited References:
ADLER RJ, 1998, PRACTICAL GUIDE HEAV.
AGGARWAL CC, 2001, LECT NOTES COMPUTER, P420.
BARABASI AL, 2002, LINKED NEW SCI NETWO.
BORNER K, 2004, P NATL ACAD SCI U S1, V101, P5266.
BRIN S, 1998, COMPUT NETWORKS ISDN, V30, P107.
COHEN W, 2000, P AS SCOMP MACH SPEC, V6, P255.
DUDA RO, 1973, PATTERN CLASSIFICATI.
ERDOS P, 1960, PUBL MATH I HUNG, V5, P17.
FLAKE GW, 2000, P ASS COMP MACH SPEC, V6, P255.
GIBSON D, 1998, P HYP 1998 C, V9, P225.
GILES CL, 1998, P INT C DIG LIB, V3, P89.
HOPCROFT J, 2003, P ASS COMP MACH SPEC, V9, P541.
JAIN AK, 1998, ALGORITHMS CLUSTERIN.
KESSLER MM, 1963, AM DOC, V14, P10.
KLEINBERG JM, 1999, J ACM, V46, P604.
NG AY, 2001, P ASS COMP MACH SPEC, V24, P258.
NG AY, 2001, P INT JOINT C ART IN, V17, P903.
PASULA H, 2003, ADV NEURAL INFORMATI, V15, P1401.
POPESCUL A, 2000, ADV DIGITAL LIB ADL, P173.
SALTON G, 1989, AUTOMATIC TEXT PROCE.
SMALL H, 1973, J AM SOC INFORM SCI, V24, P265.
SMALL H, 1974, SCI STUD, V4, P17.
WATTS D, 2003, 6 DEGREES SCI CONNEC.
WATTS DJ, 1998, NATURE, V393, P440.
Times Cited: 1
Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 5
ISI Document Delivery No.: 812EO



_______________________________________



FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5261

E-mail Address: fil at indiana.edu
Author(s): Menczer, F

Title: Evolution of document networks

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5261-5265 Suppl. 1 APR 6 2004

Language: English Document Type: Article

Abstract: How does a network of documents grow without centralized control?
This question is becoming crucial as we try to explain the emergent
scale-free topology of the World Wide Web and use link analysis to identify
important information resources. Existing models of growing information
networks have focused on the structure of links but neglected the content of
nodes. Here I show that the current models fail to reproduce a critical
characteristic of information networks, namely the distribution of textual
similarity among linked documents. I propose a more realistic model that
generates links by using both popularity and content. This model yields
remarkably accurate predictions of both degree and similarity distributions
in networks of web pages and scientific literature.
Addresses: Indiana Univ, Sch Informat, Bloomington, IN 47408 USA
Reprint Address: Menczer, F, Indiana Univ, Sch Informat, Bloomington, IN
47408 USA.

Cited References:
ADAMIC LA, 2000, SCIENCE, V287, P2115.
ALBERT R, 1999, NATURE, V401, P130.
ALDOUS D, 2003, ARXIVCONDMAT0304701.
BARABASI AL, 1999, SCIENCE, V286, P509.
BELEW R, 2000, FINDING OUT COGNITIV.
BORNER K, 2003, ANNU REV INFORM SCI, V37, P179.
BORNER K, 2004, P NATL ACAD SCI U S1, V101, P5266.
BRIN S, 1998, COMPUT NETWORKS ISDN, V30, P107.
BRODER A, 2000, COMPUT NETW, V33, P309.
COOPER C, 2001, LECT NOTES COMPUTER, V2161, P500.
DOROGOVTSEV S, 2003, EVOLUTION NETWORKS B.
DOROGOVTSEV SN, 2000, PHYS REV LETT, V85, P4633.
FABRIKANT A, 2002, LECT NOTES COMPUT SC, V2380, P110.
GANESAN P, 2003, ACM T INFORM SYST, V21, P64.
GIRVAN M, 2002, P NATL ACAD SCI USA, V99, P8271.
HENZINGER M, 2004, P NATL ACAD SCI U S1, V101, P5186.
HOPCROFT J, 2004, P NATL ACAD SCI U S1, V101, P5249.
HUBERMAN BA, 1999, NATURE, V401, P131.
KLEINBERG J, 1999, LECT NOTES COMPUTER, V1627, P1.
KLEINBERG J, 2001, SCIENCE, V294, P1849.
KLEINBERG JM, 1999, J ACM, V46, P604.
KUMAR S, 2000, P 41 ANN IEEE S FDN, P57.
LANDAUER TK, 2004, P NATL ACAD SCI U S1, V101, P5214.
MENCZER F, 2002, P NATL ACAD SCI USA, V99, P14014.
MENDELZON A, 2000, IEEE DATA ENG B, V23, P9.
NEWMAN MEJ, 2004, P NATL ACAD SCI U S1, V101, P5200.
PENNOCK DM, 2002, P NATL ACAD SCI USA, V99, P5207.
PRICE DJD, 1965, SCIENCE, V149, P510.
SALTON G, 1983, INTRO MODERN INFORMA.
VAZQUEZ A, 2003, PHYS REV E 2, V67.

Times Cited: 2
Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 5
ISI Document Delivery No.: 812EO



__________________________________

FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5266

E-mail Address: katy at indiana.edu

Author(s): Borner, K; Maru, JT; Goldstone, RL

Title: The simultaneous evolution of author and paper networks

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5266-5273 Suppl. 1 APR 6 2004

Language: English  Document Type: Article

Abstract: There has been a long history of research into the structure and
evolution of mankind's scientific endeavor. However, recent progress in
applying the tools of science to understand science itself has been
unprecedented because only recently has there been access to high-volume and
high-quality data sets of scientific output (e.g., publications, patents,
grants) and computers and algorithms capable of handling this enormous
stream of data. This article reviews major work on models that aim to
capture and recreate the structure and dynamics of scientific evolution. We
then introduce a general process model that simultaneously grows coauthor
and paper citation networks. The statistical and dynamic properties of the
networks generated by this model are validated against a 20-year data set of
articles published in PNAS. Systematic deviations from a power law
distribution of citations to papers are well fit by a model that
incorporates a partitioning of authors and papers into topics, a bias for
authors to cite recent papers, and a tendency for authors to cite papers
cited by papers that they have read. In this TARL model (for topics, aging,
and recursive linking), the number of topics is linearly related to the
clustering coefficient of the simulated paper citation network.

Addresses: Indiana Univ, Sch Lib & Informat Sci, Bloomington, IN 47405 USA;
Indiana Univ, Dept Comp Sci, Bloomington, IN 47405 USA; Indiana Univ, Dept
Psychol, Bloomington, IN 47405 USA

Reprint Address: Borner, K, Indiana Univ, Sch Lib & Informat Sci,
Bloomington, IN 47405 USA.


Cited References: ADAMS J, 1996, P NATL ACAD SCI USA, V93, P12664.
ALBERT R, 2000, NATURE, V406, P378.
ALBERT R, 2002, REV MOD PHYS, V74, P47.
AMARAL LAN, 2000, P NATL ACAD SCI USA, V97, P11149.
BANKS DL, 1996, J MATH SOCIOL, V21, P173.
BARABASI AL, 1999, SCIENCE, V286, P509.
BARABASI AL, 2000, PHYSICA A, V281, P69.
BARABASI AL, 2002, PHYSICA A, V311, P590.
BATAGELJ V, 1998, CONNECTIONS, V21, P47.
BORNER K, 2003, ANNU REV INFORM SCI, V37, P179.
DOROGOVTSEV SN, 2002, ADV PHYS, V51, P1079.
ERDOS P, 1960, HUNGARIAN ACAD SCI, V5, P17.
GARFIELD E, 1964, USE CITATION DATA WR.
GARFIELD E, 1989, INNOVATION CROSSROAD.
GILBERT N, 2003, SOCIOLOGICAL RES ONL, V2.
KLEINBERG J, 1999, LECT NOTES COMPUTER, V1627, P26.
MENCZER F, 2004, P NATL ACAD SCI U S1, V101, P5261.
MORRIS SA, 2003, J AM SOC INF SCI TEC, V54, P413.
NEWMAN MEJ, 2001, P NATL ACAD SCI USA, V98, P404.
NEWMAN MEJ, 2001, PHYS REV E 2, V64.
NEWMAN MEJ, 2001, PHYS REV E 2, V64.
PRICE DJD, 1976, J AM SOC INFORM SCI, V27, P292.
REDNER S, 1998, EUR PHYS J B, V4, P131.
SNIJDERS TAB, 2001, SOCIOL METHODOL, P361.
VANRAAN AFJ, 1997, SCIENTOMETRICS, V38, P205.
VANRAAN AFJ, 2000, SCIENTOMETRICS, V47, P347.
WATTS DJ, 1998, NATURE, V393, P440.
WATTS DJ, 1999, SMALL WORLDS DYNAMIC.
WHITE HD, 1989, ANNU REV INFORM SCI, V24, P119.
WILLINGER W, 2002, P NATL ACAD SCI U S1, V99, P2573.

Times Cited: 2

Publisher: NATL ACAD SCIENCES

Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 8
ISI Document Delivery No.: 812EO


________________________________

FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5291

E-mail Address: samorri at okstate.edu

Author(s): Morris, SA; Yen, GG

Title: Crossmaps: Visualization of overlapping relationships in collections
of journal papers

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5291-5296 Suppl. 1 APR 6 2004

Language: English   Document Type: Article

Abstract: A crossmapping technique is introduced for visualizing multiple
and overlapping relations among entity types in collections of journal
articles. Groups of entities from two entity types are crossplotted to show
correspondence of relations. For example, author collaboration groups are
plotted on the x axis against groups of papers(research fronts)on the y
axis. At the intersection of each pair of author group/research front pairs
a circular symbol is plotted whose size is proportional to the number of
times that authors in the group appear as authors in papers in the research
front. Entity groups are found by agglomerative hierarchical clustering
using conventional similarity measures. Crossmaps comprise a simple
technique that is particularly suited to showing overlap in relations among
entity groups. Particularly useful crossmaps are: research fronts against
base reference clusters, research fronts against author collaboration
groups, and research fronts against term co-occurrence clusters. When
exploring the knowledge domain of a collection of journal papers, it is
useful to have several crossmaps of different entity pairs, complemented by
research front timelines and base reference cluster timelines.

Addresses: Oklahoma State Univ, Stillwater, OK 74078 USA

Reprint Address: Morris, SA, Oklahoma State Univ, 202 Engn S, Stillwater, OK
74078 USA.

Cited References:
BORNER K, 2002, ANN REV INFORMATION, V37, P179.
BRADFORD SC, 1934, ENGINEERING-LONDON, V137, P85.
CALLON M, 1991, SCIENTOMETRICS, V22, P155.
CHEN PPS, 1976, ACM T DATABASE SYST, V1, P9.
CRANE D, 1972, INVISIBLE COLL DIFFU.
GARFIELD E, 1979, CITATION INDEXING IT.
KESSLER MM, 1963, AM DOC, V14, P10.
KRETSCHMER H, 1997, SCIENTOMETRICS, V40, P579.
KUHN TS, 1970, STRUCTURE SCI REVOLU.
LOTKA AJ, 1926, J WASHINGTON ACADEMY, V16, P317.
MORRIS SA, 2003, J AM SOC INF SCI TEC, V55, P413.
MOTHE J, 2003, J AM SOC INF SCI TEC, V54, P650.
NARANAN S, 1971, J DOC, V27, P83.
PERSSON O, 1994, J AM SOC INFORM SCI, V45, P31.
SALTON G, 1989, AUTOMATIC TEXT PROCE.
SHNEIDERMAN B, 2000, 5 ACM C DIG LIB ASS, P57.
SMALL H, 1973, J AM SOC INFORM SCI, V24, P265.
SMALL H, 1997, SCIENTOMETRICS, V38, P275.
SMALL HG, 1978, SOC STUD SCI, V8, P327.
SUBRAMANYAM K, 1983, J INFORM SCI, V6, P33.
WHITE HD, 1981, J AM SOC INFORM SCI, V32, P163.
WHITE HD, 1989, ANNU REV INFORM SCI, V24, P119.
WHITE HD, 1998, J AM SOC INFORM SCI, V49, P327.
ZIEGLER E, 2002, P IEEE 6 INT C INF V, P361.
ZIPF GK, 1949, HUMAN BEHAV PRINCIPL.

Times Cited: 0

Publisher: NATL ACAD SCIENCES

Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA

ISSN: 0027-8424
Source Item Page Count: 6
ISI Document Delivery No.: 812EO

______________________________________________

FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5297

E-mail Address: whitehd at drexel.edu

Author(s): White, HD; Lin, X; Buzydlowski, JW; Chen, CM

Title: User-controlled mapping of significant literatures

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5297-5302 Suppl. 1 APR 6 2004

Language: English Document Type: Article

Abstract: We apply a version of our web-based literature-mapping system to
PNAS for 1971-2002, as indexed by the National Library of Medicine and the
Institute for Scientific Information. Given a single input term from a user,
a medical subject heading, a cocited author, or a cocited journal, PNASLINK
rapidly displays views in which that term and the other 24 terms that most
frequently co-occur with it in a bibliographic database are interrelated in
ways suggesting fruitful combinations for document retrieval. The
interrelationships are produced by two algorithms, pathfinder networks and
Kohonen-style self-organizing maps. PNASLINK displays are themselves
interactive interfaces that can retrieve documents from digital libraries
(e.g., PNAS Online). This style of visualizing knowledge domains is called
"localized" because it does not attempt to map the indexing of literatures
in full but concentrates on the top terms in an "associative thesaurus"
reflecting user interests. It also permits swift remappings, as the user
recognizes terms worth pursuing. PNASLINK is illustrated with maps drawn
from the literature of population genetics. Some comparative and evaluative
comments are added, one from a domain expert indicating that the face
validity of the system may be tempered by insufficient specificity in the
indexing terms being mapped.

Addresses: Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA

Reprint Address: White, HD, Drexel Univ, Coll Informat Sci & Technol,
Philadelphia, PA 19104 USA.


Cited References:
BORNER K, 2003, ANNU REV INFORM SCI, V37, P179.
BUZYDLOWSKI JW, 2003, LECT NOTES COMPUTER, V2539, P133.
BUZYDLOWSKI JW, 2003, THESIS DREXEL U PHIL.
CHEN C, 2003, MAPPING SCI FRONTIER.
CHEN CM, 1998, J VISUAL LANG COMPUT, V9, P267.
CHEN CM, 1999, INFORM PROCESS MANAG, V35, P401.
CHEN HC, 1998, J AM SOC INFORM SCI, V49, P582.
CHEN HC, 2003, J AM SOC INF SCI TEC, V54, P683.
DING Y, 2000, INFORM PROCESS MANAG, V37, P817.
DODGE M, 2001, ATLAS CYBERSPACE.
FOWLER RH, 1990, PATHFINDER ASS NETWO, P165.
FOWLER RH, 1991, P 14 ANN INT ACM SIG, P142.
FOWLER RH, 1992, NAG9551921 U TEX PAN.
HEARST MA, 1999, MODERN INFORMATION R, P257.
IBEKWESANJUAN F, 2002, KNOWL ORGAN, V29, P181.
JANSEN BJ, 2000, INFORM PROCESS MANAG, V36, P207.
KAMADA T, 1989, INFORM PROCESS LETT, V31, P7.
KOHONEN T, 1997, SELFORGANIZING MAPS.
LIN X, 1991, P 14 ANN INT ACM SIG, P262.
LIN X, 2003, INFORM PROCESS MANAG, V39, P689.
MCGREEVY MW, 1995, RELATIONAL METRIC IT.
ROUSSINOV D, 1998, COMMUN COGNITION, V15, P81.
SCHATZ B, 1996, P 1 ACM INT C DIG LI, P126.
SCHVANEVELDT RW, 1990, PATHFINDER ASS NETWO.
TVERSKY A, 1982, PSYCHOL REV, V89, P123.
WHITE HD, 1990, SCHOLARLY COMMUNICAT, P84.
WHITE HD, 1997, ANNU REV INFORM SCI, V32, P99.
Times Cited: 0
Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 6
ISI Document Delivery No.: 812EO

_______________________________________

FULL TEXT AVAILABLE AT :
http://www.pnas.org/cgi/content/full/101/suppl_1/5303

E-mail Address: chaomei.chen at cis.drexel.edu

Author(s): Chen, CM

Title: Searching for intellectual turning points: Progressive knowledge
domain visualization

Source: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES
OF AMERICA, 101: 5303-5310 Suppl. 1 APR 6 2004

anguage: English Document Type: Article

Abstract: This article introduces a previously undescribed method
progressively visualizing the evolution of a knowledge domain's cocitation
network. The method first derives a sequence of cocitation networks from a
series of equal-length time interval slices. These time-registered networks
are merged and visualized in a panoramic view in such away that
intellectually significant articles can be identified based on their
visually salient features. The method is applied to a cocitation study of
the superstring field in theoretical physics. The study focuses on the
search of articles that triggered two superstring revolutions. Visually
salient nodes in the panoramic view are identified, and the nature of their
intellectual contributions is validated by leading scientists in the field.
The analysis has demonstrated that a search for intellectual turning points
can be narrowed down to visually salient nodes in the visualized network.
The method provides a promising way to simplify otherwise cognitively
demanding tasks to a search for landmarks, pivots, and hubs.
Addresses: Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA

Reprint Address: Chen, CM, Drexel Univ, Coll Informat Sci & Technol, 3141
Chestnut St, Philadelphia, PA 19104 USA.


Cited References:
AHLGREN P, 2003, J AM SOC INF SCI TEC, V54, P550.
ALBERT R, 2002, REV MOD PHYS, V74, P47.
BARABASI AL, 2002, PHYSICA A, V311, P590.
BATAGELJ V, 1998, CONNECTIONS, V21, P47.
BOOKSTEIN FL, 1989, IEEE T PATTERN ANAL, V11, P567.
BRANDES U, 2003, INF VISUAL, V2, P40.
CARROLL JD, 1970, PSYCHOMETRIKA, V35, P283.
CHEN C, 2003, IEEE S INF VIS INF V, P67.
CHEN C, 2003, MAPPING SCI FRONTIER.
CHEN CM, 1999, INFORM PROCESS MANAG, V35, P401.
CHEN CM, 2001, COMPUTER, V34, P65.
CHEN CM, 2001, IEEE T SYST MAN CY C, V31, P518.
CHEN CM, 2002, J AM SOC INF SCI TEC, V53, P678.
CHEN CM, 2003, J AM SOC INF SCI TEC, V54, P435.
DOROGOVTSEV SN, 2000, PHYS REV LETT, V85, P4633.
ERTEN C, 2003, TR0304 U AR.
GARFIELD E, 1989, INNOVATION CROSSROAD, P51.
GOWER JC, 1975, PSYCHOMETRIKA, V40, P33.
HAVRE S, 2002, IEEE T VIS COMPUT GR, V8, P9.
KAMADA T, 1989, INFORM PROCESS LETT, V31, P7.
KLEINBERG J, 2002, P 8 ACM SIGKDD INT C, P91.
KUHN TS, 1962, STRUCTURE SCI REVOLU.
MISUE K, 1995, J VISUAL LANG COMPUT, V6, P183.
NEWMAN M, 2001, PHYS REV E, V64.
NEWMAN MEJ, 2001, P NATL ACAD SCI USA, V98, P404.
NORTH S, 1995, P S GRAPH DRAW GD 95, P409.
PRICE DJD, 1965, SCIENCE, V149, P510.
SCHVANEVELDT RW, 1990, PATHFINDER ASS NETWO.
SCHWARZ JH, 1996, ARXIVHEPTH9607067.
SMALL H, 1989, COMMUN RES, V16, P642.
SMALL HG, 1977, SOC STUD SCI, V7, P139.
VANRAAN AFJ, 2000, SCIENTOMETRICS, V47, P347.
WHITE HD, 2003, J AM SOC INF SCI TEC, V54, P1250.
WHITE HD, 2003, J AM SOC INF SCI TEC, V54, P423.

Times Cited: 1
Publisher: NATL ACAD SCIENCES
Publisher Address: 2101 CONSTITUTION AVE NW, WASHINGTON, DC 20418 USA
ISSN: 0027-8424
Source Item Page Count: 8
ISI Document Delivery No.: 812EO



More information about the SIGMETRICS mailing list