Kandylas, V; Upham, SP; Ungar, LH Finding cohesive clusters for analyzing knowledge communities ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING 203-212, 2007
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Fri Mar 28 12:06:23 EDT 2008
Email address: kandylas at seas.upenn.edu
Author(s): Kandylas, V (Kandylas, Vasileios); Upham, SP (Upham, S.
Phineas); Ungar, LH (Ungar, Lyle H.)
Title: Finding cohesive clusters for analyzing knowledge communities
Editor(s): Ramakrishnan, N; Zaiane, OR; Shi, Y; Clifton, CW; Wu, XD
Source: ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL
CONFERENCE ON DATA MINING 203-212, 2007
Book Series: IEEE International Conference on Data Mining
Language: English
Document Type: Article
Conference Title: 7th IEEE International Conference on Data Mining
Conference Date: OCT 28-31, 2007
Conference Location: Omaha, NE
Conference Sponsors: IEEE, Microsoft adCenter Labs, Univ Nebraska Med Ctr,
Univ Nebraska Omaha, Thomson, Web Splashes, In The Details Events, IBM,
IEEE Comp Soc, Henry Doorly Zoo, Mutual Omaha, CAS Res Ctr Fictitious Econ
& Data Sci, First Natl Bank Omaha, Peter Kiewit Inst
KeyWords Plus: SCIENCE
Abstract: Documents and authors can be clustered into "knowledge
communities" based on the overlap in the papers they cite. We introduce a
new clustering algorithm, Streemer which finds cohesive foreground
clusters embedded in a diffuse background, and use it to identify
knowledge communities as foreground clusters of papers which share common
citations. To analyze the evolution of these communities over time, we
build predictive models with features based on the citation structure, the
vocabulary of the papers, and the affiliations and prestige of the
authors. Findings include that scientific knowledge communities tend to
grow more rapidly if their publications build on diverse information and
if they use a narrow vocabulary.
Addresses: Univ Penn, CIS Dept, Philadelphia, PA 19104 USA.
Reprint Address: Kandylas, V, Univ Penn, CIS Dept, Philadelphia, PA 19104
USA.
Cited Reference Count: 26
Publisher Name: IEEE COMPUTER SOC
Publisher Address: 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS,
CA 90720-1264 USA
ISSN: 1550-4786
ISBN: 978-0-7695-3018-5
BLEI D
23 ICML 2006 113
CRANE D
INVISIBLE COLL DIFFU : 1972
DHILLON I
ICDM : 517 2003
DHILLON IS
KNOWLEDGE DISCOVERY : 269 2001
ESTER M
KDD : 226 1996
FERN XZ
ICML : 186 2003
FLAKE GW
KDD C : 150 2000
GIBSON D
INFERRING WEB COMMUN : 1998
GRIFFITH BC
STRUCTURE OF SCIENTIFIC LITERATURES .2. TOWARD A MACROSTRUCTURE AND
MICROSTRUCTURE FOR SCIENCE
SCI STUD 4 : 339 1974
GUHA S
Clustering data streams: Theory and practice
IEEE T KNOWL DATA EN 15 : 515 2003
HOPCROFT J
KDD : 541 2003
HUANG Q
INT C IM PROC 1 : 246 1995
KEARNS M
UNCERTAINTY ARTIFICI : 282 1997
MCGANN AJ
The advantages of ideological cohesion - A model of constituency
representation and electoral competition in multi-party democracies
J THEOR POLIT 14 : 37 2002
MCGOVERN A
SIGKDD EXPLOR NEWSL 5 : 165 2003
PANTEL P
SIGIR : 199 2002
POPESCUL A
ADV DIGITAL LIB ADL : 173 2000
SAVAKIS A
P INT C IM PROC ICIP : 1998
SMALL H
Paradigms, citations, and maps of science: A personal history
J AM SOC INF SCI TEC 54 : 394 2003
SMALL HG
SPECIALTIES AND DISCIPLINES IN SCIENCE AND SOCIAL-SCIENCE - EXAMINATION OF
THEIR STRUCTURE USING CITATION INDEXES
SCIENTOMETRICS 1 : 445 1979
STEINBACH M
KDD WORKSH TEXT MIN 34 : 35 2000
STREHL A
J MACHINE LEARNING R 3 : 583 2002
SULLIVAN D
CO-CITATION ANALYSES OF SCIENCE - EVALUATION
SOC STUD SCI 7 : 223 1977
UPHAM SP
THESIS U PENNSYLVANI : 2006
WANG X
KDD 2006 : 424 2006
ZHANG T
P 1996 ACM SIGMOD IN : 103 1996
More information about the SIGMETRICS
mailing list