Kandylas, V; Upham, SP; Ungar, LH Finding cohesive clusters for analyzing knowledge communities ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING 203-212, 2007

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Fri Mar 28 12:06:23 EDT 2008


Email address: kandylas at seas.upenn.edu

Author(s): Kandylas, V (Kandylas, Vasileios); Upham, SP (Upham, S. 
Phineas); Ungar, LH (Ungar, Lyle H.)
 
Title: Finding cohesive clusters for analyzing knowledge communities 

Editor(s): Ramakrishnan, N; Zaiane, OR; Shi, Y; Clifton, CW; Wu, XD 

Source: ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL 
CONFERENCE ON DATA MINING 203-212, 2007 

Book Series: IEEE International Conference on Data Mining 

Language: English 

Document Type: Article 

Conference Title: 7th IEEE International Conference on Data Mining 

Conference Date: OCT 28-31, 2007 

Conference Location: Omaha, NE 

Conference Sponsors: IEEE, Microsoft adCenter Labs, Univ Nebraska Med Ctr, 
Univ Nebraska Omaha, Thomson, Web Splashes, In The Details Events, IBM, 
IEEE Comp Soc, Henry Doorly Zoo, Mutual Omaha, CAS Res Ctr Fictitious Econ 
& Data Sci, First Natl Bank Omaha, Peter Kiewit Inst 

KeyWords Plus: SCIENCE 

Abstract: Documents and authors can be clustered into "knowledge 
communities" based on the overlap in the papers they cite. We introduce a 
new clustering algorithm, Streemer which finds cohesive foreground 
clusters embedded in a diffuse background, and use it to identify 
knowledge communities as foreground clusters of papers which share common 
citations. To analyze the evolution of these communities over time, we 
build predictive models with features based on the citation structure, the 
vocabulary of the papers, and the affiliations and prestige of the 
authors. Findings include that scientific knowledge communities tend to 
grow more rapidly if their publications build on diverse information and 
if they use a narrow vocabulary. 

Addresses: Univ Penn, CIS Dept, Philadelphia, PA 19104 USA. 

Reprint Address: Kandylas, V, Univ Penn, CIS Dept, Philadelphia, PA 19104 
USA. 

Cited Reference Count: 26 

Publisher Name: IEEE COMPUTER SOC 

Publisher Address: 10662 LOS VAQUEROS CIRCLE, PO BOX 3014, LOS ALAMITOS, 
CA 90720-1264 USA 

ISSN: 1550-4786 

ISBN: 978-0-7695-3018-5 

BLEI D
23 ICML 2006 113 

CRANE D
INVISIBLE COLL DIFFU : 1972

DHILLON I
ICDM : 517 2003 

DHILLON IS
KNOWLEDGE DISCOVERY : 269 2001 

ESTER M
KDD : 226 1996 

FERN XZ
ICML : 186 2003 

FLAKE GW
KDD C : 150 2000 

GIBSON D
INFERRING WEB COMMUN : 1998 

GRIFFITH BC
STRUCTURE OF SCIENTIFIC LITERATURES .2. TOWARD A MACROSTRUCTURE AND 
MICROSTRUCTURE FOR SCIENCE 
SCI STUD 4 : 339 1974 

GUHA S
Clustering data streams: Theory and practice 
IEEE T KNOWL DATA EN 15 : 515 2003 

HOPCROFT J
KDD : 541 2003 

HUANG Q
INT C IM PROC 1 : 246 1995 

KEARNS M
UNCERTAINTY ARTIFICI : 282 1997 

MCGANN AJ
The advantages of ideological cohesion - A model of constituency 
representation and electoral competition in multi-party democracies
J THEOR POLIT 14 : 37 2002 

MCGOVERN A
SIGKDD EXPLOR NEWSL 5 : 165 2003 

PANTEL P
SIGIR : 199 2002 

POPESCUL A
ADV DIGITAL LIB ADL : 173 2000 

SAVAKIS A
P INT C IM PROC ICIP : 1998 

SMALL H
Paradigms, citations, and maps of science: A personal history 
J AM SOC INF SCI TEC 54 : 394 2003 

SMALL HG
SPECIALTIES AND DISCIPLINES IN SCIENCE AND SOCIAL-SCIENCE - EXAMINATION OF 
THEIR STRUCTURE USING CITATION INDEXES
SCIENTOMETRICS 1 : 445 1979 

STEINBACH M
KDD WORKSH TEXT MIN 34 : 35 2000 

STREHL A
J MACHINE LEARNING R 3 : 583 2002 

SULLIVAN D
CO-CITATION ANALYSES OF SCIENCE - EVALUATION 
SOC STUD SCI 7 : 223 1977 

UPHAM SP
THESIS U PENNSYLVANI : 2006 

WANG X
KDD 2006 : 424 2006 

ZHANG T
P 1996 ACM SIGMOD IN : 103 1996 



More information about the SIGMETRICS mailing list