He Y, Hui SC, Fong ACM "Mining a Web citation database for document clustering" APPLIED ARTIFICIAL INTELLIGENCE 16 (4): 283-302 APR 2002
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Thu Jun 20 10:23:49 EDT 2002
TITLE Mining a Web citation database for document clustering
AUTHOR He Y, Hui SC, Fong ACM
JOURNAL APPLIED ARTIFICIAL INTELLIGENCE 16 (4): 283-302 APR 2002
Document type: Article Language: English
Cited References: 37 Times Cited: 0
Abstract:
The World Wide Web has become an important medium for disseminating
scientific publications. Many publications are now made available over the
Web. However, existing search engines are ineffective in searching these
publications, as they do not index Web publications that normally appear in
PDF (Portable Document Format) or PostScript formats. One way to index Web
publications is through citation indices, which contain the references that
the publications cite. Web citation Database is a data warehouse to store
the citation indices. In this paper, we propose a mining process to extract
document cluster knowledge from the Web Citation Database to support the
retrieval of Web publications. The mining techniques used for document
cluster generation are based on Kohonen's Self-Organizing Map (KSOM) and
Fuzzy Adaptive Resonance Theory (Fuzzy ART). The proposed techniques have
been incorporated into a citation-based retrieval system known as PubSearch
for Web scientific publications.
KeyWords Plus:
RETRIEVAL
Addresses:
He Y, Nanyang Technol Univ, Sch Comp Engn, Nanyang Ave, Singapore 639798,
Singapore
Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
Publisher:
TAYLOR & FRANCIS INC, PHILADELPHIA
IDS Number:
547NK
ISSN:
0883-9514
Clear the checkbox to the left of an item if you do not want to search for
articles that cite the item when looking at Related Records.
Cited Author Cited Work Volume Page
Year
*DIG EQ CORP DEC VIRT PAP PROJ 2000
*WORDNET WORDNETA LEX DAT ENG 2000
AGGARWAL C P 5 ACM SIGKDD INT C 352 1999
BOLLACKER KD IEEE INTELL SYST APP 15 42 2000
BOLLACKER KD P 2 INT C AUT AG MIN 116 1998
CALLON M SCIENTOMETRICS 22 153 1991
CARPENTER GA NEURAL NETWORKS 4 759 1991
DEERWESTER S J AM SOC INFORM SCI 41 391 1990
FAYYAD UM ADV KNOWLEDGE DISCOV 1 1996
GARFIELD B CITATION INDEXING IT 1979
HARTER SP J AM SOC INFORM SCI 43 602 1992
HARTIGAN JA CLUSTERING ALGORITHM 1975
HE Y THESIS NANYANG TU SI 2000
HONKELA T CLASSIFICATION DATA 245 1998
JAIN AK ACM COMPUT SURV 31 264 1999
KASKI S COMPUTING SCI STAT 29 281 1998
KASKI S P IJCNN 98 INT JOINT 1 413 1998
KAUFMAN L FINDING GROUPS DATA 1990
KOHONEN T IEEE T NEURAL NETWOR 11 574 2000
KOHONEN T P INT C ART NEUR NET 65 1998
KOHONEN T SELF ORGANIZING MAPS 1995
LIN X J AM SOC INFORM SCI 48 40 1997
MITCHELL TM COMMUN ACM 42 31 1999
PAO ML INFORM PROCESS MANAG 29 95 1993
RAUBER A P 4 ACM C DIG LIB 240 1999
ROCCHIO J THESIS DIFF HARVARD 1966
SALTON G INTRO MODERN INFORMA 1983
SALTON G SCIENCE 253 974 1991
SARACEVIC T INFORMATION SCI INTE 210 1996
SCHATZ B IEEE COMPUT 29 22 1996
SLONIM N P 23 ANN INT ACM SIG 208 2000
TISHBY N P 37 ANN ALL C COMM 368 1999
TURTLE H ACM T INFORM SYST 9 187 1991
VANRIJSBERGEN C INFORMATION RETRIEVA 1979
WHITE HD J AM SOC INFORM SCI 32 163 1981
YANG Y P 22 INT C RES DEV I 42 1999
ZAMIR O P 19 INT ACM SIGIR C 46 1998
More information about the SIGMETRICS
mailing list