He Y, Hui SC, Fong ACM "Mining a Web citation database for document clustering" APPLIED ARTIFICIAL INTELLIGENCE 16 (4): 283-302 APR 2002

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Thu Jun 20 10:23:49 EDT 2002

TITLE Mining a Web citation database for document clustering

 Document type: Article    Language: English
Cited References: 37       Times Cited: 0

The World Wide Web has become an important medium for disseminating
scientific publications. Many publications are now made available over the
Web. However, existing search engines are ineffective in searching these
publications, as they do not index Web publications that normally appear in
PDF (Portable Document Format) or PostScript formats. One way to index Web
publications is through citation indices, which contain the references that
the publications cite. Web citation Database is a data warehouse to store
the citation indices. In this paper, we propose a mining process to extract
document cluster knowledge from the Web Citation Database to support the
retrieval of Web publications. The mining techniques used for document
cluster generation are based on Kohonen's Self-Organizing Map (KSOM) and
Fuzzy Adaptive Resonance Theory (Fuzzy ART). The proposed techniques have
been incorporated into a citation-based retrieval system known as PubSearch
for Web scientific publications.

KeyWords Plus:

He Y, Nanyang Technol Univ, Sch Comp Engn, Nanyang Ave, Singapore 639798,
Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore


IDS Number:

Clear the checkbox to the left of an item if you do not want to search for
articles that cite the item when looking at Related Records.

 Cited Author            Cited Work                Volume      Page

 *DIG EQ CORP DEC      VIRT PAP PROJ                                   2000
 *WORDNET              WORDNETA LEX DAT ENG                            2000
 AGGARWAL C            P 5 ACM SIGKDD INT C                   352      1999
 BOLLACKER KD          IEEE INTELL SYST APP          15        42      2000
 BOLLACKER KD          P 2 INT C AUT AG MIN                   116      1998
 CALLON M              SCIENTOMETRICS                22       153      1991
 CARPENTER GA          NEURAL NETWORKS                4       759      1991
 DEERWESTER S          J AM SOC INFORM SCI           41       391      1990
 FAYYAD UM             ADV KNOWLEDGE DISCOV                     1      1996
 GARFIELD B            CITATION INDEXING IT                            1979
 HARTER SP             J AM SOC INFORM SCI           43       602      1992
 HARTIGAN JA           CLUSTERING ALGORITHM                            1975
 HE Y                  THESIS NANYANG TU SI                            2000
 HONKELA T             CLASSIFICATION DATA                    245      1998
 JAIN AK               ACM COMPUT SURV               31       264      1999
 KASKI S               COMPUTING SCI STAT            29       281      1998
 KASKI S               P IJCNN 98 INT JOINT           1       413      1998
 KAUFMAN L             FINDING GROUPS DATA                             1990
 KOHONEN T             IEEE T NEURAL NETWOR          11       574      2000
 KOHONEN T             P INT C ART NEUR NET                    65      1998
 KOHONEN T             SELF ORGANIZING MAPS                            1995
 LIN X                 J AM SOC INFORM SCI           48        40      1997
 MITCHELL TM           COMMUN ACM                    42        31      1999
 PAO ML                INFORM PROCESS MANAG          29        95      1993
 RAUBER A              P 4 ACM C DIG LIB                      240      1999
 ROCCHIO J             THESIS DIFF HARVARD                             1966
 SALTON G              INTRO MODERN INFORMA                            1983
 SALTON G              SCIENCE                      253       974      1991
 SARACEVIC T           INFORMATION SCI INTE                   210      1996
 SCHATZ B              IEEE COMPUT                   29        22      1996
 SLONIM N              P 23 ANN INT ACM SIG                   208      2000
 TISHBY N              P 37 ANN ALL C COMM                    368      1999
 TURTLE H              ACM T INFORM SYST              9       187      1991
 VANRIJSBERGEN C       INFORMATION RETRIEVA                            1979
 WHITE HD              J AM SOC INFORM SCI           32       163      1981
 YANG Y                P 22 INT C RES DEV I                    42      1999
 ZAMIR O               P 19 INT ACM SIGIR C                    46      1998

More information about the SIGMETRICS mailing list