Jiang HF, Lou WW, Wang W "Three-tier clustering: An online citation clustering system" ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS LECTURE NOTES IN COMPUTER SCIENCE 2118: 237-248 2001

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Fri May 3 14:50:52 EDT 2002


Haifeng Jiang : {jianghf,wwlou,fervvac}@cs.ust.hk
Full Text Available At : http://www.cs.ust.hk/~fervvac/files/waim2001.pdf

TITLE Three-tier clustering: An online citation clustering system
AUTHOR Jiang HF, Lou WW, Wang W
JOURNAL ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS
            LECTURE NOTES IN COMPUTER SCIENCE 2118: 237-248 2001

Document type: Article    Language: English
Cited References: 19      Times Cited: 0


Abstract:
In this paper, we present a three tier clustering method where data objects
are described by a number of feature dimensions. Using the approach,
similarity along each feature dimension of objects are first computed. The
inter-objects similarity are then computed from inter-feature-dimension
similarity using a Bayesian multi-causal model. Objects are finally
clustered based on the computed similarity. An online citation entry
clustering system was built using the approach. It accepts user queries in
the form of name of authors. Such queries are sent to citation/bibliography
search engines. The returned entries are clustered based on feature
dimensions such as authors, title, place of publication, etc. After
clustering, entries from different authors with the similar name form
different clusters, that are presented to the user. Preliminary experiment
results indicated the effectiveness of the proposed clustering approach. The
architecture of three-tire clustering framework, feature representation of a
citation entry, a brief network model for inter-object similarity
computation, and a special cluster evaluation technique are discussed in
detail.

Addresses:
Jiang HF, Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Peoples R
China
Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Peoples R China

Publisher:
SPRINGER-VERLAG BERLIN, BERLIN

IDS Number:
BT99G

ISSN:
0302-9743

 Cited Author            Cited Work                Volume      Page   Year

                       ACC SAMPLE FUNCTION
                       COLLECTION COMPUTER
                       COMPUTER SCI BIBLIO
 *NECI SCI LIT DIG     RES IND
 BOTAFOGO RA           ACM SIGIR 93 6 93 PI
 BRIN S                P 7 INT WORLD WID WE                            1998
 CUTTING DR            15 ANN INT SIGIR 92
 CUTTING DR            16 ANN INT SIGIR 93
 DUDA RO               PATTERN CLASSIFICATI                            1973
 GILES L               P ACM C DIG LIB PITT                    89      1998
 JAIN AK               ACM COMPUTING SURVEY          31                1999
 LAWRENCE S            NATURE                       400       107      1999
 MODHA DS              P ACM HYP C MAY 30 J                            2000
 PORTER MF             PROGRAM                       14       130      1980
 RASMUSSEN E           CLUSTERING ALGORITHM                   419      1992
 RUSSELL SJ            ARTIFICIAL INTELLIGE                  CH15      1995
 WILLET P              INFORMATION PROCESSI                   577      1988
 ZHANG NL              IJCAI 99                              1288      1999
 ZHANG NL              J ARTIF INTELL RES             5       301      1996



More information about the SIGMETRICS mailing list