Wen JR, Nie JY, Zhang HJ "Query clustering using user logs" ACM TRANSACTIONS ON INFORMATION SYSTEMS 20 (1): 59-81 JAN 2002

Eugene Garfield garfield at CODEX.CIS.UPENN.EDU
Fri May 3 16:40:31 EDT 2002


J.R. Wen  : jrwen at microsoft.com
J.Y. Nie    :  nie at iro.umontreal.ca
H.J.Zhang : hjzhang at microsoft.com



TITLE Query clustering using user logs
AUTHOR Wen JR, Nie JY, Zhang HJ
JOURNAL ACM TRANSACTIONS ON INFORMATION SYSTEMS  20 (1): 59-81 JAN 2002

 Document type: Article    Language: English    Cited References: 21
Times Cited: 0


Abstract:
Query clustering is a process used to discover frequently asked questions or
most popular topics on a search engine. This process is crucial for search
engines based on question-answering. Because of the short lengths of
Queries, approaches based on keywords are not suitable for query clustering.
This paper describes a new query clustering method that makes use of user
logs which allow us to identify the documents the users have selected for a
query. The similarity between two queries may be deduced from the common
documents the users selected for them. Our experiments show that a
combination of both keywords and user logs is better than using either
method alone.

Author Keywords:
algorithms, experimentation, performance, query clustering, web data mining,
user log, search engine

Addresses:
Wen JR, Microsoft Res, Beijing Sigma Ctr 49, Asia 5F,Zhichun Rd, Beijing,
Peoples R China
Microsoft Res, Beijing Sigma Ctr 49, Beijing, Peoples R China
Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada

Publisher:
ASSOC COMPUTING MACHINERY, NEW YORK

IDS Number:
517AE

ISSN:
1046-8188

Cited Author            Cited Work                Volume      Page      Year

 BEEFERMAN D           P 6 ACM SIGKDD INT C                   407      2000
 DELIMA E              P 22 ANN INT ACM SIG                   145      1999
 DUBES RC              ALGORITHMS CLUSTERIN                            1988
 ESTER M               P 2 INT C KNOWL DISC                   226      1996
 ESTER M               P 24 INT C VER LARG                    323      1998
 FITZPATRICK L         P 20 ACM SIGIR INT C                   306      1997
 GARFIELD E            CITATION INDEXING IT                            1983
 GUSFIELD D            ALGORITHMS STRINGS T                            1997
 KESSLER MM            AM DOC                        14        10      1963
 KLEINBERG J           P ACM SIAM S DISCR A                   668      1998
 KULYUKIN VA           P AAAI 98                     98       532      1998
 LEWIS DD              P 13 ANN INT ACM SIG                   385      1990
 LU Z                  P 23 ANN INT ACM SIG                   248      2000
 MILLER G              INT J LEXICOGR                 3         4      1990
 NG RT                 P 20 INT C VER LARG                    144      1994
 PORTER MF             PROGRAM                       14       130      1980
 SALTON G              INTRO MODERN INFORMA                            1983
 SRIHARI R             P TREC 8                                75      1999
 VANRIJSBERGEN CJ      INFORMATION RETRIEVA                            1979
 VOORHEES EM           P 18 ACM SIGIR C RES                   172      1995
 XU J                  P 19 ANN INT ACM SIG                     4      1996



More information about the SIGMETRICS mailing list