Wen JR, Nie JY, Zhang HJ "Query clustering using user logs" ACM TRANSACTIONS ON INFORMATION SYSTEMS 20 (1): 59-81 JAN 2002
Eugene Garfield
garfield at CODEX.CIS.UPENN.EDU
Fri May 3 16:40:31 EDT 2002
J.R. Wen : jrwen at microsoft.com
J.Y. Nie : nie at iro.umontreal.ca
H.J.Zhang : hjzhang at microsoft.com
TITLE Query clustering using user logs
AUTHOR Wen JR, Nie JY, Zhang HJ
JOURNAL ACM TRANSACTIONS ON INFORMATION SYSTEMS 20 (1): 59-81 JAN 2002
Document type: Article Language: English Cited References: 21
Times Cited: 0
Abstract:
Query clustering is a process used to discover frequently asked questions or
most popular topics on a search engine. This process is crucial for search
engines based on question-answering. Because of the short lengths of
Queries, approaches based on keywords are not suitable for query clustering.
This paper describes a new query clustering method that makes use of user
logs which allow us to identify the documents the users have selected for a
query. The similarity between two queries may be deduced from the common
documents the users selected for them. Our experiments show that a
combination of both keywords and user logs is better than using either
method alone.
Author Keywords:
algorithms, experimentation, performance, query clustering, web data mining,
user log, search engine
Addresses:
Wen JR, Microsoft Res, Beijing Sigma Ctr 49, Asia 5F,Zhichun Rd, Beijing,
Peoples R China
Microsoft Res, Beijing Sigma Ctr 49, Beijing, Peoples R China
Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
Publisher:
ASSOC COMPUTING MACHINERY, NEW YORK
IDS Number:
517AE
ISSN:
1046-8188
Cited Author Cited Work Volume Page Year
BEEFERMAN D P 6 ACM SIGKDD INT C 407 2000
DELIMA E P 22 ANN INT ACM SIG 145 1999
DUBES RC ALGORITHMS CLUSTERIN 1988
ESTER M P 2 INT C KNOWL DISC 226 1996
ESTER M P 24 INT C VER LARG 323 1998
FITZPATRICK L P 20 ACM SIGIR INT C 306 1997
GARFIELD E CITATION INDEXING IT 1983
GUSFIELD D ALGORITHMS STRINGS T 1997
KESSLER MM AM DOC 14 10 1963
KLEINBERG J P ACM SIAM S DISCR A 668 1998
KULYUKIN VA P AAAI 98 98 532 1998
LEWIS DD P 13 ANN INT ACM SIG 385 1990
LU Z P 23 ANN INT ACM SIG 248 2000
MILLER G INT J LEXICOGR 3 4 1990
NG RT P 20 INT C VER LARG 144 1994
PORTER MF PROGRAM 14 130 1980
SALTON G INTRO MODERN INFORMA 1983
SRIHARI R P TREC 8 75 1999
VANRIJSBERGEN CJ INFORMATION RETRIEVA 1979
VOORHEES EM P 18 ACM SIGIR C RES 172 1995
XU J P 19 ANN INT ACM SIG 4 1996
More information about the SIGMETRICS
mailing list