[Sigia-l] search results and thesauri

Avi Rappoport analyst at searchtools.com
Wed May 22 15:36:03 EDT 2002


These are great discussions -- if anyone has good (or bad) examples 
of implementations, please post.  I love to use real examples when 
I'm giving talks and writing articles.

I think best practices require a number of different approaches, all 
depending on how much vocabulary control and keyword metatagging 
you're doing.

I do not recommend whisking someone to the theoretical applicable 
page without going through search results first.  It breaks the 
expectation and may be wrong.  For example, I'm analyzing a search 
spellchecker (paper to come soon) and someone misspelled "New England 
Journal of Medicine".  The site actually has a record for that, but 
because they misspelled it, the automatic system took them directly 
to the listings for -- Great Britain (because of "England").  Whoops.

My main rule is to explain any automatic conversions (and wish people 
would do that for stemming).  For true synonyms (doctor => 
physician), I think it's legit to just issue the search for the 
preferred term, put a note on the results page, and consider 
highlighting hits in a different form for the original vs. preferred 
term.  In this example, you could do italic for :doctor" and bold for 
"physician".

It does get trickier when you're talking about less solid agreement. 
On a health site, it turned out that they used "Primary Care 
Provider" or "PCP".  PCP is a more inclusive term, because it covers 
Nurse Practitioners and such, but it's not what people really expect 
when they type in a very general term like "doctor".  So I would 
recommend that the site actually provide an informative response to 
general questions about doctors, rather than just using a synonym 
system in this case.

The only search engine I know that allows search admin control over 
whether synonyms are automatically searched or whether they are 
presented as an option is Inktomi Search Software: the synonym file 
has flags for behavior.

Looking forward to learning more,

Avi


At 12:03 PM -0700 5/22/02, Chris Farnum wrote:
(snip)
>So here's a related question... does your search
>engine handle both synonyms AND preferred terms.
>Often 3rd party solutions don't include both and you
>are forced to decide how far to stretch the simple
>equivalence (synonym ring) feature they've given you.
>Your answer will depend partly on your content, partly
>on your indexing guidelines, and partly on how you've
>designed your CV.  For example, if you've got a
>collection of carefully edited and authored content in
>which term usage is very consistent you might need to
>concentrate on the alternate terms that point to your
>preferred terms (so users are less likely to get null
>results).  On the other hand if you have a more
>diverse set of content and users that are concerned
>with high recall, you will likely spend more effort on
>synonyms.  These issues may also impact how you answer
>the highlighting question.
>
>Regards,
>Chris

-- 
Search Server Industry Analysis from Search Tools Consulting
    (510) 845-2551  -- <mailto: analyst at searchtools.com>
Complete Guide to Search Engines for Web Sites and Intranets
    <http://www.searchtools.com>



More information about the Sigia-l mailing list