[Sigia-l] Findability (long)

Rashmi Sinha rashmi_sinha at yahoo.com
Tue Jan 28 17:05:50 EST 2003


I have been following this discussion about Findability and have a few 
comments to add about Recommender Systems (that have been praised for 
helping people find things directly, without categories), and metadata 
search as compared to Google type search.

If you look closely at collaborative filtering algorithms (that form the 
basis of many recommender systems), then you will find that they are not as 
category-free as you think. The first step with many Recommender Systems 
algorithms is dimensionality reduction, which often means some sort of a 
clustering (you start getting close to categorization here, though of a 
different kind). Another way is to take genres into account (Note. I am 
more familiar with book, movie and music recommender systems). Recommending 
within genres (which is what Amazon mostly does) is a conservative 
technique designed to increase revenues as opposed to helping users 
discover more about their tastes (look at MediaUnbound for the best Music 
Recommnder system, ever!). Better Recommender Systems , which truly help 
users find new items, and expand their tastes, find much less of a market. 
I cannot repeat names of companies here, but I recently launched on a 
mission to find out what happened to many of the great Recommender Systems 
that were developed in the last few years. The story is the same, many 
Recommender System companies are finding it hard to sell their best 
products that really helped users explore their tastes. The Recommender 
Systems that sell, are ones that help companies make the sale, and make it 
fast. (We wrote about this contrast between systems that help users explore 
tastes and systems that help make the sale at 
http://www.rashmisinha.com/articles/musicDIS.pdf (Self link))

Also, it is incorrect to think that Recommender Systems cannot have an 
agenda, or less of an agenda than categorization. Recommender Systems are 
explicitly designed to encourage people to buy. Often, they are the 
technique that helps the telemarketer suggest another product to you in a 
late evening phone call. In contrast browse, or search systems are much 
more self-directed. Recommender System algorithms are fine tuned for 
marketing and sales purposes not for helping you discover information.

One quote that comes to mind about Recommender Systems: "When they are 
good, they are very very good, when they are bad, they are very very bad". 
Read this article in the Wall Street Journal Opinion Page to know what I 
mean: "If TiVo Thinks You Are Gay, Here's How to Set It Straight" 
http://online.wsj.com/article_email/0,,SB1038261936872356908,00.html

Don't think Recommender Systems do not use classification. Apart from other 
things, they also classify YOU. And they classify you without any knowledge 
or choice on your part. The privacy issues are huge. Although I am a big 
advocate of Recommender Systems, I understand that such systems can be used 
(and probably are being used) in a downright insidious manner. IMO, using a 
good Metadata browse and search seems like a walk in the park, where I can 
stop and smell the roses when I feel like it, as opposed to using a 
recommender system which, in the wrong hands, can be like being followed 
around by a detective. I like the openness of categorization techniques. It 
is possible for a user to see the scheme, the explore the biases of the 
scheme if interested. With a Recommender System, you have little access to 
what it thinks of you.

A different issue: Is good keyword search always better than metadata 
search? Read the latest paper from the Flamenco project which shows that at 
least for some cases, metadata search is preferred. 
http://bailando.sims.berkeley.edu/papers/flamenco-chi03.pdf

At the end: Why use categorization schemes to display information? For the 
very reason that human beings use categorical lenses to view the world (By 
the way, there is an inordinate amount of evidence from cognitive 
psychology, anthropology, neuroscience, neuropsychology that human beings 
indeed view the world through categorical lenses, and this ability emerges 
very early in life). It enables a more efficient filtering of information. 
It also provides a social glue by, and allows for collective 
representations, and distributed cognition. The key for the information 
architect is to find the correct categorical scheme to use. I would tend to 
agree with those who point out that a bad categorical scheme is worse than 
other options (because it actively misleads the user).

Research also shows that there is great deal of consensus in how people 
categorize within a culture (see research by Romney and Weller). 
Categorization allows us to tap into the domain knowledge of another person 
without needing to learn everything they know. Read this article on 
categories used by bartenders: http://www.lehigh.edu/~jbg1/persknow.htm

IMO, there is a great deal of promise in using the correct categorization 
schemes as and when it is appropriate.

TO end, here's an incident that illustrates what I have been trying to 
convey: One of the Recommender Systems I referred to above has helped me 
tremendously in expanding my musical tastes. It recommended a number of 
songs that I thoroughly enjoyed, but had not heard before. They all seemed 
to have something in common, but I was not sure what. It was only when it 
also identified the genre as Latin Jazz, that I knew how to tie this new 
found taste to other music genres I like, how to follow that interest out 
of the confines of the Recommender System.

Hope this is not too confusing!
-rashmi 





More information about the Sigia-l mailing list