[Sigia-l] K Mel Faceted Classification
Richard Hill
rhill at asis.org
Thu Jul 11 13:56:29 EDT 2002
[Reposted for Meltsner, Kenneth" <Kenneth.Meltsner at ca.com> to avoid
HTML. Dick Hill]
Facets need to be orthogonal (and hierarchical?)
The other big distinction between faceted classification and traditional
"query-by-example" search is that the facets should be as independent or
orthogonal to each other as possible.
If I were classifying insurance-related news, I might choose:
* State
* Line of business
as two (relatively) independent facets.
Orthogonality allows you to specify values (or ranges of values) for
multiple facets with a reasonable expectation that there will be some
results. If the categories aren't independent, then it's too easy to
specify searches with no results, I think.
The other issues with facets is that they need to store more than simple,
single values. As the facets become more numerous, the more likely it is
that they will be quite sparse (that for many items, they will not have any
values). You need to ensure that facets have values most of the time
without limiting your ability to describe very specific cases. The
simplest way to do this involves hierarchical facet values, I think -- an
attribute value is broadened by going up the tree.
For example, a genre facet for novels might have values organized as a
hierarchy:
* Genre
-- Science Fiction and Fantasy
-- Space opera
-- New wave
-- Near Future
-- Historical fantasy
-- Alternate History (SF)
-- Alternate History (fantasy)
-- Mystery
-- Police procedural
-- Cozy
-- Hardboiled
This probably works, although exact classification is always a
problem: how do you find a book like _Point of Dreams_, which is an
alternative history (fantasy) with strong mystery (police procedural)
elements? Or a SF near-future hardboiled novel? Within in a facet, it
appears that you need to be able to handle multiple values as well, just as
it's possible that a California insurance news item would apply to both
auto and home insurance.
Ken Meltsner
Senior Architect, Computer Associates
Executive Director
American Society for Information Science and Technology
1320 Fenwick Lane, Suite 510
Silver Spring, MD 20910
FAX: (301) 495-0810
PHONE: (301) 495-0900
http://www.asis.org
More information about the Sigia-l
mailing list