[Sigia-l] Social + Semantic = Twine
Bill Quimby
wquimby at ecr.net
Sat Oct 20 12:23:39 EDT 2007
More on Twine...
-----------------------------------------------------------------------------------
Radar’s Twine: A semantic complement to Google
via VentureBeat by Chris Morrison on 10/1
More than a year of secrecy spawned rumors about Radar Networks. The most popular: It’s a
“Google killer.”
Tomorrow morning, San Francisco’s Radar will surprise a few people by launching Twine, a
tool for collecting and organizing information that’s very different from Google. But it’s
potentially just as ambitious.
An example of how Twine works: A user uploads a text document to their Twine account.
Twine then parses the document to find the words with meaning — names, places, concepts
and so forth. Those terms become tags, which the person can use to access related information.
Twine’s underlying technology gives the computer a measure of intelligence. Using tags, a
computer can distinguish between, say, a reference to the kind of bird that flies and the
kind that flips people off. Once it has, it can give users a wealth of other information,
drawn from their own accumulated knowledge base, other users and the outside internet.
Where Google crawls the entire web and ultimately pollutes your search results with
different kinds of “birds,” Radar picks from a smaller universe of sources and tries to
automatically discard the ones you don’t want.
Let’s dumb this down to a very concrete example. In Twine, I might be identified as “Chris
Morrison,” and then labeled with the markers “writer,” “venturebeat,” “male,”
“technology,” “charming” and “good-looking” (all true, of course). Twine would set me
apart from the many other Chris Morrisons running around.
That could help a headhunter narrow their focus, a marketer collect all the information
about a particular product, or a group of analysts to aggregate information on a subject.
The “documents” gathered will include, among many others, text, PDFs, or even videos on
YouTube (Twine simply draws on pre-existing tags and descriptions of visual media to do
its own tagging work).
The information that helps Twine make decisions on its own about what content to collect
for you comes both from a users’ accumulated information as well as their actions, which
means that, as the user pulls more info into the account on their own, Twine will begin to
work cooperatively, providing more content where it’s needed and even assisting groups or
teams of people with collaborative research and knowledge-building.
Young companies with a limited ability to do similar selection tricks — for instance,
Jiglu, which we posted about a few days ago — are increasingly common, and tend to obscure
the potential of companies that truly have a chance of becoming market leaders. That’s too
bad, because there’s no question that intelligent computer handling of data — a first step
toward artificial intelligence — will be an important part of the internet in coming years.
nova.jpgHelping Radar is the breadth of its underlying technology and the strong
scientific and engineering team, now 30 strong, that has been working on the platform for
years. It is founded by Nova Spivack (left), an entrepreneur who worked on artificial
intelligence with futurist Ray Kurzweil. He later co-founded the early internet company
EarthWeb, which went public in 1999, and helped other large companies get online.
So how does Twine plan to make money? If it’s busy reading the minds of its users, there
should be some significant advertising mojo possible. Twine also plans a paid version with
no advertising.
For now, though, the company is venture backed. As we reported last year, the company
raised $5 million for Paul Allen’s Vulcan Capital, Leapfrog Ventures and angel investors.
We’re hearing the company is looking to raise about $15-20 million for its next round.
Radar does, however, have competitors. The winning bet will boil down to which company
will be able to throw enough scientific brilliance at the difficult problem of teaching
computers to understand human information. The winner will likely dominate, as Google does
with search.
To explain the differences between these competing startups, it’s easiest to separate them
by the particular types of technology they utilize.
Broadly speaking, those technologies fall into three categories.
1) The first is statistical analysis, where Google reigns supreme. Terms are examined for
their frequency, placement and outside links to determine their apparent relevancy, and
then ranked. Google’s algorithms have gotten better over the years, and it has
incrementally added on other technologies and services.
2) Natural language search is the second category. Teaching computers to understand human
language is a complex process which involves breaking sentences down to their component
parts — nouns, verbs, adjectives and so forth — which can then take on symbolic meaning
for computers. Powerset (previous coverage), which is dribbling out its technology in
stages, is a prime example of this approach.
3) The third, semantic search, is much-hyped, but little understood. Simply put, people
attach markers to human-generated content, whether a paragraph of text or a picture, to
outright tell computers in a special machine language what’s meaningful (we mentioned the
“Chris Morrison” = “charming” example, above). If applied to the entire internet, the
result could be thought of as a giant, interrelated Wikipedia. Metaweb, which recently
launched Freebase, is attempting to create just that. It should be noted that Powerset, of
category #2 above, has partnered with Metaweb, thus placing Powerset in this third
category as well in some ways.
For the most part, each company is betting on its own core technology to win the race.
Radar, however, works with all three. That combination, it hopes, will take the day. It is
similar in its ambitions to another secretive startup, Franz Inc, which VentureBeat plans
to write about soon.
To be fair, there’s also a fourth, less glamorous approach which relies almost entirely on
humans. ChaCha and a forthcoming (much delayed) startup from Wikipedia founder Jimmy Wales
are two examples.
First, though, the viability of any technology must be proven. What matters is how well
Twine can perform at helping humans organize the avalanche of information that is modern
life. So while there are other features we could mention, from organizing content through
an innovative “bookmarklet” to finding related content through a “social graph” of similar
users, it’s more useful to give our reaction to Twine.
Having sat through a limited demo by Radar founder Nova Spivack, we can say we’re looking
forward to kicking the tires of Twine more extensively. The interface is simple, yet
powerful. While the use of tagging resembles tag-lists that have been around for years,
the Twine application is more powerful. See the example of a search for “clean-tech”
below. See the various tags that are related to it.
Caveat: The site is still in development. A wealth of other features would be helpful to
have, from a more extensive array of choices for communicating with other users (Spivack
does say that instant messaging is coming) to adding more possibilities for linking
information.
However, the Twine team won’t have to do the work alone. Sometime after the current beta
launch, which will be limited to a few thousand people, Twine plans to open up several
APIs to allow outside developers to build on the platform.
For now, the site is geared toward people who use the internet heavily — primarily
“knowledge professionals,” like the marketers and analysts mentioned above. Students,
“prosumers” (people with a strong interest in a particular thing) and companies will also
likely find uses for Twine.
------------------------------------------------------------------------------
- Bill
More information about the Sigia-l
mailing list