[Sigia-l] automated site mapping tools?

Cathy Caron ccaron at vianow.com
Sun Jun 23 08:46:00 EDT 2002


What you'll be able to use for this will depend partly on how this site is coded
and how "all" you need your "all pages" to be.  When searching for a link
checker over the last few months, I found that there are lots of programs,
including open source that will find most links, but very, very few that can
properly spider complex Javascript and DHTML links.  If this site has links in
arrays in js files, DHTML menus etc., it is very unlikely you will get a
complete list from any of the open source, or in fact most of the commercial
programs.  If you need to be certain you have all the pages, and if you only
need a single shot at the contents of the site, and not ongoing link checking, I
would download a trial version of Watchfire (www.watchfire.com) which is very
aggressive about searching complex links and allows the use of regular
expressions to define what a link is.  I am not connected with this company in
any way of course, well, except for having bought their product.

Cathy Caron
The VIA Group

> I'm presented with a site with 7,000+ static HTML files which has grown,
> um, organically over the years.
>
> What are the recommendations for software which will crawl the site and
> produce a list of all pages and all links on those pages? It doesn't
> necessarily need to produce pretty hierarchical diagrams (we're not even
> certain if the site is truly hierarchical).





More information about the Sigia-l mailing list