[Sigia-l] how are you making wireframes for Flash projects?

Karl Fast karl.fast at pobox.com
Wed Apr 7 14:53:15 EDT 2004


> Google attempts to defeat this workaround, by retrying pages using a
> non-spider UA string 

Last fall I wrote a spider to automatically query Altavista,
AlltheWeb, and Google and parse the results. It would send a query
and then screenscape the resulting HTML for the linkcount figure.

Google detected this and returned a special page saying, in effect,
"you can't run a spider against us because it's against our usage
policy, but you can use the Google API." Which I did. Altavista and
AlltheWeb didn't prevent me from doing this.

Now my script was simple. The Google query was just a few lines of
python using the standard library. You could be more sophisticated
in how you emulate a 'real' browser, but Google does automatically
detect the simple case. I never bothered to figure out exactly how
it was doing this.


--karl
http://www.livingskies.com/




More information about the Sigia-l mailing list