Saturday, January 31, 2009

Iterasi Archives Sites Without Permission

Guess what boys and girls?

There's another wonderful new site that allows people to copy your shit without your permission!

Iterasi allows their members to "archive" individual web pages.

The pages on my site have a meta tag "NOARCHIVE" which tells everyone DO NOT ARCHIVE this page yet they archived it anyway. They also stripped out my frame busting javascript so they are seriously thwarting sites at every turn that don't want to participate in their tool.

Being that Iterasi is in Beta maybe I'll cut them a little slack, very little, but just a bit.

On their web site it says:

At iterasi, we love the Web. So much so, that we want to keep it. Forever.
If you really love the web you would follow standard web protocols and if the webmaster gives you permission, fine, do whatever you want.

For those of us that don't allow it, back the fuck off.

Here's the IP and user agent details:
198.145.117.78

"Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.0; WOW64; SLCC1; .NET CLR 2.0.50727)"
They operate out of this IP range:
OrgName: Infinity Internet, Inc.
NetRange: 198.145.0.0 - 198.145.255.255
CIDR: 198.145.0.0/16
Infinity Internet is a mixed service with both hosting and business/residential DSL services so blocking the whole range probably isn't safe.

The reverse DNS shows:
pointer ip78.117.colo.iinet.com.
For the time being, you can opt-out of Iterasi by blocking anything with an RDNS containing ".colo.iinet.com" which seems to stop them dead in their archiving tracks.

Here's a few things Iterasi could do so webmasters don't get hostile:
  • Honor robots.txt
  • Honor meta tags like NOARCHIVE
  • Provide a user agent string that identifies Iterasi accessing a site
  • Provide reverse DNS so we can tell it's your company and not a spoof
Until that time, I have you permanently blocked and I'm sure others will soon .