The downside of scraping the wrong webmaster is that your websites now contain breadcrumbs that let that webmaster unravel a big chunk of your network of sites that you've been scraping and spamming.
I'm not going to even go into the list of domains I found my scrapings on as it's a huge list and the specific sites I found were all hosted on theplanet.com and 800hosting.net.
Besides, if I expose the list this MFA scraper spammer might figure out how I unraveled his system and we wouldn't want that, now would we?
I'm not even going to bother with the IP they were scraping from or the user agent since it was a spoofed browser UA of course, and the IPs doing the scraping were all from the same hosting companies listed below.
Instead, let's start at the top of the iceberg with their statistics pages listing 400-500 sites per page which in total roughly links to about 6,500 individual scraper sites, and I'm sure we're just touching the surface here.
http://www.badhood.info/So where do these sites host?
badhood.info 22.214.171.124 -> 2.89.5746.static.theplanet.comThere you go, it could've been a been long spew of data but there's really nothing you need to know except BLOCK access from data centers and you'll be a bit more secure, which I've been preaching for quite some time.
browserbytes.com 126.96.36.199 -> c2.1a.344a.static.theplanet.com
csprovisions.com 188.8.131.52 -> 2.1d.344a.static.theplanet.com
inbounders.com 184.108.40.206 -> evolution.cia.sk
jewelrydns.info 220.127.116.11 -> (800hosting.net)
landingdns.info 18.104.22.168 -> ev1s-66-98-132-73.ev1servers.net
link-magic.com 22.214.171.124 -> rs-64-246-60-95.ev1.net
link-pros.com 126.96.36.199 -> ns1.s810.net
multithreedns.info 188.8.131.52 -> c2.e1.344a.static.theplanet.com
multitwodns.info 184.108.40.206 -> 82.7e.344a.static.theplanet.com
sfte.info 220.127.116.11 -> c2.d8.5746.static.theplanet.com
terrificdns.com 18.104.22.168 -> damon.screaminghost.com
trafficsupply.com 22.214.171.124 -> (Everyones Internet)
virtual-domains.com 126.96.36.199 -> ev1s-66-98-198-44.ev1servers.net
Now, let's look at a specific site like fashionmenclothingjackets.info and you'll see how they really spam the search engines with 3 digit subdomains. All of their sites are like this and there are literally hundreds of thouands, if not millions, of junk pages associated with this one group of domains.
And we'll take a peek at another of these sites, like fiftiesteenagefashion.info, to see how they promote themselves with blog and forum spam for traffic.
There you have it all with scraping, search engine spam and blog and forum spam all tied up in one neat little package.
P.S. Did we piss on someone's cornflakes?
Getting a ton of hits to this post via a forum on http://www.pginsider.com/ which makes you go Hmmmm.... it's amazing how they out themselves once you post something.