Monetize
Affiliate Marketing, Search Engine Obfuscation, and Internet Profiteering

Step-by-Step: How to Get BILLIONS of Pages Indexed by Google
Saturday June 17th 2006, 9:44 am Edit This
Filed under: Ramblings and Random Things

As most SEOs know, MSN loves the subdomains. You can make hundreds of keyworded subdomains and MSN will think quite highly of the pages. Same goes for blogspot and other blogs– they do very well on MSN and sometimes on Yahoo. Now Google and the new BigDaddy crawler is showing an even more idiotic preference when indexing and ranking subdomains.

Check out this site: search of eiqz2q.org — depending which datacentre you hit, you will see between 3.8 and 5.5 BILLION RESULTS. Even worse… the domain is EIGHTEEN DAYS OLD. That’s right, in under 3 weeks, one person has managed to get one domain 5 billion pages indexed in Google. And they are ranking, too. That particular domain has an Alexa ranking of under 7,000. Another domain owned by the same person, t1ps2see.com, has between 1.7 and 2.4 billion indexed pages and an Alexa ranking of under 2,000… after 4 weeks. Coincidentally, the sites also have 3 blocks of Adsense ads on each page. I wonder how much that one person is earning per day with billions and billions of pages indexed and ranking?

 5 billion indexed pages
Inspired by his work, I present, The Step-by-Step Guide to Getting Billions of Pages Indexed by Google:

Register a meaningless domain consisting of numbers, letters, and secret symbols. Heck, register a hundred of them.

Setup a server to manage all of your domains and subdomains. It will need to be beefy as you will be serving a lot of traffic in a few days.

Buy as many article databases as you can. Topic doesn’t matter. You might want to search and replace some i’s and 0’s for corresponding ASCII codes to help you avoid duplicate content.

Create or buy a common scraper script. You’ll need it to respond with different articles based on what keyword is hit, effectively serving up new content for each subdomain. It should respond to any subdomain query. Your server should be setup to allow all subdomains to be redirected to your main page; there your script sorts out what content to serve. Effectively allowing you to create an infinite number of subdomains with unique content. Now the trick this guy is using involves subdomains of subdomains. So you create a “topical” subdomain such as music.3hid9gw.org and then on that subdomain you create your actual pages of additional sub-subdomains, like: 2152.music.3hid9gw.org. Because each subdomain and each sub-subdomain is considered a new site by Google, you can get past the “1 page indexed per site” delay for new domains. If you don’t get this part, hire someone… according to the Alexa traceback, it looks like Argentina has the right people for the job.

Launch your blog comment spam attack. Link to some of your subdomains which are also interlinked.

Wait a few weeks… then sit back and enjoy your billions of indexed pages. Be sure to put 3 Adsense or YPN blocks on each page.

Want proof? The spammer with billions of indexed pages definitely moves traffic and now that some domains have been de-indexed, watch the Alexa ranking fall like a rock.

Thanks to Nintendo at DP for pointing out how extensive this network of subdomain spam has become. If there is enough interest shown, I’ll work up a script that would do this and post it for free so we can all enjoy the benefits of subdomain indexing.

UPDATE: This outing made the front page of Digg and del.ici.ous, and is currently the top story at Techmeme and Reddit. Due to the publicity, you can watch the indexed pages of these domains shrink right before your eyes. The current site: count for eiqz2q.org is 700,000,000 down from 5,550,000,000. I assume it will be 0 in a matter of hours. While a hand job will work for now, when will the indexing algorithm be repaired?


After 60,000+ readers, thanks:
 Primehost
148 Comments so far
Leave a comment

Amazing … lets see how long it lasts before getting deindexed.

Comment by Sayles 06.17.06 @ 10:31 am |Edit This

Don’t forget to put up Link Vault :)

Comment by Sascha 06.17.06 @ 11:00 am |Edit This

Bigdaddy should be named Idiotdaddy.

Comment by Mong 06.17.06 @ 11:11 am |Edit This

Nice explanation. This is just one of several problems on Google that will be definitely webmasters will take advantage. Nice Blog gonna bookmark this page :)

Comment by IsulongSEOph 06.17.06 @ 12:25 pm |Edit This

I thank both you and Nintendo for the article. May I raise my hand to be a recipient of the free script you write for this?

Comment by Claudia 06.17.06 @ 12:34 pm |Edit This

Report them!

We can’t have sites like these compromising Google’s marvelous quality of search results now, can we?

Comment by Caroline 06.17.06 @ 1:07 pm |Edit This

wow thats ridiculous now anyone with a decent looking site and a few hours of free time get get rich off of adds and pagerank exploitation. I suppose this is a good/bad thing. Not only that this page is on digg >.>

Comment by Nathan 06.17.06 @ 1:45 pm |Edit This

Does the Host provide - Server-side Stats - what search queries are they coming up for - because without appearing on SERPs with keywords that are likely to be searched for - there will not be traffic

Comment by Search Engines WEB 06.17.06 @ 1:45 pm |Edit This

Could they not then create 301’s on all these pages and link all that content to one page creating a pr9 page and boosting right up the top of the page for a couple of keywords, garnering them even more earnings.

Comment by Michael Scott 06.17.06 @ 1:58 pm |Edit This

I’d rather keep the billions of pages active rather than 301 them. I believe you make much more money off of the “long tail” rather than a few big keywords. Meaning– it is better to rank pretty well for thousands of keyphrases rather than very well for just one or two keywords.

Comment by Alex 06.17.06 @ 2:02 pm |Edit This

Fuckedgoogle.com has lots of articles about fake adsense ads. Hopefully more people see how big of a problem google is when it comes to advertising.

Comment by test 06.17.06 @ 2:05 pm |Edit This

*** blog comment spam attack ***
http://www.spinstartshere.com

Comment by caz 06.17.06 @ 2:26 pm |Edit This

Yes, make and release a script that does this. If Google ends up with half it’s database being SPAM…then they might dump CrapDaddy.

Comment by Nintendo 06.17.06 @ 2:30 pm |Edit This

I desperately need your free script to uplift my brothers and a sister. We are in desperate straits after the death of our father last year.

Comment by Ester S. Nansamba 06.17.06 @ 2:32 pm |Edit This

But Ester, I just transfered $30,000 to your sealed Swiss account! Isn’t that enough? By the way, when are you going to deposit the 20 million you promised me?

Comment by Blockhead 06.17.06 @ 2:45 pm |Edit This

A free script would be the right thing so that this thing can be stopped for GOOD!

Go for it!

Comment by mark 06.17.06 @ 2:48 pm |Edit This

That’s retarded. Adsense/YPN will disable your account before you even get your check.

Comment by Jack 06.17.06 @ 2:48 pm |Edit This

Um, AdSense and Yahoo don’t have to be the only way to make money from this. There’s this thing called PPC affiliate programs!!!!

Comment by Nintendo 06.17.06 @ 2:57 pm |Edit This

There is more to it than just comment spam and one server . . . but you’re on the right track.

Comment by Quadszilla 06.17.06 @ 3:12 pm |Edit This

For people that don’t know what this means:
“Launch your blog comment spam attack. Link to some of your subdomains which are also interlinked.”
It’s basicly seting up a link like this: http://www.ecuadors.org/galapagos/video.html
This website will probably go down after a few minutes! It’s amazing what digg can do. Repeat in another thousand blogs pointing to the same address and Boom!

Comment by Esteban 06.17.06 @ 3:15 pm |Edit This

you guys should write the scripts yourself, after this scam stops working, you’ll have some employable skills.

scripting is about the easiest kind of programming, use Python. it’s got clean syntax, good libraries, and runs damn fast for a scripting language.

Comment by carter 06.17.06 @ 3:33 pm |Edit This

But the content on those sites is so good!

Comment by garraeth 06.17.06 @ 4:39 pm |Edit This

Wow, this really works, I’ve made over $73,000 in the last three hours. At last, a way for me to escape from food coupons!!

Comment by Donny 06.17.06 @ 4:40 pm |Edit This

Long before this guy showed up the whole idea of using links for serps was dead. Link exchanging and buying links are really spaming Google, only on a smaller scale. If Google wants to provide searchers original relevant content they are going to have to find another way to do it.

Comment by Accent 06.17.06 @ 4:44 pm |Edit This

Anyone remember planetsearch.com? Wonder where it went.

Comment by Grakkks 06.17.06 @ 5:52 pm |Edit This

What a load of rubbish. They may have indexed the pages, but none of them come up in key word searches. This article is a lot of nonsense.

Comment by fluxy 06.17.06 @ 5:52 pm |Edit This

If you are gonna use articles, what do you need the scraper for?

And yeah this really works.

Comment by countzero 06.17.06 @ 5:55 pm |Edit This

>>What a load of rubbish. They may have indexed the pages, but none of them come up in key word searches. This article is a lot of nonsense.

Bullshit. Try any of these phrases:

“war on terror pro cons”
“best roast pork loin”
“steak cooking perfectly”

No, they aren’t competitive phrases– but there are billions of them. For many searches, he has more than 1 spot in the top 10 and more than 20 spots in the top 100.

>>If you are gonna use articles, what do you need the scraper for?

You’re right, I’m not much of a black hatter. I meant to describe the sort of scraper-like navigation and interlinking that scrapers typically use. You don’t need to scrape for data if you use articles.

Comment by Alex 06.17.06 @ 6:16 pm |Edit This

Erm, are you so sure about this? That search only results in ~ 1,000 matches. Are you convinced Google has indexed more than 5bn pages from this site when it could just be an index error, and in fact only a couple of thousand pages have been indexed? Might want to check with Google before suppositioning something that is in fact incorrect.

Comment by neuro 06.17.06 @ 6:17 pm |Edit This

Just do a search for “pizza sauce recipe” and go to page 3 and beyond. He has pages and pages of results just for that one phrase. Yea, I’m pretty sure he really is indexed — look at the Alexa numbers! Below 2,000 is quite a feat in under a month.

Comment by Alex 06.17.06 @ 6:19 pm |Edit This

Ah, the fantastic allure of easy money. Nothing like gaming a sometimes deeply flawed system for personal gain! Time to cross-link to my own blog as subtlely as possible… ;p

Comment by Sam Jackson 06.17.06 @ 6:20 pm |Edit This

IM REPORTING THIS TO GOOGLE

I’m not going to let you spammers do this and screw up the search results and profit off of Google

NO WAY, JOSE

Comment by Siddhartha Gandhi 06.17.06 @ 6:23 pm |Edit This

Where are the Adsense ads on these sites everyone talks about, I don’t see them???

Comment by Anthony Cea 06.17.06 @ 6:31 pm |Edit This

Well… It’s nice to see this getting some airtime. I’m really getting sick of these bastards scraping links and other content from my site, and then having those links point to more of their own crap instead of the places they were pointing when they were on my site.

Now, I don’t know what effect, if any, this duplicate content is having on my own ranking, but I’m not taking any chances, and I have, therefore, been seeing to it that these sites are removed from the Google index as I find them. You can expect both of the domains you listed above to be removed from Google’s index within the next week.

Yes, I know they will just do it again with more and more and more and more and more domains, as they have continued to do every time I disable a few of them, but I’m pissed, and I will continue my assault. If any of you know how to do what I’m talking about, I would encourage you to do it as well whenever you find one of these sites. If not, well… ahem… maybe you could just report them as spam to Google (if they’ve got a way to do that…I think they do).

Don’t bother asking how I remove them. There’s exactly 0 possibility that I’ll be talking about that, but if you do know, then please help me get these fux0rz delisted.

If you own a fairly well-trafficked domain, then you might have experienced this yourself. They scrape tons of content from certain sites and place self-serving links using the exact content and anchor text from your site, showing the URL that your site links to, but not actually linking to it. I can’t imagine this has any positive effect on the sites from which this content is scraped, so I’m doing as much as I can to fix it when I see it. I encourage you to do the same.

Peace.

Comment by ahahahaaa 06.17.06 @ 6:31 pm |Edit This

>>Where are the Adsense ads on these sites

He changed the sites around while we’ve been discussing them. The eiqz2q.org domain used to have 3 blocks of Adsense per page. Now it redirects to the other domain, which leads to more MFA’s except this time showing YPN.

Comment by Alex 06.17.06 @ 6:37 pm |Edit This

Where are the Adsense ads on these sites everyone talks about, I don’t see them???

Right after people started to discuss the issue, they moved the adsense one level deeper Anthony.

Click on any of the ads on t1ps2see.com, and you’ll see the Adsense.

Comment by Cristian Mezei 06.17.06 @ 6:38 pm |Edit This

Today’s billion is tomorrow’s million.

Comment by PJ 06.17.06 @ 6:42 pm |Edit This

So are these pages simply indexed or are they actually coming up in any keyword searches?

Comment by Anthony Cea 06.17.06 @ 6:46 pm |Edit This

LOL. Pot. Kettle. Black. Take a look at Nintendo’s sites - he’s one of the biggest search engine spammers around. And that thread at DP is a showcase of the lowest IQ’s of the Net.

Comment by Joe 06.17.06 @ 6:47 pm |Edit This


Suck an egg Joe, tell us your sites coward???

Comment by Anthony Cea 06.17.06 @ 6:51 pm |Edit This

http://71370.c.geku8h.org/

Is this his site Alex that I am picking out of the results you posted?

Comment by Anthony Cea 06.17.06 @ 6:55 pm |Edit This

Here’s the link on Google to submit a spam report: http://www.google.com/contact/spamreport.html

Please use it to report these ass-hats. Maybe use the links Alex posted (above this post) to show them that these spam pages are coming up for regular searches, and aren’t just buried somewhere deep in the Google indexes.

Comment by ahahahaaa 06.17.06 @ 6:56 pm |Edit This

Yup, that is it. Pretty much anything with two subdomains and a number up front.

Comment by Alex 06.17.06 @ 6:56 pm |Edit This

>>Alexa ranking of under 7,000

I just checked the Alexa ranking and it says 633,948. Did Alexa recalculate its ranking for the domain name eiqz2q.org?

Comment by Ian 06.17.06 @ 6:57 pm |Edit This

re: Alexa

They are on such a sharp upward spike that the overall ranking hasn’t changed to reflect the 1-day and 1-week ranking yet. 1-week is 13,000 and 1-day is 6,500 for that particular domain. Well into the realm of not-able-to-be-easily-faked.

Comment by Alex 06.17.06 @ 7:02 pm |Edit This

http://internetnamebank.com

Now THERE’S a spam site. I have just reported it to Google and others. Everybody else follow suit.

Comment by Joe 06.17.06 @ 7:03 pm |Edit This

No wonder I see alot of these in my google result recently. Lots of weird subdomain name!

Comment by John 06.17.06 @ 7:58 pm |Edit This

Well, it seems there are only 3.8 billion pages now, so Google’s probably getting rid of them quite rapidly.

Comment by Jimmy 06.17.06 @ 8:03 pm |Edit This

I’d love to see a free script up to do this. More information would be nice too, your article was a bit simple in the sense it didn’t explain in great detail.

Comment by Me 06.17.06 @ 8:40 pm |Edit This

Hell yea, please do the free scrip. I will donate some money on it (i am sure other will to) and make it worth your wild;).

Comment by thomas 06.17.06 @ 8:54 pm |Edit This

Hey people, Adam (Google employee) just sent us the breaktrough and “normal” resolve :

We have noticed that some site: queries are showing bizarre results and it’s turned out to be tied to a bad data push. We’re fixing it now.

Weird right ?

I commented that on my blog.

Comment by Cristian Mezei 06.17.06 @ 9:26 pm |Edit This

I wonder how long this will stay in the SERPs. Cute idea though - sort of a pity its been “outed”

Comment by frak 06.17.06 @ 10:25 pm |Edit This

I’m pretty sure I know what happened (and no, we didn’t crawl nearly that many pages from these sites). I’ll ask someone to check and make sure it’s what I think it is.

The “site:” estimates may look large, but if it’s what I think it is (an interaction between some faulty data and results estimates), then it’s not causing much harm, but we’ll still get the results estimates corrected.

Comment by Matt Cutts 06.17.06 @ 11:34 pm |Edit This

U fukers! Leave me alone! U R takin $$ out of my pocket!

Comment by Nunya 06.18.06 @ 12:52 am |Edit This

can i have the script as well plz

Comment by Ahmad 06.18.06 @ 2:07 am |Edit This

That is such mis-information you are spreading! And what is SOOOOOOOOOO anoying is that you encourging dumb newbies to try out spam sites.

It is actually very difficult to make money with grey hat or scraper sites. It is only the people who done it for over a year that have some idea.

Now, the one site you refer to ….I am very eager to see what he did. That was something especially tric-full.

Wow you just got 100s of new spammers thanks to your post.

Comment by joe 06.18.06 @ 2:48 am |Edit This

Aay eiqz2q.org joined the co-op. With 3,780,000,000 URLs indexed in Google, how much weight do you think the site would get?

Comment by Mr Brown 06.18.06 @ 3:58 am |Edit This

Matt Cutts wrote:

>>I’m pretty sure I know what happened (and no, we didn’t crawl nearly that many pages from these sites). I’ll ask someone to check and make sure it’s what I think it is.

Hi Matt, I’m assuming that is really you even though you’re on vacation. Sounds like you and the IP is from CA, so we’ll play along.

Sorry, but I believe you have indexed at least millions, if not billions, of pages from these domains. Check out the searches linked above like “pizza sauce recipe” — go to page 3+. C’mon now, it sure looks like you’ve indexed a lot of that site! How else do you explain the Alexa rankings of 2,000 and 7,000 after a month? That is a a LOT of Google traffic they are receiving. Whether it is billions or millions, these sites are well indexed and ranking after only a few weeks… a definite problem in the spam filtering of BigDaddy’s crawler.

Comment by Alex 06.18.06 @ 4:52 am |Edit This

And it’s out of the Google index. I guess now they do it all over with another domain?

Comment by Spamhuntress 06.18.06 @ 5:06 am |Edit This

That comment is (I think) not from the real Matt Cutts. Would the real Matt Cutts put a dead link in the comment URI field: http://www.google.com/blog/ ?

Comment by Percept 06.18.06 @ 5:48 am |Edit This

re: Matt Cutts

Yeah, dead link. The IP traced back to Santa Clara so I went with it.

Comment by Alex 06.18.06 @ 5:55 am |Edit This

hmmm still shows up here, but with less indexed pages (about 500 mio). i guess this whole story will “create” an army of new spammers. will keep google busy…

Comment by Daniela 06.18.06 @ 5:56 am |Edit This

Is that how About.com keeps getting to the top of Google?

Comment by Robin 06.18.06 @ 5:56 am |Edit This

Joe, has anyone told you you were an asshole of late, if not let me help out a bit.

A man with no arms could post more logic than you ya stupid fool.

Comment by Anthony Cea 06.18.06 @ 5:57 am |Edit This

Interested in the script of course. When will it be available?

Comment by Bruno 06.18.06 @ 6:31 am |Edit This

Hee-hee, you said ‘hand job’.

This is the most hilarious PR disaster to happen to Google since it started losing our pages - serves the buggers right.

Pete

Comment by GADOOD 06.18.06 @ 7:54 am |Edit This

There is one advantage to hand-jobs; they always work!

Comment by Alex 06.18.06 @ 8:10 am |Edit This

Thank you for this article. For 2 months I try to index 50 pages with images. This guy was able to pump 500,000,000 pages of garbage on Google.

Comment by AlexBroker 06.18.06 @ 9:10 am |Edit This

I mentioned this before, since google isnt crawling as deep for some people the are just goign to setup one page sites, looks like people are already taking advantage of bigcrappy.

Comment by John 06.18.06 @ 9:56 am |Edit This

Well, the key point is this method provides unique content for each page, something that all my work with my client, will only 11 million pages, did not do, even though we thought we were following Google’s instructions, they have other bias that are not so easy to understand, such as a threshold of duplicate content where they don’t index or drop out the pages after they index them - which is probably what is happening to my client.

No doubt, this technique, that your talking about here, will be filtered out of the index eventually when Google figures out what happened.

Comment by Webmetricsguru 06.18.06 @ 10:32 am |Edit This

I can totally atest to a growing number of spam sites indexed by google. it is soooo annoying!

Comment by hacker not cracker 06.18.06 @ 10:43 am |Edit This

The only way google will cut back on spamdexing is if they stop doing updates. Everytime they roll out something new it only helps the bh sites get higher rankings and more pages indexed while everybody who plays by the rules is left wondering why google has dropped 80% of their website’s pages. This is just 1 domain but I have come across many other domains with over 200,000 pages indexed and all of them were pure spam.

Comment by beandip 06.18.06 @ 11:26 am |Edit This

Banned! :D

Comment by GodSigmA 06.18.06 @ 12:30 pm |Edit This

Kicked!

Comment by Steven 06.18.06 @ 12:35 pm |Edit This

Ok that site is no longer indexed on google :D Hot daddy deindexed it! :D

Comment by angelo 06.18.06 @ 1:07 pm |Edit This

all his steps are also explained here:
http://anaaman.blogspot.com/2006/06/moldavian-blackhat.html

Comment by Bachante 06.18.06 @ 1:41 pm |Edit This

Anaaman has a good explanation; perhaps a bit clearer than mine but he has had the benefit of time. Would’ve been nice if he linked back to any of the original sources.

Comment by Alex 06.18.06 @ 1:43 pm |Edit This

You knew it wouldn’t take to long before the goo banned him; they probably got a ton of abuse reports. This person wasted a good amount of time (even if it’s little) and some good backlinks for whatever it is he made.

Comment by LiquidChain 06.18.06 @ 2:05 pm |Edit This

The site: search doesn’t mean anything for numbers - I have a site with 3076 pages (exactly) (and, no, its good stuff, the result of much hard work) and Google currently shows 19,300 pages in a site search.
Hows that for a factor of 6 out? And its been worse!

(Not that I mean these guys aren’t/weren’t spammers!)

Comment by Lea 06.18.06 @ 3:50 pm |Edit This

The script is already out. However, it is not free.

Comment by Mike 06.18.06 @ 4:59 pm |Edit This

block by GOOGLE soon,i think!

Comment by 虚拟主机 06.18.06 @ 7:13 pm |Edit This

Great stuff, keep challenging Google to make sense of the Monster they’ve created :o )

Comment by Hempathy 06.18.06 @ 7:51 pm |Edit This

hmmm…. something seems wrong, I thought search engine spiders don’t do javascript. Adsense are embedded with javascript.

Comment by Anonymous 06.18.06 @ 11:01 pm |Edit This

Wow, cool., hope you wil release the script soon :)

Comment by prasath 06.18.06 @ 11:28 pm |Edit This

OMG! un cialtrone bello e buono! that’s really funny. (idiotDaddy… LOL!)

Comment by cialtrone 06.19.06 @ 12:35 am |Edit This

Was that Cutts or not ?

Comment by Cristian Mezei 06.19.06 @ 12:45 am |Edit This

its nothing new. insiprals.co.uk have been deploying the smae techinique for over 4 years which is why they are at the top of the competitive search term ’silver jewellery’

Comment by andy 06.19.06 @ 1:18 am |Edit This

Hi

Does anyone know why this affects Google more than other search engines ? Is this a flaw which is unique in the pagerank algorithm ?

Comment by David 06.19.06 @ 1:45 am |Edit This

“We have noticed that some site: queries are showing bizarre results
and it’s turned out to be tied to a bad data push. We’re fixing it
now.”
“Adam Lasnik at Sun, 2006-06-18″
http://www.threadwatch.org/node/6999

Comment by Robin 06.19.06 @ 3:08 am |Edit This

Here’s a list of other domains that this spammer has on their server

05s9.info
16jk.info
1er6.info
207f.info
2cxa.info
37rs4.info
7r31.info
d7n0s.info
eiqz2q.org
ffd7.info
gsw0.info
h13lu.info
hdtl0k.org
ivmcqo.org
j153.info
mcl17.info
mik8.info
mro196.org
n774.info
odyzfe.org
rfni70.org
s358v.info

Mybe someone can make sure these domains are shut down as well.

see http://www.domaintools.com/reverse-ip/?hostname=eiqz2q.org

Comment by Muzza 06.19.06 @ 3:25 am |Edit This

Amazing goDaddy? Where does he goes after?

Comment by Steve 06.19.06 @ 6:03 am |Edit This

Looks like its not in google any more..

Comment by josh 06.19.06 @ 6:54 am |Edit This

the domains are already banned from google!

Comment by alex 06.19.06 @ 8:01 am |Edit This

As Muzza pointed out, there are thousands more to go.

Not to mention legitimate sites with subdomains are also clogging up the results. (Thanks to CrankyDave for the heads up on that one.)

Comment by Alex 06.19.06 @ 8:06 am |Edit This

This is just the tip of the iceberg. Big Dud does not know how to handle subdomains. If theres not a whole lot of folks at the plex scrambling to figure out how to fix this, the “bad data” is going to continue to “push” legitimate sites out of the index.

Dave

Comment by CrankyDave 06.19.06 @ 8:16 am |Edit This

I bet this guy will get his adsense account terminated as well. Dont think they look too kindly upon spam.

Comment by Anders 06.19.06 @ 8:45 am |Edit This

Google and the other big SEs need to attack it from both sides.
From the index (not allowing spam to be indexed) and also from the side of the profitability.

MSN, Yahoo and Google along with all the other PPC services need to be sure that they are not paying to have sites like this indexed. Maybe they need to hold the payments on new domains or channels and hand review payouts for large $amounts to audit the quality of the traffic they they are in effect paying for.

Comment by Arius 06.19.06 @ 9:45 am |Edit This

I can't verify this as true or false, but it was posted on an underground SEO forum that the Adsense ID used on this round of domains is the same Adsense ID this person was using in 2004. If that is the case, it would seem the only penalty involved is de-indexing and not Adsense cancellation.

Does anyone think this was the first round of sites from this person? Absolutely not, it looked like a well-refined scripting masterpiece (if you consider gaming Google to be praiseworthy). It is a fair bet that this is not the first time they have monetized their auto-generated junk with Adsense.

Comment by Alex 06.19.06 @ 9:52 am |Edit This

Does anyone think this was the first round of sites from this person? Absolutely not, it looked like a well-refined scripting masterpiece (if you consider gaming Google to be praiseworthy). It is a fair bet that this is not the first time they have monetized their auto-generated junk with Adsense.

Exactly. This was no fly-by-nighter. It was a well thought out and constructed gameplan. They knew exactly what they were doing. They saw the chink in the armour and exploited it.

This is by no means the first or last exploitation of this means/method. Until (if) Google is able to handle this type of onslaught algorithmically it will surely continue.

Stop and think how “adept” they are at handling the spam reports they recieve now. Does anyone really think they’re anymore able, or willing, to handle this sort of thing any better?

Comment by CrankyDave 06.19.06 @ 10:59 am |Edit This

Looks like it isn’t in google any more, but nice idea :)

Comment by Hife 06.19.06 @ 2:12 pm |Edit This

Well spotted Alex and Nin. It’s incredible how full of wholes BigD is. I have seen a similar thing for WordPress done by Hammer at http://www.biggnuts.com/. But that one only generates the first sub-domain. It should not be too hard to modify to support sub-sub domains. Interesting thought.

Comment by Bjorn Solstad 06.19.06 @ 5:27 pm |Edit This

Sites like simplyhired are doing the same thing.

Comment by Will 06.19.06 @ 5:27 pm |Edit This

every domain this spammer uses is now banned foreven from Google. appart from that his Adsense account is also banned.
so is it worth it?

Comment by Anonymous 06.19.06 @ 8:45 pm |Edit This

A great scraper script to use for this method is http://www.webcommand.us

Comment by Valheru 06.19.06 @ 10:03 pm |Edit This

Google is bugged!

Comment by Paolo 06.19.06 @ 11:14 pm |Edit This

some time back alexa search of googlesyndication.com was giving a whole lot of chinese sites… which means that most of the adsense ads are clicked by th e chinese… heh heh.

Comment by Anonymous 06.20.06 @ 6:23 am |Edit This

Is it just me or are they now not deindexed?

Comment by Rose DesRochers 06.20.06 @ 7:50 am |Edit This

google shows no result for that domain….

Comment by trentino 06.20.06 @ 8:33 am |Edit This

Your search - site:eiqz2q.org - did not match any documents.

Comment by brian 06.20.06 @ 10:53 am |Edit This

>>google shows no result for that domain….

>>Your search - site:eiqz2q.org - did not match any documents.

GOOGLE IS REMOVING HIS DOMAIN’S MANUALLY. The hole is still open and there are hundreds, if not thousands, of the same person’s domains still indexed. They have not fixed anything about the algorithm, Google has simply applied penalties to these domains so far known to remove them from the results.

Comment by Alex 06.20.06 @ 10:56 am |Edit This

Wicked! So theres an advantage for search engines not indexing sites as much/often. heh

Comment by Jason 06.20.06 @ 7:14 pm |Edit This

Amazing. Got to try.

Comment by Adrian 06.20.06 @ 9:54 pm |Edit This

doesnt anyone read the comments above?? At least 3 or 4 people said its not in the index anymore? Why are you letting us know?

Comment by readtheabovecommetns 06.21.06 @ 1:16 am |Edit This

wow that’s insane. I don’t anything about scripting so i wouldn’t know the first thing to do lol

Comment by wwe divas 06.21.06 @ 3:37 am |Edit This

And there is a TEAM of programmers in Argentina working on another domain, and anotehr, and another????

Makes me wish for the good old days when the most sophisticated thing people could do is add keywords in little white letters!

WE’VE COME A LOOOONG WAY, BABY!

Andrew Larder
http://www.MarketingSharks.com

Comment by Andrew Larder 06.21.06 @ 8:10 am |Edit This

“If there is no doorz - Google SERP will be empty” (c)

How the cool Google algorithm can make it possible? Just couple of symbols changed and there is “unique” content.
BS, I’ll never pay 1c to adsense!

Comment by Andrey 06.21.06 @ 9:57 am |Edit This

And to think I spend all this time doing “legit” marketing and I struggle, like most of us, to get ahead in the rankings. Spam should stay in a can.

Comment by Nat 06.21.06 @ 5:06 pm |Edit This

Now google has a new scam going on under the table. Looks to me to be an inside job. I can’t see how one site can be relevant for every US city. Either google is seriously broken or the employees are playing god with the results with their own or friends websites. There is no logical explanation on all the suspicious crap in google.

For me any search for “city name here” car insurance and “city name here” auto insurance comes up with kanetix.com on top as #1 or very search or close to the top. The pages are not at all relevant except for a mass replace for the city.

Im talking ALL cities for every state I checked.

rockport car insurance
leander car insurance
santa fe car insurance
los alamos auto insurance

You name it this kanetix.com site is on top. I don’t think it pulling in massive traffic but it isn’t right either way. This is how the scammers usually operate - under the radar.

Comment by bob 06.21.06 @ 9:13 pm |Edit This

Impressive! Hardly a spammer but they have a master of SEO. Those listings might not generate huge amounts of traffic numbers at the moment, but they are quite valuable. If you look back in their Alexa, they did have a big spike a few months ago where they probably surfaced for a more generic term without the city name. The site has been online since 1999:

http://whois.domaintools.com/kanetix.com

The backlinks look solid:

http://search.yahoo.com/search?p=linkdomain%3Akanetix.com

I haven’t gone through the site entirely but it looks like they sell insurance leads to proper brokers after collecting them from the web. No harm in that, a good business model with excellent search engine listings. Why do you consider that to be a spammer?

Comment by Alex 06.22.06 @ 5:19 am |Edit This

This is great! I’m going to buy domains now.

Comment by clazywade 06.22.06 @ 10:14 am |Edit This

Alex, it was me commenting before. We certainly didn’t index millions of pages from that domain. The results estimates were broken (by like six orders of magnitude) because some faulty data interacted poorly with site: results estimates.

Looks like the results estimates are working again now though.

Comment by Matt Cutts 06.22.06 @ 5:41 pm |Edit This

Thanks for the confirmation, Matt.

I think we’d all agree that indexing over 7 billion pages in under 4 weeks would be a feat for any crawler. If that were the case, you’d have something to be proud of simply because of the speed of the bot. Regardless of how many it was, the site: operator is partly to blame.

The traffic from his listings was significant. I’m glad to see it (mostly) cut off now and I hope we see a solution to the subdomain indexing algorithimcally soon.

Comment by Alex 06.22.06 @ 5:49 pm |Edit This

Alex - I don’t consider a website that games a SE a spammer. Spam is unsolicited information in my book. Since Google provides the results then they are to blame if a used tire site comes up on a search for bras. It’s not spam - it’s a shitty algo at work. It’s lack of quality control.

I’m saying it’s a scam because kanetix.com can’t possibly be relevant for every city in the usa - especially as an affiliate of comparisonmarket.com. I see no difference between this site and the ones that were banned. COmpare the content. They both work to send the user to another website for revenue and have no relevant info whatsoever. There is no SEO that can do that unless it’s black hat or a google insider making it happen. Now if those results are legit then google is seriously broken. May as well replace the results with dmoz. I see this happening alot in google. A site ranks high for a top keyword so google gives the site universal top rank for any related variations of that keyword.

Comment by bob 06.22.06 @ 7:49 pm |Edit This

Happy to confirm it, Alex. Yeah, 7B pages would be a feat if it really were that much, but it was much much fewer actual pages crawled–just bad “site:” estimates made it look like more.

And now I’m off to digg to see the other post about it. sigh.. ;)

Comment by Matt Cutts 06.22.06 @ 7:50 pm |Edit This

>>A site ranks high for a top keyword so google gives the site universal top rank for any related variations of that keyword.

Bob, I won’t argue that at all. Sorry I mistook your word scam for spam. As an affiliate, I’m impressed, but as a web searcher, those are disappointing results. I couldn’t personally find anything that jedi about their SEO. It stinks a bit like doorway pages and it is thin on value-added content.

Matt, welcome back.

Comment by Alex 06.22.06 @ 7:57 pm |Edit This

Officially I’m still on vacation. :)

Comment by Matt Cutts 06.22.06 @ 7:58 pm |Edit This

Pondering the implications of this subdomain technique, for bloggers, tagging with WordPress, etc…

http://tracking.knowingart.com/116/subdomains-for-google-tag-youre-it/

Comment by PJ at Knowing Art 06.22.06 @ 8:10 pm |Edit This

PJ, thanks, I read your article earlier. It is a clever story about conforming to what Google thinks your topic is simply because they are sending the traffic anyway. I think you can break out of the MySpace tag but it would be more trouble than it is worth since you are probably overwhelmed with the viral MySpace backlinks.

Comment by Alex 06.22.06 @ 8:17 pm |Edit This

If anyone is looking for a legit way to target the long tail of search, check out http://www.hittail.com — the first site to help you zero in on the correct terms for genuine non-automated writing and targeting. It doesn’t always have to be black hat to have the advantage of the inside track.

Comment by Mike Levin 06.23.06 @ 6:50 am |Edit This

There is a comment by test referring to a fuckedgoggle.com website, and I see many pages indexed for it (through yahoo, mind you), but they seem to be gone now. Crushed under the big G’s boot?

Comment by Zeal 06.23.06 @ 7:07 am |Edit This

Great post.
Emilia Coche

Comment by Maria Coches 06.23.06 @ 8:54 am |Edit This

Man this is crazy. I just wish doing something like this would last. You know bigdaddy already took care of this little problem. :)

Comment by Anonymous 06.23.06 @ 10:15 am |Edit This

I think your retarded. Its not that easy. I would sure like to see “billions” of anything. Your full of so much deficant, your skin pigment is darkening.

Comment by mike smith 06.23.06 @ 12:17 pm |Edit This



Leave a comment
Line and paragraph breaks automatic, e-mail address never displayed, HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>