20
Jan

I know one thing that I deal with all the time is duplicate content penalties. If you do even basic forms of blackhat you’re heard of automation of blogs, scraping content, RSS feed’s and the like. Everyone with half a brain cell knows that there are duplicate content penalties in Goog’s algorithm that prevent scraper sites from gaining too well of ranks. But the real question is, to what severity does Google evaluate your content as “duplicate”?

A while ago (we’re talking about 2 years out) when I was just getting started in Blackhat SEO, Brad101 from WF introduced me to something that I like to call the 30% rule. He taught me that if 30% of the content on the page is unique, that you should pass most of the duplicate content penalties. At first this made a little sense but not enough to try, then I started to dig a little deeper. How exactly do news sites get away with it? Most of news sites is duplicate content anyway. Is it a per-domain uniqueness factor? If so, the 30% rule would make it work great! What if it was a per-page factor (which there’s enough proof already that this is most likely the case) – in which the 30% would work great as well again!

Once again, if duplicate content was so harsh that it was per paragraph or something – then news sites would be penalized all over. A lot of people would read that and say “but usually the backlink back to the author clears penalties” – ta-da! Think about that. Could that possibly be why autoblogs work so well *gasp*? :) If you post duplicate content and link back to the author, most of the time it’s a win/win. The author is -usually- happy due to a backlink, you’re happy because you may actually rank with your authority over the original site, and you can do this on mass quantities.

There’s a few tricks and tips you’ll find out as you mess more and more with these. One of the things that I’ve found quite interesting is that links themselves count as content, yet don’t have duplicate content penalties. I’m not obviously 100% sure on that one, but my sites wouldn’t be doing so well otherwise. If you read my EPN experiment I was able to send around 55,000 total users from Google to Ebay Partner Network via my splogs. This was actually click thru’s to EPN, not just to my sites. Probably on my sites alone I pulled a little over 100,000 uniques/month which equates to almost 3500 uniques/day. Not bad for a total setup time of a little under an hour, huh?

Don’t always assume that Google is going to fuck you. You shouldn’t be afraid of the Algorithm, you should be afraid of the manual reviews. This is just my opinion, of course – Google is getting smarter by the day. Have fun, and try to stay ahead of the curve!

1 Star2 Stars3 Stars4 Stars5 Stars (4 votes, average: 5.00 out of 5)
Loading ... Loading ...
Categories : Affiliate Marketing,Blackhat Explained,Blog Automation,SEO/SEM Tags : , , , , , , , ,
Trackbacks & Pingbacks
Comments
January 20, 2009

I don’t think Google is nearly as good at detecting duplicate content as people think.

Posted by xentech
January 20, 2009

That’s the main reason I say try it, most people will be surprised ;)

Posted by Contempt
January 20, 2009

One of my blogs, the one with the highest PR (of 4), has ALL duplicate content…just copied and pasted from other sites.

January 21, 2009

The word penalty kills a lot of people.

I think it’s more like “content being the same, which site looks like more of an authority and will give the better user experience and let’s show it first”, than it is “oh, lets slap this loser, his stuff is just like BillyBlogBoy over there”.

It’s back to links, etc and everything else Google loves.

Great read, O Most Contemptuous One.

Posted by Eric
January 23, 2009

Agreed – Google’s in this position where it’s not able to auto-penalise folks that republish content because this is basically all that news sites do. Until they figure out a way to sort that (and I don’t think they ever will), we get to make hay while the sun shines.

Posted by Shark SEO
January 30, 2009

Awesome post. Thanks

Posted by Perma
January 30, 2009

Now, I’m wondering, can the linkback be to the originating domain/subdomain to be ok? Or does it have to be the specific page?

Posted by Perma
January 31, 2009

Awesome post. When linking back to the original author, do you think it’s enough to link back to the originating domain/subdomain or must the original article be linked?

Posted by Perma
February 3, 2009

HELO PERMAHZ. I LIEK TEH MULTI QUESTION ASKZ TOO.

Usually I do the full URL.

Posted by Contempt
February 5, 2009

I agree.. great post. People look to Google as this ub3r l33t algorithm that can ub3r detect any time of spam.. so very untrue. Google just detects the obvious and the rest, well, that’s the beauty of SHPAM. w00t.

Posted by sgtryan
February 18, 2009

@monty, PR is nothing to do with duplicate content

Posted by xentech
May 13, 2009

I also post some duplicate content on my sites, but I’ve got some original content as well. Im still at PR 0, how many pages/links should I have to get to PR1? If I just get a lot of duplicate content, would my PR go higher or lower? My website is at http://hellosixpackabs.com if you guys could help a brother out with some advice. Thanks.

Posted by Bertus
May 14, 2009

Don’t worry about anything but results, is it getting traffic or not? PR is nothing but a number. :)

Posted by Contempt

Leave a comment

(required)

(required)