21
Feb

Automated Link Building is obviously one of the key tools of a BlackHat SEO. Most people don’t define the different types, but I figure I could do a little cheat sheet on the types and then go into a little more complexity on how they work, and how serious they could be. There’s obviously the basics like referral spam, comment spam and trackback spam. One that not many people talk about is creation spam. This probably goes by a different term for each person, but I usually call it that – even back when I was running the blog Rephraser.

Referral Spam is the art of getting a link from a site that lists the most recent referrers. This could be from sites that just put them on the sidebar, or even AWStat / Webalizer stat pages. Here’s a few little fun ones for you all to look at … here’s a few .edu links for you all (you have to sort them obviously – 226 at the time of this post) .edu links – and here’s a few .gov (once again, manually look at each one – 11 at the time of this post …) .gov links. I highly suggest you think about these footprints before actually using them. I am giving you these for research only (obviously), and I won’t be held responsible for your actions if you do use them. Referral spam for these types of links is simply just hit the URL over and over and over again with your URL as the referrer. Most of them will give you a link back if you’re in the top ~10. Not to shabby, huh? This method usually does -not- require multiple proxies to work.

Comment Spam is the next method and is obviously a little more known. It’s leaving comments on blogs, guestbooks, etc with your anchor and url in there somewhere creating a link. Usually this is great on wordpress blogs that DO NOT have Akismet (props to you guys – wish I met you in Vegas! :) ). This is all of those beautiful comments talking about cialias and online dating in the same comment, conveniently linking. Also a lot of comment spammers now a days use a method called seeding (much like email seeding) in which they post with specific keywords and strings and then scrape the Goog results for which sites index’d them – and then blast those all at once because it’s almost a guarantee’d link drop. This is great for the 10,000 links in a hour type blast. The two factors that people need to think about with link blasts are not only IP addresses, but destination URL’s as well. Most people don’t think that it’s throttled by URL, but sometimes it is. Keep an eye out.

Trackback Spam is my personal favorite. TB spam usually gets through a lot more, and I’ve never seen a trackback request be captcha’d. The one thing I’ve seen to stop it is the “is the site actually linking to us” plugin, which is easily circumvented via a closed hidden div with every URL of every link drop. After drops are in, remove URL’s – easy circumvention and won’t affect the user’s experience. This is also a nice little thing to do with Splog Ping Crawl if you’re up for the modification, modify it so it’s in a drop down so to the user it’s hidden but in Google’s eyes it’s not blackhat because it’s not cloaked, and it’s not taking up your users experience. Great way to help keep it whitehat but still get blackhat results. Zing!

Creation Spam is one of my up and coming techniques that actually works a lot better with trackback spam filtered in, but it’s basically a way to fast-track Eli’s Keyword Real Estate. My definition of creation spam is signing up for a blog, free hosting, whatever – and posting a full site with content and links to another one of your sites. The goal is to make this look at whitehat as possible. We’re talking about a YACG type site with just Wikipedia scrape, copied content, and links to your sites. The reason behind this is when you combind this with Trackback Spam – you have a great way to filter and cycle your link farms. Throw 1000 links at once of these bad boys, and then have your site linked off from it with 2-3 keyword-optimized links and you’ll be passing juice “legitly” in no time. The best example of this is to have a whitehat ecommerce site, we’re talking about 600K product pages on a 2-3 year old aged domain. Use creation spam for around 100 sites, drop 1000 links to each (total 100,000 link juice to site – theoretically, obviously) and then maybe throw in a 100 social bookmarking and 2000 directory submission on the main domain as well for some cover. This results in a good flow of juice to the site, easily covered by other sites, and should keep you afloat for a while. For me, I usually use these 4-5 levels deep (1000 creation spam sites to 100 to 10 to 1) etc.

Now comes the real fun part. Let’s talk about scaling out. Usually personal link spam programs do one site at a time, maybe a list of a few hundred sites. My personal arsenal has the rate of around 3000-4000 sites/minute if the list is pre-determined. This means – best case scenario – 4000 links per minute. This doesn’t mean 4000 to one site, but I can split them however I want. The way I have this coded is I optimized the web classes, threw in multi-threading, and database caching with fast RPM drives. I try to keep it as fast as possible. Then I went ahead and made it work with multiple boxes. I can do 10 dedicated servers, each doing 4000 a minute – that’s 40,000 sites/minute combined. Talk about some nice links huh?

Another scale that most people don’t consider much anymore is forums. This is obviously due to the XRumer spam (which I must admit, I like :) ) and a lot of speculation that forum backlinks are de-valued as a result of it. Don’t count them out completely though – they can still be used to cover up link spam in the backlink overview tools like linkdomain on Yahoo. This should keep prying eyes away from your actual links that help you do some damage. Just because they’re penalized doesn’t mean they won’t help. Keep that in mind. After all – a link is a link.

Now most people will tell you that I go overboard with my operation – but as most of you already know, I like to be very effusive. My current setup allows me to scale almost infinitely. Currently I run a network of spider boxes and hub boxes, that just keep looking for new links day in and day out. As I said above the 3000-4000 a minute is a true number – but what I didn’t mention is that’s per box. Just to give you an idea of what can be achieved, yes – I do have a network like this running and mainly I use it for research and of course because I find it fun. Don’t think that all blackhats are limited by out of box applications. This is completely coded from scratch across a programming language, a scripting language and of course I use MySQL for the database portion.

I just wanted to throw that in there to help put in perspective what’s actually possible. I hope you all enjoyed the read, and I know a lot of you were on my ass about a new post – I hope you think this was worth the wait! Feel free to use the Request Post link to request any upcoming posts and/or give me ideas for future posts, maybe I’ll get a few more out soon. :)

1 Star2 Stars3 Stars4 Stars5 Stars (7 votes, average: 5.00 out of 5)
Loading ... Loading ...
Categories : Blackhat Explained,Blog Automation,SEO/SEM Tags : , , , , , , ,
Comments
February 21, 2009

Thanks for making this!

Posted by MrRESpond
February 21, 2009

You’re truly scary.

Posted by _Tuff_
February 21, 2009

What do you code in if you don’t mind me asking? I use PHP and ran into trouble ‘multi-threading’ if that’s what it’s called, or just trying to run the same script 10 times accessing the same mysql table.

Posted by Georgie
February 21, 2009

@ _Tuff_

At least I don’t do anything too scary with it. I just like automation! :)

@Georgie

C# with Multi-Thread (about 350ish threads) – PHP yeah you’d have to run it over and over, haha.

Posted by Contempt
February 21, 2009

so now i know who spam comment my splogs everyday…:-P

Posted by martin91
February 21, 2009

oops ;)

Posted by Contempt
February 21, 2009

Awesome post. What book/resource would you recommend for a php coding newbie?

Posted by Josh Smith
February 21, 2009

I’d probably say http://www.tizag.com/phpT/ is great for beginners …

Posted by Contempt
February 21, 2009

@Contempt
Thank you.

Posted by Josh Smith
February 21, 2009

oldies but still goodies

Posted by QuietAffiliate
February 21, 2009

You truly are hard core.

A friend of mine once said that the game was the make twice as many pages, get twice as many links as Google found each day, so make 10,000 pages 5,000 will be found, net result is 5k of joy.

You need to blog more, I learn a lot when you do.

Posted by Jim Banks
February 21, 2009

Another excellent post by the man, the myth, the legend…

Posted by Tangy
February 22, 2009

Excellent post, great information for mass sploggers and mass spammers. I am currently developing a framework written entirely in PHP (w/ mySQL) for scraping content from google.. I was thinking of utilizing ‘shared’ hosts + the power of cURL + a main control server control panel so I can cheaply (well, free) accomplish 100s of active cURL threads without having to bog down my server. 4000 links per second is truly amazing but I don’t know C# so hopefully I will be able to accomplish a fraction of that through my idea… you stinkin’ blackhat sunnuvagun you.

Posted by sgtryan
February 23, 2009

Best post I’ve read in a while, Bofu.

I’m going to steal your “request post” idea for my blog if you don’t mind. :D

Also, though.. re: PHP, while it doesn’t have true “threads” that I’m aware of (everything I do with it is web-based) it does have process control and other features that allow you to implement things that would give you the same end result.

I’m always an advocate of using the right tool for the job, just saying that if you already know PHP fluently I’d consider looking into those options before embarking on learning C sharp. :)

Posted by Plunge
February 23, 2009

No worries Plunge, take it. And yeah, I’ve looked into most of the stuff in PHP that allows a simulated fork or multi-thread, but the performance isn’t even close from my testing. :)

Posted by Contempt
February 24, 2009

A little php/curl multi-threading for your readers

http://www.developertutorials.com/blog/php/parallel-web-scraping-in-php-curl-multi-functions-375/

Not that curl will support 4k links per minute. It still might get someone started that doesn’t want to deal with c+ / python but does want to play with multi-threading.

Nice setup bofu, well done & grassy ass for sharing

Posted by erect
February 24, 2009

My jaw has dropped. Beautiful work and way to think outside the box.

Posted by jewin
February 24, 2009

Thanks! :)

Posted by Contempt
February 24, 2009

Great post my dude, keep up the good work.

Posted by Lorne Fade
March 10, 2009

Hello,

first of all I would like to thank for your interesting blog. Althought multi-threading is easily achievable in C (I am myself using pthread_* functions), I wonder how you can bypass the two main difficulties I encounter:
- the captchas (Not related to Xrummer features)
- the research process: you and me know how its hard to have a reliable proxy network, furthermore most anonymous nodes (like Tor) are blacklisted by G.

By the way, great job.

Cheers.

Posted by Kilroy
March 10, 2009

Captcha’s I’m currently still learning how to crack. I’m on my way, but just not there yet.

For the research process I have a few IP blocks that I use, and for higher end drops I utilize a lot of SOCKS5′s.

Posted by Contempt
May 9, 2009

The creation spam is what caught my eyes, do you expect the truly clean Ecommerce site to get “caught” at some point or can it run like this for ever?

Posted by newibe
May 10, 2009

I don’t know about for ever, but the life expectancy is definitely higher.

Posted by Contempt
June 16, 2009

One thing about Referral Spam.
If you take a quick look at robots.txt you will notice that some of them if not all use
…..
Disallow: /webalizer/
…..

Posted by maozet
September 14, 2009

Great stuff, I learn more here than….anywhere. The part I can’t put my finger on is how you roll in your Splog Ping Crawl plugin when you do creation spam. What are you using to automate the blog build (I know YACG, but methinks you splog with WP as well?), and is it automagically installing the plugin with your WP install?

I probably missed it, but if you don’t mind repeating I appreciate it.

Posted by Noober

Leave a comment

(required)

(required)