Spamdexing: how does it work and how to make it doable ?

stallion45d

Power Member
Joined
Oct 11, 2011
Messages
735
Reaction score
464
Hi, while doing some keyword research for a specific niche, I accidentally found a huge "network" of spamdexing hitting google pretty hard for a big amount of keywords. Some details about its structure:

- thousands of expired domains . Each domain contains 500 - 2500 internal php pages. Everything is redirected with 302 temporary redirect ( used redirectdetective.com to check this) to another domain which kinda acts as a doorway for a second , and then finally all the traffic ends up on a popular affiliate network. You can´t really see the content of any of the internal php pages. But the text appearing on the snippet is totally junk, unrelated and unreadable. So no way they are ranking because of quality content.

- All the domains are hosted in cloudfare ( I guess they are using the free plan ,otherwise hosting all those domains will be costly) . Used hostingchecker.com to get this info

- They target low competition keywords on this niche, even for some query searches, they rank with 8 or 9 different domains.
- using webrate.org says that they get 25.000 UVs / day in total.
- I tried to download / scrape internal links / check html or php code of a few of these domains and nothing comes in. Not using scrapebox or similar software.

first I would like to congratulate the people behind this , for laughing at google this way . At least, they have been doing this method for the last 3 years , by checking the registration date of their main domain collecting all the traffic at the end.

now, here comes the big question for the SEO experts on the forum:

how to do Spamdexing , this way , without being screwed by google ?
 
I'm interested also on this analysis.
 
Well, at beginning I would reverse engineer your competitors, and do your research.

Spamdexing or however you call it is numbers a game, you just keep build out sites and see what sticks.

But basics are:
Scrape your content, mix it up (spin it etc.)
Add your keywords for each post
And do basic on-page seo stuff

End of the day Google still indexes content, sometimes they have filters, but this will work forever.
 
Hi, while doing some keyword research for a specific niche, I accidentally found a huge "network" of spamdexing hitting google pretty hard for a big amount of keywords. Some details about its structure:

- thousands of expired domains . Each domain contains 500 - 2500 internal php pages. Everything is redirected with 302 temporary redirect ( used redirectdetective.com to check this) to another domain which kinda acts as a doorway for a second , and then finally all the traffic ends up on a popular affiliate network. You can´t really see the content of any of the internal php pages. But the text appearing on the snippet is totally junk, unrelated and unreadable. So no way they are ranking because of quality content.

- All the domains are hosted in cloudfare ( I guess they are using the free plan ,otherwise hosting all those domains will be costly) . Used hostingchecker.com to get this info

- They target low competition keywords on this niche, even for some query searches, they rank with 8 or 9 different domains.
- using webrate.org says that they get 25.000 UVs / day in total.
- I tried to download / scrape internal links / check html or php code of a few of these domains and nothing comes in. Not using scrapebox or similar software.

first I would like to congratulate the people behind this , for laughing at google this way . At least, they have been doing this method for the last 3 years , by checking the registration date of their main domain collecting all the traffic at the end.

now, here comes the big question for the SEO experts on the forum:

how to do Spamdexing , this way , without being screwed by google ?
Can I send you a test URL to see if we are talking about the same network of sites?
 
its not that complicated really with the right tools.
you create sites automatcally and then cloak them to affiliate landers.
its been done since ages and it still works fine ;)
 
you can check the pages content with google cache.. I wonder how they distinguish you from google when they do the cloaking
 
Sure, send me a PM. By respect to creator , i Will not disclose any URL here. Please, understand this point
Completely understand. I just want to know if we both are tracking a similar network since the MO seems the same.

I can't seem to send you a PM; Can you start a conversation with me?
 
Completely understand. I just want to know if we both are tracking a similar network since the MO seems the same.

I can't seem to send you a PM; Can you start a conversation with me?
Check your profile, please
 
its not that complicated really with the right tools.
you create sites automatcally and then cloak them to affiliate landers.
its been done since ages and it still works fine ;)
Could you please indicate the tools to use?

How is It possible that When I try to download or scrape one of those sites/ domains , I get nothing like if they did not exist , but in fact they are ranking there for many keywords?
 
Could you please indicate the tools to use?

How is It possible that When I try to download or scrape one of those sites/ domains , I get nothing like if they did not exist , but in fact they are ranking there for many keywords?
i use my own tools that i've created.
when you try to scrape a cloaked site it wont let you as you are being redirected to a different site.
some simple cloakers you might get away with changing your user agent to googlebot, but with proper cloaking that wont work as they do IP/rDNS checks.
 
It really looks like a double win,
They seem to be cloaking users and using the google shown content to index their links,
So they do take advantage of both of the visits,
Seems pretty clever, I must say :D

You can do the same with some autoblogging plugin + spinning service along with some cloaking script,
A good expired domain will make things easier and warming it up will make it even better
 
Hi, using a google bot simulator, I get this:

https://www.dnsqueries.com/en/googlebot_simulator.php
HTTP CODE​
=
HTTP/1.1 302 Found
Date​
=
Thu, 08 Jul 2021 01:08:32 GMT
Content-Type​
=
text/html;charset=UTF-8
Transfer-Encoding​
=
chunked
Connection​
=
keep-alive
Location​
=
XXXXXXXXXXX ( deleted by me , just to not show the landing page url )
CF-Cache-Status​
=
DYNAMIC
Expect-CT​
=
max-age=604800, report-uri="https://XXXXXXX.cloudflare.com/cdn-cgi/beacon/expect-ct"
Report-To​
=
{"endpoints":[{"url":"https:\/\/a.nel.cloudflare.com\/report\/v2?s=% LONG URLXXXXXXXX "}],"group":"cf-nel","max_age":604800}
NEL​
=
{"report_to":"cf-nel","max_age":604800}
Server​
=
cloudflare
CF-RAY​
=
6XXXXXXXXd9e4181-HAM
alt-svc​
=
h3-27=":443"; ma=86400, h3-28=":443"; ma=86400, h3-29=":443"; ma=86400, h3=":443"; ma=86400

<html>
<head><title>302 Moved Temporary</title></head>
<body bgcolor="white">
<center><h1>302 Moved Temporary</h1></center>
<hr><center>nginx</center>
</body>
</html>
-------------------------------------------------------------

All this data came from and internal PHP page like http:/domain.com/skin-care-kit.php

Does this clarify bit more how these guys are drilling google ?
 
Try with google cache cache:http:/domain.com/skin-care-kit.php
 
its just showing a (302) redirect as it should be to hide the cloaking.
if they didnt disable caching then as suggested in the last post, you can try to see if google has a version of the site in its cache.
 
By respect to creator , i Will not disclose any URL here
Anyway, this is quite weird that you think you are "respecting the creator"

If you have found this through public sources (searching in Google), it's technically public.

This is not something that someone in your local area, or in this forum has shown you for learning purposes out of a private conversation.

So as you found it public, you can share it public. No respect fault is being inflicted by sharing in a forum.

Does this clarify bit more how these guys are drilling google ?

Leaving this apart, I'm not 100% sure if I have understood you but what I've read is a little strange and not sure that ther answers so far, are on the right direction so far.

The thing here is that, as you mention, ranking pages are being redirected with 302, which means that Google currently have cached them, and indexed with superb and worthy ranking content that satisfy the intent you are querying.

There are two possibilities I figure out in case I've understood well:

1. They used an excellent expired domain with high reputation for spam purposes.
2. They also may have blasted that domain the right way (have you check AHrefs). Maybe not, and the domain was awesome (check domain rating in Ahrefs, Majestic and linking profile overall)
3. After a while, when they observed that it was already ranking good for a big couple of keywords, they start moving everything to a 3rd party affiliate site, by hideously redirecting through a cloak or a doorway (since I have not seen the pages I cannot tell if you are talking about cloak 302s or doorways, or both at the same time)

The only caveat, is that as you say, this seems to have been working for 3 years. But what you don't know if they had these domains purchased for the first year, built the content in the second, and started the cloaking/doorways in the 3rd.

Can you check Archive.org or even the Google cache for those sites to see what was being indexed?
 
Anyway, this is quite weird that you think you are "respecting the creator"

If you have found this through public sources (searching in Google), it's technically public.

This is not something that someone in your local area, or in this forum has shown you for learning purposes out of a private conversation.

So as you found it public, you can share it public. No respect fault is being inflicted by sharing in a forum.



Leaving this apart, I'm not 100% sure if I have understood you but what I've read is a little strange and not sure that ther answers so far, are on the right direction so far.

The thing here is that, as you mention, ranking pages are being redirected with 302, which means that Google currently have cached them, and indexed with superb and worthy ranking content that satisfy the intent you are querying.

There are two possibilities I figure out in case I've understood well:

1. They used an excellent expired domain with high reputation for spam purposes.
2. They also may have blasted that domain the right way (have you check AHrefs). Maybe not, and the domain was awesome (check domain rating in Ahrefs, Majestic and linking profile overall)
3. After a while, when they observed that it was already ranking good for a big couple of keywords, they start moving everything to a 3rd party affiliate site, by hideously redirecting through a cloak or a doorway (since I have not seen the pages I cannot tell if you are talking about cloak 302s or doorways, or both at the same time)

The only caveat, is that as you say, this seems to have been working for 3 years. But what you don't know if they had these domains purchased for the first year, built the content in the second, and started the cloaking/doorways in the 3rd.

Can you check Archive.org or even the Google cache for those sites to see what was being indexed?
I truly believe the owner is a reputable member of BHW who has helped other people a lot. If I get this to work ,I Will not compete on his niche, even If I know he is making a killing.
I want to learn this spam technique , not taking advatage of the Network I found

I Will check ahrefs and post some results .
 
I truly believe the owner is a reputable member of BHW who has helped other people a lot. If I get this to work ,I Will not compete on his niche, even If I know he is making a killing.
I want to learn this spam technique , not taking advatage of the Network I found

I Will check ahrefs and post some results .
PM me the urls please sir
 
Back
Top