He is one of the guys who make PAA question/answer Spam sites. And here is his Process.

Something like this:
  1. Split the sentences into tokens.
  2. Paraphrase words and phrases.
  3. Whenever possible, change the order of words.
  4. Whenever possible, change the order of sentences.


like how? rss? stuff like zapier and the like?

so true. This is why I'm not planning to sell my script. Also it's constantly evolving and I would have to provide customer support. If you can make money without hiring almost any people it's probably the best way. Little overhead and noone bothering you :)

  1. Generate a few thousand “seed” keywords for our sub-niche.
  2. Scrape ~200 PAAs for each seed. Now you are left with around 100k PAAs after removing the duplicates.
  3. Check the US volume for each of these 100k PAA keywords. Use positive US volume keywords as titles of articles because US == $$.
  4. Check the SERPs competition for each positive volume keyword. I created an algorithm that checks the relevancy, and authority of the pages and also looks for pages with user-generated content.
  5. Choose ~1k most promising keywords out of our pool.
  6. Generate articles by clustering semantically relevant keywords together.
  7. When generating articles it's paraphrasing the snippets using a custom algorithm.
  8. Generate custom images.
  9. Post to WordPress using the REST API.
  10. Rinse and repeat until I have the number of articles I want.
How do you generate images? I'm unable to use Nider as its not compatible with the latest version of Python. I use pillow it's good but sometimes the background color matches with the text color and makes it unreadable. How do you get right images?
 
  1. Generate a few thousand “seed” keywords for our sub-niche.
  2. Scrape ~200 PAAs for each seed. Now you are left with around 100k PAAs after removing the duplicates.
  3. Check the US volume for each of these 100k PAA keywords. Use positive US volume keywords as titles of articles because US == $$.
  4. Check the SERPs competition for each positive volume keyword. I created an algorithm that checks the relevancy, and authority of the pages and also looks for pages with user-generated content.
  5. Choose ~1k most promising keywords out of our pool.
  6. Generate articles by clustering semantically relevant keywords together.
  7. When generating articles it's paraphrasing the snippets using a custom algorithm.
  8. Generate custom images.
  9. Post to WordPress using the REST API.
  10. Rinse and repeat until I have the number of articles I want.
Woah!!! I am considering doing something of this nature next year, do you create PAA sites?
 
There is a big opportunity here. The only thing that makes this method succeed is the ability to scrape millions of paa per day.
 
How do you generate images? I'm unable to use Nider as its not compatible with the latest version of Python. I use pillow it's good but sometimes the background color matches with the text color and makes it unreadable. How do you get right images?
Nider works with Python 3.9.X if you handle dependencies:

Code:
* Works on Python 3.9.10 on Xubuntu and Windows 10
**Xubuntu:**
* `sudo apt-get install python-dev libjpeg-dev libfreetype6-dev zlib1g-dev`
* `sudo apt install pip`
* `python3 -m pip install --no-deps -c constraints.txt nider==0.5.0`
* `python3 -m pip install -r requirements.txt`
You might have problems with installing Nider/Pillow on Ubuntu Python 3.7+:
* `sudo apt-get install python-dev libjpeg-dev libfreetype6-dev zlib1g-dev`
* do the above before installing pillow. if you already installed then uninstall and `pip install --no-cache-dir -r requirements.txt`

**Windows:**
* `python -m pip download nider==0.5.0`
* `python -m pip install --no-deps -c constraints.txt nider==0.5.0`
* `python -m pip install colorthief`
* `python -m pip install -r requirements.txt`
* Problems? pip uninstall pillow, do these steps again.

constraints.txt contains 1 line:
pillow>=9.0.0

Woah!!! I am considering doing something of this nature next year, do you create PAA sites?
Yes, I'm the guy.
Come-on guys :D Break the news
LOL
There is a big opportunity here. The only thing that makes this method succeed is the ability to scrape millions of paa per day.
it's not such a big deal, just a strong PC and a lot of proxies.
 
This shows that programming knowledge always benefits, however I stay away from code as my brain freezes as soon as I see those long lines of mumbo jumbo. But I think there are some no code tools coming up and maybe one day AI will write code for me taking simple english as input similar to dall e or imagen.
Maybe "one day" is already here with Cheatlayer app.
 
Can you give a detailed process of how setting up a PAA site would look like?

Let's say you are speaking to a coder, what would you tell him or explain you need him to do for you?

I don't know, that's why that comment.
 
Any skill that's monetizable help. This thread is based around coding so it may seem like the best thing. I know a lot of programmers who are horrible at marketing, and they want to become like us. The grass is always greener on the other side!
Codder + marketing team can't be beat.
 
Something like this:
  1. Split the sentences into tokens.
  2. Paraphrase words and phrases.
  3. Whenever possible, change the order of words.
  4. Whenever possible, change the order of sentences.
after these actions, can a readable text be obtained?
 
Last edited:
it's meh and expensive to scale. Not a fan of GPT-3 articles in general.
Why are you “meh” on gpt-3? Is it a value thing, or something else?

it sounds like the whole game here is Adsense? Does this work at all for d2c product sales? Filling the affiliate funnel?
 
Why are you “meh” on gpt-3? Is it a value thing, or something else?

it sounds like the whole game here is Adsense? Does this work at all for d2c product sales? Filling the affiliate funnel?
gpt3 has no factual accuracy.It has good English but my solution does to.

Yes, especially stuff like local SEO. "plumber in [10k US cities".
So you basically need to scrape content from 100s of sites to make it viable?
1 article = 10 sites. 5000 articles might be 5-10k different sites. That's why I'm a running a 24 core CPU to make 1 site and it still takes 2 weeks.
 
Back
Top
AdBlock Detected

We get it, advertisements are annoying!

Sure, ad-blocking software does a great job at blocking ads, but it also blocks useful features and essential functions on BlackHatWorld and other forums. These functions are unrelated to ads, such as internal links and images. For the best site experience please disable your AdBlocker.

I've Disabled AdBlock