My new PAA Journey

SharonHe

Regular Member
Feb 7, 2020
255
256
For the last 2 weeks I was working on [Python <-> MySQL <-> PHP <-> WordPress] Google PAA scraper.
- For scraping I am using Python with Chromium headless WebDriver.
- Storing the data in MySQL
- Processing, parsing and posting the data as separate post on the wordpress site is done with PHP script

Yesterday I finally finished the testing, and the scraper is running in full power. At this moment my website has almost 10K articles.
They are all connected with related posts plugin that was heavily modified by me and every post at the top has a Table of Content.

To be honest the website looks very premium and not spammy at all. I applied for adsense verification of the domain and I am waiting to get the result.

Today Visitors 0. - Indexed sites by google 30

I will regularly update this thread and I will show once and for all if this niche really works or not.
If this really picks up I am planning on scaling. I have VPS servers so scaling wont be a problem.

My PAA site is niche website. I am not making general PAA site with all sorts of topics.

You can ask me any questions.
 
you can also do it with raw HTTP requests and keywords lists, that way its faster and you can multithread it better :)
I know, I tried that method too, but I got many google spam protections, so that way needs proxy (mainly because google can detect that you are not running javascript)
This method allows me to go very fast without any proxies.
 
Can you explain why you modified related posts plugin?
To make the related posts part of the main content with header question and answer body. The plugin first was showing other posts in a style of related posts.
After modifying it you can't distinguish if it is the article body or related posts.
 
How did you choose the domain name?
Is it brand new or expired/used one?

When buying domain, did you already know that you are gonna make PAA website?
 
How did you choose the domain name?
Is it brand new or expired/used one?

When buying domain, did you already know that you are gonna make PAA website?

1. I choose the domain based on the nich I was going to work on.
2. The domain was brand new, now it is 6 months old.
3. No, the domain was meant to be used for blogging, but I found out (like 100 times before) that I am not good at writing blog posts so I moved to PAA
 
To make the related posts part of the main content with header question and answer body. The plugin first was showing other posts in a style of related posts.
After modifying it you can't distinguish if it is the article body or related posts.
yea but with rotating proxies its no issue and you get much more data faster as you dont need delays like when you scrape google from 1 IP, but was just a suggestion.
if your setup works for you, then thats fine too. there are always multiple ways to do things :)
 
Last edited:
All the best for your journey, lets see how it works against latest google AI update.
 
- Processing, parsing and posting the data as separate post on the wordpress site is done with PHP script
Are you bundling the PAAs in 10-15 questions (+ answers) for 1 post or just using 1 PAA as 1 post?
Good luck with your project
 
Are you bundling the PAAs in 10-15 questions (+ answers) for 1 post or just using 1 PAA as 1 post?
Good luck with your project
All the PAAs are stored as separate posts in worpdress, but when you open one post, the content gets appended with 20 more posts (PAAs) that look like they are all part of the main content.
 
Love to follow your project.

1. How many paa in your db now?
2. How do you create 1 post title and post content?
3. Do you add something more into the content of each post?
 
Love to follow your project.

1. How many paa in your db now?
2. How do you create 1 post title and post content?
3. Do you add something more into the content of each post?
Right now I have 17K posts in the WordPress database.
Every post is separate question with answer.
I generate tags for every post based on the title.
Then I use related posts plugin which I have modified to include the related posts (based on tags, title and main body) in the main body of the primary post as title H3 and answer. So the whole post now looks like it has 20 subheadings with content but actually those 20 subheadings are real posts in the wordpress and you can open every single one of them and get different related posts for that post too.
Also, my PHP script, before adding new post to the database, compares the content of all posts and checks if there is already post with the same content. If there is, then I do not add that as new post, I only take the question and append it to the post as meta data.
Then in my template I use function to extract the meatdata and show it in the top of the post as "Similar questions"

Then I use Table of content plugin in the top of the primary post to show all the subheadings and link them for quick access.

PS: Google adsense is still reviewing my page.
PSS: I use Google keyword planner to generate keywords, that I later feed to my Python scraper
 
Last edited:
Back
Top
AdBlock Detected

We get it, advertisements are annoying!

Sure, ad-blocking software does a great job at blocking ads, but it also blocks useful features and essential functions on BlackHatWorld and other forums. These functions are unrelated to ads, such as internal links and images. For the best site experience please disable your AdBlocker.

I've Disabled AdBlock