1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

[Scrapebox] Filering out porn and gambling results

Discussion in 'Black Hat SEO Tools' started by cash202, Mar 27, 2011.

  1. cash202

    cash202 Elite Member Premium Member

    Joined:
    Mar 12, 2011
    Messages:
    1,803
    Likes Received:
    2,821
    Location:
    Sydney, Australia
    Home Page:
    I am currently harvesting for a big list of AA social bookmarking
    sites for BMD and the thing is a lot of Scuttle site are porn or
    gambling related and I don't really want to post any of my links
    there.

    After an hour of searching on forums I couldn't find a decent way
    to do it so I come up with the following.

    1. First of all, I harvest a list of sites. I'll skip details on this part.
    It is pretty easy. Just use and Scuttle/Pligg/PHPDug footprints
    list.

    2. Then, I create a footprint file with the following content:

    Code:
    site:%KW% -porn -xxx -sex -f.uck -casino -gambling -gamble -inurl:about
    It is self explanatory. I use -inurl:about because some of the
    porn Scuttle sites have pretty decent and clean about page.

    3. Next, I go to the Harvester section and paste all the URLs as
    keywords and merge it with the footprint.

    4. Next, in Select Engines & Proxies I only select Google and set
    it to return 10 results only (because it is the minimum, at least
    I couldn't set it to one)

    5. Then I start harvesting and after it finished I do Remove/Filter,
    Remove Duplicate Domains.

    All that's left in a harvested list is clean.

    If anyone knows a better way of doing so, please post it here.

    Thank you.
     
    • Thanks Thanks x 4