1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Easily remove all spam domains from a list.

Discussion in 'Black Hat SEO' started by myopic1, Apr 4, 2016.

  1. myopic1

    myopic1 Regular Member

    Joined:
    Mar 24, 2014
    Messages:
    408
    Likes Received:
    402
    Nifty little share here that should save some people a great deal of time. If you've ever found yourself with a list of domains, whether you're looking for PBNs, outreach targets for white hat SEO (yuck), sites to email etc etc, you may have wound up with a large percentage of, to put it kindly, shit domains within that list...well no more!

    I've decided to open source my spam filter for anyone to use as they see fit. This isn't exactly how I do it, I tend to run a python script that does it for me but here's a command that should save a lot of time and resources for people who don't know how to code this for themselves:

    Step 1: Have a text file full of domains that you need to analyze, e.g mymixedfile.txt (the name isn't important).

    Step 2 (Mac/Linux): Enter the below command into the terminal or 'command line', if you're not sure what that is, have a quick Google.

    Code:
    grep -v 'casino\|tiffany\|michael\|karen\|jimmy\|payday\|loan\|ugg\|vuitton\|mulberry\|burberry\|poker\|roulette\|money\|cash\|replica\|adult\|sex\|escorts\|cheap\|chanel\|moncler|\barbour\|hollister\|rayban\|cialis\|ketone\|handbag\|abercrombie\|discount\|cigarette\|sunglasses\|viagra\|debt\|nike\|pandora\|binary\|coupons\|loubout\|northface\|watches\|mortgage\|lacost\|shoe\|dating\|debt\|rolex\|karen\|pussy\|asian\|vagina\|xx\|fuck\|tokyo\|teens\|girl\|japan\|kinky\|bondage\|wives\|private\|fetish\|chick\|shoes' mymixedfile.txt > mycleanfile.txt
    If you scroll to the end of that command you'll see the part where your file name gets inserted, you can also use folder paths here if your list is located somewhere else on your computer. Please note, you'll need to navigate to the folder where your domain list is located on your computer from within the terminal (command line) first in order for this to work.

    If there's enough interest in the above, then I'll do an alternate command for Windows users but...well...please don't use Windows.

    Of course add any words into the above that you want to filter out. The above command churns through files that are gigabytes in size in a matter of seconds, though that depends on your hardware setup. In my experience, the above will filter out 99% of spam domains that are obviously spam but you may get a small percentage of sneaky domains with creative names getting through the net.
     
    • Thanks Thanks x 1
    Last edited: Apr 4, 2016
  2. Furious Man

    Furious Man Jr. VIP Jr. VIP

    Joined:
    Aug 4, 2015
    Messages:
    1,533
    Likes Received:
    235
    thanks for the info let i ill try
     
    • Thanks Thanks x 1
  3. Take Action

    Take Action Jr. VIP Jr. VIP

    Joined:
    Aug 3, 2014
    Messages:
    1,135
    Likes Received:
    241
    Gender:
    Male
    Home Page:
    Need to check this out.. Thanks for the share.
     
    • Thanks Thanks x 1
  4. Crazy Monkey

    Crazy Monkey Jr. VIP Jr. VIP

    Joined:
    Aug 4, 2015
    Messages:
    1,956
    Likes Received:
    240
    Gender:
    Male
    Location:
    In Jungle
    Thanks, it solves my big probes
     
    • Thanks Thanks x 1
  5. Devil Rider

    Devil Rider BANNED BANNED

    Joined:
    Jul 24, 2015
    Messages:
    554
    Likes Received:
    59
    Awesome share man! You did great work on this info...
     
    • Thanks Thanks x 1
  6. Cha Wong

    Cha Wong Regular Member

    Joined:
    Feb 1, 2016
    Messages:
    230
    Likes Received:
    10
    Occupation:
    SEO Expert & Social Media Marketer (Beginner)
    Location:
    Singapore
    Thanks for the tip :)