1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scrapebox Filtering Issue

Discussion in 'Black Hat SEO Tools' started by delta_squadron, Jun 3, 2014.

  1. delta_squadron

    delta_squadron Registered Member

    Joined:
    Mar 29, 2013
    Messages:
    55
    Likes Received:
    18
    Hi guys,

    I feel like a complete noob asking this but here goes:

    I have list of targets I want to filter using the 'Remove URLs not containing entries from' option in Scrapebox. The file I want to use as a filter has a bunch of URL paths in it (not full URLs), for example /path/profile/account.php. However, when I choose the option and select the file it doesn't remove any URLs from the list.

    Now, you'd think this means all of the URLs contain strings from the filter but by looking through the list I can see that lots of them don't. Why aren't they removed?

    Thanks in advance.
     
  2. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,572
    Likes Received:
    1,904
    Gender:
    Male
    Home Page:
    It should work fine, but it would have to be exact matches, if 1 character, period, slash etc.. is off its not going to get removed.

    If you want to post up a few urls and the exact strings that are not working with those urls, then I can give you more accurate info.
     
  3. delta_squadron

    delta_squadron Registered Member

    Joined:
    Mar 29, 2013
    Messages:
    55
    Likes Received:
    18
    Ah, so I guess I had misunderstood how this option works. I hoped it would keep all the urls that have even partial match with the entries from the 'filter' list. Makes more sense now.
     
  4. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,572
    Likes Received:
    1,904
    Gender:
    Male
    Home Page:
    If you want to keep those urls, then just choose remove urls not containing entries from and select your file.

    When I say exact match, I mean it will work on partial match so if you have

    car/page.htm

    in your file

    and a url with

    http://www.doman.com/page/car/page.htm

    it will still match.