1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Quick Scrapebox Question

Discussion in 'Black Hat SEO Tools' started by devs101, Oct 19, 2011.

  1. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    Hi guys, I've got a bunch of URLs that I would like to filter.

    Is there a way to use the "Remove URLs not containing" feature in scrapebox so that I can remove by more than one keyword?

    Any help would be great.

    Cheers,

    Devs ;)
     
  2. blackHat_BR

    blackHat_BR Registered Member

    Joined:
    Jan 5, 2011
    Messages:
    77
    Likes Received:
    10
    Occupation:
    SEO
    Location:
    Brasil
    i would tell you to try to use excel to filter that kind of thing
     
  3. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    Thats a good idea, I was hoping that there would be a way to do it directly in SB but that doesn't seem posable.

    How would this be done in excel? I don't use this app very much.

    Cheers!
     
  4. totallybaked

    totallybaked Newbie

    Joined:
    Mar 31, 2011
    Messages:
    26
    Likes Received:
    1
    Location:
    UK
    yeah use excel, look for the Sort & filter options. Then add a filter for the keywords you want to exclude using the "does not include" option
     
  5. mindmaster

    mindmaster Jr. VIP Jr. VIP Premium Member

    Joined:
    Sep 16, 2010
    Messages:
    2,501
    Likes Received:
    1,136
    Location:
    at my new office
    Check the remove filter tab, up-right corner. ;)

    Cheers
     
  6. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    I've had a look in Excel and under the Advanced Filter options. I only get the following options:

    List Range:
    Criteria range:
    Copy to:

    Hmmmm?
     
  7. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    In SB or Excel? If SB how do you get it to filter different KWs?

    Cheers ;)
     
  8. totallybaked

    totallybaked Newbie

    Joined:
    Mar 31, 2011
    Messages:
    26
    Likes Received:
    1
    Location:
    UK
    Hi Again,

    what version of excel do you have?
     
  9. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    Hi its v12.1.3 2008 on a Mac.
     
  10. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,936
    Occupation:
    Design director
    Location:
    Paris (France)
    If you want to filter you URLs on the page content (and not URL text) for more accurate results, you can use NicheURLFilter :
    Code:
    http://codecanyon.net/item/nicheurlfilter-version-10/681808
    It's what I use ; maybe there are other tools like that.

    Beny
     
  11. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    Hi and thanks for the suggestion. its not what I'm wanting to do though. i need to filter the urls looking for contact pages. So I'm looking for words like: 'contact', 'contactus', 'hello' etc.
     
  12. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,936
    Occupation:
    Design director
    Location:
    Paris (France)
    You can try that, but a bit limited... :
    Code:
    http://textmechanic.com/Remove-Lines-of-Text-Containing.html
    I'm sure there are others more sophisticated.

    Beny
     
  13. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    Thanks Beny, thats what I'm after but ideally a non web based version as I have loads of urls that I need to process and it just kills the page :(
     
  14. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,936
    Occupation:
    Design director
    Location:
    Paris (France)
    Yes, web service are not perfect for large tasks...
    It's strange how it's difficult to find software to do this, it seems very simple.

    Beny
     
  15. rody00

    rody00 Regular Member

    Joined:
    Feb 25, 2010
    Messages:
    354
    Likes Received:
    80
    Location:
    The Machine City
    yes you can just use "Remove URLs containing entries from" and load up a txt file with those keywords you want to filter out, it's very simple.
     
    Last edited: Oct 19, 2011
  16. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    i know it seems like an easy thing to do.

    Rody, how do you filter via multiple keywords via the remove tool in SB? It doesn't give you the option to load a text file, as far as I can see?
     
    Last edited: Oct 19, 2011
  17. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,936
    Occupation:
    Design director
    Location:
    Paris (France)
    You can't...
    So, if you choose this method you need to make keyword per keyword, and exporting each time.
    Finally merge all lists in one.

    Beny
     
  18. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    Yeah thats what I thought Beny, looks like thats the route I'll have to take.

    Thanks for the input mate ;)
     
  19. mindmaster

    mindmaster Jr. VIP Jr. VIP Premium Member

    Joined:
    Sep 16, 2010
    Messages:
    2,501
    Likes Received:
    1,136
    Location:
    at my new office
    Devs101, correct me if I misunderstood:

    You have a list of urls in scrapebox.
    You want to remove certain urls containing specific keywords. Let's say you want to remove words like: "adult", "sexy", etc.

    If this is the situation..
    Put all your keywords in a txt file.
    Load your list in scrapebox.
    Hit the "Remove/Filter" button.
    Go to "Remove urls containing entries from ..". Here you load your txt file with all those words.
    Let it reap.

    Now you have your list cleaned from those keywords. Save it.

    If you want only urls containing those keywords, you go a step further.
    Save the cleaned list.
    Load the full list again.
    Go to "Import URL list" --> "Select the url list to compare". Here you load your cleaned list.
    This will remove the cleaned urls, and keep all your urls with your wanted keywords in it.
     
  20. devs101

    devs101 Registered Member

    Joined:
    Oct 13, 2011
    Messages:
    79
    Likes Received:
    1
    mindmaster you're 100% correct mate. The problem that I have is that my version of SB does not give you the option to load keywords via the cleaner? What version are you using? Im on 1.15.29.

    I've just run into another little problem that you may be able to help with as you seem to know your stuff ;)

    I need to run the urls through the email grabber. however some of the urls error with fault-0 and fault-400. I can't see a way to retry all the failed urls or export them to a list? Can this be done or do you know what would cause the errors as the pages work via the browser.

    Cheers! ;)