1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Can Scrapebox scan the whole web ?

Discussion in 'Black Hat SEO Tools' started by vjbeng, Jun 9, 2015.

Tags:
  1. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    Hello all,

    I discovered this forum by reading this excellent thread about Scrapebox :
    [Tutorial] How I Generate Niche Targeted Email Lists/Leads With Scrapebox


    Actually, I consider buying this software, would you please take a look to see whether Scrapebox can do this :

    1) Scan all domains in *.fr,
    2) looking for pages whose url include a specific KEYWORD in it
    3) and sorting the results with a 2 columns table : URL / email found

    Thanx guys for your skilled advice !
    Vjbeng
     
  2. Aske_senteria

    Aske_senteria Junior Member

    Joined:
    Sep 14, 2010
    Messages:
    146
    Likes Received:
    23
    1. yes
    2. yes
    3. what do you mean by url / email
     
    • Thanks Thanks x 1
  3. ZennoBlaster

    ZennoBlaster Senior Member

    Joined:
    Jan 17, 2014
    Messages:
    1,029
    Likes Received:
    310
    No, not the whole web. There are vast sections that aren't indexed, i.e. the dark web, or the Warrior forum.
     
    • Thanks Thanks x 5
  4. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    Thanks, good news :)
    I mean an excel spreadsheet to organize the data, with column A : "Site's domain" and Column B : "email scraped"

    At ZennoBlaster : thanks for the precision.
     
  5. scoobyrobert

    scoobyrobert Junior Member

    Joined:
    Aug 19, 2013
    Messages:
    100
    Likes Received:
    164
    Scrapebox can only scrape was has been indexed by search engines... Majority of the web but not all.
     
    • Thanks Thanks x 1
  6. eyashwant

    eyashwant Power Member

    Joined:
    Feb 11, 2009
    Messages:
    583
    Likes Received:
    57
    Occupation:
    Delivering Results & Success through Content Marke
    Location:
    Everywhere...want to join me?
    Home Page:
    Okay. Scrapebox CAN scrape the whole web IF you get it the LINKS to start it from
    It can scrape everything indexed by search engine.
    It has scrapping ability. But it needs a base/data to start from, and that's google and hence it's limited to what google sees
     
    • Thanks Thanks x 1
  7. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,875
    Likes Received:
    2,058
    Gender:
    Male
    Home Page:
    Yes to all 3, except number 3. It wouldn't be in excel by default, its a text file, but you could put it in excel and separate the data easily.

    However its not going to be a push button solution, your going to have to be very involved and it will take time, lots of time.

    You can harvest for .fr domains
    use the page scanner to find the urls with the keyword
    use the email grabber to grab the emails and save them to file, so long as they are not in javascript etc..., the mails have to just be in the html of the page.
     
    • Thanks Thanks x 1
  8. ConvertiVid

    ConvertiVid Junior Member

    Joined:
    Apr 14, 2015
    Messages:
    109
    Likes Received:
    11
    Location:
    Philippines
    What in the world is the dark web?
     
  9. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0

    Thanks a lot guys, I think I am going to read the documentation on their site.

    There used to be a discount coupon for people from BHW, no ?

    Goog to know, thanks! Do you know if in the text file, you can gather different fields of data, like say, "domain harvested" and "email found" ?
     
  10. simply

    simply Junior Member

    Joined:
    Jul 1, 2012
    Messages:
    169
    Likes Received:
    85
    Gender:
    Male
  11. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    Hi guys,

    I recall a discount coupon to get ScrapeBox at 57$, nobody remembers it maybe ?
     
  12. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,875
    Likes Received:
    2,058
    Gender:
    Male
    Home Page:
    http://www.scrapebox.com/bhw
     
    • Thanks Thanks x 1
  13. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    Amazing, thanks !
     
  14. sxtcsctc

    sxtcsctc Newbie

    Joined:
    Jun 14, 2015
    Messages:
    14
    Likes Received:
    0
    yes it can but you have to feed it the links
     
  15. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    I got my licence and spent a few hours on this great soft. Now getting back to this answer, would you enlighten me a little ?

    with a search in scrapebox like
    inurl:keyword and inurl:fr

    questions 1 and 2 are solved (except I still have to find how to get more than 1000 urls)

    but I can't figure out #3 : How to sort harvested emails with the corresponding domains ?

    Any emails grabbed so far is just plain email without any other data.


    and thanks for the awesome tuts btw Loopline ;)
     
    Last edited: Jun 17, 2015
  16. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    Ok, why do I need that ?

    So far, the SB email grabber only gives back emails. And it's quite hard to cold email someone (or a mass) without any other info.

    If I get results like :

    domain1|email1
    domain2|email2
    etc

    then, I'll be able to send somehow personalized emails to John and Georgio :

    I visited your site domain1 and blabla bla

    So please, how to get the harvested domain with the grabbed email ?
     
  17. GoTRooT

    GoTRooT Jr. VIP Jr. VIP

    Joined:
    Jun 21, 2010
    Messages:
    554
    Likes Received:
    262
    Gender:
    Male
    Occupation:
    MD
    Location:
    Prague
    Home Page:
    A tool that will do what you are looking for is GSA email spider.
     
  18. Sweetfunny

    Sweetfunny Jr. VIP Jr. VIP

    Joined:
    Jul 13, 2008
    Messages:
    1,798
    Likes Received:
    5,076
    Location:
    ScrapeBox v2.0
    Home Page:
    If you are using ScrapeBox v1 then go to Options >> Email grabber: Save urls with email, and it will save it exactly like email|domain

    Or if you are using ScrapeBox v2, which you should because it's got a native 64-bit version which can handle hundreds of millions of urls, and it also has a site spider email grabber which can crawl a site not just scrape emails from the loaded urls. The option to save the urls with the emails is right on the email scrapers.
     
  19. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,875
    Likes Received:
    2,058
    Gender:
    Male
    Home Page:
    Glad you like the tutorials. :)

    if you are searching for .fr domains you can try site:.fr

    The site operator seems to get banned less quickly then inurl, although it may not make much difference as you already have an inurl operator in there too.

    However to get more then 1000 results you can tack keywords or letters. The keyword scraper has the option to append a-z to the end and then google will return different sets of results from its database of thousands and then you can just remove duplicate urls when you are done.
     
  20. vjbeng

    vjbeng Newbie

    Joined:
    Jun 9, 2015
    Messages:
    8
    Likes Received:
    0
    Thx Stipen for the alternative. Good to know, even if not free.

    Ok, thanks SweetFunny, sorry of being such a newbie. I found my way with this option, and using this tool to trim URLs down to domain names : ninjaseotools


    A thank you is still free ! got it for inurl, I'll keep this in mind.

    thanks 1 more time ! nice way to overcome this 1000 results limit

    PS : this "no url no emails" policy for newcomers is quite a pain... 15 minutes to find out that SMILEYS are forbidden....