1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

using scrapebox to grab emails from craigslist

Discussion in 'Black Hat SEO Tools' started by nairb, Feb 1, 2012.

  1. nairb

    nairb Registered Member

    Joined:
    Jan 26, 2012
    Messages:
    54
    Likes Received:
    3
    Location:
    NYC
    So I'm looking to grab emails that are used to post Resume postings on craigslist (region. craigslist. org /res/) with ScrapeBox's Grab Emails function.

    I'm using the footprint site:craigslist. org/ res/ to harvest URLs with the "region." different in each URL. The problem is, the region. craigslist. org/ res/ URLs list all the resume postings but emails are not present on this page. It looks like I need to enter each posting individually to get a glance at posters' emails (please refresh my memory - is it even possible to see posters' emails within their posts? Last I remember, it was an option that they could choose to display their email in the post or make it hidden).

    How could I use additional footprint syntax to tell SB to enter individual posts for ALL the "region." domains' inner pages?

    SB is software I haven't had much exposure to but I can already tell how AMAZING it is haha. Really looking forward to using it to its full potential.
     
  2. marketer14

    marketer14 Regular Member

    Joined:
    Nov 2, 2010
    Messages:
    310
    Likes Received:
    41
    Home Page:
    Here's some directions I found online. I tried it with another section of Craigslist and it did scrape the emails. Not sure about the Resume section, but try it out. Follow directions below.

    You can grab emails with the email grabber in the harvested urls section. It will let you harvest emails from a url or a local file.
    Say you wanted to harvest emails from the Jobs category on Craigslist.
    In a regular web browser open up Craigslist. Find the category you want to harvest from, in the case of the jobs category, most major cities it looks like this:
    http://losangeles.craigslist.org/jjj/
    I got this by selecting the city I wanted, and then clicking the "jobs" link at the top of the category.
    Then you would copy down that url, which is what is above. Note: make sure that if it gives you a spam warning you follow thru to get the actual url of the page that lists the ads.
    If you like you can also copy down the urls of the "Next 100 results".
    Then save off all of the urls from the categories you want.
    Then import them into the Link Extractor addon.
    Choose Internal only.
    Then let it harvest all the urls from those pages. This will give you all the current craigslist ads for each category from all the pages you choose.
    Then export the results to a txt file.
    Then import that txt file into the urls harvester section.
    Then use the email grabber to get the emails from those urls. Thus you have scraped all the emails from Craigslist for the current ads from the categories you have chosen.
    The best part is the category urls are static, but the urls that you harvest from them change daily, so you can repeat this process over and over.
     
    • Thanks Thanks x 6
  3. nairb

    nairb Registered Member

    Joined:
    Jan 26, 2012
    Messages:
    54
    Likes Received:
    3
    Location:
    NYC
    Wow that's a great solution! I didn't realize SB has all these free Addons - this software really is incredible.

    But in order to get the URLs for each region's Resume listing pages (including Next 100... pages), will I need to manually work my way through each region and its subsequent "Next 100 results" pages? There are a whole lot of regions and cities in the U.S. alone, so doing this manually would certainly take a really long time. If there is a way to automate this, it would be awesome.

    Nevertheless, thanks for the instructions. I'll start working on this right away
     
    • Thanks Thanks x 1
  4. ego_whore

    ego_whore Junior Member

    Joined:
    Nov 10, 2010
    Messages:
    172
    Likes Received:
    63
    You get index100.html page if you'd like to view the next 100 resumes. Why not use this page (and, index200.html if there are such) for all the locations you're scraping from (by adding the mentioned string to the base url of a particular city)? You don't need to visit every page; just use the created algorithm.
     
  5. kokoloko75

    kokoloko75 Elite Member

    Joined:
    Jan 1, 2011
    Messages:
    1,628
    Likes Received:
    1,935
    Occupation:
    Design director
    Location:
    Paris (France)
    This thread is 5 months old.

    Beny
     
  6. mafiaclan

    mafiaclan BANNED BANNED

    Joined:
    Aug 28, 2012
    Messages:
    35
    Likes Received:
    3
    What do you guys enter in the Harvest section? I'm guessing you guys are doing "inurl:craigslist.com" and have your keywords being your niche?