1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How to Scrape all pages of tumblr

Discussion in 'Black Hat SEO' started by anil190185, Feb 27, 2012.

  1. anil190185

    anil190185 Junior Member

    Joined:
    Jan 9, 2011
    Messages:
    112
    Likes Received:
    44
    Occupation:
    SEO
    Location:
    India
    I already have a list of 2K pages which I have scraped using ScrapeBox. Now I would like to get more pages excluding the 2k pages which I already have.

    I own SB for more than 4-5 months but I rarely use it so I am not very familiar.

    Thanks!!
     
  2. reb0rn

    reb0rn Newbie

    Joined:
    Feb 17, 2011
    Messages:
    45
    Likes Received:
    139
    get a bigger list of keywords, i scraped 20k unique with mine, but this is still low
     
  3. anil190185

    anil190185 Junior Member

    Joined:
    Jan 9, 2011
    Messages:
    112
    Likes Received:
    44
    Occupation:
    SEO
    Location:
    India
    So I just need to insert random keywords and scrape? Also there is a limit to scrape only 1K. Can this be increased? Can I put 10K there?
     
  4. nik-0

    nik-0 BANNED BANNED

    Joined:
    Jan 19, 2012
    Messages:
    510
    Likes Received:
    96
    Its 1000 searchresults for each keword. So in the footprint box on top you put site:tumblr.com

    And in the keyword box you put a ton of keywords. Start harvesting and you should be able to get like 100k tumblr url's or more depending on how divers your keyword list is. Then trim to root and remove duplicate domains, that is possible with tumblr cause it are subdomains.
     
    • Thanks Thanks x 1
  5. anil190185

    anil190185 Junior Member

    Joined:
    Jan 9, 2011
    Messages:
    112
    Likes Received:
    44
    Occupation:
    SEO
    Location:
    India

    Thanks for clarifying that its 1k per KW. Will definitely give it a go.
     
  6. dirtbag

    dirtbag Senior Member

    Joined:
    Jul 24, 2008
    Messages:
    990
    Likes Received:
    527
    Google intext:wordlist.txt, find yourself a nice word list of common nouns, cut/paste, do what nik-0 said to add in your tumblr footprint, scrape like crazy...
     
    • Thanks Thanks x 1