1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

tumblr scraping went bad

Discussion in 'Tumblr' started by abcva, May 13, 2016.

  1. abcva

    abcva Junior Member

    Joined:
    Dec 15, 2013
    Messages:
    120
    Likes Received:
    2
    so basicly I harvested kws then scraped tumblr urls and then gone through them with vanity checker or whatever is name of scrapebox plugin. results 352 urls - 352 errors. mostly error 500/403, 'completed', timeout.
    What is problem? I am using scrapebox with 380 proxies I scraped with scrapebox
     
  2. everythingred

    everythingred Jr. VIP Jr. VIP Premium Member

    Joined:
    Jul 21, 2010
    Messages:
    2,168
    Likes Received:
    1,366
    Public proxies go to shit after a while. That may be the case. I only use private proxies and it seems to work fine.
     
  3. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,873
    Likes Received:
    2,057
    Gender:
    Male
    Home Page:
    As noted above the first thing to do is use private proxies. 500 is a server error and 403 is a forbidden error.

    You might wind up with a 403 from Tumblr but its more likely a proxy issue and its unlikely tumblr would give a 500 so again probably the proxy.

    I think tumblr also changed their formatting, if you haven't updated it yourself, you can always do so. I know in the next version of the vanity addon they will update it, but Scrapebox built the vanity addon so users can customize it. So you can always just check it yourself.

    But if you want to edit the defintion this works:

    Code:
    [Available1]
    resultcode=404
    mustcontain= <h1 class="error-message-title" data-localization="heading">
                        There's nothing here.
                    </h1>
    mustnotcontain=
     
    • Thanks Thanks x 1
  4. mindmaster

    mindmaster Jr. VIP Jr. VIP

    Joined:
    Sep 16, 2010
    Messages:
    2,873
    Likes Received:
    1,304
    Home Page: