1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Scrapebox reporting error 404 during harvesting

Discussion in 'Black Hat SEO' started by ljuba973, Jul 23, 2011.

  1. ljuba973

    ljuba973 Newbie

    Joined:
    Jul 23, 2011
    Messages:
    15
    Likes Received:
    0
    Hi all,

    I am using ScrapeBox together with Auto Hide IP and that combination worked perfectly until yesterday when I tried to harvest some blogs from Google and Yahoo and Yahoo reported error 404. I tried different country's IP and run again just Yahoo and same error I got. I left it until now and still after 24 hours I am getting same error - as on attached image.

    I couldn't find what exactly Error 404 means - source unavailable, blocked, ...? How I can fix this issue and be able to harvest Yahoo again?
     

    Attached Files:

  2. Knoxgates

    Knoxgates Supreme Member

    Joined:
    Aug 9, 2008
    Messages:
    1,266
    Likes Received:
    918
    As far as i know Scrapebox is not harvesting links from yahoo since last few days. Don't know the issue is with proxies or yahoo's end.
     
    • Thanks Thanks x 1
  3. ljuba973

    ljuba973 Newbie

    Joined:
    Jul 23, 2011
    Messages:
    15
    Likes Received:
    0
    Thanks a lot ... so 404 IS "source unavailable". Probably Yahoo changed structure of pages. Hopefully new update of SB will be issued with new harvesting algorithm
     
  4. Knoxgates

    Knoxgates Supreme Member

    Joined:
    Aug 9, 2008
    Messages:
    1,266
    Likes Received:
    918
    Yes i think Sweetfunny is releasing a new update for this.
     
  5. TheMatrix

    TheMatrix BANNED BANNED

    Joined:
    Dec 20, 2008
    Messages:
    3,444
    Likes Received:
    7,279
    It's nothing to do with SB. Yahoo is doing an API update or something and should be up very soon.
     
  6. muchacho

    muchacho Supreme Member

    Joined:
    May 14, 2009
    Messages:
    1,293
    Likes Received:
    187
    Location:
    Lancashire, England.
    Not sure what the latest is, but it still aint working properly.

    What was once an easy way to harvest over 2-3 million URLs now gets you around 3-4,000 URLs before you have to run it again.
     
  7. Frogserv

    Frogserv Regular Member

    Joined:
    Jun 21, 2011
    Messages:
    376
    Likes Received:
    180
    Occupation:
    Entrepreneur
    Location:
    Paris, FR
    How can you harvest 2/3 million urls? Scrapebox seems to limit to 1 million.:06:
     
  8. muchacho

    muchacho Supreme Member

    Joined:
    May 14, 2009
    Messages:
    1,293
    Likes Received:
    187
    Location:
    Lancashire, England.
    1 million is only the amount it can store in the harvested results section.

    If you go to the harvested sessions inside the Scrapebox folder it stores them all there.

    I'm getting "error 502" when I try harvesting with Yahoo and a quick bit of Googling says that's a Yahoo API gateway error. So it sounds like Yahoo have API problems, or they have made it so that Scrapebox won't function correctly with it.
     
    • Thanks Thanks x 1
    Last edited: Aug 2, 2011
  9. Frogserv

    Frogserv Regular Member

    Joined:
    Jun 21, 2011
    Messages:
    376
    Likes Received:
    180
    Occupation:
    Entrepreneur
    Location:
    Paris, FR
    Ty for the millions :)

    Well, I have the same problem for yahoo since this night.
    Edit : Now Yahoo is good for me
     
    Last edited: Aug 2, 2011
  10. sylvanas

    sylvanas Newbie

    Joined:
    Dec 22, 2010
    Messages:
    5
    Likes Received:
    1
    I just tested auto hide ip, and it changes the IE proxy settings, does scrapebox use the IE setting if you dont set proxy in it?
    how have you done that?