scrapebox harvesting irrelevant urls

Discussion in 'Black Hat SEO' started by nonai, Oct 13, 2014.

  1. nonai

    nonai Power Member

    Joined:
    Oct 10, 2013
    Messages:
    524
    Likes Received:
    64
    I have been having this problem with SB for a long time. I enter a phrase and try to harvest, and it comes up with all these URLs that dont even contain my phrase.

    for example, today I was trying to harvest some elgg based websites. So I entered in the keyword section
    "Display name" "Password (again for verification)"

    that's an elgg footprint. I harvested for a few minutes and it gave me a bunch of bullshit, like:
    Code:
    http://www.zdnet.com/dropbox-gets-hacked-again-7000001928//RK=0
    http://www.att.com/esupport/article.jsp?sid=KB408822&cv=807/RK=0
    http://www.researchgate.net/publication/4179340_Compact_frequency-agile_absorptive_bandstop_filters/RK=0
    http://www.theaustralian.com.au/nocookies/RK=0
    http://www.socialsecurity.gov/employer/ssnv.htm/RK=0
    http://www.reddit.com/r/technology/comments/ch42m/california_drivers_may_soon_have_led_adblaring//RK=0
    http://www.ebay.com/sch/i.html?_nkw=antique+doll/RK=0
    http://httpd.apache.org/docs/2.2/programs/htpasswd.html/RK=0
    http://www.wired.com/2012/08/apple-amazon-mat-honan-hacking//RK=0
    http://www.amazon.com/gp/help/customer/display.html?nodeId=10412241/RK=0
    http://wenku.baidu.com/view/280acdc0524de518964b7df6.html?re=view
    first off, I dont understand why the fuck it keeps putting /RK=0 at the end of the URLs.
    secondly, these are not even elgg based websites. If I go to these pages, my phrase does not even exist there.

    any idea how I can fix this?
     
  2. ziplack

    ziplack Supreme Member

    Joined:
    Feb 18, 2010
    Messages:
    1,321
    Likes Received:
    680
    Location:
    BHW
    your footprint its wrong.
     
  3. ziplack

    ziplack Supreme Member

    Joined:
    Feb 18, 2010
    Messages:
    1,321
    Likes Received:
    680
    Location:
    BHW
    use these.

    "Powered By Elgg"
    inurl:"/social/register" "powered by Elgg"
    inurl:"/groups/profile/" "powered by Elgg"
    inurl:"/discussion/view/" "powered by Elgg"
     
  4. hnmfvnxe

    hnmfvnxe Regular Member

    Joined:
    Nov 10, 2012
    Messages:
    281
    Likes Received:
    79
    /RK=0 is from scraping yahoo/bing. this has been an issue for a long time now and they dont bother to fix it. its been like that for months now.

    on the other hand when i told sven from GSA SER about the "/RK=0" he fixed it same day.
     
  5. hnmfvnxe

    hnmfvnxe Regular Member

    Joined:
    Nov 10, 2012
    Messages:
    281
    Likes Received:
    79
    op using different variation footprint "Display name" "Password (again for verification)" works fine. "inurl" type search operators get you banned fast.
     
  6. hatblackguy

    hatblackguy Regular Member

    Joined:
    Jun 8, 2011
    Messages:
    252
    Likes Received:
    61
    Occupation:
    kicking Big G's fat Mug
    Location:
    1% Percent of wealthiest people
    Did you update to latest version? try asking support they are fast in answering