1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How can you parse harvested wiki links via scrapebox ?

Discussion in 'Black Hat SEO' started by ComputerEngineer, Feb 20, 2013.

  1. ComputerEngineer

    ComputerEngineer Senior Member

    Joined:
    Apr 25, 2012
    Messages:
    833
    Likes Received:
    70
    How can you parse harvested wiki links via scrapebox ?

    I harvested a lot of links and need to parse them. I mean get the real wiki links out of them

    Any freeway available ? i own scrapebox atm. I also can code a software.
     
  2. rkwebs

    rkwebs Power Member

    Joined:
    Sep 23, 2010
    Messages:
    602
    Likes Received:
    87
    Occupation:
    IT
    Location:
    India
    Home Page:
    Use this footprint and in coustum footprints and Put some keyword and hite Harvest

    site:.edu inurl:wiki
    site:.edu inurl:MediaWiki_talk
    site:.edu "Log in / create account"
    site:.edu wiki
     
  3. ComputerEngineer

    ComputerEngineer Senior Member

    Joined:
    Apr 25, 2012
    Messages:
    833
    Likes Received:
    70
    did you really read the question :)

    i already harvested. need to way for parse found links :)
     
  4. dennica

    dennica Jr. VIP Jr. VIP Premium Member

    Joined:
    Dec 17, 2012
    Messages:
    820
    Likes Received:
    197
    Home Page:
    what do you mean by parsE?
    do you mean on identifying the real wiki results?

    I guess the only way is to crawl the site and check if its a wiki template..
     
  5. trustedfire9

    trustedfire9 Jr. VIP Jr. VIP Premium Member

    Joined:
    Jun 15, 2010
    Messages:
    2,116
    Likes Received:
    1,786
    • Thanks Thanks x 2
  6. ComputerEngineer

    ComputerEngineer Senior Member

    Joined:
    Apr 25, 2012
    Messages:
    833
    Likes Received:
    70
    can we say all wikis have index.php ?
     
  7. welie

    welie Junior Member

    Joined:
    May 16, 2010
    Messages:
    137
    Likes Received:
    41
    The best way will be to get your wiki software to post on the wikis with some garbage link. Then use Scrapebox to find the live links.
    The downside is you'll probably burn a lot of captchas.

    And I think only MediaWiki sites have index.php and not others like WikkaWiki etc.