1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

ScrapeBox Forum Profile Sorting (Need to get the proper url for the register pages)

Discussion in 'Black Hat SEO Tools' started by lewi, Sep 7, 2010.

  1. lewi

    lewi Jr. VIP Jr. VIP Premium Member

    Joined:
    Aug 5, 2008
    Messages:
    2,309
    Likes Received:
    818
    Hey,

    I have gathered a list of around 10k forums so far (smallish list i know but just wanted to do some tests before i ended up with a huge list that i couldn't do anything with)

    However the list has a mix of url's all pointing to different pages on different forums (some to profiles, others to the home pages and others to actual topics).

    For my software what i need is to 'convert' these lists so that they give the direct url's to the sites register page.

    How could i do this with scrapebox?

    I thought about just using find and replace and trying /forum/register.php but not all sites are coded like that and some have changed the pages and names which i don't really want to miss out.

    Plus im sure SB could find the pages somehow right?

    Thanks

    Lewi
     
  2. virtualc08

    virtualc08 Supreme Member

    Joined:
    Mar 23, 2010
    Messages:
    1,379
    Likes Received:
    951
    I think this might help :)

    Code:
    http://www.blackhatworld.com/blackhat-seo/black-hat-seo-tools/230815-scrapebox-footprints.html
     
  3. crazyflx

    crazyflx Elite Member

    Joined:
    Nov 9, 2009
    Messages:
    1,674
    Likes Received:
    4,825
    Location:
    http://CRAZYFLX.COM
    Home Page:
    What kind of forums are they?
     
  4. lewi

    lewi Jr. VIP Jr. VIP Premium Member

    Joined:
    Aug 5, 2008
    Messages:
    2,309
    Likes Received:
    818
    Well i have a mix list which i have been compiling manually for the last month from what people have shared around the net!

    But the other lists i have are smf and vbulletin!

    I suppose then each forum type has a different footprint for like 99% of their sites right? So then would a search and replace work?

    Lewi
     
  5. mtime88

    mtime88 Regular Member

    Joined:
    Mar 10, 2008
    Messages:
    329
    Likes Received:
    79
    Occupation:
    Student
    Location:
    Near Philadelphia
    uhh use the footprints in scrapebox to only scrape register pages...

    along the lines of inurl:register.php
     
  6. lewi

    lewi Jr. VIP Jr. VIP Premium Member

    Joined:
    Aug 5, 2008
    Messages:
    2,309
    Likes Received:
    818
    So your saying use the site operator first to get all the links of the site and then filter looking just for the register.php pages?

    Or are you saying just trim to the root domain and then use inurl on the root domains to find what should be the forum register url?

    Probably a silly question but i am only running on a gold rush mind atm due to lack of sleep and loads of technical knowledge over the past 24 hours.

    Lewi
     
  7. lewi

    lewi Jr. VIP Jr. VIP Premium Member

    Joined:
    Aug 5, 2008
    Messages:
    2,309
    Likes Received:
    818
    Sorry for the bump but this seems to have got buried!

    Lewi
     
  8. jb2008

    jb2008 Senior Member

    Joined:
    Jul 15, 2010
    Messages:
    1,158
    Likes Received:
    972
    Occupation:
    Scraping, Harvesting in the Corn Fields
    Location:
    On my VPS servers
    As above, you need to find the part of the register page url that is common to all register pages specifically, then use the inurl: operator to reflect that.
     
  9. cody41

    cody41 Power Member

    Joined:
    Jun 18, 2009
    Messages:
    682
    Likes Received:
    274
    Location:
    Texas
    Do this: post up the list of forums, and i'll do my best to recompile the list with their registration pages.
     
  10. quadratic

    quadratic Registered Member

    Joined:
    Oct 26, 2009
    Messages:
    69
    Likes Received:
    46
    I assume that the list you have is composed of links from many different types of forums. The issue this presents is that there is not a common footprint for a registration page across all these as some may be on register.php, some may be /register/ etc.

    Suggest you first separate out each type of forum. You can use Scrapebox for this - SweetFunny mentioned before that " The Link Checker in ScrapeBox is essentially a string checker, so it can check for anything in a pages source. "

    So you could put something like: "Powered by vbulletin" etc into the text file which would normally hold the list of websites to check, use Scraprbox to check these 'links' on each of the urls you have recorded, and then export the list of url where the 'link' = the search string was found.

    Once you have each forum type in a separate file you can use the registration url footprint for the specific forum type. Use Scrapebox to strip the urls down to the domain and then append the registration url onto the end of each domain.
     
    • Thanks Thanks x 4
  11. dandan

    dandan Regular Member

    Joined:
    Jan 7, 2009
    Messages:
    240
    Likes Received:
    50
    NVM... sorted it out...
     
    Last edited: Sep 28, 2010