1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

[SCRAPEBOX] Tip for harvesting pages in specific language

Discussion in 'Black Hat SEO Tools' started by Salamone, Aug 8, 2012.

  1. Salamone

    Salamone Newbie

    Joined:
    Jul 5, 2012
    Messages:
    12
    Likes Received:
    6
    I was having some trouble in harvesting urls in a specific language, and was not getting good results...but then I came up with a good solution to this problem that I want to share.

    This method works only with G searches.

    From the Scrapebox "Select Engines & Proxies" tab select the drop down list near G.
    Choose "add more Gxxx" and in the pop up fill in like this:

    g****e.de+&lr=lang_de (example for targeting german pages)

    Then when scraping make sure you select that engine from the drop down list.

    So basically for every language/country you want to target create one entry:
    g****e.fr+&lr=lang_fr (french)
    g****e.es+&lr=lang_es (spanish)
    g****e.it+&lr=lang_it (italian)

    and so on...

    In this method you can "append" any query request parameters to your searches.

    You can get more ideas on how to use this with the G API reference from the developerìs site.

    PS: Thanks to Loopline's unofficial FAQ, question 9 in the Scraping section was my inspiration.
     
    • Thanks Thanks x 6
  2. ija61

    ija61 Senior Member

    Joined:
    Mar 2, 2011
    Messages:
    960
    Likes Received:
    634
    Gender:
    Male
    Occupation:
    The first SEO economist:)
    Location:
    Romania
    Home Page:
    Nice... You came on the right moment... I just start to search for a solution to that problem...:)

    And since we got here this solution may be used also for rank tracking, it must be adapt but as an idea is great.

    Thank you
     
  3. rabbit2

    rabbit2 Newbie

    Joined:
    Aug 2, 2012
    Messages:
    38
    Likes Received:
    3
    why don't you share working version of scrapebox?
     
  4. wpbacklinks

    wpbacklinks Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 27, 2010
    Messages:
    3,399
    Likes Received:
    1,339
    Gender:
    Male
    Occupation:
    Affiliate Marketer
    Location:
    Everywhere
  5. nipester

    nipester Regular Member

    Joined:
    Feb 1, 2009
    Messages:
    256
    Likes Received:
    28
    Doesn't sb itself say you can add google.de+de ? Where did the lr come from and the longer stuff after that?
     
  6. loopline

    loopline Jr. VIP Jr. VIP

    Joined:
    Jan 25, 2009
    Messages:
    3,383
    Likes Received:
    1,801
    Gender:
    Male
    Home Page:
    Yes, and that is the easier method, I was going to just post it up, but you beat me to it. :)

    As in Opps example he made it to be

    g****e.fr+&lr=lang_fr (french)
    g****e.es+&lr=lang_es (spanish)
    g****e.it+&lr=lang_it (italian)

    Scrapebox supports the simpler version.

    g****e.fr+fr (french)
    g****e.es+es (spanish)
    g****e.it+it (italian)

    Says it right in scrapebox on the section the OP is talking about.

    [​IMG]


    Great effort OP, but if you read just a tad, prob could have saved yourself some extra work. ;)
     
    • Thanks Thanks x 2