1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

scrapebox lazy loading + external link issue

Discussion in 'Black Hat SEO' started by staypositive, Aug 9, 2016.

  1. staypositive

    staypositive Jr. VIP Jr. VIP

    Joined:
    Jul 28, 2015
    Messages:
    984
    Likes Received:
    130
    Occupation:
    Hiring content writer & VA for links building, PM
    Hi all guru and pros,

    I learnt about using SB from BHW and got the low pricing purchase.

    simply put, I wanna scrape emails, but i encounter two website that I just dunno why it doesnt work properly.

    1. For below link, it has to be scrolled down to enable all of the exhibitors name to be shown, the problem now is that there's just close to unlimited number of pages, is there a way it can scrape all of the links?

    Link:
    http://www.prowein.com/vis/v1/en/se...=2&_query=&f_type=profile&f_tag=prowein2016.6


    2. Second question is, i used the addon for link extractor, the internal one worked fine,but for external, it just doesnt work, e.g. , for link:
    http://www.worldbulkwine.com/ing/expositores2015_detalle.php?cod=20157141

    when i click external link extract, it just extract 4 links, which is twitter, facebook, youtube link,

    however, it just wont extract www.grupopenaflor dot com dot ar

    i just dunno what happen.

    can anyone advise please? Thanks in advance!!!!


    ( admin, sorry if i posted to wrong page, I am new to BHW , if i posted here wrongly or posted the above link breaking the rules of bhw, i am more than happy to amend it. )
     
  2. staypositive

    staypositive Jr. VIP Jr. VIP

    Joined:
    Jul 28, 2015
    Messages:
    984
    Likes Received:
    130
    Occupation:
    Hiring content writer & VA for links building, PM
  3. Stephen Kurt

    Stephen Kurt BANNED BANNED

    Joined:
    Jan 15, 2016
    Messages:
    50
    Likes Received:
    10
    Code:
    <a target="_blank" href=http://www.grupopenaflor.com.ar><span class="down">www.grupopenaflor.com.ar</span></a>
    There're more than 80 links on worldbulkwine; Only grupopenaflor's attribute href's value hasn't been quoted by ", this can be improved.
    jQuery's sizzle engine's chunker has taken consideration of all those situations.
     
  4. staypositive

    staypositive Jr. VIP Jr. VIP

    Joined:
    Jul 28, 2015
    Messages:
    984
    Likes Received:
    130
    Occupation:
    Hiring content writer & VA for links building, PM
    thanks a lot for your reply

    i apologize if i am so newbie.

    so may i know how can i make them work?
     
  5. Winston_

    Winston_ BANNED BANNED

    Joined:
    Jul 9, 2016
    Messages:
    41
    Likes Received:
    11
    Gender:
    Male
    Here's prowein.txt, save it as PROWEIN.dat for example; and in the Havester Engine Configuration, click import and select "add non-existing engines" and import it. I tested it's working fine but you need some additional improvements.
    start.jpg 360截图20160811200518377.jpg
    360截图20160811200548848.jpg
    After have all the urls, then extract their emails.

    Custom Data Grabber:
    Custom Data Grabber.png
    Grab.jpg
    Before the link is href=
    after the link is a white space

    Grab2.jpg
    Or you can extract between <a and </a.
     

    Attached Files:

    • Thanks Thanks x 1
  6. staypositive

    staypositive Jr. VIP Jr. VIP

    Joined:
    Jul 28, 2015
    Messages:
    984
    Likes Received:
    130
    Occupation:
    Hiring content writer & VA for links building, PM
    BIG BIG THANKS!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
    BIG BIG THANKS!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
    BIG BIG THANKS!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!