1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How to scrape non-clickable links with Scrabebox??

Discussion in 'Black Hat SEO' started by frankweerasinghe, May 1, 2017.

  1. frankweerasinghe

    frankweerasinghe Regular Member

    Joined:
    Jun 6, 2011
    Messages:
    467
    Likes Received:
    395
    Location:
    Colombo, Sri Lanka
    I have a list of links that contains URL's in non-clickable format in their page source. I need to extract them. I tried with Scrapebox link extractor plugin but it doesn't extract those non-clickable (non a href) urls.

    Is their a way to do this?

    Thanks!
     
  2. Unavailable

    Unavailable Junior Member

    Joined:
    Mar 24, 2015
    Messages:
    116
    Likes Received:
    16
    It can be done via custom ways
     
  3. frankweerasinghe

    frankweerasinghe Regular Member

    Joined:
    Jun 6, 2011
    Messages:
    467
    Likes Received:
    395
    Location:
    Colombo, Sri Lanka
    What did you mean? With or without scrapebox? :rolleyes:
     
  4. alexel

    alexel Jr. VIP Jr. VIP

    Joined:
    Feb 20, 2015
    Messages:
    239
    Likes Received:
    101
    Location:
    New York
    Well, are you talking about links generated by JavaScript?
     
    • Thanks Thanks x 1
  5. Brian Alexander

    Brian Alexander Regular Member UnGagged Attendee

    Joined:
    Aug 12, 2016
    Messages:
    200
    Likes Received:
    110
    Gender:
    Male
    A link is by definition clickable (originates from hyperlink) so scrapebox is already able to extract links.
    If what you are looking for is a way to extract url's from a text then the easiest way would probably be to code a custom tool that uses regex.
     
    • Thanks Thanks x 1
  6. bartosimpsonio

    bartosimpsonio Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 21, 2013
    Messages:
    12,748
    Likes Received:
    11,412
    Occupation:
    COINZ
    Location:
    BUYAH
    Home Page:
    My guess is he's talking about plaintext urls.
     
    • Thanks Thanks x 2
  7. alexel

    alexel Jr. VIP Jr. VIP

    Joined:
    Feb 20, 2015
    Messages:
    239
    Likes Received:
    101
    Location:
    New York
    Lol, just hit Command + A and copy all texts.
     
  8. bartosimpsonio

    bartosimpsonio Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 21, 2013
    Messages:
    12,748
    Likes Received:
    11,412
    Occupation:
    COINZ
    Location:
    BUYAH
    Home Page:
    The idea is to not do that by hand though.
     
    • Thanks Thanks x 1
  9. alexel

    alexel Jr. VIP Jr. VIP

    Joined:
    Feb 20, 2015
    Messages:
    239
    Likes Received:
    101
    Location:
    New York
    He said a list of links, probably meaning one page. By the time you try and figure out an automated solution, you can paste all the links to a file editor go from there!
     
    • Thanks Thanks x 1
  10. bartosimpsonio

    bartosimpsonio Jr. VIP Jr. VIP Premium Member

    Joined:
    Mar 21, 2013
    Messages:
    12,748
    Likes Received:
    11,412
    Occupation:
    COINZ
    Location:
    BUYAH
    Home Page:
    If it's a single page there's probably a Perl one liner regex that'll do.
     
  11. redarrow

    redarrow Elite Member

    Joined:
    Apr 1, 2013
    Messages:
    5,940
    Likes Received:
    1,430
    This works

    $string = "this is my friend's website http://example.com I think it is cool, but this is cooler http://www.memelpower.com :)";
    $regex = '/\b(https?|ftp|file):\/\/[-A-Z0-9+&@#\/%?=~_|$!:,.;]*[A-Z0-9+&@#\/%=~_|$]/i';
    preg_match_all($regex, $string, $matches);
    $urls = $matches[0];
    // go over all links
    foreach($urls as $url)
    {
    echo $url.'<br />';
    }



    Try this http://www.web-max.ca/PHP/misc_23.php

    Read what it says very easy to use..
     
    • Thanks Thanks x 1
  12. redarrow

    redarrow Elite Member

    Joined:
    Apr 1, 2013
    Messages:
    5,940
    Likes Received:
    1,430
    • Thanks Thanks x 1
  13. frankweerasinghe

    frankweerasinghe Regular Member

    Joined:
    Jun 6, 2011
    Messages:
    467
    Likes Received:
    395
    Location:
    Colombo, Sri Lanka
    Hey guys! Thanks for the replies.. but what I mean is I have a list of links that contain plantext links in each of their page sources. So I have to load each of them and extract the urls in their page sources.
     
  14. frankweerasinghe

    frankweerasinghe Regular Member

    Joined:
    Jun 6, 2011
    Messages:
    467
    Likes Received:
    395
    Location:
    Colombo, Sri Lanka
    Anyone?
     
  15. Brian Alexander

    Brian Alexander Regular Member UnGagged Attendee

    Joined:
    Aug 12, 2016
    Messages:
    200
    Likes Received:
    110
    Gender:
    Male
    Like I previously said, you probably need a custom solution for this.
    It won't take a coder more than 15 minutes to do though.
     
  16. frankweerasinghe

    frankweerasinghe Regular Member

    Joined:
    Jun 6, 2011
    Messages:
    467
    Likes Received:
    395
    Location:
    Colombo, Sri Lanka
    Mmmmm