1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How to extract website data with Firefox imacros add-on?

Discussion in 'Black Hat SEO Tools' started by chocolatemonkey, Jun 11, 2010.

  1. chocolatemonkey

    chocolatemonkey Junior Member

    Joined:
    Dec 20, 2009
    Messages:
    197
    Likes Received:
    29
    In the imacros video tutoraials they utilize an "extract data" button to extract data from a website. However, when I open up my imacros, I can't seem to find the "extract data" button under the "record" tab like they show in the video. It appears they are using an older version of imacros for the demo videos, but where is the "extract data" button in the newest version?
     
  2. thelock

    thelock Newbie

    Joined:
    May 14, 2010
    Messages:
    9
    Likes Received:
    1
    Location:
    Durham, UK
    Not sure what page you are trying to scrape but this is what I use for scraping ezine articles summary sections (it's easy to hand code when you get a basic understanding):

    TAG POS=1 TYPE=TEXTAREA FORM=NAME:NoFormName ATTR=ID:article-summary EXTRACT=TXT

    You should be able to get some working code running with that, if not let me know and I will spoon feed you some more.
     
    • Thanks Thanks x 1
  3. chocolatemonkey

    chocolatemonkey Junior Member

    Joined:
    Dec 20, 2009
    Messages:
    197
    Likes Received:
    29
    PM'd you lock. Thanks for your spoon feeding! ;)
     
  4. purewealthinc

    purewealthinc Regular Member

    Joined:
    May 3, 2010
    Messages:
    427
    Likes Received:
    383
    Occupation:
    Web Fishing
    Location:
    World Wide Web City

    Hi thelock, I know this is an old post but i have question about imacros mate.. Please help!
    I really want to learn more about iMacros.

    I just started using iMacros for 1 month and i learned a lot specially data extracted like extracting some website, i can extract the whole members of the website if there is a user ID (numbers only)

    Now i want to learn how we can extract alphabets
    like twitter usernames for example, can we extract the twitter usernames using iMacros? I can extract only one username, and i don't know how to LOOP the aplhabets or how to call the iMacros to crawl all members usernames only.. This is what i really want to learn.. to extract usernames in any websites.

    Like this one twitter dot com/Mandru28

    The username is Mandru28 and we there is millions of twitter members. Now i want to learn how to call the iMacros to crawl and extract all those members usernames.

    Need help mate.. I will hit more thanks button for you soon>>> :)
     
  5. roknimam

    roknimam Regular Member

    Joined:
    May 13, 2010
    Messages:
    202
    Likes Received:
    217
    Occupation:
    I enjoy good company and good times however I'm no
    Location:
    {Mind|Desk,Bead}
  6. purewealthinc

    purewealthinc Regular Member

    Joined:
    May 3, 2010
    Messages:
    427
    Likes Received:
    383
    Occupation:
    Web Fishing
    Location:
    World Wide Web City
     
  7. indiarocks

    indiarocks Newbie

    Joined:
    Jun 17, 2012
    Messages:
    8
    Likes Received:
    0
    Occupation:
    Student
    Location:
    West Bengal
    Home Page:
    How to get the highlighted value from webpage in Imacros :


    <font color="maroon" size="2"><b>Timer</b>: <span id="count1820380num">50</span>/133</font>


    The Span ID is Dynamic , also the num 50 and 133 is dynamic ... any help is appreciated regarding this !
     
  8. HoNeYBiRD

    HoNeYBiRD Jr. VIP Jr. VIP

    Joined:
    May 1, 2009
    Messages:
    5,913
    Likes Received:
    7,150
    Gender:
    Male
    Occupation:
    Geographer, Tourism Manager
    Location:
    Ghosted
    Can't you use the asterisk (*) wildcard?

    If you can't, you can extract the whole html of every page you want, make the macro save them into a .csv. Open the file with Notepad++ and search for the desired part of the code by bookmarking the lines, which contain a common string (e. g. <b>Timer</b>: <span id="). Under Search/Bookmarks you can remove the non-marked lines, so you remove all the redundant lines and you'll be left with the marked lines only. When you have those lines, you can use MS Excel to split the lines into columns by a specific character using the Convert Text to Columns Wizard. If you want the number 133 as result, you need to set the delimiter to /, complete the wizard, then highlight the column where your numbers are, run Convert Text to Columns Wizard again, now set the delimiter to <, so all the desired numbers will be in the same column under each other one per line with all the redundant data separated.
     
    Last edited: Oct 31, 2016