1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

How to make functional scraping of images from Amazon?

Discussion in 'General Programming Chat' started by ThreadKiller, Dec 15, 2013.

  1. ThreadKiller

    ThreadKiller Power Member

    Joined:
    Jan 31, 2012
    Messages:
    626
    Likes Received:
    304
    Location:
    Never too long in one place.
    Hi all.

    So here's an Amazon example page http://www.amazon.com/gp/product/B0...pf_rd_t=101&pf_rd_p=1630083462&pf_rd_i=507846

    I want to scrape all the expanded images, preferably just with one click.

    Been trying it out with iMacros but I have failed to make it work.

    Also tried a lot of other tools on the market, webharvy etc, but they can not handle flash it seems.
    I have had the most success with the ZIG image grabber plugin, but I can not get consistent results with that.

    Any ideas?
     
  2. sohom

    sohom Senior Member

    Joined:
    May 26, 2013
    Messages:
    990
    Likes Received:
    175
    Location:
    not in Past
    I can make a bot for,as per your requirement
    check PM
     
  3. n0money

    n0money Newbie

    Joined:
    Dec 19, 2013
    Messages:
    20
    Likes Received:
    6
    1. Crack open the resources section in firebug or whatever developer tools of your browser you have. Look for images on that page that are big.

    2. Open up the page source.

    3. Search for the big image names

    - For example, the scope image is named: bla
    - Look for it in the page source.

    4. Now look at the source of the rest of the big images. Identify a pattern that distinguishes them from the smaller ones. Depending on the complexity of the pattern, you might be able to get away with something like wget to scrape them. Worst case scenario you have to code your own scraper.

    5. Good luck.