How to scrape content

Discussion in 'White Hat SEO' started by cucurucu, Apr 4, 2012.

  1. cucurucu

    cucurucu Regular Member

    Joined:
    Dec 6, 2008
    Messages:
    232
    Likes Received:
    152
    So basically I want load a list of url's and scrape only prices and send the results to txt/csv. Can this be done with scrapebox or any other software ?
     
  2. roster67

    roster67 Registered Member

    Joined:
    Mar 27, 2012
    Messages:
    69
    Likes Received:
    10
    Urls from same domain or from differents domain?

    Simple html grabber with little regex can make it easily if you want price products from a specific website.
     
  3. cucurucu

    cucurucu Regular Member

    Joined:
    Dec 6, 2008
    Messages:
    232
    Likes Received:
    152
    url's from same domain.

    Can you please elaborate on the regex stuff a little more ?
     
    Last edited: Apr 4, 2012
  4. roster67

    roster67 Registered Member

    Joined:
    Mar 27, 2012
    Messages:
    69
    Likes Received:
    10
    You need to extract price from product form a single website, that should be really easy to do.

    For Regex, just see the wiki article, that explain it clearly. {sorry cant post url link}

    But as you said its only for one website, i assume you dont really need to use any regex, just parsing html.

    If you PMed me the website url, i can see which way would be the best for you to extract products prices from this website.
     
    Last edited: Apr 4, 2012
  5. roster67

    roster67 Registered Member

    Joined:
    Mar 27, 2012
    Messages:
    69
    Likes Received:
    10
    PM readed, but i cant answer by PM.

    You site is really simple, so no problem to extract price.

    I ll suggest to use a basic c# application which:
    - get url data through webrequest
    - use htmlagilitypack lib to parse DOM and extract name and price of product

    If you dont understand what im talking about, someone could compile it for you.

    I could make it for you for free if you are interrest, shouldn't take me much time.
     
  6. cucurucu

    cucurucu Regular Member

    Joined:
    Dec 6, 2008
    Messages:
    232
    Likes Received:
    152
    Well I am not a developer so I really can't do this by myself
     
  7. roster67

    roster67 Registered Member

    Joined:
    Mar 27, 2012
    Messages:
    69
    Likes Received:
    10
    No Problem!

    Im at work, i finish something and after I will compile a little application.
     
  8. rodrigax

    rodrigax Newbie

    Joined:
    Mar 5, 2012
    Messages:
    27
    Likes Received:
    3
    Occupation:
    Reputation Management Specialist and Conversion Ra
    I also need help scrapping a site for it's content. This site is a dictionary type site with thousands of terms defined. Anyone's help is appreciated. I also am not a developer and I am willing to pay for this service.
     
  9. cucurucu

    cucurucu Regular Member

    Joined:
    Dec 6, 2008
    Messages:
    232
    Likes Received:
    152
    thanks a lot
     
  10. roster67

    roster67 Registered Member

    Joined:
    Mar 27, 2012
    Messages:
    69
    Likes Received:
    10
    Ok ive finished.
    Extract product name and price on each page.
    You can add as many pages as you wish but its a single thread application, so i suggest you to start with no more than ten pages at first.

    Which version of windows are you using?
    I need to know for compilation.

    PS: send me your email in PM so i can send you the application
     
    • Thanks Thanks x 1
    Last edited: Apr 4, 2012
  11. roster67

    roster67 Registered Member

    Joined:
    Mar 27, 2012
    Messages:
    69
    Likes Received:
    10
    Of course, i can help.
    I need more info about your project before give you any advice or help.

    PM me url of your site.