Fun On Rails: Hpricot scraping in ruby

Wednesday, December 2, 2009

Hpricot scraping in ruby

by Sandip Ransing 1 comments

Include gems/library required before getting started

require 'hpricot'
require 'net/http'
require 'rio'

# Pass website url to be scraped
url = "www.funonrails.com"

# Define filename to store file locally
file = "temp.html"
# Save page locally
rio(url) < rio (file)

# Open page through hpricot
doc = Hpricot(open(file))

Apply hpricot library to get right contents

doc.at("div.pageTitle")
doc/"div.pageTitle"
doc.search("div.entry")
doc//"div.pageTitle"

Hpricot API Reference click here

Include gems/library required before getting started

require 'hpricot'
require 'net/http'
require 'rio'

# Pass website url to be scraped
url = "www.funonrails.com"

# Define filename to store file locally
file = "temp.html"
# Save page locally
rio(url) < rio (file)

# Open page through hpricot
doc = Hpricot(open(file))

Apply hpricot library to get right contents

doc.at("div.pageTitle")
doc/"div.pageTitle"
doc.search("div.entry")
doc//"div.pageTitle"

Hpricot API Reference click here

Fun On Rails

A Ruby and Rails Blog

Wednesday, December 2, 2009

Hpricot scraping in ruby

About The Author

Connect With Me...

Labels

Github Projects

@sandipransing Twitter

Blog Links

Recent Posts

Followers

FACEBOOK

Fun On Rails

A Ruby and Rails Blog

Wednesday, December 2, 2009

Hpricot scraping in ruby

About The Author

Connect With Me...

Labels

Github Projects

Blog Archive

@sandipransing Twitter

Blog Links

Recent Posts

Followers

FACEBOOK