HTML parsing

1096

Watchers: 816
Forks: 56

1. nokogiri

Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser with XPath and CSS selector support.
Last commit: about 7 hours ago

On the web

GitHub: tenderlove/nokogiri

Home: nokogiri.org/

As a Ruby Gem

Rubyforge: nokogiri (Current version: 1.4.1)

gem install nokogiri

GitHub: tenderlove-nokogiri (Current version: 0.0.0.20081001111445)

gem install tenderlove-nokogiri --source "http://gems.github.com"

In the news

Final Status Update (or How to get Nokogiri in JRuby without(...) 6 months ago

Web Scraping.. Learn the Basic 6 months ago

DigitalNZ client library for ruby 7 months ago

491

Watchers: 336
Forks: 31

2. hpricot

A swift, liberal HTML parser with a fantastic library
Last commit: about 1 month ago

Documentation

RDoc: rdoc.info/projects/why/hpricot

As a Ruby Gem

Rubyforge: hpricot (Current version: 0.8.2)

gem install hpricot

GitHub: why-hpricot (Current version: 0.7.229)

gem install why-hpricot --source "http://gems.github.com"

359

Watchers: 234
Forks: 25

3. scrubyt

A simple to learn and use, yet powerful web scraping toolkit!
Last commit: about 1 month ago

On the web

GitHub: scrubber/scrubyt

Home: scrubyt.org

Documentation

GitHub Wiki: wiki.github.com/scrubber/scrubyt (2 pages)

As a Ruby Gem

Rubyforge: scrubyt (Current version: 0.5.1)

gem install scrubyt

GitHub: scrubber-scrubyt (Current version: 0.4.30)

gem install scrubber-scrubyt --source "http://gems.github.com"

In the news

Well, it seems there are no news about scrubber/scrubyt yet...

42

Watchers: 61
Forks: 1
35% Penalty

4. scrapi

scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from HTML content.
Last commit: about 1 year ago

On the web

GitHub: assaf/scrapi

Documentation

RDoc: rdoc.info/projects/assaf/scrapi

As a Ruby Gem

Rubyforge: scrapi (Current version: 1.2.0)

gem install scrapi

GitHub: assaf-scrapi (Current version: 1.2.1)

gem install assaf-scrapi --source "http://gems.github.com"

In the news

Well, it seems there are no news about assaf/scrapi yet...

1

Watchers: 3
Forks: 0
35% Penalty

5. libxml-ruby

Libxml bindings for Ruby
Last commit: about 1 year ago

On the web

GitHub: cfis/libxml-ruby

Home: libxml.rubyforge.org/

As a Ruby Gem

Rubyforge: libxml-ruby (Current version: 1.1.3)

gem install libxml-ruby

In the news

Well, it seems there are no news about cfis/libxml-ruby yet...
Category_20