Nokogiri

Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.

Rubygem nokogiri

Total Downloads
21659050
Releases
69
Current Version
1.6.2.rc2
Released
2014-04-10 00:00:00 UTC
First Release
2008-10-30 07:00:00 UTC

Github sparklemotion/nokogiri

Watchers
2546
Forks
341
Development activity
Less active
Last commit
2014-02-11 00:22:40 UTC
Contributors
96
Issues
Wiki pages

Hpricot

a swift, liberal HTML parser with a fantastic library

Rubygem hpricot

Total Downloads
3252315
Releases
13
Current Version
0.8.6
Released
2012-01-17 00:00:00 UTC
First Release
2006-08-11 09:00:00 UTC
Depends on following gems
Depending Gems
458

Github hpricot/hpricot

Watchers
478
Forks
99
Development activity
Inactive
Last commit
2013-12-13 16:43:01 UTC
Contributors
18
Wiki pages

Libxml-ruby

The Libxml-Ruby project provides Ruby language bindings for the GNOME Libxml2 XML toolkit. It is free software, released under the MIT License. Libxml-ruby's primary advantage over REXML is performance - if speed is your need, these are good libraries to consider, as demonstrated by the informal benchmark below.

Rubygem libxml-ruby

Total Downloads
1919486
Releases
52
Current Version
2.7.0
Released
2013-08-03 00:00:00 UTC
First Release
2006-02-23 00:00:00 UTC

Github cfis/libxml-ruby

Watchers
17
Forks
12
Development activity
Inactive
Last commit
2013-01-21 09:35:20 UTC
Contributors
19
Issues

Scrubyt

scRUBYt! is an easy to learn and use, yet powerful and effective web scraping framework. It's most interesting part is a Web-scraping DSL built on HPricot and WWW::Mechanize, which allows to navigate to the page of interest, then extract and query data records with a few lines of code. It is hard to describe scRUBYt! in a few sentences - you have to see it for yourself!

Rubygem scrubyt

Total Downloads
20307
Releases
11
Current Version
0.4.06
Released
2008-11-23 23:00:00 UTC
First Release
2007-01-14 23:00:00 UTC
Depends on following gems
Depending Gems
0

Github scrubber/scrubyt

Watchers
296
Forks
62
Development activity
Inactive
Last commit
2010-01-19 23:11:03 UTC
Top contributors
Contributors
3
Issues
Wiki pages

Scrapi

scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from HTML content.

Rubygem scrapi

Total Downloads
18750
Releases
3
Current Version
2.0.0
Released
2010-11-10 08:00:00 UTC
First Release
2006-08-15 07:00:00 UTC
Depends on following gems
Depending Gems
6

Github assaf/scrapi

Watchers
147
Forks
25
Development activity
Inactive
Last commit
2010-11-10 19:11:27 UTC
Top contributors
Contributors
2
Issues

nikkou

Extract useful data from HTML and XML with ease!

Rubygem nikkou

Total Downloads
1158
Releases
2
Current Version
0.0.3
Released
2013-06-02 00:00:00 UTC
First Release
2013-04-23 00:00:00 UTC
Depends on following gems
Depending Gems
1

Github tombenner/nikkou

Watchers
15
Forks
0
Development activity
Less active
Last commit
2013-07-03 01:05:14 UTC
First commit
Top contributors
Contributors
1
Issues

xml-motor

A new short XML Parsing Algorithm implemented directly in less-than-500 lines. An easy-to-use XML Parser without any Native Dependencies. Its under continuous improvement as being used/tested under my other xml-parsing required projects. [What, Why, HowTo]: http://justfewtuts.blogspot.in/2012/03/xml-motor-what-it-is-how-why-should-you.html

Rubygem xml-motor

Total Downloads
10098
Releases
14
Current Version
0.1.6
Released
2012-08-20 00:00:00 UTC
First Release
2011-11-06 00:00:00 UTC
Depends on following gems
Depending Gems
7

Github abhishekkr/rubygem_xml_motor

Watchers
3
Forks
0
Development activity
Inactive
Last commit
2013-03-22 10:59:30 UTC
Top contributors
Contributors
1
Issues
Wiki pages
×

In order to continue, you must be signed in using your Github account.

If you're signing in using this account for the first time Github will ask for your permission to give access to your public user data to the Ruby Toolbox.

Although the Github Authorization page does not mention it, the request includes read-only access to your verified email address (user:email OAuth scope). This is neccessary so there's a way to notify you about comments, information about your accepted project edits and the like. You can review your notification settings on your account page once you're signed in.