Nokogiri

Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.

Rubygem nokogiri

Total Downloads
36614578
Releases
82
Current Version
1.6.6.2
Released
2015-01-23 00:00:00 UTC
First Release
2008-10-30 07:00:00 UTC

Github sparklemotion/nokogiri

Watchers
3175
Forks
441
Development activity
Active
Last commit
2015-03-23 12:55:54 UTC

Hpricot

a swift, liberal HTML parser with a fantastic library

Rubygem hpricot

Total Downloads
3917158
Releases
13
Current Version
0.8.6
Released
2012-01-17 00:00:00 UTC
First Release
2006-08-11 09:00:00 UTC
Depends on following gems
Depending Gems
677

Github hpricot/hpricot

Watchers
485
Forks
105
Development activity
Inactive
Last commit
2013-12-13 16:43:01 UTC

Libxml-ruby

The Libxml-Ruby project provides Ruby language bindings for the GNOME Libxml2 XML toolkit. It is free software, released under the MIT License. Libxml-ruby's primary advantage over REXML is performance - if speed is your need, these are good libraries to consider, as demonstrated by the informal benchmark below.

Rubygem libxml-ruby

Total Downloads
2906040
Releases
53
Current Version
2.8.0
Released
2015-01-09 00:00:00 UTC
First Release
2006-02-23 00:00:00 UTC

Github cfis/libxml-ruby

Watchers
16
Forks
12
Development activity
Inactive
Last commit
2013-01-21 09:35:20 UTC
Contributors
19
Issues

Scrubyt

scRUBYt! is an easy to learn and use, yet powerful and effective web scraping framework. It's most interesting part is a Web-scraping DSL built on HPricot and WWW::Mechanize, which allows to navigate to the page of interest, then extract and query data records with a few lines of code. It is hard to describe scRUBYt! in a few sentences - you have to see it for yourself!

Rubygem scrubyt

Total Downloads
23411
Releases
11
Current Version
0.4.06
Released
2008-11-23 23:00:00 UTC
First Release
2007-01-14 23:00:00 UTC
Depends on following gems
Depending Gems
0

Github scrubber/scrubyt

Watchers
299
Forks
63
Development activity
Inactive
Last commit
2010-01-19 23:11:03 UTC
Top contributors
Contributors
3
Issues

Scrapi

scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from HTML content.

Rubygem scrapi

Total Downloads
30568
Releases
3
Current Version
2.0.0
Released
2010-11-10 08:00:00 UTC
First Release
2006-08-15 07:00:00 UTC
Depends on following gems
Depending Gems
6

Github assaf/scrapi

Watchers
150
Forks
26
Development activity
Inactive
Last commit
2010-11-10 19:11:27 UTC
Top contributors
Contributors
2
Issues

nikkou

Extract useful data from HTML and XML with ease!

Rubygem nikkou

Total Downloads
2689
Releases
3
Current Version
0.0.4
Released
2014-07-10 00:00:00 UTC
First Release
2013-04-23 00:00:00 UTC
Depends on following gems
Depending Gems
2

Github tombenner/nikkou

Watchers
24
Forks
4
Development activity
Inactive
Last commit
2014-07-10 04:56:05 UTC
First commit
Top contributors
Contributors
1
Issues

xml-motor

A new short XML Parsing Algorithm implemented directly in less-than-500 lines. An easy-to-use XML Parser without any Native Dependencies. Its under continuous improvement as being used/tested under my other xml-parsing required projects. [What, Why, HowTo]: http://justfewtuts.blogspot.in/2012/03/xml-motor-what-it-is-how-why-should-you.html

Rubygem xml-motor

Total Downloads
13659
Releases
14
Current Version
0.1.6
Released
2012-08-20 00:00:00 UTC
First Release
2011-11-06 00:00:00 UTC
Depends on following gems
Depending Gems
7

Github abhishekkr/rubygem_xml_motor

Watchers
3
Forks
1
Development activity
Inactive
Last commit
2013-03-22 10:59:30 UTC
Top contributors
Contributors
1
Issues
×

In order to continue, you must be signed in using your Github account.

If you're signing in using this account for the first time Github will ask for your permission to give access to your public user data to the Ruby Toolbox.

Although the Github Authorization page does not mention it, the request includes read-only access to your verified email address (user:email OAuth scope). This is neccessary so there's a way to notify you about comments, information about your accepted project edits and the like. You can review your notification settings on your account page once you're signed in.