Category

HTML parsing

This category does not have a description yet. You can add one on github!

nokogiri

30.25
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors.
 Popularity
Downloads
197,813,892
Stars
5,003
Forks
676
Watchers
147
 Releases
Current version
1.8.5
Total releases
377
First release
Latest release
 Activity
Issue Closure Rate
88%
Pull Request Acceptance Rate
53%
Average date of last 50 commits
within last 3 months
Reverse Dependencies
6,553

hpricot

1.95
a swift, liberal HTML parser with a fantastic library
 Popularity
Downloads
6,774,785
 Releases
Current version
0.8.6
Total releases
33
First release
Latest release
 Activity
Reverse Dependencies
677

libxml-ruby

1.15
The Libxml-Ruby project provides Ruby language bindings for the GNOME Libxml2 XML toolkit. It is free software, released under the MIT License. Libxml-ruby's primary advantage over REXML is performance - if speed is your need, these are good libraries to consider, as demonstrated by the informal benchmark below.
 Popularity
Downloads
7,469,416
Stars
104
Forks
50
Watchers
9
 Releases
Current version
3.1.0
Total releases
100
First release
Latest release
 Activity
Issue Closure Rate
97%
Pull Request Acceptance Rate
89%
Average date of last 50 commits
within last year
Reverse Dependencies
265

scrapi

0.07
scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from HTML content.
 Popularity
Downloads
44,972
Stars
153
Forks
25
Watchers
8
 Releases
Current version
2.0.0
Total releases
3
First release
Latest release
 Activity
Issue Closure Rate
0%
Pull Request Acceptance Rate
50%
Average date of last 50 commits
more than 2 years ago
Reverse Dependencies
6

nikkou

0.02
Extract useful data from HTML and XML with ease!
 Popularity
Downloads
15,682
Stars
51
Forks
8
Watchers
2
 Releases
Current version
0.0.5
Total releases
4
First release
Latest release
 Activity
Issue Closure Rate
66%
Pull Request Acceptance Rate
50%
Average date of last 50 commits
more than 2 years ago
Reverse Dependencies
2

scrubyt

0.01
scRUBYt! is an easy to learn and use, yet powerful and effective web scraping framework. It's most interesting part is a Web-scraping DSL built on HPricot and WWW::Mechanize, which allows to navigate to the page of interest, then extract and query data records with a few lines of code. It is hard to describe scRUBYt! in a few sentences - you have to see it for yourself!
 Popularity
Downloads
32,282
 Releases
Current version
0.4.06
Total releases
11
First release
Latest release
 Activity
Reverse Dependencies
0

xml-motor

0.01
A new short XML Parsing Algorithm implemented directly in less-than-500 lines. An easy-to-use XML Parser without any Native Dependencies. Its under continuous improvement as being used/tested under my other xml-parsing required projects. [What, Why, HowTo]: http://justfewtuts.blogspot.in/2012/03/xml-motor-what-it-is-how-why-should-you.html
 Popularity
Downloads
23,956
Stars
3
Forks
2
Watchers
1
 Releases
Current version
0.1.6
Total releases
14
First release
Latest release
 Activity
Average date of last 50 commits
more than 2 years ago
Reverse Dependencies
7