Category

HTML parsing

This category does not have a description yet. You can add one on github!

nokogiri

32.79
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors.
 Popularity
Downloads
184,942,919
Stars
4,937
Forks
661
Watchers
148
 Releases
Current version
1.8.4
Total releases
369
First release
Latest release
 Activity
Issue Closure Rate
88%
Pull Request Acceptance Rate
51%
Average date of last 50 commits
within last year
Reverse Dependencies
6,487

hpricot

2.21
a swift, liberal HTML parser with a fantastic library
 Popularity
Downloads
6,622,415
 Releases
Current version
0.8.6
Total releases
33
First release
Latest release
 Activity
Reverse Dependencies
678

libxml-ruby

1.27
The Libxml-Ruby project provides Ruby language bindings for the GNOME Libxml2 XML toolkit. It is free software, released under the MIT License. Libxml-ruby's primary advantage over REXML is performance - if speed is your need, these are good libraries to consider, as demonstrated by the informal benchmark below.
 Popularity
Downloads
7,101,274
Stars
104
Forks
50
Watchers
9
 Releases
Current version
3.1.0
Total releases
100
First release
Latest release
 Activity
Issue Closure Rate
98%
Pull Request Acceptance Rate
87%
Average date of last 50 commits
within last year
Reverse Dependencies
265

scrapi

0.07
scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from HTML content.
 Popularity
Downloads
44,576
Stars
153
Forks
25
Watchers
8
 Releases
Current version
2.0.0
Total releases
3
First release
Latest release
 Activity
Issue Closure Rate
0%
Pull Request Acceptance Rate
50%
Average date of last 50 commits
more than 2 years ago
Reverse Dependencies
6

nikkou

0.02
Extract useful data from HTML and XML with ease!
 Popularity
Downloads
14,293
Stars
51
Forks
8
Watchers
2
 Releases
Current version
0.0.5
Total releases
4
First release
Latest release
 Activity
Issue Closure Rate
66%
Pull Request Acceptance Rate
50%
Average date of last 50 commits
more than 2 years ago
Reverse Dependencies
2

scrubyt

0.01
scRUBYt! is an easy to learn and use, yet powerful and effective web scraping framework. It's most interesting part is a Web-scraping DSL built on HPricot and WWW::Mechanize, which allows to navigate to the page of interest, then extract and query data records with a few lines of code. It is hard to describe scRUBYt! in a few sentences - you have to see it for yourself!
 Popularity
Downloads
32,109
 Releases
Current version
0.4.06
Total releases
11
First release
Latest release
 Activity
Reverse Dependencies
0

xml-motor

0.01
A new short XML Parsing Algorithm implemented directly in less-than-500 lines. An easy-to-use XML Parser without any Native Dependencies. Its under continuous improvement as being used/tested under my other xml-parsing required projects. [What, Why, HowTo]: http://justfewtuts.blogspot.in/2012/03/xml-motor-what-it-is-how-why-should-you.html
 Popularity
Downloads
23,727
Stars
3
Forks
2
Watchers
1
 Releases
Current version
0.1.6
Total releases
14
First release
Latest release
 Activity
Average date of last 50 commits
more than 2 years ago
Reverse Dependencies
7