Category

HTML parsing

This category does not have a description yet. You can add one on github!

30.81
A long-lived project that still receives updates
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors.
2014
2015
2016
2017
2018
2019
1.57
No release in over 3 years
a swift, liberal HTML parser with a fantastic library
2014
2015
2016
2017
2018
2019
1.04
No release in over a year
The Libxml-Ruby project provides Ruby language bindings for the GNOME Libxml2 XML toolkit. It is free software, released under the MIT License. Libxml-ruby's primary advantage over REXML is performance - if speed is your need, these are good libraries to consider, as demonstrated ...
2014
2015
2016
2017
2018
2019
0.83
A long-lived project that still receives updates
A fast XML parser and object serializer that uses only standard C lib. Optimized XML (Ox), as the name implies was written to provide speed optimized XML handling. It was designed to be an alternative to Nokogiri and other Ruby XML parsers for generic XML parsing and as an alternativ...
2014
2015
2016
2017
2018
2019
0.11
No release in over 3 years
Low commit activity in last 3 years
scrAPI is an HTML scraping toolkit for Ruby. It uses CSS selectors to write easy, maintainable scraping rules to select, extract and store data from HTML content.
2014
2015
2016
2017
2018
2019
0.04
No release in over 3 years
Low commit activity in last 3 years
Extract useful data from HTML and XML with ease!
2014
2015
2016
2017
2018
2019
0.01
No release in over 3 years
scRUBYt! is an easy to learn and use, yet powerful and effective web scraping framework. It's most interesting part is a Web-scraping DSL built on HPricot and WWW::Mechanize, which allows to navigate to the page of interest, then extract and query data records with a few lines of code. It is hard...
2014
2015
2016
2017
2018
2019
0.01
No commit activity in last 3 years
No release in over 3 years
A new short XML Parsing Algorithm implemented directly in less-than-500 lines. An easy-to-use XML Parser without any Native Dependencies. Its under continuous improvement as being used/tested under my other xml-parsing required projects. [What, Why, HowTo]: http://justfewtuts.blogspot.in/2012/03/...
2014
2015
2016
2017
2018
2019