Category: HTML parsing - The Ruby Toolbox

96%

74%

2026-06-04

7,831

libxml-ruby Homepage Documentation

libxml-ruby

0.81

A long-lived project that still receives updates

libxml-Ruby provides Ruby language bindings for libxml2 It is free software, released under the MIT License. libxml-ruby provides DOM, SAX, Reader, and Writer APIs along with XPath support and validation via DTD, RelaxNG, and XML Schema.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

29,241,625

Releases

6.0.0

133

2009-07-25

2026-04-08

Activity

276

ox ohler55/ox Homepage Documentation Source Code Bug Tracker

ox

0.77

A long-lived project that still receives updates

A fast XML parser and object serializer that uses only standard C lib. Optimized XML (Ox), as the name implies was written to provide speed optimized XML handling. It was designed to be an alternative to Nokogiri and other Ruby XML parsers for generic XML parsing and as an alternative to Marshal for Object serialization.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

39,313,071

911

Releases

2.14.28

163

2011-07-01

2026-06-28

Activity

98%

84%

2026-03-12

176

oga yorickpeterse/oga Homepage Documentation Source Code Bug Tracker Wiki

oga

0.63

Low commit activity in last 3 years

A long-lived project that still receives updates

Oga is an XML/HTML parser written in Ruby.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

28,378,501

1,171

Releases

3.5

2014-09-12

2026-06-10

Activity

99%

54%

2020-08-19

122

hpricot Homepage Documentation

hpricot

0.43

No release in over 3 years

a swift, liberal HTML parser with a fantastic library

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

15,390,614

Releases

0.8.6

2009-07-25

2012-01-17

Activity

657

nikkou tombenner/nikkou Homepage Documentation Source Code Bug Tracker Wiki

nikkou

0.02

No commit activity in last 3 years

No release in over 3 years

Extract useful data from HTML and XML with ease!

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

636,040

Releases

0.0.5

2013-04-23

2016-08-27

Activity

50%

50%

2014-05-07

rubyful_soup Homepage Documentation

rubyful_soup

0.0

No release in over 3 years

Rubyful Soup is a *ML parser that makes screen-scraping easy. It won't choke on bad markup, and it's easy to locate the part of a document you want.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

22,323

Releases

1.0.4

2009-07-25

2009-07-25

Activity

scrubyt Homepage Documentation

scrubyt

0.0

No release in over 3 years

scRUBYt! is an easy to learn and use, yet powerful and effective web scraping framework. It's most interesting part is a Web-scraping DSL built on HPricot and WWW::Mechanize, which allows to navigate to the page of interest, then extract and query data records with a few lines of code. It is hard to describe scRUBYt! in a few sentences - you have to see it for yourself!

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

53,008

Releases

0.4.06

2009-07-25

2009-07-25

Activity