Bad news. The server hosting The Ruby Toolbox went bust on the evening of June 7th. While I do have backups, the
original source code is in a very outdated state so I currently don't feel it makes sense to try and get it running again.
For the time being, here is a very stripped down version of the Ruby Toolbox's contents.
Pismo extracts and retrieves content-related metadata from HTML pages - you can use the resulting data in an organized way, such as a summary/first paragraph, body text, keywords, RSS feed URL, favicon, etc.
Cobweb is a web crawler that can use resque to cluster crawls to quickly crawl extremely large sites which is much more performant than multi-threaded crawlers. It is also a standalone crawler that has a sophisticated statistics monitoring interface to monitor the progress of the crawls.