0.0
No commit activity in last 3 years
No release in over 3 years
A client for the PageMunch web crawler API
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Pantopoda is a web crawler that visits all links on a given domain that's fast and effective.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Bulbasaur is a helper for crawler operations used in Pread.ly
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Ruby Cheerio is a jQuery style HTML parser, which take selectors as input. This is a Ruby version NodeJS package named 'Cheerio', which is extensively used by crawlers. Please visit the home page for usage details.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over 3 years
Samao is a web crawler written in ruby.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
The SimpleCrawler module is a library for crawling web sites. The crawler provides comprehensive data from the page crawled which can be used for page analysis, indexing, accessibility checks etc. Restrictions can be specified to limit crawling of binary files.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over 3 years
Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, si...
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Allows your rails application to be spiderable by crawlers
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
SuperCrawler allows you to easily crawl full web sites or web pages (extracting internal links and assets) in few seconds.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
The application crawls a URL and extracts links, tags and sequences. These features are written to an output file
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
This gem offers: classes to subclass and create a manga site crawler; a dowloader to use with these classes; some site-specific scripts.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-worker gem provides a worker deamon for feed/page processing.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over 3 years
s7o crawler optimized for programmer happiness and sustainable productivity.
2019
2020
2021
2022
2023
2024
0.0
The project is in a healthy, maintained state
Hushes worthless Rails exceptions & logs, such as those caused by bots and crawlers.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
A simple ruby gem to recursively traverse all URLs on a Root URL. It returns all the URLs it encountered
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Botch is a DSL for quickly creating web crawlers. Inspired by Sinatra.
2019
2020
2021
2022
2023
2024