0.01
No commit activity in last 3 years
No release in over 3 years
render_static allows you to make your single-page apps (Backbone, Angular, etc) built on Rails SEO-friendly. It works by injecting a small rack middleware that will render pages as plain html, when the requester is one of the most common crawlers/bots out there (Google, Yahoo Baidu and Bing)
2020
2021
2022
2023
2024
2025
0.01
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Cosmicrawler is crawler library for Ruby. It provides scalable asynchronous crawling by (http|file|etc) using EventMachine.
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
SemanticCrawler is a ruby library that encapsulates data gathering from different sources. Currently microdata from websites, country information from Freebase, Factbook and FAO (Food and Agriculture Organization of the United Nations), crisis information from GDACS.org and geo data from LinkedGe...
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
A simple directory crawler DSL.
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
Driller is a command line Ruby based web crawler based on Anemone. Driller can crawl website and reports error pages and slow pages and generates HTML reports.
2020
2021
2022
2023
2024
2025
0.01
Repository is archived
No commit activity in last 3 years
No release in over 3 years
This is a crawler framework.
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
JavaScript enabled web crawler kit
2020
2021
2022
2023
2024
2025
0.0
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Dead simple yet powerful Ruby crawler for easy parallel crawling with support for an anonymity.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
check how many links are available inside the website
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
livedoor-feeddiscover performs feed autodiscovery using the livedoor Feed Discover API. livedoor Feed Discover API find a Atom/RSS feed(s) from the livedoor Reader crawler database. So, livedoor-feeddiscover do not access the target URL.
2020
2021
2022
2023
2024
2025
0.0
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Simple Twitter crawler
2020
2021
2022
2023
2024
2025
0.0
Repository is gone
No release in over 3 years
Ruby web crawler to access omelete informations
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Multithreaded web crawler with transparent DSL and requests caching.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Simple async HTTP crawler based on em-synchrony
2020
2021
2022
2023
2024
2025
0.0
No commit activity in last 3 years
No release in over 3 years
Bulbasaur is a helper for crawler operations used in Pread.ly
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
The SimpleCrawler module is a library for crawling web sites. The crawler provides comprehensive data from the page crawled which can be used for page analysis, indexing, accessibility checks etc. Restrictions can be specified to limit crawling of binary files.
2020
2021
2022
2023
2024
2025
0.0
Repository is gone
No release in over 3 years
Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, si...
2020
2021
2022
2023
2024
2025
0.0
No commit activity in last 3 years
No release in over 3 years
SuperCrawler allows you to easily crawl full web sites or web pages (extracting internal links and assets) in few seconds.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
The application crawls a URL and extracts links, tags and sequences. These features are written to an output file
2020
2021
2022
2023
2024
2025