0.01
render_static allows you to make your single-page apps (Backbone, Angular, etc) built on Rails SEO-friendly. It works by injecting a small rack middleware that will render pages as plain html, when the requester is one of the most common crawlers/bots out there (Google, Yahoo Baidu and Bing)
2020
2021
2022
2023
2024
2025
0.01
Cosmicrawler is crawler library for Ruby. It provides scalable asynchronous crawling by (http|file|etc) using EventMachine.
2020
2021
2022
2023
2024
2025
0.01
SemanticCrawler is a ruby library that encapsulates data gathering from different sources. Currently microdata from websites, country information from Freebase, Factbook and FAO (Food and Agriculture Organization of the United Nations), crisis information from GDACS.org and geo data from LinkedGe...
2020
2021
2022
2023
2024
2025
0.01
A simple directory crawler DSL.
2020
2021
2022
2023
2024
2025
0.01
Driller is a command line Ruby based web crawler based on Anemone. Driller can crawl website and reports error pages and slow pages and generates HTML reports.
2020
2021
2022
2023
2024
2025
0.01
This is a crawler framework.
2020
2021
2022
2023
2024
2025
0.01
JavaScript enabled web crawler kit
2020
2021
2022
2023
2024
2025
0.0
Dead simple yet powerful Ruby crawler for easy parallel crawling with support for an anonymity.
2020
2021
2022
2023
2024
2025
0.0
check how many links are available inside the website
2020
2021
2022
2023
2024
2025
0.0
livedoor-feeddiscover performs feed autodiscovery using the livedoor Feed Discover API. livedoor Feed Discover API find a Atom/RSS feed(s) from the livedoor Reader crawler database. So, livedoor-feeddiscover do not access the target URL.
2020
2021
2022
2023
2024
2025
0.0
Simple Twitter crawler
2020
2021
2022
2023
2024
2025
0.0
image download from instagram
2020
2021
2022
2023
2024
2025
0.0
Ruby web crawler to access omelete informations
2020
2021
2022
2023
2024
2025
0.0
Multithreaded web crawler with transparent DSL and requests caching.
2020
2021
2022
2023
2024
2025
0.0
Simple async HTTP crawler based on em-synchrony
2020
2021
2022
2023
2024
2025
0.0
Bulbasaur is a helper for crawler operations used in Pread.ly
2020
2021
2022
2023
2024
2025
0.0
The SimpleCrawler module is a library for crawling web
sites. The crawler provides comprehensive data from the page crawled which
can be used for page analysis, indexing, accessibility checks etc.
Restrictions can be specified to limit crawling of binary files.
2020
2021
2022
2023
2024
2025
0.0
Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, si...
2020
2021
2022
2023
2024
2025
0.0
SuperCrawler allows you to easily crawl full web sites or web pages (extracting internal links and assets) in few seconds.
2020
2021
2022
2023
2024
2025
0.0
The application crawls a URL and extracts links, tags and sequences. These features are written to an output file
2020
2021
2022
2023
2024
2025