0.0
A web crawler using Ruby and Redis.
2020
2021
2022
2023
2024
2025
0.0
Website crawler harvesting e-mails. Uses Sidekiq and Typhoeus.
2020
2021
2022
2023
2024
2025
0.0
Another Web crawler running with Amazon SQS and ElastiCache(Redis)
2020
2021
2022
2023
2024
2025
0.0
Rack middleware that executes javascript before serving pages to crawlers.
2020
2021
2022
2023
2024
2025
0.0
Checks a user agent for a web crawler
2020
2021
2022
2023
2024
2025
0.0
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-barc gem contains support for the BARC Basic ARChive format.
2020
2021
2022
2023
2024
2025
0.0
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-simhash gem contains support for generation and searching over simhash fingerprints
2020
2021
2022
2023
2024
2025
0.0
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-html gem contains filters for HTML parsing, filtering, exracting text and links.
2020
2021
2022
2023
2024
2025
0.0
Simple Web Crawler
2020
2021
2022
2023
2024
2025
0.0
Simple Twitter crawler
2020
2021
2022
2023
2024
2025
0.0
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-core gem contains core facilities and notably, does not contain such facilities as database-backed state management.
2020
2021
2022
2023
2024
2025
0.0
Dead simple yet powerful Ruby crawler for easy parallel crawling with support for an anonymity.
2020
2021
2022
2023
2024
2025
0.0
check how many links are available inside the website
2020
2021
2022
2023
2024
2025
0.0
livedoor-feeddiscover performs feed autodiscovery using the livedoor Feed Discover API. livedoor Feed Discover API find a Atom/RSS feed(s) from the livedoor Reader crawler database. So, livedoor-feeddiscover do not access the target URL.
2020
2021
2022
2023
2024
2025
0.0
image download from instagram
2020
2021
2022
2023
2024
2025
0.0
A set of classes for dealing with options. It includes a crawler for Yahoo!Finance.
2020
2021
2022
2023
2024
2025
0.0
Multithreaded web crawler with transparent DSL and requests caching.
2020
2021
2022
2023
2024
2025
0.0
A client for the PageMunch web crawler API
2020
2021
2022
2023
2024
2025
0.0
Pantopoda is a web crawler that visits all links on a given domain that's fast and effective.
2020
2021
2022
2023
2024
2025
0.0
Bulbasaur is a helper for crawler operations used in Pread.ly
2020
2021
2022
2023
2024
2025