0.0
Repository is gone
No release in over 3 years
A crawler for a single domain web application
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Another Web crawler running with Amazon SQS and ElastiCache(Redis)
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-barc gem contains support for the BARC Basic ARChive format.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-simhash gem contains support for generation and searching over simhash fingerprints
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-html gem contains filters for HTML parsing, filtering, exracting text and links.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-char-detector gem provides charset detection support.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
ITunesCrawler provides an easy way to download the requested iTunes data through Apple's Search API.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-da gem provides a PostgreSQL-based content meta-data store and work priority queue.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-brutefuzzy-service provides a fuzzy simhash lookup index as a distributed service.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. This gem is an rjack-httpclient-3 based implementation of the iudex-http interfaces.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-brutefuzzy-protobuf gem contains the protocol buffer generated java classes for the iudex-brutefuzzy-service.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-rome gems is an adaption of rjack-rome for feed parsing in Iudex.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-core gem contains core facilities and notably, does not contain such facilities as database-backed state management.
2021
2022
2023
2024
2025
2026
0.0
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Dead simple yet powerful Ruby crawler for easy parallel crawling with support for an anonymity.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
DSL to build crawlers easily
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
check how many links are available inside the website
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
livedoor-feeddiscover performs feed autodiscovery using the livedoor Feed Discover API. livedoor Feed Discover API find a Atom/RSS feed(s) from the livedoor Reader crawler database. So, livedoor-feeddiscover do not access the target URL.
2021
2022
2023
2024
2025
2026
0.0
Repository is gone
No release in over 3 years
Ruby web crawler to access omelete informations
2021
2022
2023
2024
2025
2026