0.01
Repository is archived
No commit activity in last 3 years
No release in over 3 years
This is a crawler framework.
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
Driller is a command line Ruby based web crawler based on Anemone. Driller can crawl website and reports error pages and slow pages and generates HTML reports.
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
Gem for crawling data from external sources
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
Website crawler and fulltext indexer.
2020
2021
2022
2023
2024
2025
0.01
No release in over 3 years
Low commit activity in last 3 years
your friendly neighborhood web crawler
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
A simple directory crawler DSL.
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
JavaScript enabled web crawler kit
2020
2021
2022
2023
2024
2025
0.01
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
SemanticCrawler is a ruby library that encapsulates data gathering from different sources. Currently microdata from websites, country information from Freebase, Factbook and FAO (Food and Agriculture Organization of the United Nations), crisis information from GDACS.org and geo data from LinkedGe...
2020
2021
2022
2023
2024
2025
0.0
No commit activity in last 3 years
No release in over 3 years
ITunesCrawler provides an easy way to download the requested iTunes data through Apple's Search API.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-da gem provides a PostgreSQL-based content meta-data store and work priority queue.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-brutefuzzy-service provides a fuzzy simhash lookup index as a distributed service.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. This gem is an rjack-httpclient-3 based implementation of the iudex-http interfaces.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-brutefuzzy-protobuf gem contains the protocol buffer generated java classes for the iudex-brutefuzzy-service.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-rome gems is an adaption of rjack-rome for feed parsing in Iudex.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-core gem contains core facilities and notably, does not contain such facilities as database-backed state management.
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
check how many links are available inside the website
2020
2021
2022
2023
2024
2025
0.0
No release in over 3 years
livedoor-feeddiscover performs feed autodiscovery using the livedoor Feed Discover API. livedoor Feed Discover API find a Atom/RSS feed(s) from the livedoor Reader crawler database. So, livedoor-feeddiscover do not access the target URL.
2020
2021
2022
2023
2024
2025
0.0
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Simple Twitter crawler
2020
2021
2022
2023
2024
2025
0.0
Repository is gone
No release in over 3 years
Ruby web crawler to access omelete informations
2020
2021
2022
2023
2024
2025