0.0
No commit activity in last 3 years
No release in over 3 years
Website crawler harvesting e-mails. Uses Sidekiq and Typhoeus.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Simple web crawler to crawl a domain and generate sitemap
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Empower World Travel Information Technology
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Arachnidish is a web crawler that relies on Bloom Filters to efficiently store visited urls and Typhoeus to avoid the overhead of Mechanize when crawling every page on a domain.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Discovery Mission is an easy-to-use website crawler. Use it for generating sitemaps.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
The Baidu Crawler is to crawl data with your demmand
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
株価情報を取得してあれこれするライブラリ
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Fake User-Agents of about %80 of real devices to use in headers of web crawlers. It keeps your script away from being nested by many UA strings.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Another Web crawler running with Amazon SQS and ElastiCache(Redis)
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
A ruby based zhihu content crawler.
2019
2020
2021
2022
2023
2024
0.02
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Cosmicrawler is crawler library for Ruby. It provides scalable asynchronous crawling by (http|file|etc) using EventMachine.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over 3 years
s7o crawler optimized for programmer happiness and sustainable productivity.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over 3 years
Generic Web crawler with a DSL that parses event-related data from web pages
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Crawler for http://legendas.tv to see the most dowloaded subtitles
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
check how many links are available inside the website
2019
2020
2021
2022
2023
2024
0.0
Repository is archived
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
A web crawler using Ruby and Redis.
2019
2020
2021
2022
2023
2024