Categories

No matching categories were found
0.0
No commit activity in last 3 years
No release in over 3 years
== Medusa: a ruby crawler framework {rdoc-image:https://badge.fury.io/rb/medusa-crawler.svg}[https://rubygems.org/gems/medusa-crawler] rdoc-image:https://github.com/brutuscat/medusa-crawler/workflows/Ruby/badge.svg?event=push Medusa is a framework for the ruby language to crawl and collect usefu...
2021
2022
2023
2024
2025
2026
0.08
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
Cobweb is a web crawler that can use resque to cluster crawls to quickly crawl extremely large sites which is much more performant than multi-threaded crawlers. It is also a standalone crawler that has a sophisticated statistics monitoring interface to monitor the progress of the crawls.
2021
2022
2023
2024
2025
2026
0.08
Low commit activity in last 3 years
A long-lived project that still receives updates
CrawlerDetect is a library to detect bots/crawlers via the user agent
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Crawler Engine provides function of crawl all news from the customized website
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Protect your content from AI crawlers and monetize every request with Senthor. Real-time detection, crawler control, and detailed analytics.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Protect your content from AI crawlers and monetize every request with Senthor. Real-time detection, crawler control, and detailed analytics.
2021
2022
2023
2024
2025
2026
0.38
No release in over 3 years
Low commit activity in last 3 years
There's a lot of open issues
Generic Web crawler with a DSL that parses structured data from web pages
2021
2022
2023
2024
2025
2026
0.22
No release in over 3 years
Low commit activity in last 3 years
Voight-Kampff detects bots, spiders, crawlers and replicants
2021
2022
2023
2024
2025
2026
0.01
No commit activity in last 3 years
No release in over 3 years
is_crawler does exactly what you might think it does: determine if the supplied string matches a known crawler or bot.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
Web crawler with JSON-based DSL and EventMachine-powered page fetching
2021
2022
2023
2024
2025
2026
0.0
Repository is archived
No commit activity in last 3 years
No release in over 3 years
A periodic crawler that fetches the latest CVE additions, parses them, and filters them
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
Easy way to enable AdSense crawler to login and see private or custom pages in your rails application. Basically one custom login filter. Gem enables you to easily slightly increase revenues from Google AdSense/AdWords. It makes it easy to enable crawling on private pages and so get better target...
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
The Baidu Crawler is to crawl data with your demmand
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
This gem is a web crawler sample code.So I don't reccmmend that you use.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
web crawler that generates a sitemap to a neo4j database. It will also store broken_links and total number of pages on site
2021
2022
2023
2024
2025
2026