Categories

No matching categories were found
0.0
No commit activity in last 3 years
No release in over 3 years
== Medusa: a ruby crawler framework {rdoc-image:https://badge.fury.io/rb/medusa-crawler.svg}[https://rubygems.org/gems/medusa-crawler] rdoc-image:https://github.com/brutuscat/medusa-crawler/workflows/Ruby/badge.svg?event=push Medusa is a framework for the ruby language to crawl and collect usefu...
2021
2022
2023
2024
2025
2026
0.09
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
Cobweb is a web crawler that can use resque to cluster crawls to quickly crawl extremely large sites which is much more performant than multi-threaded crawlers. It is also a standalone crawler that has a sophisticated statistics monitoring interface to monitor the progress of the crawls.
2021
2022
2023
2024
2025
2026
0.08
Low commit activity in last 3 years
A long-lived project that still receives updates
CrawlerDetect is a library to detect bots/crawlers via the user agent
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Crawler Engine provides function of crawl all news from the customized website
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Protect your content from AI crawlers and monetize every request with Senthor. Real-time detection, crawler control, and detailed analytics.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Protect your content from AI crawlers and monetize every request with Senthor. Real-time detection, crawler control, and detailed analytics.
2021
2022
2023
2024
2025
2026
0.39
No release in over 3 years
Low commit activity in last 3 years
There's a lot of open issues
Generic Web crawler with a DSL that parses structured data from web pages
2021
2022
2023
2024
2025
2026
0.22
Low commit activity in last 3 years
No release in over a year
Voight-Kampff detects bots, spiders, crawlers and replicants
2021
2022
2023
2024
2025
2026
0.01
No commit activity in last 3 years
No release in over 3 years
is_crawler does exactly what you might think it does: determine if the supplied string matches a known crawler or bot.
2021
2022
2023
2024
2025
2026