0.0
No release in over 3 years
Multithreaded web crawler with transparent DSL and requests caching.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
A client for the PageMunch web crawler API
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Pantopoda is a web crawler that visits all links on a given domain that's fast and effective.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Rails Analyzer Tools contains Bench, a simple web page benchmarker, Crawler, a tool for beating up on web sites, RailsStat, a tool for monitoring Rails web sites, and IOTail, a tail(1) method for Ruby IOs.
2021
2022
2023
2024
2025
2026
0.0
Repository is gone
No release in over 3 years
Samao is a web crawler written in ruby.
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
The SimpleCrawler module is a library for crawling web sites. The crawler provides comprehensive data from the page crawled which can be used for page analysis, indexing, accessibility checks etc. Restrictions can be specified to limit crawling of binary files.
2021
2022
2023
2024
2025
2026
0.0
Repository is gone
No release in over 3 years
Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, si...
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
This gem offers: classes to subclass and create a manga site crawler; a dowloader to use with these classes; some site-specific scripts.
2021
2022
2023
2024
2025
2026
0.0
No release in over a year
初级开发工程师,基于 http 写的爬虫扩展包。请不要随意下载里面有很多坑。
2021
2022
2023
2024
2025
2026
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-worker gem provides a worker deamon for feed/page processing.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
The Taiwan VSCinema crawler to get latest film list.
2021
2022
2023
2024
2025
2026
0.0
Low commit activity in last 3 years
No release in over a year
webget gem - a web (go get) crawler incl. web cache
2021
2022
2023
2024
2025
2026
0.0
Repository is gone
No release in over 3 years
s7o crawler optimized for programmer happiness and sustainable productivity.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
This gem crawls the latest CircleCI artifact file you specified. For Example, you can get the result JSON of simplecov.gem etc.
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
SemanticCrawler is a ruby library that encapsulates data gathering from different sources. Currently microdata from websites, country information from Freebase, Factbook and FAO (Food and Agriculture Organization of the United Nations), crisis information from GDACS.org and geo data from LinkedGe...
2021
2022
2023
2024
2025
2026
0.0
No release in over a year
With just a few lines of code, developers can effortlessly integrate this gem into their projects, enabling seamless retrieval of page titles from HTML documents. Whether you're building web scrapers, crawlers, or any application that requires fetching webpage titles, WebTitle streamlines the pro...
2021
2022
2023
2024
2025
2026
0.0
No commit activity in last 3 years
No release in over 3 years
A simple ruby gem to recursively traverse all URLs on a Root URL. It returns all the URLs it encountered
2021
2022
2023
2024
2025
2026