0.0
No release in over 3 years
Low commit activity in last 3 years
webget gem - a web (go get) crawler incl. web cache
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Fake User-Agents of about %80 of real devices to use in headers of web crawlers. It keeps your script away from being nested by many UA strings.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over 3 years
s7o crawler optimized for programmer happiness and sustainable productivity.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. This gem is a Jetty HTTP Client based implementation of the iudex-http interfaces.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Crawl the senegalese web, looking for jobs using the excellent wombat gem
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Botch is a DSL for quickly creating web crawlers. Inspired by Sinatra.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Arachnidish is a web crawler that relies on Bloom Filters to efficiently store visited urls and Typhoeus to avoid the overhead of Mechanize when crawling every page on a domain.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
A simple ruby gem to recursively traverse all URLs on a Root URL. It returns all the URLs it encountered
2019
2020
2021
2022
2023
2024