0.01
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
SemanticCrawler is a ruby library that encapsulates data gathering from different sources. Currently microdata from websites, country information from Freebase, Factbook and FAO (Food and Agriculture Organization of the United Nations), crisis information from GDACS.org and geo data from LinkedGe...
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Automatically protects your staging app from web crawlers and casual visitors.
2019
2020
2021
2022
2023
2024
0.01
No commit activity in last 3 years
No release in over 3 years
BFS webcrawler that implements Observable
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Ruby Cheerio is a jQuery style HTML parser, which take selectors as input. This is a Ruby version NodeJS package named 'Cheerio', which is extensively used by crawlers. Please visit the home page for usage details.
2019
2020
2021
2022
2023
2024
0.02
Repository is archived
No commit activity in last 3 years
No release in over 3 years
Cosmicrawler is crawler library for Ruby. It provides scalable asynchronous crawling by (http|file|etc) using EventMachine.
2019
2020
2021
2022
2023
2024
0.0
Repository is archived
No release in over a year
Retrieves a list of URLs to seed the crawler by publishing them to a RabbitMQ exchange.
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Bulbasaur is a helper for crawler operations used in Pread.ly
2019
2020
2021
2022
2023
2024
0.01
No commit activity in last 3 years
No release in over 3 years
JavaScript enabled web crawler kit
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
rails_angular_seo allows you to make your single-page apps (Backbone, Angular, etc) built on Rails SEO-friendly. It works by injecting a small rack middleware that will render pages as plain html, when the requester is one of the most common crawlers/bots out there (Google, Yahoo Baidu and Bing)
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Low commit activity in last 3 years
webget gem - a web (go get) crawler incl. web cache
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
A ruby social media stat crawler
2019
2020
2021
2022
2023
2024
0.01
No release in over 3 years
Low commit activity in last 3 years
your friendly neighborhood web crawler
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
Simple site crawler using Capybara
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
The Baidu Crawler is to crawl data with your demmand
2019
2020
2021
2022
2023
2024
0.0
No commit activity in last 3 years
No release in over 3 years
Botch is a DSL for quickly creating web crawlers. Inspired by Sinatra.
2019
2020
2021
2022
2023
2024