Project

rcrawler

0.0
No commit activity in last 3 years
No release in over 3 years
The wrapper of capybara for crawler
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 1.3
>= 0
>= 0

Runtime

 Project Readme

RCrawler

Gem Version Build Status Coverage Status Code Climate

The wrapper of capybara for crawler.

Dependencies

Installation

Add this line to your application's Gemfile:

gem 'rcrawler'

And then execute:

$ bundle

Or install it yourself as:

$ gem install rcrawler

Usage

Crawl

require "rcrawler"

RCrawler.crawl do
  # Some capybara dsl
  visit("https://example.com/login")
  page.fill_in("name", with: "user")
  page.fill_in("password", with: "secret")
  page.click_button("send")
  page.save_screenshot("/tmp/example.png")

  # Screenshot shortcut
  # visit(arg[0]) and page.save_screenshot(arg[1])
  screenshot("http://example.com", "/tmp/example.png")

  # Nokogiri
  # doc is return Nokogiri::HTML(page.html)
  visit("http://example.com")
  doc.css("a.some_link").each {|a| puts a.attr("href")}
end

Configuration

RCrawler.configure do |c|
  c.threads = 10 # => default is 8
  c.timeout = 20 # => default is 10
  c.timeout_proc = :ignore # => default is :raise
end

Async processing

RCrawler.async do
  crawl do
    # do something
  end

  crawl do
    # do something
  end

  crawl do
    # do something
  end
end

Command

% rcrawler help
Commands:
  rcrawler help [COMMAND]         # Describe available commands or one specific command
  rcrawler sc http://example.com  # Get screen shot

Contributing

  1. Fork it
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create new Pull Request