Search results for 'crawler' - The Ruby Toolbox

18,051

Releases

0.0.5

2012-01-06

2012-04-11

Activity

vrowser kimoto/vrowser Homepage Documentation Source Code Bug Tracker Wiki

vrowser

0.0

No commit activity in last 3 years

No release in over 3 years

Server browser and Crawler for many games (L4D2, TF2, CS:S, KZMOD, The Ship)

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

45,158

Releases

0.1.5

2012-02-05

2012-03-12

Activity

2012-02-09

pioneer Documentation Source Code

pioneer

0.0

No release in over 3 years

Simple async HTTP crawler based on em-synchrony

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

23,988

Releases

0.0.9

2012-02-21

2012-04-11

Activity

iudex-brutefuzzy-service Homepage Documentation

iudex-brutefuzzy-service

0.0

No release in over 3 years

Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-brutefuzzy-service provides a fuzzy simhash lookup index as a distributed service.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

11,889

Releases

1.3.0

2012-03-05

2013-10-30

Activity

iudex-brutefuzzy-protobuf Homepage Documentation

iudex-brutefuzzy-protobuf

0.0

No release in over 3 years

Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-brutefuzzy-protobuf gem contains the protocol buffer generated java classes for the iudex-brutefuzzy-service.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

11,628

Releases

1.3.0

2012-03-05

2013-10-30

Activity

semantic-crawler obale/semantic_crawler Homepage Documentation Source Code Bug Tracker Wiki

semantic-crawler

0.01

No commit activity in last 3 years

No release in over 3 years

There's a lot of open issues

SemanticCrawler is a ruby library that encapsulates data gathering from different sources. Currently microdata from websites, country information from Freebase, Factbook and FAO (Food and Agriculture Organization of the United Nations), crisis information from GDACS.org and geo data from LinkedGeoData are supported. Additional the GeoNames module allows to get Factbook and FAO country information from GPS coordinates.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

38,416

Releases

0.7.1

2012-03-25

2013-04-07

Activity

Issue Closure Rate

64%

2012-07-30

app-reviews Documentation

app-reviews

0.0

No release in over 3 years

Mobile App Review Crawler

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

9,608

Releases

0.0.2

2012-03-29

2012-03-29

Activity

murmuring_spider Homepage Documentation

murmuring_spider

0.0

Repository is gone

No release in over 3 years

MurmuringSpider is a concise Twitter crawler. When we write a data-mining / text-mining application based on twitter timeline, we have to collect and store tweets first. I am irritated with writing such crawler repeatedly, so I wrote this. What you have to do is only to add query and to run them periodically. Thanks to consistent Twitter API and twitter gem (http://twitter.rubyforge.org/), it is quite easy to track various types of timelines (such as user_timeline, home_timeline, search...)

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

4,189

Releases

0.0.2

2012-04-13

2012-04-13

Activity

caule rafaelss/caule Homepage Documentation Source Code Bug Tracker Wiki

caule

0.0

No commit activity in last 3 years

No release in over 3 years

DSL to build crawlers easily

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

4,671

Releases

0.0.1

2012-04-14

2012-04-14

Activity

2012-04-20

omelete Homepage Documentation

omelete

0.0

Repository is gone

No release in over 3 years

Ruby web crawler to access omelete informations

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

50,427

Releases

2.0.7

2012-05-06

2013-01-25

Activity

krawler Homepage Documentation

krawler

0.0

Repository is gone

No release in over 3 years

Simple little website crawler.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

60,626

Releases

1.0.14

2012-05-10

2013-03-19

Activity

skyscraper Homepage Documentation Wiki

skyscraper

0.0

Repository is gone

No release in over 3 years

Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, siblings etc. Furthermore, you are able to download images, web pages, and store all content in the database. Please visit my Github account for more details.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

15,207

Releases

0.1.0

2012-05-17

2012-05-30

Activity

baidu_crawler debbbbie/baidu_crawler Homepage Documentation Source Code Bug Tracker Wiki

baidu_crawler

0.0

No commit activity in last 3 years

No release in over 3 years

The Baidu Crawler is to crawl data with your demmand

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

5,973

Releases

0.0.1

2012-09-01

2012-09-01

Activity

2012-08-21

attribute_imagifiable zealot128/attribute_imagifiable Homepage Documentation Source Code Bug Tracker Wiki

attribute_imagifiable

0.0

Repository is archived

No commit activity in last 3 years

No release in over 3 years

Using paperclip to generate images from sensible attributes like e-mails and telephone numbers, in order to reduce crawler's success

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

27,846

Releases

0.0.8

2012-10-08

2013-07-31

Activity

2012-11-23

masque uu59/masque Homepage Documentation Source Code Bug Tracker Wiki

masque

0.01

No commit activity in last 3 years

No release in over 3 years

JavaScript enabled web crawler kit

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

45,322

Releases

0.4.3

2012-10-19

2014-10-19

Activity

100%

2013-05-23

the_country_identity p1nox/the_country_identity Homepage Documentation Source Code Bug Tracker

the_country_identity

0.0

No commit activity in last 3 years

No release in over 3 years

CIA World Factbook crawler

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

8,334

Releases

0.0.3

2012-11-23

2014-10-19

Activity

Issue Closure Rate

66%

100%

2014-02-23

apollo-crawler korczis/apollo-crawler Homepage Documentation Source Code Bug Tracker Wiki

apollo-crawler

0.01

No commit activity in last 3 years

No release in over 3 years

Gem for crawling data from external sources

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

267,612

Releases

0.1.31

2013-02-23

2013-03-27

Activity

2014-08-30

is_crawler ccashwell/is_crawler Homepage Documentation Source Code Bug Tracker Wiki

is_crawler

0.02

No commit activity in last 3 years

No release in over 3 years

is_crawler does exactly what you might think it does: determine if the supplied string matches a known crawler or bot.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

161,294

Releases

0.1.5

2013-02-27

2013-05-23

Activity

Issue Closure Rate

60%

2013-12-05

cosmicrawler bash0c7/cosmicrawler Homepage Documentation Source Code Bug Tracker Wiki

cosmicrawler

0.02

Repository is archived

No commit activity in last 3 years

No release in over 3 years

Cosmicrawler is crawler library for Ruby. It provides scalable asynchronous crawling by (http|file|etc) using EventMachine.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

Popularity

5,174

Releases

0.0.1

2013-03-11

2013-03-11

Activity

100%

2013-03-17