Search results for 'crawler' - The Ruby Toolbox

80%

2020-05-23

cobweb stewartmckee/cobweb Homepage Documentation Source Code Bug Tracker Wiki

cobweb

Web Content Scrapers

0.09

Web Content Scrapers

No commit activity in last 3 years

No release in over 3 years

There's a lot of open issues

Cobweb is a web crawler that can use resque to cluster crawls to quickly crawl extremely large sites which is much more performant than multi-threaded crawlers. It is also a standalone crawler that has a sophisticated statistics monitoring interface to monitor the progress of the crawls.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

349,150

225

Releases

1.2.1

2010-11-10

2021-01-09

Activity

50%

57%

2016-04-07

crawler_detect loadkpi/crawler_detect Homepage Documentation Source Code Bug Tracker

crawler_detect

User Agent Detection

0.08

User Agent Detection

Low commit activity in last 3 years

A long-lived project that still receives updates

CrawlerDetect is a library to detect bots/crawlers via the user agent

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

2,189,871

143

Releases

1.2.9

1980-01-02

2025-11-06

Activity

100%

76%

2022-03-09

crawler-engine Homepage Documentation

crawler-engine

0.0

No release in over 3 years

Crawler Engine provides function of crawl all news from the customized website

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

5,304

Releases

0.1.0

2011-11-22

2011-11-22

Activity

bank-crawlers-hapoalim joaomilho/bank-crawlers-hapoalim Homepage Documentation Source Code Bug Tracker Wiki

bank-crawlers-hapoalim

0.0

No commit activity in last 3 years

No release in over 3 years

A crappy crawler for a crappy bank interface

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

6,851

Releases

0.0.7

2015-04-03

2015-04-03

Activity

2014-04-11

embulk-filter-crawler toyama0919/embulk-filter-crawler Homepage Documentation Source Code Bug Tracker Wiki

embulk-filter-crawler

0.0

No commit activity in last 3 years

No release in over 3 years

Crawler4J filter plugin for Embulk

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

11,243

Releases

0.1.3

2016-03-25

2016-04-06

Activity

2016-03-28

senthor_rails_legacy Homepage Documentation

senthor_rails_legacy

0.0

No release in over 3 years

Protect your content from AI crawlers and monetize every request with Senthor. Real-time detection, crawler control, and detailed analytics.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

695

Releases

1.1.0

1980-01-02

1980-01-02

Activity

senthor_rails Homepage Documentation

senthor_rails

0.0

No release in over 3 years

Protect your content from AI crawlers and monetize every request with Senthor. Real-time detection, crawler control, and detailed analytics.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

485

Releases

1.1.0

1980-01-02

1980-01-02

Activity

wombat felipecsl/wombat Homepage Documentation Source Code Bug Tracker Wiki

wombat

Web Content Scrapers

0.39

Web Content Scrapers

No release in over 3 years

Low commit activity in last 3 years

There's a lot of open issues

Generic Web crawler with a DSL that parses structured data from web pages

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

233,713

1,355

129

Releases

3.2.0

1980-01-02

2022-08-23

Activity

59%

80%

2020-10-09

voight_kampff biola/voight-kampff Homepage Documentation Source Code Bug Tracker Wiki

voight_kampff

0.22

Low commit activity in last 3 years

No release in over a year

Voight-Kampff detects bots, spiders, crawlers and replicants

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

9,215,652

192

Releases

2.0.0

2011-05-11

2023-03-12

Activity

94%

58%

2018-09-03

is_crawler ccashwell/is_crawler Homepage Documentation Source Code Bug Tracker Wiki

is_crawler

0.01

No commit activity in last 3 years

No release in over 3 years

is_crawler does exactly what you might think it does: determine if the supplied string matches a known crawler or bot.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

175,349

Releases

0.1.5

2013-02-27

2013-05-23

Activity

60%

2013-12-05

zy_crawler uuensky/zycrawler Homepage Documentation Source Code Bug Tracker Wiki

zy_crawler

0.0

No commit activity in last 3 years

No release in over 3 years

A simple crawler demo crawler

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

1,575

Releases

0.0.1

2022-03-08

2022-03-08

Activity

2022-03-08

nicoquery-crawler Documentation

nicoquery-crawler

0.0

No release in over 3 years

crawler of niconico

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

3,923

Releases

0.0.1.4

2013-08-09

2013-08-09

Activity

baidu_crawler debbbbie/baidu_crawler Homepage Documentation Source Code Bug Tracker Wiki

baidu_crawler

0.0

No commit activity in last 3 years

No release in over 3 years

The Baidu Crawler is to crawl data with your demmand

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

6,417

Releases

0.0.1

2012-09-01

2012-09-01

Activity

2012-08-21

web_crawler webgago/web_crawler Homepage Documentation Source Code Bug Tracker Wiki

web_crawler

0.0

No commit activity in last 3 years

No release in over 3 years

Web crawler help you with parse and collect data from the web

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

20,691

Releases

0.5.4

2011-05-30

2011-06-24

Activity

100%

100%

2011-09-08

arb-bs arybin-cn/arb-bs Homepage Documentation Source Code Bug Tracker Wiki

arb-bs

0.0

No commit activity in last 3 years

No release in over 3 years

A demo of Web Crawler using arb-crawler

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

22,599

Releases

1.1.4

2017-02-13

2018-04-12

Activity

2017-09-11

crawler_rocks Documentation

crawler_rocks

0.0

No release in over 3 years

a crawler toolkit

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

17,162

Releases

0.0.6

2015-05-29

2016-07-21

Activity

yz_crawler Homepage Documentation

yz_crawler

0.0

No release in over 3 years

A simple web crawler gem

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

4,534

Releases

0.0.1

2015-03-22

2015-03-22

Activity

arb-crawler arybin-cn/arb-crawler Homepage Documentation Source Code Bug Tracker Wiki

arb-crawler

0.0

No commit activity in last 3 years

No release in over 3 years

Web page crawler.

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

10,112

Releases

1.0.3

2017-02-12

2017-08-06

Activity

2017-03-24

czj_crawler Homepage Documentation

czj_crawler

0.0

No release in over 3 years

A simple web crawler gem

2005

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Popularity

2,913

Releases

0.0.1

2019-02-10

2019-02-10

Activity