0.0
Repository is gone
No release in over 3 years
Easy to use DSL that helps scraping data from websites. Thanks to it, writing web crawlers would be very fast and intuitive. Traversing through html nodes and fetching all of the HTML attributes, would be possible. Just like in jQuery - you will find methods like parent, children, first, find, si...
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
The application crawls a URL and extracts links, tags and sequences. These features are written to an output file
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
RegexpCrawler is a Ruby library for crawl data from website using regular expression.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
This gem offers: classes to subclass and create a manga site crawler; a dowloader to use with these classes; some site-specific scripts.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-http-test gem contains a HTTP test server for testing HTTP client implementations.
2019
2020
2021
2022
2023
2024
0.0
Repository is gone
No release in over a year
Crawler Guru provides all basic functionalities to extract data from web pages
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. This gem is an rjack-async-httpclient based implementation of the iudex-http interfaces.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Crawler Engine provides function of crawl all news from the customized website
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
A simple news crawler. You can specify the structure of your xml or rss feeds.
2019
2020
2021
2022
2023
2024
0.0
No release in over 3 years
Iudex is a general purpose web crawler and feed processor in ruby/java. The iudex-worker gem provides a worker deamon for feed/page processing.
2019
2020
2021
2022
2023
2024