Project

zhSieve

0.0
No commit activity in last 3 years
No release in over 3 years
A ruby based zhihu content crawler.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

>= 0
~> 1.13
~> 10.0
~> 3.0

Runtime

 Project Readme

zhSieve

A ruby based zhihu content crawler

license Gem Version

Status: under working 🚧

Pre-Request

Create cookies.txt file and put it into root folder. (Mozilla cookies.txt-style)

Usage

Crawler specific answer, and output it as markdown format.

$ zhSieve answer -q 'Your Question Id' -a 'Your Answer Id'

Crawler specific Zhuanlan article, and output it as markdown format.

$ zhSieve article -z 'Your Article Id'

TO-DO List

  • Question 🚨
  • Ansewer 🏁
    • Use question_id and answer_id to crawler page information, and output a markdown file with ["question title and link", "author's avatar, name, biography", "answer content"].
    • Testing:warning:!
  • People 🚧
  • ZhuanLan, cantains two components
      1. Crawler single article 🏁
      1. Crawler someone's zhuanlan category 🚨

Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/gwzz/zhSieve. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the Contributor Covenant code of conduct.

License

The gem is available as open source under the terms of the MIT License.