The project is in a healthy, maintained state
pikuri-extractors plugs additional document formats into pikuri-core's +Pikuri::Extractor+ registry. The bundled +Pikuri::Extractors::DOCUMENTS+ extractor converts office documents (DOCX, ODT, XLSX, legacy XLS, PPTX, EPUB, RTF) to Markdown by piping the bytes through pandoc / markitdown — preferably inside a one-shot, networkless, locally-built docker container (the untrusted bytes never touch the host filesystem or network), falling back to a host-installed pandoc / markitdown CLI when docker is absent. Registration is explicit — +Pikuri::Extractors::DOCUMENTS.register+ — so requiring the gem changes nothing by itself; the host script picks which extractors it wires in.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
 Dependencies

Runtime