No commit activity in last 3 years
No release in over 3 years
A command line utility for extracting annotation and field metadata from a PDF in JSON format.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
 Dependencies

Development

~> 2.0
~> 11.0
~> 10.0
~> 3.0

Runtime

~> 4.4
~> 3.0
 Project Readme

PDF::Extract

Build Status

Code Climate

This gem provides a command line interface to extract field and annotation metadata from a PDF.

pdf-extract fields spec/data/field-examples/text.pdf
[{"name":"Sample Text Field","value":"Hello"},{"name":"Sample Text Field (required)","value":null}]
pdf-extract annotations spec/data/annotation-examples/note.pdf
[{"name":null,"contents":"Hello"},{"name":null,"contents":"Hello"}]

Installation

Add this line to your application's Gemfile:

gem 'pdf-extract-meta'

And then execute:

$ bundle

Or install it yourself as:

$ gem install pdf-extract-meta

Usage

Run pdf-extract --help for usage.

From within Ruby:

Bundler.with_clean_env do
  JSON.parse(`pdf-extract fields '#{pdf_path}'`)
end

Development

After checking out the repo, run bin/setup to install dependencies. Then, run rake spec to run the tests. You can also run bin/console for an interactive prompt that will allow you to experiment.

To install this gem onto your local machine, run bundle exec rake install. To release a new version, update the version number in version.rb, and then run bundle exec rake release, which will create a git tag for the version and push git commits and tags.