Gaspar PDF
Parses PDF tables into HTML / Json / Xml / CSV files without losing data. This gem uses pdf-table-extract.
Installation
Add this line to your application's Gemfile:
gem 'gaspar-pdf'
And then execute:
$ bundle
Or install it yourself as:
$ gem install gaspar-pdf
Usage
You need to install pdf-table-extract on your system to use this gem.
require 'gaspar'
# Parse document.pdf to document.html
# This requires that the pdf-table-extract command is present in your PATH.
content = Gaspar.parse(source: 'document.pdf',
target: 'document.html',
as: :html)
# target is optional
# Available types: :html, :json, :xml, :csv
Inspired by Kristin
Contributing
- Fork it
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request