Awesome
Pdf2htmlex
Elixir library to convert PDF documents to HTML without losing text or format.
Requirements
The command line tool pdf2htmlex must be installed on your system and the binary must be on your PATH.
Installation
Add this to your mix.exs
file, then run mix do deps.get, deps.compile
:
{:pdf2htmlex, "~> 0.1"}
Examples
import Pdf2htmlex
# Simplest conversion possible.
open("/Users/ricn/pdfs/sample.pdf") |> convert_to!("/Users/ricn/html")
# Set zoom ratio to 150 % and set horizontal and vertical dpi to 96 for images.
open("/Users/ricn/pdfs/sample.pdf")
|> zoom(1.5)
|> hdpi(96)
|> vdpi(96)
|> convert_to!("/Users/ricn/html")
# Set the maximum width to 640 pixels and maximum height to 480 pixels
open("/Users/ricn/pdfs/sample.pdf")
|> fit_width(640)
|> fit_height(480)
|> convert_to!("/Users/ricn/html")
# Converts only page 1 to 10
open("/Users/ricn/pdfs/sample.pdf")
|> first_page(1)
|> last_page(10)
|> convert_to!("/Users/ricn/html")
# For more options see the documentation
Credits
The following people have contributed ideas, documentation, or code to Pdf2htmlex:
- Richard Nyström
Contributing
- Fork it
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request