Input
Most common input document file types are supported:
pdf
images of various kinds (jpg, jpeg, png, tiff, …)
common Microsoft office documents (docx, xlsx, ppt)
Output types
JSON
Excel (xlsx, csv)
original file (and a generated pdf)
images for every page (jpeg)