Skip to end of metadata
Go to start of metadata

You are viewing an old version of this content. View the current version.

Compare with Current View Version History

« Previous Version 4 Current »

Input

Most common input document file types are supported:

  • general: pdf, txt, zip

  • images of various kinds (jpg, jpeg, png, tiff, gif, bmp)

  • common Microsoft office documents (doc, docx, xls, xlsx, ppt)

  • emails (eml and msg)

Output types

The default output type is JSON, since the default output method is by API.

Additionally, we provide excel download to download results for multiple files (xlsx, csv)

Finally, it is possible to obtain (via API):

  • the original file (and a generated pdf)

  • an image for every page of the file (jpeg)

  • No labels