Welcome-to-pdfminer-six-s-documentation-pdfmine
107 words·1 min
Welcome-to-pdfminer-six-s-documentation-pdfmine #
— pdfminer.six 20191020 documentation Created: January 28, 2020 9:01 PM Tags: Tools URL: https://pdfminersix.readthedocs.io/en/latest/index.html
Features #
- Parse all objects from a PDF document into Python objects.
- Analyze and group text in a human-readable way.
- Extract text, images (JPG, JBIG2 and Bitmaps), table-of-contents, tagged contents and more.
- Support for (almost all) features from the PDF-1.7 specification
- Support for Chinese, Japanese and Korean CJK) languages as well as vertical writing.
- Support for various font types (Type1, TrueType, Type3, and CID).
Installation instructions #
Before using it, you must install it using Python 3.4 or newer.
$ pip install pdfminer.six
Contributing #
We welcome any contributors to pdfminer.six!