Skip to main content
  1. Posts/

Welcome-to-pdfminer-six-s-documentation-pdfmine

107 words·1 min

Welcome-to-pdfminer-six-s-documentation-pdfmine #

— pdfminer.six 20191020 documentation Created: January 28, 2020 9:01 PM Tags: Tools URL: https://pdfminersix.readthedocs.io/en/latest/index.html

Features #

  • Parse all objects from a PDF document into Python objects.
  • Analyze and group text in a human-readable way.
  • Extract text, images (JPG, JBIG2 and Bitmaps), table-of-contents, tagged contents and more.
  • Support for (almost all) features from the PDF-1.7 specification
  • Support for Chinese, Japanese and Korean CJK) languages as well as vertical writing.
  • Support for various font types (Type1, TrueType, Type3, and CID).

Installation instructions #

Before using it, you must install it using Python 3.4 or newer.

$ pip install pdfminer.six

Contributing #

We welcome any contributors to pdfminer.six!