Frequently-Asked Questions

Which Python versions are supported?

PyPDF2 2.0+ supports Python 3.6 and later. PyPDF2 1.27.10 supported Python 2.7 to 3.10.

Who uses PyPDF2?

pyPdf is vendored into several projects. That means the code of pyPdf was copied into that project.

Projects that depend on PyPDF2:

  • Camelot: A Python library to extract tabular data from PDFs

  • edi: Electronic Data Interchange modules

  • amazon-textract-textractor: Analyze documents with Amazon Textract and generate output in multiple formats.

  • maigret: Collect a dossier on a person by username from thousands of sites

  • deda: tracking Dots Extraction, Decoding and Anonymisation toolkit

  • opencanary

  • Document Conversions

  • pdfalyzer: A PDF analysis tool for visualizing the inner tree-like data structure of a PDF in spectacularly large and colorful diagrams as well as scanning the binary streams embedded in the PDF for hidden potentially malicious content.

How do I cite PyPDF2?

In BibTeX format:

@misc{pypdf2,
 title         = {The {PyPDF2} library},
 author        = {Mathieu Fenniak and
                  Matthew Stamy and
                  pubpub-zz and
                  Martin Thoma and
                  Matthew Peveler and
                  exiledkingcc and {PyPDF2 Contributors}},
 year          = {2022},
 url           = {https://pypi.org/project/PyPDF2/}
 note          = {See https://pypdf2.readthedocs.io/en/latest/meta/CONTRIBUTORS.html for all contributors}
}