The PyPDF2
package is quite useful and is usually pretty fast. You can use PyPDF2
to automate large jobs and leverage its capabilities to help you do your job better!
In this course, you learned how to do the following:
- Extract metadata from a PDF
- Rotate pages
- Merge and split PDFs
- Add watermarks
- Add encryption
Also keep an eye on the newer PyPDF4
package as it will likely replace PyPDF2
soon. You might also want to check out pdfrw
, which can do many of the same things that PyPDF2
can do.
If you’d like to learn more about working with PDFs in Python, then you should check out some of the following resources for more information:
- The
PyPDF2
website - The Github page for
PyPDF4
- The Github page for
pdfrw
- The ReportLab website
- The Github page for
PDFMiner
- Camelot: PDF Table Extraction for Humans
Congratulations, you made it to the end of the course! What’s your #1 takeaway or favorite thing you learned? How are you going to put your newfound skills to use? Leave a comment in the discussion section and let us know.
mikesult on March 1, 2020
Thank you Andrew for a great and very useful tutorial. I learned a lot about working with PDFs. I use pdf files as music charts quite a bit and these techniques will be very useful to split, merge and organize charts from pdf books. I appreciate your links to additional resources too.