使用pypdf模块提取PDF中的全部图片,效果不次于pymupdf。代码如下:
from pypdf import PdfReader, PdfWriter
reader = PdfReader("example.pdf")
writer = PdfWriter()
count = 0
for i in range(len(reader.pages)):
page = reader.pages[i]
for img_fiel in page.images:
with open(f"{count}-{img_fiel.name}",'wb') as fp:
fp.write(img_fiel.data)
count += 1