使用python PyPDF2合并PDF文件_python pdf (PyPDF2模块)-如何拆分/合并？_使用Python合并PDF文件 - 腾讯云开发者社区

python-2.7、pypdf

嗨，我试图水印一个pdf文件使用pypdf2，虽然我得到这个错误，我不知道哪里出了问题。我得到以下错误： Traceback (most recent call last): File "test.py", line 13, in <module> page.mergePage(watermark.getPage(0)) File "C:\Python27\site-packages\PyPDF2\pdf.py", line 1594, in mergePage self._mergePage(page2) File &

浏览 2提问于2013-11-26得票数 0

回答已采纳

1回答

组合PDF，errno 2，即使文件存在python，也没有这样的文件或目录。

python、pdf

我在Python中合并了许多PDF文件，并得到了错误“errno 2 nno such file or directory”，尽管该文件是存在的。我试图显示PDF文件，只是为了显示PDF文件的存在。 import os from PyPDF2 import PdfFileMerger merger = PdfFileMerger() source_dir = os.getcwd() + '/Combined PDF' for items in os.listdir(source_dir): if items.endswith('.pdf'):

浏览 7提问于2022-03-03得票数 0

2回答

使用PyPDF2合并文件时，'OSError：[Errno 22]无效参数‘

python、pdf、pypdf2、oserror

我只是想用python合并一些PDF文件，更具体地说是PyPDF2。很简单，但由于某些原因，我得到了一个错误，这是根本不理解的。在寻找解决方案的过程中，我发现其他人也有这个问题。然而，我没有满意的解决方案张贴出来。我的合并文件代码： from PyPDF2 import PdfFileMerger def merge(self, work_files, destination_file): pdf_merger = PdfFileMerger() for pdf in work_files: pdf_merger.append(pdf)

浏览 0提问于2020-05-28得票数 0

1回答

在Python中使用PyPdf2 PdfFileMerger时发生错误

python、python-3.x、exception、pypdf2、pypdf

我一直在创建一个使用PyPdf2合并多个pdf文件的Python程序. 这是代码 import os from PyPDF2 import PdfFileMerger source_dir = os.getcwd() merger = PdfFileMerger() for item in os.listdir(source_dir): if item.endswith('pdf'): merger.append(item) merger.write('completed_file.pdf') merger.close() 在运行

浏览 3提问于2020-11-17得票数 2

回答已采纳

1回答

Python OSError [ErrNo22] PyPDF2脚本

python

我正在制作一个脚本，它会在文件夹中标记所有pdfs。它起了作用，但后来我调整了它，所以水印的pdfs移动到一个目标文件夹，突然我不能让它工作了. 错误： "C:\Users\niels\AppData\Local\Programs\Python\Python38-32\lib\site-packages\PyPDF2\pdf.py"，回溯(最近一次调用)：文件"pdf_watermarker_v2.py"，第25行，在source_read = PyPDF2.PdfFileReader(source_open)文件第1084行，在__init__ self.re

浏览 2提问于2020-04-09得票数 0

3回答

PyPDF2:流意外结束

python、python-3.x、pdf、pypdf、pypdf2

我有一个Python脚本，它使用PyPDF2来颠倒PDF的页面顺序。 from PyPDF2 import PdfFileWriter, PdfFileReader output = PdfFileWriter() rpage = [] name = input("What's the file called?") filename = name.split('.', 1) input1 = PdfFileReader(open(name,'rb'), strict = False) pages = list(range(1,i

浏览 5提问于2017-03-03得票数 0

1回答

保存使用PyPDF2生成的pdf文件时使用python中的AssertionError

python、python-3.x、pdf、pypdf2

我想把一个给定的PDF的页面分割成单独的PDF。下面是我写的代码，但在这里，当使用open()和.write()函数保存文件时，我得到了错误: AssertionError from PyPDF2 import PdfFileReader, PdfFileWriter pdf = PdfFileReader("input.pdf") # this is the source pdf for page in range(pdf.getNumPages()): pdf_writer = PdfFileWriter() pdf_writer.addPage(p

浏览 16提问于2021-07-07得票数 0

3回答

是否有更快的方法来合并两个文件而不是逐页合并？

python、pypdf2

我使用Python3，使用，为了将页码添加到新生成的PDF (我使用)中，我按以下方式逐页合并两个PDF文件： from PyPDF2 import PdfFileWriter, PdfFileReader def merge_pdf_files(first_pdf_fp, second_pdf_fp, target_fp): """ Merges two PDF files into a target final PDF file. Args: first_pdf_fp: the first PDF file path.

浏览 5提问于2020-04-29得票数 0

回答已采纳

1回答

从另一个PDF中替换至少100页的PDF

python、pdf

这是代码的一个例子， import PyPDF2 import numpy as np # creating a pdf file object pdfFileObj = open('original.pdf' , 'rb') pdfFileObj_1 = open('tutorial.pdf', 'rb') # creating a pdf reader object pdfReader = PyPDF2.PdfFileReader(pdfFileObj) pdfReader_1 = PyPDF2.PdfFileReader(

浏览 0提问于2018-09-18得票数 1

回答已采纳

2回答

使用PyPDF2将PDF与来自两个不同文件夹的特定名称组合起来

python、python-3.x、pypdf2

我有两个文件夹和一组不同的pdfs。我知道，具有第一个文件夹的特定名称的PDF需要与第二个文件夹中的特定名称的PDF相结合。例如，来自第一个文件夹的"PID-01.pdf“需要与来自第二个文件夹的"FNN-PID-01.pdf”结合，来自第一个文件夹的"PID-02.pdf“需要与来自第二个文件夹的"FNN-PID-02.pdf”相结合，我有两个文件夹，以此类推。我正在使用python模块PyPDF2。有人能给出一个使用PyPDF2的例子吗？

浏览 3提问于2021-08-03得票数 0

回答已采纳

1回答

获取TypeError: ord()期望长度为1的字符串，但int找到了错误

python、python-3.x、pypdf2

代码是 from PyPDF2 import PdfFileReader with open('HTTP_Book.pdf','rb') as file: pdf=PdfFileReader(file) pagedd=pdf.getPage(0) print(pagedd.extractText()) 此代码引发下面所示的错误： TypeError: ord() expected string of length 1, but int found 我在网上搜索，发现了这个，但没有多大帮助。我知道这个错误的背景是什么，但不确定它在这里有什么

浏览 0提问于2019-05-05得票数 6

回答已采纳

1回答

向pdf添加信息，PyPDF2合并速度太慢

python、python-2.x、pypdf

我想在每一页的pdf文字。这个文本是一个看起来像<p style="color: #ff0000">blabla</p>的html代码，在最终文档上显示为红色，我将其转换为pdf (html2pdf库)，然后我将其合并(PyPDF2库)到我的pdf的每一页。...but合并非常慢！我的问题是:有没有比PyPDF2的page.mergePage方法更快的合并pdf的方法？(或者有没有更快的方法将我的文本添加到这个pdf中？) 谢谢！(在Windows 8上使用python 2.7.5 )

浏览 5提问于2013-08-07得票数 5

1回答

使用PyPDF2通过python加密许多PDF

python、pdf、encryption、permissions、pypdf2

我正在尝试制作一个python程序，它循环遍历文件夹中的所有文件，选择那些扩展名为'.pdf‘的文件，并使用受限权限对它们进行加密。我使用的是这个版本的PyPDF2库：https://github.com/vchatterji/PyPDF2。(对原始PyPDF2的修改也允许设置权限)。我已经用一个pdf文件测试了它，它工作得很好。我希望原始的pdf文件应该被删除，加密的文件应该保留相同的名称。下面是我的代码： import os import PyPDF2 directory = './' for filename in os.listdir(directory)

浏览 17提问于2019-03-21得票数 1

1回答

NotImplementedError在python3中使用PyPDF2模块

python、python-3.x、pypdf2、pypdf

我一直在用Python创建一个程序，将2个pdf文件合并到一个文件中。这是代码：- import os from PyPDF2 import PdfFileMerger source_dir = os.getcwd() merger = PdfFileMerger() for item in os.listdir(source_dir): if item.endswith('pdf'): merger.append(item) merger.write('completed_file.pdf') merger.close() 在运

浏览 2提问于2020-11-16得票数 0

1回答

使用PyPDF2合并具有相同前缀的PDF文件

python、python-3.x、python-requests

我有多个具有不同前缀的PDF文件。我想根据第三个前缀(下划线中的第三个值)合并这些pdf文件。我想使用python库PyPDF2来做这件事。例如： 0_2021_1_123.pdf 0_2021_1_1234.pdf 0_2021_1_12345.pdf 0_2021_2_123.pdf 0_2021_2_1234.pdf 0_2021_2_12345.pdf 预期结果 1_merged.pdf 2_merged.pdf 这是我尝试过的，但我得到了一个错误，它不工作。任何帮助都是非常感谢的。 from PyPDF2 import PdfFileMerger import io import

浏览 49提问于2021-11-03得票数 1

回答已采纳

4回答

PyPDF2 write不适用于某些PDF文件(Python3.5.1)

python、python-3.x、pdf、reportlab、pypdf2

首先，我使用的是Python3.5.1 (32位版本)，我编写了以下程序，使用PyPDF2和reportlab在我的pdf文件的所有页面上添加页码： #import modules from os import listdir from PyPDF2 import PdfFileWriter, PdfFileReader import io from reportlab.pdfgen import canvas from reportlab.lib.pagesizes import A4 #initial values of variable declarations PDFlist=[] X

浏览 1提问于2017-08-31得票数 12

2回答

不能将PDF与py2pdf - ValueError合并

python、pdf

我试图合并我从Google下载的PDF文件，我得到了以下错误： ValueError: invalid literal for int() with base 10: b'F-1.4' 当我将我生成的PDF与基调合并时，这种情况就不会发生。完整的错误如下： Traceback (most recent call last): File "weekly_meeting.py", line 36, in <module> file_path = sort_pdf(path) File "weekly_meeting.py"

浏览 2提问于2019-01-13得票数 1

1回答

使用Python和PyPDF2合并PDF文件会抛出一个TypeError

python、pdf、pypdf2

我使用Python 3.6.5将PDF合并在一起，但遇到了一个问题。下面的代码引发一个'TypeError: 'NumberObject' object is not subscriptable'错误。我做错了什么？当我用merger.append注释掉这一行时，它会正确地打印出文件路径。 import webbrowser import os from PyPDF2 import PdfFileMerger, PdfFileReader path = 'C:/test/pdfs' merger = PdfFileMerger() for pd

浏览 0提问于2018-04-06得票数 4

1回答

PyPDF2 PdfFileWriter没有属性流

python、pdf、pypdf2

我正在尝试将pdf分成多个页面，并将每个页面另存为一个新的pdf。我尝试了上一个问题中的方法，但没有成功，也尝试了中的pypdf2拆分示例，但没有成功。编辑:我可以在我的文件中看到它成功地写入了第一页，然后创建了第二页pdf，但它是空的。下面是我尝试运行的代码： from PyPDF2 import PdfFileWriter, PdfFileReader inputpdf = PdfFileReader(open("my_pdf.pdf", "rb")) for i in range(inputpdf.numPages): output = Pd

浏览 0提问于2016-10-21得票数 4

1回答

“导入pyPDF2”结果为“ModuleNotFoundError”

python、python-3.x、windows、pypdf2、modulenotfounderror

问题总结:在使用python解释器时，我输入了import pyPDF2并得到了一个ModuleNotFound错误，尽管我已经安装了pyPDF2模块： >>> import pyPDF2 Traceback (most recent call last): File "<stdin>", line 1, in <module> ModuleNotFoundError: No module named 'pyPDF2' 我尝试过的:我使用的是Windows10。我是python的新手。我已经将Python3.8.3安

浏览 468提问于2020-05-28得票数 0

1回答

合并PDF文件在哪里？

python、pypdf2

我有问题，需要你的帮助。我通过“用Python自动完成无聊的事情”来学习Python。我目前在第13章，处理PDF文件和Word文件。我有这些代码从book.It基本上合并的pdf文件，没有他们的第一页。但是在运行程序之后，我没有看到任何PDF文件弹出。我试图在目录中找到它，但它也不在那里。所以帮我找到那份文件谢谢！这是密码 import PyPDF2 import os pdfFiles = [] for filename in os.listdir('.'): if filename.endswith('.pdf'): pdfFiles.append

浏览 1提问于2019-04-08得票数 1

回答已采纳

3回答

使用python中的pyPDF2模块递归合并子文件夹中的pdf

python、pypdf2

我是一个新手开发人员，学习python和.Im，试图递归地解析文件夹和子文件夹，将多个pdf合并为一个基于子文件夹名称的pdf。我有以下文件夹和子文件夹结构合并前文件夹 dummy ball ball_baseball.pdf ball_basketball.pdf ball_volleyball.pdf ice ice_skating.pdf ice_curling.pdf

浏览 8提问于2017-05-02得票数 1

回答已采纳

1回答

将WindowsPath路径的python列表中的PDF合并

python、pdf

我有一个excel文件，包含行和列中的一些数据，我将从每一行获取文件名并将它们合并为一个pdf文件(简单地说，每一行到一个pdf文件)--这是列表['1', '112238', '112239', '112240', '112337', '112338']的一个例子，python列表中的第一个元素将是pdf名称，其他元素是应该存在于名为Files的目录中的文件名。 from pathlib import Path import pandas as pd from PyPDF2 import PdfF

浏览 4提问于2022-04-12得票数 0

1回答

读取PDF文件python - pypdf2时出现断言错误

python、pdf、python-3.6、pypdf2

当我尝试读取PDF文件时，出现以下错误。代码： from PyPDF2 import PdfFileReader import os os.chdir("Path to dir") pdf_document = 'sample.pdf' pdf = PdfFileReader(pdf_document,'rb') #Error here 错误： Traceback (most recent call last): File "/home/krishna/PycharmProjects/sample/sample.py", l

浏览 45提问于2020-05-21得票数 0

2回答

使用PyPDF2合并两个pdf文件时出错

python、python-2.7、pypdf2

我为这个问题搜索了很多次，但我没有找到这个问题的确切解决方案，这就是为什么我要问这个问题…… 这是我使用PyPDF2在python中合并两个pdf文件的代码： import os from PyPDF2 import PdfFileReader, PdfFileMerger files_dir = "/Users/ajayvictor/" pdf_files = [f for f in os.listdir(files_dir) if f.endswith("pdf")] merger = PdfFileMerger() for filename in pd

浏览 0提问于2017-04-22得票数 1

5回答

如何使用PyPDF2添加PDF页面

python、pdf、pdf-generation、pypdf

有没有人有使用python lib PyPDF2将两页PDF文件合并成一页的经验？当我尝试page1.mergePage(page2)时，结果是page2覆盖了page1。如何将page2添加到page1的底部？

浏览 2提问于2014-04-02得票数 11

1回答

如何使用Python2.7创建使用PyPDF2的路径？

python、python-2.7

如何让python的路径使用PyPDF2 ..。关于PyPDF2，我需要下载它并将它添加到python中吗？我是python的初学者，我需要学习如何阅读文本表单PDF文件。(帮助我:)

浏览 1提问于2016-03-23得票数 1

回答已采纳

1回答

PyPDF2 append_function找不到文件

python-3.x、pypdf2

我想使用PyPDF2模块合并PDF。以下代码工作正常： import PyPDF2 import sys import os input_path = r'\Users\XXXXX\OneDrive\Desktop\PDF_File_Input' merger = PyPDF2.PdfFileMerger() for file in os.listdir(input_path): if file.endswith(".pdf"): print(file) 一旦实现了追加函数，我就会从第10行得到一个跟踪错误。FileNotFound

浏览 5提问于2022-07-31得票数 0

2回答

直接在Python中使用来自web的pdf？

python、pdf、urllib、pypdf

我试图使用Python直接从web读取.pdf文件，而不是将它们全部保存到我的计算机上。我所需要的只是来自.pdf的文本，我将阅读很多(~60k)它们，所以我更希望不必将它们全部保存起来。我知道如何使用urllib从互联网上保存.pdf并使用PyPDF2打开它。() 我想跳过保存到文件的步骤。 import urllib, PyPDF2 urllib.urlopen('https://bitcoin.org/bitcoin.pdf') wFile = urllib.urlopen('https://bitcoin.org/bitcoin.pdf') lFile

浏览 0提问于2014-04-18得票数 2

3回答

仅支持算法代码1和2

python、pypdf2

我想读一下pdf文件。这是带有密码的book.pdf (256位AES加密)。我知道一个密码。我正在使用Jupyter Notebook。我得到一个错误： import PyPDF2 pdfReader = PyPDF2.PdfFileReader(open('book.pdf', 'rb')) pdfReader.decrypt('333') pdfReader.getPage(0) --------------------------------------------------------------------------- N

浏览 3提问于2018-06-08得票数 8

1回答

为什么“导入pyPDF2 2”安装后不工作？

python、pypdf2

这是我的代码，我得到了这个错误ModuleNotFoundError: No module named 'pyPDF2'。我已经安装了pip instal pyPDF2。如果我再试一次，上面写着： C:\Users\nicks\Desktop\Coding Projects\Python\Pdf to Audio›pip install PyPDF2 Requirement already satisfied: PyPDF2 in c: \users\nicks\appdata\local\packages \pythonsoftwarefoundation.python.

浏览 20提问于2022-03-13得票数 0

2回答

在pypdf2中使用PdfFileMerger()后的页数

python、pypdf、pypdf2

我正在尝试使用PyPDF2中的PdfFileMerger()来合并pdf文件(参见代码)。 from PyPDF2 import PdfFileMerger, PdfFileReader [...] merger = PdfFileMerger() if (some condition): merger.append(PdfFileReader(file(filename1, 'rb'))) merger.append(PdfFileReader(file(filename2, 'rb'))) if (test for non-zero f

浏览 5提问于2016-08-31得票数 1

回答已采纳

1回答

PyPDF2给我一个无效参数错误

pypdf2

我正在尝试解析pdf文件中的文本。当我在做how to PyPDF2的教程时，我得到了以下错误。我进行了搜索，但最终什么也找不到。任何帮助都将不胜感激。 Traceback (most recent call last): File "D:/text_recognizer/main.py", line 4, in <module> inputStream = PyPDF2.PdfFileReader(input) File "D:\KimKanna's Class\python27\lib\site-packages\PyPDF2\p

浏览 0提问于2017-09-12得票数 2

1回答

未找到EOF标记-如何在PyPDF和PyPDF2中修复？

python、pdf、pypdf

我正在尝试使用Python将几个PDF文件组合成一个PDF文件。我已经尝试了PyPDF和PyPDF2 -在一些文件上，它们都抛出了这个相同的错误： PdfReadError:找不到EOF标记以下是我的代码(page_files)是要组合的PDF文件路径列表： # use pypdf to combine pdf pages output = PdfFileWriter() for pf in page_files: filestream = file(pf, "rb") pdf = PdfFileReader(filestream)

浏览 6提问于2013-04-23得票数 12

1回答

如何使用Python获取PDF文件元数据“页面大小”？

python、scanning、pypdf2、page-size

我尝试在Python3中使用PyPDF2模块，但无法显示“页面大小”属性。我想知道在扫描到PDF文件之前的纸张尺寸是什么。就像这样： import PyPDF2 pdf=PdfFileReader("sample.pdf","rb") print(pdf.getNumPages()) 但是我正在寻找另一个Python函数，而不是例如getNumPages(). 下面的命令输出某种元数据，但没有页面大小： pdf_info=pdf.getDocumentInfo() print(pdf_info)

浏览 7提问于2017-09-15得票数 3

回答已采纳

1回答

PDFKit & PyPDF2 -无法读取格式错误的PDF文件

python、pdf、pdfkit、pypdf

我正面临着从pdfkit.from_file(文件名，'w+')生成的pdf文件的问题。其中filename是html文件。从html文件生成PDF文件后，将使用以下代码合并： merger = PdfFileMerger() for pdf in input_files: merger.append(pdf) merger.write(output_stream) merger.close() 这就是我遇到错误的地方： File "/home/finrpt/finrpt/finrpt_py/htm_gen.py", line 193, in pdf

浏览 73提问于2020-08-28得票数 0

2回答

PyPDF2忽略内容，仅获取水印

python、pypdf2

我有成千上万的PDF文件，像。我正在尝试使用PyPDF2将它们转换为纯文本(代码如下)。但PyPDF2显然只“看到”水印，而不是内容本身。我能在这里做些什么？ import os import PyPDF2 path_to_pdfs = '/path/to/pdf/files/' for filename in os.listdir(path_to_pdfs): if '.pdf' in filename.lower(): with open(path_to_pdfs + filename, mode = 'rb')

浏览 0提问于2018-06-14得票数 1

1回答

pyPDF2“流意外结束”

python、visual-studio-code、error-handling、pypdf2

这是我的第一个python代码。作者传递了一个错误。这似乎是随机发生在循环过程中，通过pdf的。 try: except: pass将无法工作，因为它只会跳过该问题的文件，而不会为它生成一个输出。 strict=False似乎不适合作者。错误： PdfReadWarning: Multiple definitions in dictionary at byte 0x6eb54 for key /PageMode [generic.py:587] PdfReadWarning: Multiple definitions in dictionary at byte 0x75740 for key

浏览 11提问于2022-03-31得票数 0

1回答

Python - PyPdf2合并不能保持PDF大小

python、pypdf2

我有一个大小的问题，当我合并一个PDF使用PyPDF2。我有以下代码来合并pdfs文件： merger = PyPDF2.PdfFileMerger() for pdf in fileSorted: merger.append(pdf[1]) os.remove(pdf[1]) merger.write(tmpPath + '/result.pdf') 问题是，PDF的大小比原始的太高了。如何指定pdf大小？输入文件的大小为210*297 of (A4)，输出的大小为900x1273 of 非常感谢

浏览 82提问于2019-12-16得票数 1

回答已采纳

1回答

查找文本，无论是否高亮显示

python、pdf-generation、pypdf2

我目前正在尝试使用PyPDF2读取Python.I中的PDF文件，希望知道该PDF文件的文本是否高亮显示。上下文：我们使用不同的color.Is来突出显示PDF文件中的文本，有什么方法可以知道哪些文本在、Python、中是使用任何库之类的突出显示的？如果有，请告诉我正确的来源。我为这个problem.What查找了很多地方，我发现PyPDF2不能解决这个问题吗？

浏览 4提问于2016-08-09得票数 3

1回答

使用PyPDF2和os.listdir读取FileNotFound文件时出现错误()

python、pypdf2

我有以下脚本来将几个PDF合并在一起： import PyPDF2 import sys import os inputs = sys.argv[1] list = os.listdir(inputs) merger = PyPDF2.PdfFileMerger() for pdf in list: merger.append(pdf) merger.write('merged.pdf') print('All done') 包含这些文件的文件夹与正在运行的脚本位于不同的目录中，因此我插入了完整路径。从终端python3 pdf-merger

浏览 1提问于2020-06-03得票数 0

1回答

PyPDF2 IndexError:超出范围的索引

python、pypdf2

首先，我对使用Python和PyPDF非常陌生。我试图收集所有的字段在一个pdf收集成一个数据。最后，我想收集成千上万的PDF，它们都具有与基线相同的结构(表单)，并将它们放入PDF中。在没有数字证书/签名的情况下，我能够让这些代码在PDF上工作得很好。但是，当我在PDF上运行带有数字证书/签名的代码时，会出现错误。我真的不需要文档的数字签名/证书点，所以我认为最简单的方法就是跳过PDF字段。但是，我不知道如何做到这一点，因为PyPDF2包会查看每个字段。代码： import os import PyPDF2 as pypdf import pandas as pd directory

浏览 9提问于2022-08-15得票数 0

回答已采纳

1回答

PyPDF2 PdfReadError:无法读取布尔对象

python、pypdf2

当使用PyPDF2读取某些PDF文件时，我得到以下错误。由于这些文件的机密性，我无法分享它们，但我可以尝试提供有助于解决这个问题的信息。斯塔克斯迹- inputpdf = PdfFileReader(open(pdfpath, "rb"), strict=False) File "/home/tata/.virtualenvs/obu/local/lib/python2.7/site-packages/PyPDF2/pdf.py", line 1084, in __init__ self.read(stream) File "/

浏览 7提问于2017-11-14得票数 0

回答已采纳

1回答

如何使用python代码将pdf转换为xml /json

python、pdf

有谁能帮助我如何使用python代码将pdf文件转换成xml文件？我的pdf包含：非结构化数据它有图像数学方程化学方程表数据徽标的标签等。我尝试使用PDFMiner，但我的pdf数据没有转换成.xml/json文件格式。除了PDFMiner之外，还有其他库吗？PyPDF2、Tabula-py、PDFQuery、comelot、PyMuPDF、pdf to dox、pandas- -这些其他库/实用程序都不适合我的需求。请告诉我其他的选择。谢谢。

浏览 12提问于2022-06-06得票数 -1

1回答

当尝试从lib运行示例时，pyPDF2 TypeError

python、python-3.x、pypdf

从这里获得pyPDF2库：当尝试运行脚本“示例1:”时，从那里可以看到： PyPDF2 python versions (2.5 - 3.3) compatibility branch Traceback (most recent call last): File "1.py", line 6, in <module> input1 = PdfFileReader(open("document1.pdf", "rb")) File "C:\Python33\lib\site-packages\PyPDF2

浏览 3提问于2013-10-04得票数 0

回答已采纳

1回答

如何使用Python3和PyPDF2将unicode编码的PDF文件转换为文本

python、pdf、text、data-conversion

我正在尝试使用Python3和PyPDF2库将PDF转换为文本文件。但PDF主要是用韩语编写的，所以在处理PDF文本之前，它似乎是用'utf-8‘编码的。但是，无论是使用"open“功能读取PDF文件，还是使用"codecs”功能读取PDF文件，似乎都无法正确提取‘utf-8’编码的文本。你有什么想法可以使用Python3和其他相关的Python库从PDF文件中提取文本吗？提前感谢！ (您可以通过下载示例文件) import PyPDF2 import codecs pdf_file = open('6060273.pdf','rb'

浏览 0提问于2018-12-17得票数 1

1回答

使用python PyPDF2合并PDF文件

python、pdf、pypdf2

我看了一个视频，学习如何将PDF文件合并为一个PDF文件。我尝试在代码中进行一些修改，以便处理包含PDF文件的文件夹，主文件夹(Spyder)包含Demo.py，这是代码 import os from PyPDF2 import PdfFileMerger source_dir = os.getcwd() + './PDF Files' merger = PdfFileMerger() for item in os.listdir(source_dir): if item.endswith('pdf'): merger.append

浏览 16提问于2020-10-25得票数 1

1回答

PdfReadWarning:不可能解码XFormObject /SPIPa1 PdfReadWarning，

python、pypdf2

我使用PyPDF2读取多个pdf文件。我的脚本如下： from PyPDF2 import PdfFileReader flist = os.listdir(pdfFolder) for f in flist: pdfFileObj = open(os.path.join(pdfFolder, f), 'rb') pdfReader = PyPDF2.PdfFileReader(pdfFileObj, strict=False) for i in range(0,pdfReader.numPages): pageObj = pdfReader

浏览 7提问于2022-06-09得票数 1

1回答

根据内容裁剪pdf页面

python、pdf

使用Python，是否可以将pdf页面裁剪到下图所示的内容中，其中任务是在Inkscape中实现的？内容的边界区域应该是自动找到的。使用PyPDF2我可以裁剪页面，但它需要手动查找坐标，这对于大量文件来说是乏味的。在Inkscape中，坐标是自动找到的。我使用的代码如下所示，示例输入文件是。 # Python 3.7.0 import PyPDF2 # version 1.26.0 with open('document-1.pdf','rb') as fin: pdf = PyPDF2.PdfFileReader(fin) pa

浏览 8提问于2018-11-28得票数 1

2回答

PyPDF2 -从两个不同的PDF文件合并页面不起作用

python、pdf、pdf-generation、pypdf、pypdf2

我试图将两个PDF文件中的页面合并到一个PDF文件中。因此，我尝试了下面使用PyPDF2的代码： from PyPDF2 import PdfFileReader,PdfFileWriter import sys f = sys.argv[1] k = sys.argv[2] print f,k file1 = PdfFileReader(file(f, "rb")) file2 = PdfFileReader(file(k, "rb")) output = PdfFileWriter() page = file1.getPage(0) page.mergePa

浏览 5提问于2016-12-17得票数 2

回答已采纳