从pdf2docx导入转换器pdf_file = r'C:\Users\ABCD\Desktop\XYZ/Document1.pdf'#源文件docx_file = r'C:\Users\ABCD\Desktop\XYZ/sample.docx‘#目标文件# convert pdf to docx cv = Converter(pdf_file) cv.convert(docx_file，start=0，end=None) cv.close() #Output Parsing Page53: 53/53...正在创建第53页: 53/53...在6.258919400000195s.中终止

选项2

从pdf2docx导入解析pdf_file = r'C:\Users\ABCD\Desktop\XYZ/Document2.pdf‘# source file docx_file = r'C:\Users\ABCD\Desktop\XYZ/sample_2.docx’# destination file # convert pdf to docx parse(pdf_file，docx_file，start=0，end=None) # output Parsing Page53: 53/53...正在创建第53页: 53/53...在5.883666100000482s.中终止

票数 2

Stack Overflow用户

发布于 2019-06-12 18:23:54

您可以尝试使用pdftohtml，然后使用Pandoc将HTML转换为docx。

实际上，PDF并不是一种真正的文档格式，而是一种页面布局格式，因此转换可能会有问题。

票数 0

Stack Overflow用户

发布于 2019-06-13 18:39:27

我是Zamzar的首席技术官，我们在https://developers.zamzar.com/上有一个API可以做到这一点。

我们有a Test account，你可以免费使用来试用这项服务，还有our docs中的Python代码示例，它可以让你非常简单地将PDF文件转换为DOCX：

import requests
from requests.auth import HTTPBasicAuth

api_key = 'YOUR_API_KEY'
endpoint = "https://sandbox.zamzar.com/v1/jobs"
source_file = "/tmp/my.pdf"
target_format = "docx"

file_content = {'source_file': open(source_file, 'rb')}
data_content = {'target_format': target_format}
res = requests.post(endpoint, data=data_content, files=file_content, auth=HTTPBasicAuth(api_key, ''))
print res.json()

然后，您可以在downloading your converted file之前使用poll the job查看它何时完成。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/56559796

复制

相似问题

问将PDF转换为docx
EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问将PDF转换为docxEN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问将PDF转换为docx
EN