blocks|key|2926325|text|可以使用pdf2image库。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|2926326|你可以简单地用，|2926327|pip+install+pdf2image|code-block|syntax|javascript|2926328|安装完毕后，您可以使用下面的代码获取图像。|2926329|from+pdf2image+import+convert_from_path
pages+=+convert_from_path('pdf_file',+500)|2926330|以jpeg格式保存页面|2926331|for+page+in+pages:
++++page.save('out.jpg',+'JPEG')|2926332|编辑:+Github+pdf2image还提到它使用pdftoppm，并且需要其他安装：|offset|length|style|CODE|2926333|pdftoppm是一款具有实际魔力的软件。它作为一个更大的包(称为波普尔+)的一部分分发。Windows用户将不得不安装Windows应用程序。Mac用户将不得不安装苹果机波普尔。Linux用户将预先安装pdftoppm和发行版(在Ubuntu和Archlinux上测试)，如果没有，请运行sudo+apt+install+poppler-utils。|blockquote|2926334|您可以使用anaconda安装Windows下的最新版本，方法是：|2926335|conda+install+-c+conda-forge+poppler|2926336|注意：http://blog.alivate.com.au/poppler-windows/提供的Windows版本可达0.67，但请注意，0.68为2018年8月发布，因此您将无法获得最新的特性或bug修复。|2926337|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/Belval/pdf2image|1|https://poppler.freedesktop.org/|2|https://sourceforge.net/projects/poppler-win32/|3|http://macappstore.org/poppler/|4|http://blog.alivate.com.au/poppler-windows/|5|https://poppler.freedesktop.org/releases.html^0|0|0|0|0|0|0|0|Q|8|B|9|0|0|41|U|X|3|1|1O|B|2|2B|6|3|0|0|0|3|17|4|23|9|5|0^^$0|@$1|2|3|4|5|6|7|1N|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|1O|8|@]|9|@]|A|$]]|$1|D|3|E|5|F|7|1P|8|@]|9|@]|A|$G|H]]|$1|I|3|J|5|6|7|1Q|8|@]|9|@]|A|$]]|$1|K|3|L|5|F|7|1R|8|@]|9|@]|A|$G|H]]|$1|M|3|N|5|6|7|1S|8|@]|9|@]|A|$]]|$1|O|3|P|5|F|7|1T|8|@]|9|@]|A|$G|H]]|$1|Q|3|R|5|6|7|1U|8|@$S|1V|T|1W|U|V]]|9|@$S|1X|T|1Y|1|1Z]]|A|$]]|$1|W|3|X|5|Y|7|20|8|@$S|21|T|22|U|V]]|9|@$S|23|T|24|1|25]|$S|26|T|27|1|28]|$S|29|T|2A|1|2B]]|A|$]]|$1|Z|3|10|5|6|7|2C|8|@]|9|@]|A|$]]|$1|11|3|12|5|F|7|2D|8|@]|9|@]|A|$G|H]]|$1|13|3|14|5|6|7|2E|8|@]|9|@$S|2F|T|2G|1|2H]|$S|2I|T|2J|1|2K]]|A|$]]|$1|15|3|-4|5|6|7|2L|8|@]|9|@]|A|$]]]|16|$17|$5|18|19|1A|A|$1B|1C]]|1D|$5|18|19|1A|A|$1B|1E]]|1F|$5|18|19|1A|A|$1B|1G]]|1H|$5|18|19|1A|A|$1B|1I]]|1J|$5|18|19|1A|A|$1B|1K]]|1L|$5|18|19|1A|A|$1B|1M]]]]

The pdf2image library can be used.

You can install it simply using, 

<pre><code>pip install pdf2image
</code></pre>

Once installed you can use following code to get images.

<pre><code>from pdf2image import convert_from_path
pages = convert_from_path('pdf_file', 500)
</code></pre>

Saving pages in jpeg format

<pre><code>for page in pages:
 page.save('out.jpg', 'JPEG')
</code></pre>

<hr>

Edit: the Github repo <a href="https://github.com/Belval/pdf2image" rel="noreferrer">pdf2image</a> also mentions that it uses <code>pdftoppm</code> and that it requires other installations:

<blockquote>
 pdftoppm is the piece of software that does the actual magic. It is distributed as part of a greater package called <a href="https://poppler.freedesktop.org/" rel="noreferrer">poppler</a>.
 Windows users will have to install <a href="https://sourceforge.net/projects/poppler-win32/" rel="noreferrer">poppler for Windows</a>.
 Mac users will have to install <a href="http://macappstore.org/poppler/" rel="noreferrer">poppler for Mac</a>.
 Linux users will have pdftoppm pre-installed with the distro (Tested on Ubuntu and Archlinux) if it's not, run <code>sudo apt install poppler-utils</code>.
</blockquote>

You can install the latest version under Windows using anaconda by doing:

<pre><code>conda install -c conda-forge poppler
</code></pre>

note: Windows versions upto 0.67 are available at <a href="http://blog.alivate.com.au/poppler-windows/" rel="noreferrer">http://blog.alivate.com.au/poppler-windows/</a> but note that 0.68 was <a href="https://poppler.freedesktop.org/releases.html" rel="noreferrer">released in Aug 2018</a> so you'll not be getting the latest features or bug fixes.

blocks|key|2926391|text|实际上，Python库pdf2image+(在另一个答案中使用)并不对subprocess.Popen执行不仅仅是发射+pdttoppm操作，因此下面是一个直接执行该操作的简短版本：|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|2926392|PDFTOPPMPATH+=+r"D:\Documents\software\____PORTABLE\poppler-0.51\bin\pdftoppm.exe"
PDFFILE+=+"SKM_28718052212190.pdf"

import+subprocess
subprocess.Popen('"%25s"+-png+"%25s"+out'+%25+(PDFTOPPMPATH,+PDFFILE))|code-block|syntax|javascript|2926393|下面是pdftoppm的(包含在一个名为poppler的包中)：http://blog.alivate.com.au/poppler-windows/。|2926394|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/Belval/pdf2image/blob/master/pdf2image/pdf2image.py#L203|1|http://blog.alivate.com.au/poppler-windows/^0|B|9|Z|G|1O|8|1H|6|0|0|0|3|8|W|17|1|0^^$0|@$1|2|3|4|5|6|7|W|8|@$9|X|A|Y|B|C]|$9|Z|A|10|B|C]|$9|11|A|12|B|C]]|D|@$9|13|A|14|1|15]]|E|$]]|$1|F|3|G|5|H|7|16|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|17|8|@$9|18|A|19|B|C]]|D|@$9|1A|A|1B|1|1C]]|E|$]]|$1|M|3|-4|5|6|7|1D|8|@]|D|@]|E|$]]]|N|$O|$5|P|Q|R|E|$S|T]]|U|$5|P|Q|R|E|$S|V]]]]

The Python library <code>pdf2image</code> (used in the other answer) in fact doesn't do <a href="https://github.com/Belval/pdf2image/blob/master/pdf2image/pdf2image.py#L203" rel="noreferrer">much more than just launching</a> <code>pdttoppm</code> with <code>subprocess.Popen</code>, so here is a short version doing it directly:
<pre><code>PDFTOPPMPATH = r&quot;D:\Documents\software\____PORTABLE\poppler-0.51\bin\pdftoppm.exe&quot;
PDFFILE = &quot;SKM_28718052212190.pdf&quot;

import subprocess
subprocess.Popen('&quot;%s&quot; -png &quot;%s&quot; out' % (PDFTOPPMPATH, PDFFILE))
</code></pre>
Here is the Windows installation link for <code>pdftoppm</code> (contained in a package named poppler): <a href="http://blog.alivate.com.au/poppler-windows/" rel="noreferrer">http://blog.alivate.com.au/poppler-windows/</a>.

blocks|key|2926405|text|它们是一个名为pdftojpg的实用工具，可用于将pdf转换为img。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|2926406|您可以在这里找到代码，https://github.com/pankajr141/pdf2jpg|offset|length|2926407|from+pdf2jpg+import+pdf2jpg
inputpath+=+r"D:\inputdir\pdf1.pdf"
outputpath+=+r"D:\outputdir"
#+To+convert+single+page
result+=+pdf2jpg.convert_pdf2jpg(inputpath,+outputpath,+pages="1")
print(result)

#+To+convert+multiple+pages
result+=+pdf2jpg.convert_pdf2jpg(inputpath,+outputpath,+pages="1,0,3")
print(result)

#+to+convert+all+pages
result+=+pdf2jpg.convert_pdf2jpg(inputpath,+outputpath,+pages="ALL")
print(result)|code-block|syntax|javascript|2926408|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/pankajr141/pdf2jpg^0|0|B|11|0|0|0^^$0|@$1|2|3|4|5|6|7|S|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|T|8|@]|9|@$D|U|E|V|1|W]]|A|$]]|$1|F|3|G|5|H|7|X|8|@]|9|@]|A|$I|J]]|$1|K|3|-4|5|6|7|Y|8|@]|9|@]|A|$]]]|L|$M|$5|N|O|P|A|$Q|R]]]]

Their is a utility called pdftojpg which can be used to convert the pdf to img

You can found the code here <a href="https://github.com/pankajr141/pdf2jpg" rel="noreferrer">https://github.com/pankajr141/pdf2jpg</a>

<pre><code>from pdf2jpg import pdf2jpg
inputpath = r"D:\inputdir\pdf1.pdf"
outputpath = r"D:\outputdir"
# To convert single page
result = pdf2jpg.convert_pdf2jpg(inputpath, outputpath, pages="1")
print(result)

# To convert multiple pages
result = pdf2jpg.convert_pdf2jpg(inputpath, outputpath, pages="1,0,3")
print(result)

# to convert all pages
result = pdf2jpg.convert_pdf2jpg(inputpath, outputpath, pages="ALL")
print(result)
</code></pre>

blocks|key|2888210|text|@gaurwraith，安装poppler并使用pdftoppm.exe，如下所示：|type|unstyled|depth|inlineStyleRanges|offset|length|style|BOLD|entityRanges|data|2888211|从http://blog.alivate.com.au/poppler-windows/下载带有Poppler最新二进制文件/dll的zip文件，然后解压缩到程序文件文件夹中的一个新文件夹。例如："C:\Program+(x86)\Poppler“。|ordered-list-item|2888212|在系统路径环境变量中添加"C:\Program+(X86)\Poppler-0.68.0\bin“。|2888213|从cmd行安装pdf2image模块->+"pip+pdf2image“。|2888214|或者，按照用户Basj的解释，使用Python的子流程模块直接从代码中执行pdftoppm.exe。|2888215|@vishvAs+vAsuki，这段代码应该通过子处理模块为给定文件夹中一个或多个pdfs的所有页面生成您想要的jpgs：|2888216|import+os,+subprocess

pdf_dir+=+r"C:\yourPDFfolder"
os.chdir(pdf_dir)

pdftoppm_path+=+r"C:\Program+Files+(x86)\Poppler\poppler-0.68.0\bin\pdftoppm.exe"

for+pdf_file+in+os.listdir(pdf_dir):

++++if+pdf_file.endswith(".pdf"):

++++++++subprocess.Popen('"%25s"+-jpeg+%25s+out'+%25+(pdftoppm_path,+pdf_file))|code-block|syntax|javascript|2888217|或者使用pdf2image模块：|2888218|import+os
from+pdf2image+import+convert_from_path

pdf_dir+=+r"C:\yourPDFfolder"
os.chdir(pdf_dir)

++++for+pdf_file+in+os.listdir(pdf_dir):

++++++++if+pdf_file.endswith(".pdf"):

++++++++++++pages+=+convert_from_path(pdf_file,+300)
++++++++++++pdf_file+=+pdf_file[:-4]

++++++++++++for+page+in+pages:

+++++++++++++++page.save("%25s-page%25d.jpg"+%25+(pdf_file,pages.index(page)),+"JPEG")|2888219|entityMap|0|LINK|mutability|MUTABLE|url|http://blog.alivate.com.au/poppler-windows/^0|C|U|0|0|3H|1|17|0|0|0|1E|0|0|11|0|0|1E|0|0|1P|0|0|0|G|0|0^^$0|@$1|2|3|4|5|6|7|17|8|@$9|18|A|19|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|1A|8|@$9|1B|A|1C|B|C]]|D|@$9|1D|A|1E|1|1F]]|E|$]]|$1|I|3|J|5|H|7|1G|8|@$9|1H|A|1I|B|C]]|D|@]|E|$]]|$1|K|3|L|5|H|7|1J|8|@$9|1K|A|1L|B|C]]|D|@]|E|$]]|$1|M|3|N|5|H|7|1M|8|@$9|1N|A|1O|B|C]]|D|@]|E|$]]|$1|O|3|P|5|6|7|1P|8|@$9|1Q|A|1R|B|C]]|D|@]|E|$]]|$1|Q|3|R|5|S|7|1S|8|@]|D|@]|E|$T|U]]|$1|V|3|W|5|6|7|1T|8|@$9|1U|A|1V|B|C]]|D|@]|E|$]]|$1|X|3|Y|5|S|7|1W|8|@]|D|@]|E|$T|U]]|$1|Z|3|-4|5|6|7|1X|8|@]|D|@]|E|$]]]|10|$11|$5|12|13|14|E|$15|16]]]]

@gaurwraith, install poppler for Windows and use pdftoppm.exe as follows:

<ol>
<li>Download zip file with Poppler's latest binaries/dlls from <a href="http://blog.alivate.com.au/poppler-windows/" rel="noreferrer">http://blog.alivate.com.au/poppler-windows/</a> and unzip to a new folder in your program files folder. For example: "C:\Program Files (x86)\Poppler".</li>
<li>Add "C:\Program Files (x86)\Poppler\poppler-0.68.0\bin" to your SYSTEM PATH environment variable.</li>
<li>From cmd line install pdf2image module -> "pip install pdf2image".</li>
<li>Or alternatively, directly execute pdftoppm.exe from your code using Python's subprocess module as explained by user Basj.</li>
</ol>

@vishvAs vAsuki, this code should generate the jpgs you want through the subprocess module for all pages of one or more pdfs in a given folder:

<pre><code>import os, subprocess

pdf_dir = r"C:\yourPDFfolder"
os.chdir(pdf_dir)

pdftoppm_path = r"C:\Program Files (x86)\Poppler\poppler-0.68.0\bin\pdftoppm.exe"

for pdf_file in os.listdir(pdf_dir):

 if pdf_file.endswith(".pdf"):

 subprocess.Popen('"%s" -jpeg %s out' % (pdftoppm_path, pdf_file))
</code></pre>

Or using the pdf2image module:

<pre><code>import os
from pdf2image import convert_from_path

pdf_dir = r"C:\yourPDFfolder"
os.chdir(pdf_dir)

 for pdf_file in os.listdir(pdf_dir):

 if pdf_file.endswith(".pdf"):

 pages = convert_from_path(pdf_file, 300)
 pdf_file = pdf_file[:-4]

 for page in pages:

 page.save("%s-page%d.jpg" % (pdf_file,pages.index(page)), "JPEG")
</code></pre>

blocks|key|1596552|text|没有必要在您的操作系统上安装Poppler。这将起作用：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1596553|pip安装棒|1596554|from+wand.image+import+Image

f+=+"somefile.pdf"
with(Image(filename=f,+resolution=120))+as+source:+
++++for+i,+image+in+enumerate(source.sequence):
++++++++newfilename+=+f[:-4]+%2B+str(i+%2B+1)+%2B+'.jpeg'
++++++++Image(image).save(filename=newfilename)|code-block|syntax|javascript|1596555|entityMap^0|0|0|0^^$0|@$1|2|3|4|5|6|7|K|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|L|8|@]|9|@]|A|$]]|$1|D|3|E|5|F|7|M|8|@]|9|@]|A|$G|H]]|$1|I|3|-4|5|6|7|N|8|@]|9|@]|A|$]]]|J|$]]

There is no need to install Poppler on your OS. This will work:

pip install Wand

<pre><code>from wand.image import Image

f = "somefile.pdf"
with(Image(filename=f, resolution=120)) as source: 
 for i, image in enumerate(source.sequence):
 newfilename = f[:-4] + str(i + 1) + '.jpeg'
 Image(image).save(filename=newfilename)
</code></pre>

blocks|key|2888365|text|我找到了这个简单的解决方案，PyMuPDF，输出到png文件。注意，库导入为"fitz"，这是它使用的呈现引擎的历史名称。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|2888366|import+fitz

pdffile+=+"infile.pdf"
doc+=+fitz.open(pdffile)
page+=+doc.load_page(0)++#+number+of+page
pix+=+page.get_pixmap()
output+=+"outfile.png"
pix.save(output)|code-block|syntax|javascript|2888367|注意:库从使用"camelCase“改为"snake_cased”。如果您遇到一个函数不存在的错误，请在弃用名称下面查看。上述示例中的功能已相应更新。|2888368|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/pymupdf/PyMuPDF|1|https://pymupdf.readthedocs.io/en/latest/znames.html^0|E|7|0|0|0|1G|4|1|0^^$0|@$1|2|3|4|5|6|7|U|8|@]|9|@$A|V|B|W|1|X]]|C|$]]|$1|D|3|E|5|F|7|Y|8|@]|9|@]|C|$G|H]]|$1|I|3|J|5|6|7|Z|8|@]|9|@$A|10|B|11|1|12]]|C|$]]|$1|K|3|-4|5|6|7|13|8|@]|9|@]|C|$]]]|L|$M|$5|N|O|P|C|$Q|R]]|S|$5|N|O|P|C|$Q|T]]]]

I found this simple solution, <a href="https://github.com/pymupdf/PyMuPDF" rel="nofollow noreferrer">PyMuPDF</a>, output to png file. Note the library is imported as &quot;fitz&quot;, a historical name for the rendering engine it uses.
<pre><code>import fitz

pdffile = &quot;infile.pdf&quot;
doc = fitz.open(pdffile)
page = doc.load_page(0) # number of page
pix = page.get_pixmap()
output = &quot;outfile.png&quot;
pix.save(output)
</code></pre>
Note: The library changed from using &quot;camelCase&quot; to &quot;snake_cased&quot;. If you run into an error that a function does not exist, have a look under <a href="https://pymupdf.readthedocs.io/en/latest/znames.html" rel="nofollow noreferrer">deprecated names</a>. The functions in the example above have been updated accordingly.

blocks|key|2888370|text|from+pdf2image+import+convert_from_path
import+glob

pdf_dir+=+glob.glob(r'G:\personal\pdf\*')++#your+pdf+folder+path
img_dir+=+"G:\\personal\\img\\"+++++++++++#your+dest+img+path

for+pdf_+in+pdf_dir:
++++pages+=+convert_from_path(pdf_,+500)
++++for+page+in+pages:
++++++++page.save(img_dir%2Bpdf_.split("\\")[-1][:-3]%2B"jpg",+'JPEG')|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|2888371|unstyled|entityMap^0|0^^$0|@$1|2|3|4|5|6|7|G|8|@]|9|@]|A|$B|C]]|$1|D|3|-4|5|E|7|H|8|@]|9|@]|A|$]]]|F|$]]

<pre><code>from pdf2image import convert_from_path
import glob

pdf_dir = glob.glob(r'G:\personal\pdf\*') #your pdf folder path
img_dir = "G:\\personal\\img\\" #your dest img path

for pdf_ in pdf_dir:
 pages = convert_from_path(pdf_, 500)
 for page in pages:
 page.save(img_dir+pdf_.split("\\")[-1][:-3]+"jpg", 'JPEG')
</code></pre>

blocks|key|1596667|text|我使用了(可能)更简单的pdf2image选项：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1596668|cd+$dir
for+f+in+*.pdf
do
++if+[+-f+"${f}"+];+then
++++n=$(echo+"$f"+%7C+cut+-f1+-d'.')
++++pdftoppm+-scale-to+1440+-png+$f+$conv/$n
++++rm+$f
++++mv++$conv/*.png+$dir
++fi
done|code-block|syntax|javascript|1596669|这是循环中bash脚本的一小部分，用于使用狭窄的铸造设备。每5秒检查一次增加的pdf文件(全部)并处理它们。这是一个演示设备，最后转换将在远程服务器上进行。现在转换为.PNG，但是.JPG也是可能的。|1596670|这种转换，加上A4格式的转换，显示一个视频、两个平滑的滚动文本和一个徽标(三个版本的转换)将Pi3设置为所有4x100%25的cpu负载;-)|1596671|entityMap^0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|M|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|N|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|O|8|@]|9|@]|A|$]]|$1|I|3|J|5|6|7|P|8|@]|9|@]|A|$]]|$1|K|3|-4|5|6|7|Q|8|@]|9|@]|A|$]]]|L|$]]

I use a (maybe) much simpler option of pdf2image:

<pre><code>cd $dir
for f in *.pdf
do
 if [ -f "${f}" ]; then
 n=$(echo "$f" | cut -f1 -d'.')
 pdftoppm -scale-to 1440 -png $f $conv/$n
 rm $f
 mv $conv/*.png $dir
 fi
done
</code></pre>

This is a small part of a bash script in a loop for the use of a narrow casting device.
Checks every 5 seconds on added pdf files (all) and processes them.
This is for a demo device, at the end converting will be done at a remote server. Converting to .PNG now, but .JPG is possible too.

This converting, together with transitions on A4 format, displaying a video, two smooth scrolling texts and a logo (with transition in three versions) sets the Pi3 to allmost 4x 100% cpu-load ;-)

blocks|key|1596701|text|对于基于Linux的系统，GhostScript的执行速度要快得多。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1596702|以下是pdf到图像转换的代码。|1596703|def+get_image_page(pdf_file,+out_file,+page_num):
++++page+=+str(page_num+%2B+1)
++++command+=+["gs",+"-q",+"-dNOPAUSE",+"-dBATCH",+"-sDEVICE=png16m",+"-r"+%2B+str(RESOLUTION),+"-dPDFFitPage",
+++++++++++++++"-sOutputFile="+%2B+out_file,+"-dFirstPage="+%2B+page,+"-dLastPage="+%2B+page,
+++++++++++++++pdf_file]
++++f_null+=+open(os.devnull,+'w')
++++subprocess.call(command,+stdout=f_null,+stderr=subprocess.STDOUT)|code-block|syntax|javascript|1596704|可以使用GhostScript在macOS上安装brew+install+ghostscript|offset|length|style|CODE|1596705|其他平台的安装信息可以找到这里。如果它尚未安装在您的系统上。|1596706|entityMap|0|LINK|mutability|MUTABLE|url|https://www.ghostscript.com/doc/9.23/Install.htm^0|0|0|0|O|O|0|D|2|0|0^^$0|@$1|2|3|4|5|6|7|Y|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|Z|8|@]|9|@]|A|$]]|$1|D|3|E|5|F|7|10|8|@]|9|@]|A|$G|H]]|$1|I|3|J|5|6|7|11|8|@$K|12|L|13|M|N]]|9|@]|A|$]]|$1|O|3|P|5|6|7|14|8|@]|9|@$K|15|L|16|1|17]]|A|$]]|$1|Q|3|-4|5|6|7|18|8|@]|9|@]|A|$]]]|R|$S|$5|T|U|V|A|$W|X]]]]

GhostScript performs much faster than Poppler for a Linux based system. 

Following is the code for pdf to image conversion.

<pre><code>def get_image_page(pdf_file, out_file, page_num):
 page = str(page_num + 1)
 command = ["gs", "-q", "-dNOPAUSE", "-dBATCH", "-sDEVICE=png16m", "-r" + str(RESOLUTION), "-dPDFFitPage",
 "-sOutputFile=" + out_file, "-dFirstPage=" + page, "-dLastPage=" + page,
 pdf_file]
 f_null = open(os.devnull, 'w')
 subprocess.call(command, stdout=f_null, stderr=subprocess.STDOUT)
</code></pre>

GhostScript can be installed on macOS using <code>brew install ghostscript</code>

Installation information for other platforms can be found <a href="https://www.ghostscript.com/doc/9.23/Install.htm" rel="noreferrer">here</a>. If it is not already installed on your system.

blocks|key|2926695|text|这里有一个解决方案，它不需要额外的库，而且非常快速。这是从：pdfs.html#中找到的，我在一个函数中添加了代码，以使它更方便。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|2926696|def+convert(filepath):
++++with+open(filepath,+"rb")+as+file:
++++++++pdf+=+file.read()

++++startmark+=+b"\xff\xd8"
++++startfix+=+0
++++endmark+=+b"\xff\xd9"
++++endfix+=+2
++++i+=+0

++++njpg+=+0
++++while+True:
++++++++istream+=+pdf.find(b"stream",+i)
++++++++if+istream+<+0:
++++++++++++break
++++++++istart+=+pdf.find(startmark,+istream,+istream+%2B+20)
++++++++if+istart+<+0:
++++++++++++i+=+istream+%2B+20
++++++++++++continue
++++++++iend+=+pdf.find(b"endstream",+istart)
++++++++if+iend+<+0:
++++++++++++raise+Exception("Didn't+find+end+of+stream!")
++++++++iend+=+pdf.find(endmark,+iend+-+20)
++++++++if+iend+<+0:
++++++++++++raise+Exception("Didn't+find+end+of+JPG!")

++++++++istart+%2B=+startfix
++++++++iend+%2B=+endfix
++++++++jpg+=+pdf[istart:iend]
++++++++newfile+=+"{}jpg".format(filepath[:-3])
++++++++with+open(newfile,+"wb")+as+jpgfile:
++++++++++++jpgfile.write(jpg)

++++++++njpg+%2B=+1
++++++++i+=+iend

++++++++return+newfile|code-block|syntax|javascript|2926697|以pdf路径作为参数进行调用，函数将在同一个目录中创建一个.jpg文件。|2926698|entityMap|0|LINK|mutability|MUTABLE|url|https://nedbatchelder.com/blog/200712/extracting_jpgs_from_pdfs.html#^0|U|A|0|0|0|0^^$0|@$1|2|3|4|5|6|7|S|8|@]|9|@$A|T|B|U|1|V]]|C|$]]|$1|D|3|E|5|F|7|W|8|@]|9|@]|C|$G|H]]|$1|I|3|J|5|6|7|X|8|@]|9|@]|C|$]]|$1|K|3|-4|5|6|7|Y|8|@]|9|@]|C|$]]]|L|$M|$5|N|O|P|C|$Q|R]]]]

Here is a solution which requires no additional libraries and is very fast. This was found from: <a href="https://nedbatchelder.com/blog/200712/extracting_jpgs_from_pdfs.html#" rel="nofollow noreferrer">https://nedbatchelder.com/blog/200712/extracting_jpgs_from_pdfs.html#</a>
I have added the code in a function to make it more convenient.

<pre><code>def convert(filepath):
 with open(filepath, "rb") as file:
 pdf = file.read()

 startmark = b"\xff\xd8"
 startfix = 0
 endmark = b"\xff\xd9"
 endfix = 2
 i = 0

 njpg = 0
 while True:
 istream = pdf.find(b"stream", i)
 if istream &lt; 0:
 break
 istart = pdf.find(startmark, istream, istream + 20)
 if istart &lt; 0:
 i = istream + 20
 continue
 iend = pdf.find(b"endstream", istart)
 if iend &lt; 0:
 raise Exception("Didn't find end of stream!")
 iend = pdf.find(endmark, iend - 20)
 if iend &lt; 0:
 raise Exception("Didn't find end of JPG!")

 istart += startfix
 iend += endfix
 jpg = pdf[istart:iend]
 newfile = "{}jpg".format(filepath[:-3])
 with open(newfile, "wb") as jpgfile:
 jpgfile.write(jpg)

 njpg += 1
 i = iend

 return newfile
</code></pre>

Call convert with the pdf path as the argument and the function will create a .jpg file in the same directory

blocks|key|2888509|text|一个问题是，每个人都将面临的一个问题是，安装.My是一种棘手的方法，但它的工作效率很高。第一，下载Poppler+这里.Then解压缩它，在代码部分添加它，只需添加poppler_path=r'C:\Program+\poppler-0.68.0\bin‘(例如)。就像下面|type|unstyled|depth|inlineStyleRanges|offset|length|style|BOLD|entityRanges|data|2888510|from+pdf2image+import+convert_from_path
images+=+convert_from_path("mypdf.pdf",+500,poppler_path=r'C:\Program+Files\poppler-0.68.0\bin')
for+i,+image+in+enumerate(images):
++++fname+=+'image'%2Bstr(i)%2B'.png'
++++image.save(fname,+"PNG")|code-block|syntax|javascript|2888511|entityMap|0|LINK|mutability|MUTABLE|url|http://blog.alivate.com.au/poppler-windows/^0|K|2|1D|7|2A|1A|1L|2|0|0|0^^$0|@$1|2|3|4|5|6|7|S|8|@$9|T|A|U|B|C]|$9|V|A|W|B|C]|$9|X|A|Y|B|C]]|D|@$9|Z|A|10|1|11]]|E|$]]|$1|F|3|G|5|H|7|12|8|@]|D|@]|E|$I|J]]|$1|K|3|-4|5|6|7|13|8|@]|D|@]|E|$]]]|L|$M|$5|N|O|P|E|$Q|R]]]]

One problem,everyone will face that is to Install Poppler.My way is a tricky way,but will work efficiently.1st download Poppler <a href="http://blog.alivate.com.au/poppler-windows/" rel="nofollow noreferrer">here</a>.Then Extract it add In the code section just add poppler_path=r'C:\Program Files\poppler-0.68.0\bin'(for eg.) like below
<pre><code>from pdf2image import convert_from_path
images = convert_from_path(&quot;mypdf.pdf&quot;, 500,poppler_path=r'C:\Program Files\poppler-0.68.0\bin')
for i, image in enumerate(images):
 fname = 'image'+str(i)+'.png'
 image.save(fname, &quot;PNG&quot;)
</code></pre>

blocks|key|2888517|text|下面是一个函数，用于将PDF文件与一页或多页转换为单一合并的JPEG图像。|type|unstyled|depth|inlineStyleRanges|offset|length|style|BOLD|entityRanges|data|2888518|import+os
import+tempfile
from+pdf2image+import+convert_from_path
from+PIL+import+Image

def+convert_pdf_to_image(file_path,+output_path):
++++#+save+temp+image+files+in+temp+dir,+delete+them+after+we+are+finished
++++with+tempfile.TemporaryDirectory()+as+temp_dir:
++++++++#+convert+pdf+to+multiple+image
++++++++images+=+convert_from_path(file_path,+output_folder=temp_dir)
++++++++#+save+images+to+temporary+directory
++++++++temp_images+=+[]
++++++++for+i+in+range(len(images)):
++++++++++++image_path+=+f'{temp_dir}/{i}.jpg'
++++++++++++images[i].save(image_path,+'JPEG')
++++++++++++temp_images.append(image_path)
++++++++#+read+images+into+pillow.Image
++++++++imgs+=+list(map(Image.open,+temp_images))
++++#+find+minimum+width+of+images
++++min_img_width+=+min(i.width+for+i+in+imgs)
++++#+find+total+height+of+all+images
++++total_height+=+0
++++for+i,+img+in+enumerate(imgs):
++++++++total_height+%2B=+imgs[i].height
++++#+create+new+image+object+with+width+and+total+height
++++merged_image+=+Image.new(imgs[0].mode,+(min_img_width,+total_height))
++++#+paste+images+together+one+by+one
++++y+=+0
++++for+img+in+imgs:
++++++++merged_image.paste(img,+(0,+y))
++++++++y+%2B=+img.height
++++#+save+merged+image
++++merged_image.save(output_path)
++++return+output_path|code-block|syntax|javascript|2888519|示例用法：-|2888520|convert_pdf_to_image("path_to_Pdf/1.pdf",+"output_path/output.jpeg")|CODE|2888521|entityMap^0|B|5|H|5|P|B|0|0|0|6|0|0|1W|0^^$0|@$1|2|3|4|5|6|7|R|8|@$9|S|A|T|B|C]|$9|U|A|V|B|C]|$9|W|A|X|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|Y|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|Z|8|@$9|10|A|11|B|C]]|D|@]|E|$]]|$1|M|3|N|5|6|7|12|8|@$9|13|A|14|B|O]]|D|@]|E|$]]|$1|P|3|-4|5|6|7|15|8|@]|D|@]|E|$]]]|Q|$]]

Here is a function that does the conversion of a PDF file with one or multiple pages to a single merged JPEG image.
<pre><code>import os
import tempfile
from pdf2image import convert_from_path
from PIL import Image

def convert_pdf_to_image(file_path, output_path):
 # save temp image files in temp dir, delete them after we are finished
 with tempfile.TemporaryDirectory() as temp_dir:
 # convert pdf to multiple image
 images = convert_from_path(file_path, output_folder=temp_dir)
 # save images to temporary directory
 temp_images = []
 for i in range(len(images)):
 image_path = f'{temp_dir}/{i}.jpg'
 images[i].save(image_path, 'JPEG')
 temp_images.append(image_path)
 # read images into pillow.Image
 imgs = list(map(Image.open, temp_images))
 # find minimum width of images
 min_img_width = min(i.width for i in imgs)
 # find total height of all images
 total_height = 0
 for i, img in enumerate(imgs):
 total_height += imgs[i].height
 # create new image object with width and total height
 merged_image = Image.new(imgs[0].mode, (min_img_width, total_height))
 # paste images together one by one
 y = 0
 for img in imgs:
 merged_image.paste(img, (0, y))
 y += img.height
 # save merged image
 merged_image.save(output_path)
 return output_path
</code></pre>
Example usage: -
<code>convert_pdf_to_image(&quot;path_to_Pdf/1.pdf&quot;, &quot;output_path/output.jpeg&quot;)</code>

blocks|key|1596812|text|对于一个包含多页的pdf文件，以下是最好的和最简单的(我使用了pdf2image-1.14.0)：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1596813|from+pdf2image+import+convert_from_path
from+pdf2image.exceptions+import+(
+++++PDFInfoNotInstalledError,
+++++PDFPageCountError,
+++++PDFSyntaxError
+++++)
++++++++
images+=+convert_from_path(r"path/to/input/pdf/file",+output_folder=r"path/to/output/folder",+fmt="jpg",)+#dpi=200,+grayscale=True,+size=(300,400),+first_page=0,+last_page=3)
++++++++
images.clear()|code-block|syntax|javascript|1596814|注意：|1596815|“图像”是PIL图像的列表。|ordered-list-item|1596816|在输出文件夹中保存的图像将具有系统生成的名称；如果需要，稍后可以更改它们。|1596817|entityMap^0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|P|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|Q|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|R|8|@]|9|@]|A|$]]|$1|I|3|J|5|K|7|S|8|@]|9|@]|A|$]]|$1|L|3|M|5|K|7|T|8|@]|9|@]|A|$]]|$1|N|3|-4|5|6|7|U|8|@]|9|@]|A|$]]]|O|$]]

For a pdf file with multiple pages, the following is the best &amp; simplest (I used pdf2image-1.14.0):
<pre><code>from pdf2image import convert_from_path
from pdf2image.exceptions import (
 PDFInfoNotInstalledError,
 PDFPageCountError,
 PDFSyntaxError
 )
 
images = convert_from_path(r&quot;path/to/input/pdf/file&quot;, output_folder=r&quot;path/to/output/folder&quot;, fmt=&quot;jpg&quot;,) #dpi=200, grayscale=True, size=(300,400), first_page=0, last_page=3)
 
images.clear()
</code></pre>
Note:
<ol>
<li>&quot;images&quot; is a list of PIL images.</li>
<li>The saved images in the output folder will have system generated names; one can later change them, if required.</li>
</ol>

blocks|key|2926761|text|from+pdf2image+import+convert_from_path

PDF_file+=+'Statement.pdf'
pages+=+convert_from_path(PDF_file,+500,userpw='XXX')

image_counter+=+1

for+page+in+pages:

++++filename+=+"foldername/page_"+%2B+str(image_counter)+%2B+".jpg"
++++page.save(filename,+'JPEG')
++++image_counter+=+image_counter+%2B+1|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|2926762|unstyled|entityMap^0|0^^$0|@$1|2|3|4|5|6|7|G|8|@]|9|@]|A|$B|C]]|$1|D|3|-4|5|E|7|H|8|@]|9|@]|A|$]]]|F|$]]

<pre><code>from pdf2image import convert_from_path

PDF_file = 'Statement.pdf'
pages = convert_from_path(PDF_file, 500,userpw='XXX')

image_counter = 1

for page in pages:

 filename = &quot;foldername/page_&quot; + str(image_counter) + &quot;.jpg&quot;
 page.save(filename, 'JPEG')
 image_counter = image_counter + 1
</code></pre>

blocks|key|2926780|text|我编写这个脚本是为了很容易地将包含PDF(单页)的文件夹目录转换为PNG。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|2926781|import+os
from+pathlib+import+PurePath
import+glob
#+from+PIL+import+Image
from+pdf2image+import+convert_from_path
import+pdb

#+In[file+list]

wd+=+os.getcwd()

#+filter+images
fileListpdf+=+glob.glob(f'{wd}//*.pdf')

#+In[Convert+pdf+to+images]

for+i+in+fileListpdf:
++++
++++images+=+convert_from_path(i,+dpi=300)
++++
++++path_split+=+PurePath(i).parts
++++fileName,+ext+=+os.path.splitext(path_split[-1])
++++
++++images[0].save(f'{fileName}.png',+'PNG')|code-block|syntax|javascript|2926782|希望，如果您需要将PDF转换为PNG，这将有所帮助！|2926783|entityMap^0|0|0|0^^$0|@$1|2|3|4|5|6|7|K|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|L|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|M|8|@]|9|@]|A|$]]|$1|I|3|-4|5|6|7|N|8|@]|9|@]|A|$]]]|J|$]]

I wrote this script to easily convert a folder directory that contains PDFs (single page) to PNGs really nicely.
<pre class="lang-py prettyprint-override"><code>import os
from pathlib import PurePath
import glob
# from PIL import Image
from pdf2image import convert_from_path
import pdb

# In[file list]

wd = os.getcwd()

# filter images
fileListpdf = glob.glob(f'{wd}//*.pdf')

# In[Convert pdf to images]

for i in fileListpdf:
 
 images = convert_from_path(i, dpi=300)
 
 path_split = PurePath(i).parts
 fileName, ext = os.path.splitext(path_split[-1])
 
 images[0].save(f'{fileName}.png', 'PNG')
</code></pre>
Hopefully, this helps if you need to convert PDFs to PNGs!

blocks|key|2926815|text|使用pypdfium2|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|2926816|python3+-m+pip+install+pypdfium2|code-block|syntax|javascript|2926817|import+pypdfium2+as+pdfium

#+Load+a+document
filepath+=+"tests/resources/multipage.pdf"
pdf+=+pdfium.PdfDocument(filepath)

#+render+a+single+page+(in+this+case:+the+first+one)
page+=+pdf.get_page(0)
pil_image+=+page.render_to(
++++pdfium.BitmapConv.pil_image,
)
pil_image.save("output.jpg")

#+render+multiple+pages+concurrently+(in+this+case:+all)
page_indices+=+[i+for+i+in+range(len(pdf))]
renderer+=+pdf.render_to(
++++pdfium.BitmapConv.pil_image,
++++page_indices+=+page_indices,
)
for+image,+index+in+zip(renderer,+page_indices):
++++image.save("output_%2502d.jpg"+%25+index)|2926818|优势：|2926819|PDFium是自由许可的(根据您的选择，BSD3-Clause或Apache2.0)|unordered-list-item|2926820|它速度快，表现好于波普尔。就速度而言，pypdfium2几乎可以达到PyMuPDF。|2926821|根据需要返回PIL.Image.Image、numpy.ndarray、字节或ctype数组。|2926822|能够处理加密的(密码保护的)PDF。|2926823|没有强制的运行时依赖项|2926824|支持Python3.6+>=|2926825|安装基础设施符合PEP+517/518，而遗留安装仍然有效。|2926826|车轮目前可用于|2926827|Windows+amd64，win32，arm64|2926828|macOS+x86_64，arm64|2926829|Linux+(glibc)+x86_64，i686，aarch64，armv7l|2926830|Linux+(musl)+x86_64，i686|2926831|也有一个从源代码构建的脚本。|2926832|(免责声明:我是作者)|2926833|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/pypdfium2-team/pypdfium2|1|https://github.com/pymupdf/PyMuPDF|2|https://pillow.readthedocs.io/en/stable/reference/Image.html#PIL.Image.Image|3|https://numpy.org/doc/stable/reference/generated/numpy.ndarray.html#numpy.ndarray^0|2|9|2|9|0|0|0|0|0|0|Y|7|Y|7|1|0|6|F|M|D|6|F|2|M|D|3|0|0|0|0|0|0|0|7|0|0|5|0|0|D|0|0|C|0|0|0^^$0|@$1|2|3|4|5|6|7|1V|8|@$9|1W|A|1X|B|C]]|D|@$9|1Y|A|1Z|1|20]]|E|$]]|$1|F|3|G|5|H|7|21|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|H|7|22|8|@]|D|@]|E|$I|J]]|$1|M|3|N|5|6|7|23|8|@]|D|@]|E|$]]|$1|O|3|P|5|Q|7|24|8|@]|D|@]|E|$]]|$1|R|3|S|5|Q|7|25|8|@$9|26|A|27|B|C]]|D|@$9|28|A|29|1|2A]]|E|$]]|$1|T|3|U|5|Q|7|2B|8|@$9|2C|A|2D|B|C]|$9|2E|A|2F|B|C]]|D|@$9|2G|A|2H|1|2I]|$9|2J|A|2K|1|2L]]|E|$]]|$1|V|3|W|5|Q|7|2M|8|@]|D|@]|E|$]]|$1|X|3|Y|5|Q|7|2N|8|@]|D|@]|E|$]]|$1|Z|3|10|5|Q|7|2O|8|@]|D|@]|E|$]]|$1|11|3|12|5|Q|7|2P|8|@]|D|@]|E|$]]|$1|13|3|14|5|6|7|2Q|8|@]|D|@]|E|$]]|$1|15|3|16|5|Q|7|2R|8|@$9|2S|A|2T|B|C]]|D|@]|E|$]]|$1|17|3|18|5|Q|7|2U|8|@$9|2V|A|2W|B|C]]|D|@]|E|$]]|$1|19|3|1A|5|Q|7|2X|8|@$9|2Y|A|2Z|B|C]]|D|@]|E|$]]|$1|1B|3|1C|5|Q|7|30|8|@$9|31|A|32|B|C]]|D|@]|E|$]]|$1|1D|3|1E|5|6|7|33|8|@]|D|@]|E|$]]|$1|1F|3|1G|5|6|7|34|8|@]|D|@]|E|$]]|$1|1H|3|-4|5|6|7|35|8|@]|D|@]|E|$]]]|1I|$1J|$5|1K|1L|1M|E|$1N|1O]]|1P|$5|1K|1L|1M|E|$1N|1Q]]|1R|$5|1K|1L|1M|E|$1N|1S]]|1T|$5|1K|1L|1M|E|$1N|1U]]]]

Using <a href="https://github.com/pypdfium2-team/pypdfium2" rel="nofollow noreferrer"><code>pypdfium2</code></a>:
<pre class="lang-bash prettyprint-override"><code>python3 -m pip install pypdfium2
</code></pre>
<pre class="lang-py prettyprint-override"><code>import pypdfium2 as pdfium

# Load a document
filepath = &quot;tests/resources/multipage.pdf&quot;
pdf = pdfium.PdfDocument(filepath)

# render a single page (in this case: the first one)
page = pdf.get_page(0)
pil_image = page.render_to(
 pdfium.BitmapConv.pil_image,
)
pil_image.save(&quot;output.jpg&quot;)

# render multiple pages concurrently (in this case: all)
page_indices = [i for i in range(len(pdf))]
renderer = pdf.render_to(
 pdfium.BitmapConv.pil_image,
 page_indices = page_indices,
)
for image, index in zip(renderer, page_indices):
 image.save(&quot;output_%02d.jpg&quot; % index)
</code></pre>
Advantages:
<ul>
<li>PDFium is liberal-licensed (BSD 3-Clause or Apache 2.0, at your choice)</li>
<li>It is fast, outperforming Poppler. In terms of speed, pypdfium2 can almost reach <a href="https://github.com/pymupdf/PyMuPDF" rel="nofollow noreferrer"><code>PyMuPDF</code></a></li>
<li>Returns <a href="https://pillow.readthedocs.io/en/stable/reference/Image.html#PIL.Image.Image" rel="nofollow noreferrer"><code>PIL.Image.Image</code></a>, <a href="https://numpy.org/doc/stable/reference/generated/numpy.ndarray.html#numpy.ndarray" rel="nofollow noreferrer"><code>numpy.ndarray</code></a>, bytes, or a ctypes array, depending on your needs</li>
<li>Is capable of processing encrypted (password-protected) PDFs</li>
<li>No mandatory runtime dependencies</li>
<li>Supports Python &gt;= 3.6</li>
<li>Setup infrastructure complies with PEP 517/518, while legacy setup still works as well</li>
</ul>
Wheels are currently available for
<ul>
<li><code>Windows</code> amd64, win32, arm64</li>
<li><code>macOS</code> x86_64, arm64</li>
<li><code>Linux (glibc)</code> x86_64, i686, aarch64, armv7l</li>
<li><code>Linux (musl)</code> x86_64, i686</li>
</ul>
There is a script to build from source, too.
(Disclaimer: I'm the author)

blocks|key|2888625|text|这个简单的脚本可以将包含PDF(单页/多页)的文件夹目录转换为jpeg。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|2888626|from+PIL+import+Image
import+pytesseract
import+sys
from+pdf2image+import+convert_from_path
import+os
from+os+import+listdir
from+os+import+system
from+os.path+import+isfile,+join,+basename,+dirname
import+shutil

def+move_processed_file(file,+doc_path,+download_processed):
++++try:
++++++++shutil.move(doc_path+%2B+'/'+%2B+file,+download_processed+%2B+'/'+%2B+file)
++++++++pass
++++except+Exception+as+e:
++++++++print(e.errno)
++++++++raise
++++else:
++++++++pass
++++finally:
++++++++pass
++++pass


def+run_conversion():
++++root_dir+=+os.path.abspath(os.curdir)

++++doc_path+=+root_dir+%2B+r"\data\download"
++++pdf_processed+=+root_dir+%2B+r"\data\download\pdf_processed"
++++results_folder+=+doc_path

++++files+=+[f+for+f+in+listdir(doc_path)+if+isfile(join(doc_path,+f))]

++++pdf_files+=+[f+for+f+in+listdir(doc_path)+if+isfile(join(doc_path,+f))+and+f.lower().endswith('.pdf')]

++++#+check+OS+type
++++if+os.name+==+'nt':
++++++++#+if+is+windows+or+a+graphical+OS,+change+this+poppler+path+with+your+own+path
++++++++poppler_path+=+r"C:\poppler-0.68.0\bin"
++++else:
++++++++poppler_path+=+root_dir+%2B+r"\usr\bin"

++++for+file+in+pdf_files:

++++++++'''+
++++++++#+Converting+PDF+to+images+
++++++++'''

++++++++#+Store+all+the+pages+of+the+PDF+in+a+variable
++++++++pages+=+convert_from_path(doc_path+%2B+'/'+%2B+file,+500,+poppler_path=poppler_path)

++++++++#+Counter+to+store+images+of+each+page+of+PDF+to+image
++++++++image_counter+=+1

++++++++filename,+file_extension+=+os.path.splitext(file)

++++++++#+Iterate+through+all+the+pages+stored+above
++++++++for+page+in+pages:
++++++++++++#+Declaring+filename+for+each+page+of+PDF+as+JPG
++++++++++++#+PDF+page+n+->+page_n.jpg
++++++++++++filename+=+filename+%2B+'_'+%2B+str(image_counter)+%2B+".jpg"

++++++++++++#+Save+the+image+of+the+page+in+system
++++++++++++page.save(results_folder+%2B+'/'+%2B+filename,+'JPEG')

++++++++++++#+Increment+the+counter+to+update+filename
++++++++++++image_counter+%2B=+1

++++++++move_processed_file(file,+doc_path,+pdf_processed)|code-block|syntax|javascript|2888627|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|I|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|J|8|@]|9|@]|A|$E|F]]|$1|G|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|H|$]]

This easy script can convert a folder directory that contains PDFs (single/multiple pages) to jpeg.
<pre><code>from PIL import Image
import pytesseract
import sys
from pdf2image import convert_from_path
import os
from os import listdir
from os import system
from os.path import isfile, join, basename, dirname
import shutil

def move_processed_file(file, doc_path, download_processed):
 try:
 shutil.move(doc_path + '/' + file, download_processed + '/' + file)
 pass
 except Exception as e:
 print(e.errno)
 raise
 else:
 pass
 finally:
 pass
 pass


def run_conversion():
 root_dir = os.path.abspath(os.curdir)

 doc_path = root_dir + r&quot;\data\download&quot;
 pdf_processed = root_dir + r&quot;\data\download\pdf_processed&quot;
 results_folder = doc_path

 files = [f for f in listdir(doc_path) if isfile(join(doc_path, f))]

 pdf_files = [f for f in listdir(doc_path) if isfile(join(doc_path, f)) and f.lower().endswith('.pdf')]

 # check OS type
 if os.name == 'nt':
 # if is windows or a graphical OS, change this poppler path with your own path
 poppler_path = r&quot;C:\poppler-0.68.0\bin&quot;
 else:
 poppler_path = root_dir + r&quot;\usr\bin&quot;

 for file in pdf_files:

 ''' 
 # Converting PDF to images 
 '''

 # Store all the pages of the PDF in a variable
 pages = convert_from_path(doc_path + '/' + file, 500, poppler_path=poppler_path)

 # Counter to store images of each page of PDF to image
 image_counter = 1

 filename, file_extension = os.path.splitext(file)

 # Iterate through all the pages stored above
 for page in pages:
 # Declaring filename for each page of PDF as JPG
 # PDF page n -&gt; page_n.jpg
 filename = filename + '_' + str(image_counter) + &quot;.jpg&quot;

 # Save the image of the page in system
 page.save(results_folder + '/' + filename, 'JPEG')

 # Increment the counter to update filename
 image_counter += 1

 move_processed_file(file, doc_path, pdf_processed)


</code></pre>

In python code, how to efficiently save a certain page in a pdf as a jpeg file? (Use case: I've a python flask web server where pdf-s will be uploaded and jpeg-s corresponding to each page is stores.)

<a href="https://stackoverflow.com/a/34116472">This solution</a> is close, but the problem is that it does not convert the entire page to jpeg.

Extract a page from a pdf as a jpeg

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

在python代码中，如何有效地将某个页面保存在pdf中作为jpeg文件？(用例:我有一个python烧瓶web服务器，其中pdf-s将被上传，与每个页面对应的jpeg-s是存储的。)是接近的，但问题是它没有将整个页面转换为jpeg。

问从pdf中提取页面作为jpeg
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问从pdf中提取页面作为jpegEN