blocks|key|1001449|text|试试pdfminer|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1001450|from+pdfminer.pdfparser+import+PDFParser
from+pdfminer.pdfdocument+import+PDFDocument

fp+=+open('diveintopython.pdf',+'rb')
parser+=+PDFParser(fp)
doc+=+PDFDocument(parser)

print(doc.info)++#+The+"Info"+metadata|code-block|syntax|javascript|1001451|下面是输出：|1001452|>>>+[{'CreationDate':+'D:20040520151901-0500',
++'Creator':+'DocBook+XSL+Stylesheets+V1.52.2',
++'Keywords':+'Python,+Dive+Into+Python,+tutorial,+object-oriented,+programming,+documentation,+book,+free',
++'Producer':+'htmldoc+1.8.23+Copyright+1997-2002+Easy+Software+Products,+All+Rights+Reserved.',
++'Title':+'Dive+Into+Python'}]|1001453|有关更多信息，请参阅本教程：A+lightweight+XMP+parser+for+extracting+PDF+metadata+in+Python。|1001454|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/euske/pdfminer/|1|http://blog.matt-swain.com/post/25650072381/a-lightweight-xmp-parser-for-extracting-pdf-metadata-in^0|2|8|0|0|0|0|0|E|1Q|1|0^^$0|@$1|2|3|4|5|6|7|Y|8|@]|9|@$A|Z|B|10|1|11]]|C|$]]|$1|D|3|E|5|F|7|12|8|@]|9|@]|C|$G|H]]|$1|I|3|J|5|6|7|13|8|@]|9|@]|C|$]]|$1|K|3|L|5|F|7|14|8|@]|9|@]|C|$G|H]]|$1|M|3|N|5|6|7|15|8|@]|9|@$A|16|B|17|1|18]]|C|$]]|$1|O|3|-4|5|6|7|19|8|@]|9|@]|C|$]]]|P|$Q|$5|R|S|T|C|$U|V]]|W|$5|R|S|T|C|$U|X]]]]

Try <a href="https://github.com/euske/pdfminer/" rel="noreferrer">pdfminer</a>:

<pre><code>from pdfminer.pdfparser import PDFParser
from pdfminer.pdfdocument import PDFDocument

fp = open('diveintopython.pdf', 'rb')
parser = PDFParser(fp)
doc = PDFDocument(parser)

print(doc.info) # The "Info" metadata
</code></pre>

Here's the output:

<pre><code>&gt;&gt;&gt; [{'CreationDate': 'D:20040520151901-0500',
 'Creator': 'DocBook XSL Stylesheets V1.52.2',
 'Keywords': 'Python, Dive Into Python, tutorial, object-oriented, programming, documentation, book, free',
 'Producer': 'htmldoc 1.8.23 Copyright 1997-2002 Easy Software Products, All Rights Reserved.',
 'Title': 'Dive Into Python'}]
</code></pre>

For more info, look at this tutorial: <a href="http://blog.matt-swain.com/post/25650072381/a-lightweight-xmp-parser-for-extracting-pdf-metadata-in" rel="noreferrer">A lightweight XMP parser for extracting PDF metadata in Python</a>.

blocks|key|4834570|text|Morten+Zilmer指出:+pyPdf+homepage说它不再被维护。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|4834571|我已经使用pyPdf实现了这一点。请看下面的示例代码。|4834572|from+pyPdf+import+PdfFileReader
pdf_toread+=+PdfFileReader(open("doc2.pdf",+"rb"))
pdf_info+=+pdf_toread.getDocumentInfo()
print(str(pdf_info))|code-block|syntax|javascript|4834573|输出：|4834574|{'/Title':+u'Microsoft+Word+-+Agnico-Eagle+-+Complaint+(00040197-2)',+'/CreationDate':+u"D:20111108111228-05'00'",+'/Producer':+u'Acrobat+Distiller+10.0.0+(Windows)',+'/ModDate':+u"D:20111108112409-05'00'",+'/Creator':+u'PScript5.dll+Version+5.2.2',+'/Author':+u'LdelPino'}|4834575|entityMap|0|LINK|mutability|MUTABLE|url|http://pybrary.net/pyPdf/|1^0|N|8|0|0|5|5|1|0|0|0|0^^$0|@$1|2|3|4|5|6|7|X|8|@]|9|@$A|Y|B|Z|1|10]]|C|$]]|$1|D|3|E|5|6|7|11|8|@]|9|@$A|12|B|13|1|14]]|C|$]]|$1|F|3|G|5|H|7|15|8|@]|9|@]|C|$I|J]]|$1|K|3|L|5|6|7|16|8|@]|9|@]|C|$]]|$1|M|3|N|5|H|7|17|8|@]|9|@]|C|$I|J]]|$1|O|3|-4|5|6|7|18|8|@]|9|@]|C|$]]]|P|$Q|$5|R|S|T|C|$U|V]]|W|$5|R|S|T|C|$U|V]]]]

Pointed out by Morten Zilmer: pyPdf <a href="http://pybrary.net/pyPdf/" rel="nofollow noreferrer">homepage</a> says it is no longer maintained.

I have implemented this using <a href="http://pybrary.net/pyPdf/" rel="nofollow noreferrer">pyPdf</a>. Please see the sample code below.

<pre><code>from pyPdf import PdfFileReader
pdf_toread = PdfFileReader(open("doc2.pdf", "rb"))
pdf_info = pdf_toread.getDocumentInfo()
print(str(pdf_info))
</code></pre>

Output:

<pre><code>{'/Title': u'Microsoft Word - Agnico-Eagle - Complaint (00040197-2)', '/CreationDate': u"D:20111108111228-05'00'", '/Producer': u'Acrobat Distiller 10.0.0 (Windows)', '/ModDate': u"D:20111108112409-05'00'", '/Creator': u'PScript5.dll Version 5.2.2', '/Author': u'LdelPino'}
</code></pre>

blocks|key|4351410|text|有关Python3，请参阅@Khaleel中的PyPDF2示例代码，更新为：|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|4351411|from+PyPDF2+import+PdfFileReader
pdf_toread+=+PdfFileReader(open("test.pdf",+"rb"))
pdf_info+=+pdf_toread.getDocumentInfo()
print(str(pdf_info))|code-block|syntax|javascript|4351412|使用pip+install+PyPDF2进行安装。|style|CODE|4351413|entityMap|0|LINK|mutability|MUTABLE|url|https://github.com/mstamy2/PyPDF2^0|N|6|0|0|0|2|I|0^^$0|@$1|2|3|4|5|6|7|U|8|@]|9|@$A|V|B|W|1|X]]|C|$]]|$1|D|3|E|5|F|7|Y|8|@]|9|@]|C|$G|H]]|$1|I|3|J|5|6|7|Z|8|@$A|10|B|11|K|L]]|9|@]|C|$]]|$1|M|3|-4|5|6|7|12|8|@]|9|@]|C|$]]]|N|$O|$5|P|Q|R|C|$S|T]]]]

For Python 3 see <a href="https://github.com/mstamy2/PyPDF2" rel="noreferrer">PyPDF2</a> with example code from @Khaleel updated to:

<pre><code>from PyPDF2 import PdfFileReader
pdf_toread = PdfFileReader(open("test.pdf", "rb"))
pdf_info = pdf_toread.getDocumentInfo()
print(str(pdf_info))
</code></pre>

Install using <code>pip install PyPDF2</code>.

blocks|key|4351441|text|对于Python3和新的pdfminer+(pip+install+pdfminer3k)：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|4351442|import+os
from+pdfminer.pdfparser+import+PDFParser
from+pdfminer.pdfparser+import+PDFDocument

fp+=+open("foo.pdf",+'rb')
parser+=+PDFParser(fp)
doc+=+PDFDocument(parser)
parser.set_document(doc)
doc.set_parser(parser)
if+len(doc.info)+>+0:
++++info+=+doc.info[0]
++++print(info)|code-block|syntax|javascript|4351443|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|I|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|J|8|@]|9|@]|A|$E|F]]|$1|G|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|H|$]]

For Python 3 and new pdfminer (pip install pdfminer3k):

<pre><code>import os
from pdfminer.pdfparser import PDFParser
from pdfminer.pdfparser import PDFDocument

fp = open("foo.pdf", 'rb')
parser = PDFParser(fp)
doc = PDFDocument(parser)
parser.set_document(doc)
doc.set_parser(parser)
if len(doc.info) &gt; 0:
 info = doc.info[0]
 print(info)
</code></pre>

blocks|key|264697|text|尝试pdfreader，您可以访问文档目录元数据，如下所示：|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|264698|+++from+pdfreader+import+PDFDocument++++
+++f+=+open("foo.pdf",+'rb')
+++doc+=+PDFDocument(f)
+++metadata+=+doc.root.Metadata|code-block|syntax|javascript|264699|entityMap|0|LINK|mutability|MUTABLE|url|http://pdfreader.readthedocs.io/^0|2|9|0|0|0^^$0|@$1|2|3|4|5|6|7|Q|8|@]|9|@$A|R|B|S|1|T]]|C|$]]|$1|D|3|E|5|F|7|U|8|@]|9|@]|C|$G|H]]|$1|I|3|-4|5|6|7|V|8|@]|9|@]|C|$]]]|J|$K|$5|L|M|N|C|$O|P]]]]

Try <a href="http://pdfreader.readthedocs.io/" rel="nofollow noreferrer">pdfreader</a>
You can access document catalog Metadata like below:

<pre><code> from pdfreader import PDFDocument 
 f = open("foo.pdf", 'rb')
 doc = PDFDocument(f)
 metadata = doc.root.Metadata
</code></pre>

blocks|key|1001574|text|pikepdf提供了一种简单可靠的方法来实现这一点。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1001575|我用一堆pdf文件对此进行了测试，似乎有两种不同的方法可以在创建PDF时插入元数据。一些人正在插入NUL字节和其他胡言乱语。Pikepdf很好地处理了这两个问题。|style|CODE|1001576|import+pikepdf
p+=+pikepdf.Pdf.open(r'path/to/file.pdf')
str(p.docinfo['/Author'])++#+mind+the+slash|code-block|syntax|javascript|1001577|这将返回一个字符串-如果用str包装它的话。示例：|1001578|1001579|'Normal+person'|unordered-list-item|1001580|'ABC'|1001581|1001582|与其他选项相比：|1001583|1001584|pdfminer+-+Not+active+maintained|1001585|pdfminer.six+-+active|1001586|pdfreader+-Active(但仍建议您使用easy_install、a.o.)|1001587|pyPdf+-而不是maintained|1001588|PyPDF2+-+Not+maintained+)+(Phaseit，Inc.退出了radar)|1001589|Borb+-Active。|1001590|1001591|1001592|Pdfminer.six：|1001593|pip+install+pdfminer.six|1001594|import+pdfminer.pdfparser
import+pdfminer.pdfdocument
h+=+open('path/to/file.pdf',+'rb')
p+=+pdfminer.pdfparser.PDFParser(h)
d+=+pdfminer.pdfparser.PDFDocument(p)
d.info[0]['Author']|1001595|这将返回一个二进制字符串，其中包括不可解码的字符(如果存在)。示例：|1001596|美国广播公司(+|1001597|b'Normal+person'|1001598|b'\xfe\xff\x00A\x00B\x00C'+)|1001599|1001600|要转换为字符串，请执行以下操作：|1001601|1001602|b'Normal+person'.decode()生成字符串'Normal+person'|1001603|b'\xfe\xff\x00A\x00B\x00C'.decode(encoding='utf-8',+errors='ignore').replace('\x00',+'')生成字符串'ABC'|1001604|1001605|1001606|pdfreader|1001607|pip+install+pdfreader|1001608|import+pdfreader
h+=+open(r'path/to/file.pdf',+'rb')
d+=+pdfreader.PDFDocument(h)
d.metadata['Author']|1001609|这将返回包含所请求信息的字符串，或包含所找到数据的十六进制表示形式的字符串。然后，这还包括相同的不可解码字符。示例：|1001610|美国广播公司(+|1001611|'Normal+person'|1001612|'FEFF004100420043'+)|1001613|1001614|然后，您首先需要检测这是否仍然是“编码的”，我认为这是一个相当麻烦的问题。通过调用这段丑陋的代码，可以使第二个字符串成为合理的字符串：|1001615|s+=+'FEFF004100420043'
''.join([c+for+c+in+(chr(int(s[i:i%2B2],+16))+for+i+in+range(0,+len(s),+2))+if+c.isascii()]).replace('\x00',+'')
>>>+'ABC'|1001616|1001617|Borb|1001618|pip+install+borb|1001619|import+borb.pdf.pdf
h+=+open(r'path/to/file.pdf',+'rb')
d:+borb.pdf.document.Document+=+borb.pdf.pdf.PDF.loads(h)
str(d.get_document_info().get_author())|1001620|这将返回一个字符串-如果用str包装它的话。加载一个相当大的PDF需要很长时间。我有一个PDF，borb因为TypeError异常而卡住了。另请参阅borb's+dedicated+example+repo上的示例。|1001621|entityMap|0|LINK|mutability|MUTABLE|url|https://pypi.org/project/pikepdf/|1|https://github.com/euske/pdfminer|2|https://github.com/pdfminer/pdfminer.six|3|https://pypi.org/project/pdfreader/|4|http://pybrary.net/pyPdf/|5|https://pypi.org/project/PyPDF2/|6|https://borbpdf.com/|7|https://github.com/jorisschellekens/borb-examples#31-extracting-meta-information^0|0|7|0|0|1D|3|0|0|D|3|0|0|0|F|0|0|5|0|0|0|0|0|8|1|0|0|C|2|0|P|C|0|9|3|0|0|5|4|0|0|6|5|0|0|4|6|0|0|0|0|0|O|0|0|0|0|0|G|0|0|Q|0|0|0|0|0|P|U|F|0|0|2G|2L|5|0|0|0|0|0|L|0|0|0|0|0|F|0|0|I|0|0|0|0|0|0|0|G|0|0|D|3|22|T|7|0^^$0|@$1|2|3|4|5|6|7|3E|8|@]|9|@$A|3F|B|3G|1|3H]]|C|$]]|$1|D|3|E|5|6|7|3I|8|@$A|3J|B|3K|F|G]]|9|@]|C|$]]|$1|H|3|I|5|J|7|3L|8|@]|9|@]|C|$K|L]]|$1|M|3|N|5|6|7|3M|8|@$A|3N|B|3O|F|G]]|9|@]|C|$]]|$1|O|3|-4|5|6|7|3P|8|@]|9|@]|C|$]]|$1|P|3|Q|5|R|7|3Q|8|@$A|3R|B|3S|F|G]]|9|@]|C|$]]|$1|S|3|T|5|R|7|3T|8|@$A|3U|B|3V|F|G]]|9|@]|C|$]]|$1|U|3|-4|5|6|7|3W|8|@]|9|@]|C|$]]|$1|V|3|W|5|6|7|3X|8|@]|9|@]|C|$]]|$1|X|3|-4|5|6|7|3Y|8|@]|9|@]|C|$]]|$1|Y|3|Z|5|R|7|3Z|8|@]|9|@$A|40|B|41|1|42]]|C|$]]|$1|10|3|11|5|R|7|43|8|@]|9|@$A|44|B|45|1|46]]|C|$]]|$1|12|3|13|5|R|7|47|8|@$A|48|B|49|F|G]]|9|@$A|4A|B|4B|1|4C]]|C|$]]|$1|14|3|15|5|R|7|4D|8|@]|9|@$A|4E|B|4F|1|4G]]|C|$]]|$1|16|3|17|5|R|7|4H|8|@]|9|@$A|4I|B|4J|1|4K]]|C|$]]|$1|18|3|19|5|R|7|4L|8|@]|9|@$A|4M|B|4N|1|4O]]|C|$]]|$1|1A|3|-4|5|6|7|4P|8|@]|9|@]|C|$]]|$1|1B|3|-4|5|6|7|4Q|8|@]|9|@]|C|$]]|$1|1C|3|1D|5|6|7|4R|8|@]|9|@]|C|$]]|$1|1E|3|1F|5|6|7|4S|8|@$A|4T|B|4U|F|G]]|9|@]|C|$]]|$1|1G|3|1H|5|J|7|4V|8|@]|9|@]|C|$K|L]]|$1|1I|3|1J|5|6|7|4W|8|@]|9|@]|C|$]]|$1|1K|3|1L|5|6|7|4X|8|@]|9|@]|C|$]]|$1|1M|3|1N|5|R|7|4Y|8|@$A|4Z|B|50|F|G]]|9|@]|C|$]]|$1|1O|3|1P|5|R|7|51|8|@$A|52|B|53|F|G]]|9|@]|C|$]]|$1|1Q|3|-4|5|6|7|54|8|@]|9|@]|C|$]]|$1|1R|3|1S|5|6|7|55|8|@]|9|@]|C|$]]|$1|1T|3|-4|5|6|7|56|8|@]|9|@]|C|$]]|$1|1U|3|1V|5|R|7|57|8|@$A|58|B|59|F|G]|$A|5A|B|5B|F|G]]|9|@]|C|$]]|$1|1W|3|1X|5|R|7|5C|8|@$A|5D|B|5E|F|G]|$A|5F|B|5G|F|G]]|9|@]|C|$]]|$1|1Y|3|-4|5|6|7|5H|8|@]|9|@]|C|$]]|$1|1Z|3|-4|5|6|7|5I|8|@]|9|@]|C|$]]|$1|20|3|21|5|6|7|5J|8|@]|9|@]|C|$]]|$1|22|3|23|5|6|7|5K|8|@$A|5L|B|5M|F|G]]|9|@]|C|$]]|$1|24|3|25|5|J|7|5N|8|@]|9|@]|C|$K|L]]|$1|26|3|27|5|6|7|5O|8|@]|9|@]|C|$]]|$1|28|3|29|5|6|7|5P|8|@]|9|@]|C|$]]|$1|2A|3|2B|5|R|7|5Q|8|@$A|5R|B|5S|F|G]]|9|@]|C|$]]|$1|2C|3|2D|5|R|7|5T|8|@$A|5U|B|5V|F|G]]|9|@]|C|$]]|$1|2E|3|-4|5|6|7|5W|8|@]|9|@]|C|$]]|$1|2F|3|2G|5|6|7|5X|8|@]|9|@]|C|$]]|$1|2H|3|2I|5|J|7|5Y|8|@]|9|@]|C|$K|L]]|$1|2J|3|-4|5|6|7|5Z|8|@]|9|@]|C|$]]|$1|2K|3|2L|5|6|7|60|8|@]|9|@]|C|$]]|$1|2M|3|2N|5|6|7|61|8|@$A|62|B|63|F|G]]|9|@]|C|$]]|$1|2O|3|2P|5|J|7|64|8|@]|9|@]|C|$K|L]]|$1|2Q|3|2R|5|6|7|65|8|@$A|66|B|67|F|G]]|9|@$A|68|B|69|1|6A]]|C|$]]|$1|2S|3|-4|5|6|7|6B|8|@]|9|@]|C|$]]]|2T|$2U|$5|2V|2W|2X|C|$2Y|2Z]]|30|$5|2V|2W|2X|C|$2Y|31]]|32|$5|2V|2W|2X|C|$2Y|33]]|34|$5|2V|2W|2X|C|$2Y|35]]|36|$5|2V|2W|2X|C|$2Y|37]]|38|$5|2V|2W|2X|C|$2Y|39]]|3A|$5|2V|2W|2X|C|$2Y|3B]]|3C|$5|2V|2W|2X|C|$2Y|3D]]]]

<a href="https://pypi.org/project/pikepdf/" rel="nofollow noreferrer">pikepdf</a> provides an easy and reliable way to do this.
I tested this with a bunch of pdf files, and it seems there are two distinct ways to insert metadata when the PDF is created. Some are inserting <code>NUL</code> bytes and other gibberish. Pikepdf handles both well.
<pre><code>import pikepdf
p = pikepdf.Pdf.open(r'path/to/file.pdf')
str(p.docinfo['/Author']) # mind the slash
</code></pre>
This returns a string - if you wrapped it with <code>str</code>. Examples:
<ul>
<li><code>'Normal person'</code></li>
<li><code>'ABC'</code></li>
</ul>
Comparing with other options:
<ul>
<li><a href="https://github.com/euske/pdfminer" rel="nofollow noreferrer">pdfminer</a> - Not actively maintained</li>
<li><a href="https://github.com/pdfminer/pdfminer.six" rel="nofollow noreferrer">pdfminer.six</a> - active</li>
<li><a href="https://pypi.org/project/pdfreader/" rel="nofollow noreferrer">pdfreader</a> - active (but still suggest you to use <code>easy_install</code>, a.o.)</li>
<li><a href="http://pybrary.net/pyPdf/" rel="nofollow noreferrer">pyPdf</a> - Not maintained</li>
<li><a href="https://pypi.org/project/PyPDF2/" rel="nofollow noreferrer">PyPDF2</a> - Not maintained (Phaseit, Inc. went off the radar)</li>
<li><a href="https://borbpdf.com" rel="nofollow noreferrer">Borb</a> - Active.</li>
</ul>
<hr />
<h2>Pdfminer.six:</h2>
<code>pip install pdfminer.six</code>
<pre><code>import pdfminer.pdfparser
import pdfminer.pdfdocument
h = open('path/to/file.pdf', 'rb')
p = pdfminer.pdfparser.PDFParser(h)
d = pdfminer.pdfparser.PDFDocument(p)
d.info[0]['Author']
</code></pre>
This returns a binary string, including the non-decodable characters if they are present. Examples:
<ul>
<li><code>b'Normal person'</code></li>
<li><code>b'\xfe\xff\x00A\x00B\x00C'</code> (ABC)</li>
</ul>
To convert to a string:
<ul>
<li><code>b'Normal person'.decode()</code> yields the string <code>'Normal person'</code></li>
<li><code>b'\xfe\xff\x00A\x00B\x00C'.decode(encoding='utf-8', errors='ignore').replace('\x00', '')</code> yields the string <code>'ABC'</code></li>
</ul>
<hr />
<h2>pdfreader</h2>
<code>pip install pdfreader</code>
<pre><code>import pdfreader
h = open(r'path/to/file.pdf', 'rb')
d = pdfreader.PDFDocument(h)
d.metadata['Author']
</code></pre>
This returns either the string with the requested information, or a string containing the hex representation of the data it found. This then also includes the same non-decodable characters. Examples:
<ul>
<li><code>'Normal person'</code></li>
<li><code>'FEFF004100420043'</code> (ABC)</li>
</ul>
You would then first need to detect whether this is still 'encoded', which I think is quite a nuisance. The second can be made a sensible string by calling this ugly piece of code:
<pre><code>s = 'FEFF004100420043'
''.join([c for c in (chr(int(s[i:i+2], 16)) for i in range(0, len(s), 2)) if c.isascii()]).replace('\x00', '')
&gt;&gt;&gt; 'ABC'
</code></pre>
<hr />
<h2>Borb</h2>
<code>pip install borb</code>
<pre><code>import borb.pdf.pdf
h = open(r'path/to/file.pdf', 'rb')
d: borb.pdf.document.Document = borb.pdf.pdf.PDF.loads(h)
str(d.get_document_info().get_author())
</code></pre>
This returns a string - if you wrapped it with <code>str</code>. Loading a sizeable PDF takes a long time. I had one PDF on which borb choked with a TypeError exception. See also the examples on <a href="https://github.com/jorisschellekens/borb-examples#31-extracting-meta-information" rel="nofollow noreferrer">borb's dedicated example repo</a>.

How can I read the properties/metadata like Title, Author, Subject and Keywords stored on a PDF file using Python?

Reading the PDF properties/metadata in Python

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

如何使用Python读取存储在PDF文件中的属性/元数据，如标题、作者、主题和关键字？

问在Python中读取PDF属性/元数据
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在Python中读取PDF属性/元数据EN