我尝试了两种方法来下载XML文件:
import requests
from tqdm import tqdm
url = "http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/6.2/msigdb_v6.2.xml"
response = requests.get(url, stream=True)
with open("lol.xml", "wb") as handle:
for data in tqdm(response.iter_content()):
handle.write(data)
第二个是:
import urllib2
response = urllib2.urlopen(url)
data = response.read()
print(data)
该URL被重定向至:
response.url
u'https://software.broadinstitute.org/gsea/login.jsp;jsessionid=2544FF431CB094FBBA80451EDD3A0411'
结果,我只下载了html文件,而没有下载xml文件,这是输出的代码片段:
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<base href="http://software.broadinstitute.org/gsea/" />
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1" />
<meta name="verify-v1" content="/23Jlayki9tnRqU7DcCYrbFI7zPmHJ3HfeZltM6mK5Q=" />
<title>GSEA | Login</title>
<link href="css/style.css" rel="stylesheet" type="text/css" />
</head>
如何下载XML文件?
发布于 2018-07-26 01:52:02
试一试
file.write(response.content)
取而代之的是第一个方法的最后两行。问题可能是您无法直接从此链接下载文件(需要登录):
url = "http://software.broadinstitute.org/gsea/msigdb/download_file.jsp?filePath=/resources/msigdb/6.2/msigdb_v6.2.xml"
https://stackoverflow.com/questions/51524366
复制相似问题