bs4)每次解析html会产生多少个请求？

bs4是Python中的一个库，用于解析HTML和XML文档。它并不会产生请求，而是通过解析已经获取到的HTML文档来提取所需的信息。

在使用bs4解析HTML时，通常需要先获取HTML文档，可以通过网络请求、本地文件读取等方式获取。获取HTML文档的过程中可能会产生请求，但这与bs4本身无关。

因此，使用bs4解析HTML不会产生请求，而是通过解析已经获取到的HTML文档来提取信息。

页面内容是否对你有帮助？

有帮助

没帮助

BeautifulSoup (bs4)解析错误

、、、

使用bs4解析此示例文档，来自python2.7.6： <html> <body> <p>HTML allows omitting P end-tags. <p>Like that and this. <p>And this, too. <p>What happened?</p> <p>And can we <p>nest a paragraph, too?</p></p> </body> </html> 使用： from bs4

浏览 1提问于2015-04-29得票数 3

回答已采纳

3回答

python urllib2 -在所有脚本运行后读取页面

、、

为了从页面中提取数据，我尝试用urllib2读取一个页面。页面的一部分是每次加载生成的，当我使用urllib2读取url时，这个部分不在我要得到的html中。 url是，我正在尝试获取为图生成的表。例如： <div aria-label="A tabular representation of the data in the chart." style="position: absolute; left: -10000px; top: auto; width: 1px; height: 1px; overflow: hidden;">

浏览 7提问于2015-01-23得票数 2

回答已采纳

1回答

如何从使用javascript生成的工具提示中刮取文本

、、、

我编写了下面的代码来获取地图中所有蓝色标记的位置。 from bs4 import BeautifulSoup from requests_html import HTMLSession session = HTMLSession() url="https://emf2.bundesnetzagentur.de/karte/Default.aspx?lat=52.4107723&lon=14.2930953&zoom=14" r = session.get(url) r.html.render(sleep = 3) data = r.html.html so

浏览 2提问于2020-02-03得票数 0

回答已采纳

1回答

beautifulsoup4:得到href但返回"#“

、、

我正在使用bs4从一个站点获得一些href。 <a class="aaa" target="12345" href="someURL" data-track="HOT:SR:HotelModule" tabindex="0"> <span class="visuallyhidden"> some text here </span> </a> HTML类似于上面的内容。我可以使用以下代码获得大部分URL

浏览 5提问于2017-03-22得票数 0

回答已采纳

2回答

当网络抓取时，我们把"html.parser“的论点放在哪里？

、、、

请看下面的代码片段 import requests from bs4 import BeautifulSoup url = #Insert url here # Method 1 html = requests.get(url, "html.parser") soup = BeautifulSoup( html.text ) #Method 2 html2 = requests.get(url) soup2 = BeautifulSoup( html.text, "html.parser") 哪种方法是正确的？方法1还是方法2？我们应该将"html.

浏览 2提问于2020-08-11得票数 1

回答已采纳

2回答

使用BS4和Request Python3抓取产品数据时不返回任何提示

、、、

希望你们一切都好。我正在尝试从中抓取特定的产品，以便获得产品的数据，例如可用尺寸。问题是，每次我尝试运行我的脚本时，都没有返回任何结果。提前谢谢你！ import requests from bs4 import BeautifulSoup url = 'https://www.footlocker.fr/fr/p/jordan-1-mid-bebe-chaussures-69677?v=316161155904' page = requests.get(url) soup = BeautifulSoup(page.content, 'html.parser'

浏览 0提问于2020-05-11得票数 0

1回答

BeautifulSoup4缺失标签

、、、

我在Anaconda的发行版中使用BeautifulSoup 4作为bs4。如果我错了，请纠正我--我理解BeautifulSoup是用来将格式不正确的HTML转换成格式良好的HTML的库。但是，当我将HTML赋值给它的构造函数时，我损失了一半以上的字符。它不应该只是修复HTML而不是清理它吗？在中，它不是很好的描述。这是代码： from bs4 import BeautifulSoup soup = BeautifulSoup(html) 其中html是谷歌主页的HTML。编辑：可能是因为我通过str(soup)检索HTML字符串的方式

浏览 2提问于2015-03-12得票数 3

回答已采纳

2回答

BeautifulSoup返回空方括号

、、、

我试图用python的bs4库在谷歌中搜索到多少个结果，但当我这样做的时候，它返回了空括号。下面是我的代码： import requests from bs4 import BeautifulSoup url_page = 'https://www.google.com/search?q=covid&oq=covid&aqs=chrome.0.0i433l2j0i131i433j0i433j0i131i433l2j0j0i131i433j0i433j0i131i433.691j0j7&sourceid=chrome&ie=UTF-8' p

浏览 0提问于2021-04-28得票数 1

1回答

Python:无法从网站中提取tbody信息

、、、

我想提取这个网站的所有链接：我想要的信息存储在tbody：中。每次我试图提取数据，我都没有得到任何结果。 from bs4 import BeautifulSoup import requests from requests_html import HTMLSession url = "https://pflegefinder.bkk-dachverband.de/pflegeheime/searchresult.php?required=1&statistics=1&searchdata%5BmaxDistance%5D=0&searchdata%5

浏览 14提问于2022-01-20得票数 -1

回答已采纳

1回答

bs4)每次解析html会产生多少个请求？

、、

我通过requests得到了html源代码，我想将它们解析为blow(sudo代码)： import requests from bs4 import BeautifulSoup response = requests.get('https://www.example.com', headers=headers, params=params) html_doc = response.text soup = BeautifulSoup(html_doc, 'html.parser') item_ls = [] for elem in soup.select

浏览 20提问于2019-10-15得票数 0

回答已采纳

2回答

如何解析漂亮汤中的cookie文件

、

我的组织需要我认证一个双因素认证来刮取一个内部网站。每次打开浏览器时，它都会要求进行身份验证。身份验证cookie存储在c://users//.way//cookie.bat中。我想使用这个曲奇文件刮一个内部网站。有人能帮我吗？样本程序 from bs4 import BeautifulSoup import requests header={'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.11

浏览 9提问于2022-10-27得票数 0

5回答

获得输出0，即使有25个相同的类

、、、

我想看看这个页面上有多少个类，但是输出是0。我使用BeautifulSoup已经有一段时间了，但从未见过这样的错误。 from bs4 import BeautifulSoup import requests result = requests.get("https://www.holonis.com/motivationquotes") c = result.content soup = BeautifulSoup(c) samples = soup.findAll("div", {"class": "ng-scope"

浏览 0提问于2018-04-04得票数 0

回答已采纳

2回答

如何防止BeautifulSoup4向汤中添加额外的<html><body>标记？

、

在3之前的BeautifulSoup版本中，我可以获取任意块的HTML并以这种方式获得字符串表示： from BeautifulSoup import BeautifulSoup soup3 = BeautifulSoup('<div><b>soup 3</b></div>') print unicode(soup3) '<div><b>soup</b></div>' 但是，对于BeautifulSoup4，相同的操作会创建额外的标记： from bs4 im

浏览 6提问于2013-04-12得票数 17

回答已采纳

1回答

配置自动关闭标签

、、、

让我们以例子来解释我的问题： from bs4 import BeautifulSoup txt = """

浏览 1提问于2014-04-09得票数 0

回答已采纳

1回答

不要从汤中获取数据

、、

我用python创建了bs4网络抓取应用程序。我的程序返回空列表以供审阅。因为汤程序正常运行。 from bs4 import BeautifulSoup import requests import pandas as pd data = [] usernames = [] titles = [] comments = [] result = requests.get('https://www.kupujemprodajem.com/review.php?action=list') soup = BeautifulSoup(result.text, 'html.

浏览 7提问于2021-12-10得票数 -1

2回答

使用find时BeautifulSoup挂起

、、、、

我对bs4包有一个问题。我有一个html文档，如下所示： data = """<html><head></head><body> <p> this is tab </p> <img src="image.jpg"> </body></html> """ 这是我的代码： from bs4 import BeautifulSoup soup = BeautifulSoup(data, 'html5lib') s

浏览 0提问于2016-03-18得票数 3

1回答

在iframe内刮上美汤

、、、

我正在使用Beautifulsoup进行抓取。但是由于目标是价格是在iframe中，所以无法获得目标。目标如下。 <span class="last">1,025.5</span> 你能告诉我怎样才能找到目标吗？我的代码如下。 stock = "" import requests from bs4 import BeautifulSoup url = 'https://www.xxxxxx.com/jp/ir/' html = requests.get(url) soup = BeautifulSoup(html.

浏览 0提问于2018-09-25得票数 2

1回答

正在阅读网站中的内容，无法打开

、

我使用的是Python 2.7.9 我试着打开和阅读一个网站，但我得到的错误如下: 11001 getaddrsinfo或没有连接...机器主动拒绝它事实上，当我试图打开一个网站，想要阅读它的时候，我永远无法打开它。我认为问题出在系统配置上。使用webdriver，我可以打开一个网站，但我不知道如何阅读这些内容。你能帮帮忙吗？这是我使用的代码，有不同的可能性，但总是有相同的错误。 import socket import os os.environ['http_proxy'] = '127.0.0.1:8080' import requests, r

浏览 0提问于2020-08-04得票数 0

1回答

属性错误：“NoneType”对象没有属性“父”

、、、、

from urllib.request import urlopen from bs4 import BeautifulSoup html= urlopen("http://www.pythonscraping.com/pages/page3.html") soup= BeautifulSoup(html.read()) print(soup.find("img",{"src":"../img/gifts/img1.jpg" }).parent.previous_sibling.get_text()) 上面的代码工作得很好，但是b

浏览 2提问于2017-04-18得票数 3

回答已采纳

2回答

BeautifulSoup返回vs来自Chrome的视频源(Zillow)

、、、

我一直在尝试从Zillow中抓取代码，但是漂亮的汤给出的代码比来自chrome的view-source要少得多。下面是我的代码： from bs4 import BeautifulSoup import requests from bs4 import BeautifulSoup import requests url='https://www.zillow.com/homedetails/49-Mountain-St-Hartford-CT-06106/58139903_zpid/' html=requests.get(url) bs = BeautifulSoup(ht

浏览 6提问于2021-11-27得票数 0

3回答

使用请求Python登录网站

、、、

解决方案：这个特定站点的action是action="user/ajax/login"，因此为了实现有效负载，必须将其附加到主站点的url中。(action可以通过在ctrl + f中搜索action在Page Source中找到)。url是将要刮掉的东西。with requests.Session() as s:是在站点内部维护cookies的内容，这就是允许一致刮取的内容。res变量是将有效负载发布到登录url中的响应，允许用户从特定的帐户页面中刮取。在发布之后，请求将达到指定的url。在此基础上，BeautifulSoup现在可以从accounts站点中获取和解析HTML

浏览 6提问于2020-05-24得票数 0

回答已采纳

1回答

列出网页上所有带扩展名的文件的路径

、

在python中是否有一个命令或方式请求库从网页上下载具有特定扩展名的所有文件？或者至少列出它们的完整路径，如ftp库中的nest命令？这是页面：，我想要扩展名为.grib的所有文件 import re from bs4 import BeautifulSoup as soup data_html = soup(r'https://gimms.gsfc.nasa.gov/SMOS/jbolten/FAS/L03/', 'lxml') # making soap links = data_html.findAll(href=re.compile("/.g

浏览 0提问于2018-07-13得票数 0

1回答

使用BeautifulSoup获取不可见的网页信息

、、、

我试图从网站"“获得一些信息，更准确地说，所有的个人估计，这是在页面的底部。但它只显示前30，然后您应该手动按下按钮“显示所有”，以获得另一个30等等。到目前为止，我的代码如下： from urllib import urlopen from bs4 import BeautifulSoup html = urlopen("https://www.estimize.com/jpm/fq3-2016#chart=table") soup = BeautifulSoup(html.read(), "html.parser") print(soup) 我看到印

浏览 3提问于2017-03-28得票数 1

回答已采纳

1回答

Div中的div破坏了整个div的提取- Python/BS4

、

这是我正在使用的HTML： <div id="post_message_64012736" class=" post"> <br> Just testing something, please ignore this :D<br> <br> <br> <br> <br> <div style="margin:20px; margin-top:5px; "> <div class="smallfont" style="

浏览 1提问于2013-06-17得票数 0

1回答

通过网络抓取我的成绩

、、、

我正在尝试创建一个程序，每天从一个网站上获取我的学校成绩。然后存储这些值并为我的成绩创建一个图表，但是当我尝试抓取页面时，我收到的HTML与使用inspect元素得到的HTML不同。 from urllib.request import urlopen from bs4 import BeautifulSoup html = urlopen("https://ames.usoe-dcs.org/Students/2567") bsObj = BeautifulSoup(html.read(), 'lxml'); print(bsObj) 检查元素给了我：而py

浏览 7提问于2017-02-20得票数 0

1回答

BeautifulSoup4:缺少分析过的表数据

、、

我试图通过BeautifulSoup 4从中提取每股收益数据。当我解析数据时，使用默认的lxml和HTML5解析器丢失表信息。我相信这与Javascript有关，我一直在尝试实现PyV8，将脚本转换为可读的BS4 HTML。问题是我不知道从这里往哪里走。你知道这是不是我的问题吗？我读了很多帖子，今天我很头疼。下面是一个简单的例子。financeWrap包含表信息，但是beautifulSoup显示它是空的。 import requests from bs4 import BeautifulSoup url = "http://financials.morningstar.com/

浏览 0提问于2014-10-21得票数 1

回答已采纳

1回答

具有无效路由的Web API返回HTML

、

我有一个使用属性路由的HTML2项目，如果请求了无效的路由，我会收到一个正文中包含WebAPI的404。它甚至没有命中我的初始DelegatingHandler。我需要做什么才能确保所有请求都通过WebAPI处理。此项目没有MVC。

浏览 0提问于2015-12-10得票数 1

2回答

从网页的图像中获取源代码

、、

所以我想从这个网站获取图片来源：但每次我尝试使用bs4时，我总是失败，我也尝试过其他帖子，但都不能正常工作。它一直返回None import requests import bs4 from bs4 import BeautifulSoup url = 'https://www.pixiv.net/en/artworks/77564597' r = requests.get(url) soup = BeautifulSoup(r.content, 'html.parser') x = soup.find("img") print(x)

浏览 0提问于2019-12-01得票数 0

2回答

BeautifulSoup lxml解析器结束标记

、、

我使用BeautifulSoup的lxml解析器来解析一些html。然而，它并没有像它写的那样被解析。例如，以下代码： import bs4 my_html = ''' <html> <body> <B> <P> Hello, I am some bolded text </P> </B> </body> </html> ''' soup = bs4.BeautifulSoup(my_html, 'lxml') print soup

浏览 4提问于2016-07-20得票数 1

回答已采纳

1回答

如何使用bs4从网站获取表格数据

、

我试图用bs4抓取一个网站，里面有一个表，但我得到的内容元素并不像我从inspect得到的那样完整。我在里面找不到标签<tr>和<td>。如何获取该站点的完整内容，尤其是表格的标记？下面是我的代码： from bs4 import BeautifulSoup import requests link = requests.get("https://pemilu2019.kpu.go.id/#/ppwp/hitung-suara/", verify = False) src = link.content soup = BeautifulSoup(sr

浏览 62提问于2019-04-24得票数 1

回答已采纳

2回答

使用BeautifulSoup的抓取范围

、、、、

我试着用BeautifulSoup抓取"span“标签。这是我的代码.. import urllib from bs4 import BeautifulSoup url="someurl" res=urllib.urlopen(url) html=res.read() soup=BeautifulSoup(html,"html.parser") soup.findAll("span") 但是当我这样做的时候，对于一些特定的网页。它没有列出所有的跨度。它只显示有限的否。跨度。但当我这么做的时候 soup.prettify() 它包含所有的跨

浏览 1提问于2015-12-19得票数 0

2回答

中的Web刮刀错误：“NoneType”对象不可调用

、、

目前正在使用漂亮的shop构建一个网络刮刀，以便通过提交ZIP代码获得车身商店位置的列表。() 当我指定一个元素的id (“dl”)并试图查看它是否工作时，它总是在下面说一个错误： search_box =soup.find_element_by_id(“dl”) TypeError：“NoneType”对象不可调用我已经找了好几个小时的答案，并决定问，因为我找不到解决办法。下面是我的代码： import requests from bs4 import BeautifulSoup URL = 'https://owners.honda.com/collision/pro

浏览 3提问于2019-11-25得票数 0

3回答

urlopen('http.....').read()中的read()做了什么？[urllib]

、、

嗨，我正在读"Web Scraping with Python (2015)“。我看到了以下两种打开url的方法，分别使用和不使用.read()。请参阅bs1和bs2 from urllib.request import urlopen from bs4 import BeautifulSoup html = urlopen('http://web.stanford.edu/~zlotnick/TextAsData/Web_Scraping_with_Beautiful_Soup.html') bs1 = BeautifulSoup(html.read(), '

浏览 3提问于2016-03-08得票数 8

回答已采纳

1回答

一个网站中的两个字符集，如何解析

、、

我最近正在学习python的知识，我想要废除一个网站。我从美丽的汤中得到警告：一些字符无法解码，并被替换字符替换。我谷歌的问题，我认为这可能是解码问题，我的代码可以顺利地废弃其他网站。那我该怎么办？这是我的密码： from urllib.request import urlopen from bs4 import BeautifulSoup code_type = 'utf-8' html = urlopen("http://news.sina.com.cn/") print(html) bsObj = BeautifulSoup(ht

浏览 3提问于2016-09-06得票数 0

1回答

ASP.NET Web.Config ConfigurationManager.AppSettings文件缓存

、、、

我使用ConfigurationManager.AppSettings集合从ASP.NET应用程序的Web.config文件中检索配置值。是否有人知道AppSettings中的值是否以某种方式缓存在内存中，或者是否每次检索设置时都会发生对Web.config的文件读取？ string someValue = ConfigurationManager.AppSettings["SomeSetting"]; 谢谢

浏览 0提问于2012-09-30得票数 6

回答已采纳

2回答

Python漂亮汤-获取输入值

、

我的计划是能够通过使用_AntiCsrfToken获取Bs4。我的HTML来自于这个HTML 我在代码中写的是 token = soup.find('input', {'name':'_AntiCsrfToken'})['value']) print(token) 但这让我说错了 Traceback (most recent call last): File "C:\Users\HelloWorld.py", line 67, in <module> print(soup.f

浏览 5提问于2017-09-03得票数 0

回答已采纳

1回答

Python: urllib urlopen卡住，超时错误

、、、、

正如标题所述，urlopen get卡在URL的打开过程中。 “守则”： from bs4 import BeautifulSoup as soup # HTML data structure from urllib.request import urlopen as uReq # Web client page_url = "https://store.hp.com/us/en/pdp/hp-laserjet-pro-m404n?jumpid=ma_weekly-deals_product-tile_printers_3_w1a52a_hp-laserjet-pro-m404&

浏览 1提问于2020-03-01得票数 0

1回答

bs4第二个注释<！->丢失了

、、

我正在使用BeautifulSoup进行python挑战级别-9。url = "“。Bs4.版本 == '4.3.2‘。它的页面源中有两个注释。汤的产量应如下。但是，在应用BeautifulSoup时，缺少第二个注释。听起来有点奇怪。有什么暗示吗？谢谢! import requests from bs4 import BeautifulSoup url = "http://www.pythonchallenge.com/pc/return/good.html" page = requests.get(url, auth = ("huge",

浏览 1提问于2014-10-25得票数 0

回答已采纳

1回答

用id网络抓取python <span>

、、、、

我想要在<span/>属性中使用BeautifulSoup为给定的网站报废数据。你可以在屏幕截图中看到它所在的位置。但是，我使用的代码只是返回一个空列表。我找不到我想要的名单上的数据。我做错了什么？ from bs4 import BeautifulSoup from urllib import request url = "http://144.122.167.229" opener = urllib.request.build_opener() opener.addheaders = [('User-agent', 'Mozilla/5

浏览 2提问于2018-02-22得票数 0

回答已采纳

1回答

如何避免Python BeautifulSoup中的错误

这是我的程序 from bs4 import BeautifulSoup import urllib2 url="http://www.moneycontrol.com/commodity/gold-price.html#05oct2013" content = urllib2.urlopen(url).read() soup = BeautifulSoup(content) 它给出以下错误 Traceback (most recent call last): File "<interactive input>", line 1, in <

浏览 2提问于2013-09-10得票数 1

1回答

如何通过编写python脚本从许多不同的html链接中提取电子邮件、电话、传真号码和地址？

、、、、

我试过这段代码，但它不能正常工作(不能从所有站点提取，等等，还有许多其他问题)。需要帮助！ from bs4 import BeautifulSoup import re import requests allsite = ["https://www.ionixxtech.com/", "https://sumatosoft.com", "https://4irelabs.com/", "https://www.leewayhertz.com/", "https://stackoverflow.

浏览 0提问于2020-05-26得票数 0

1回答

我在刮擦上做错了什么。不返回代码的值。

、、

我的代码适用于一个站点，而不是另一个站点。有人能帮我吗。 import requests from bs4 import BeautifulSoup URL = "https://www.homedepot.com/s/311256393" page = requests.get(URL) soup = BeautifulSoup(page.content, "html.parser") results = soup.find(id="root") print(results.prettify()) 下面的代码显示输出的地方，网站上

浏览 3提问于2021-11-15得票数 1

回答已采纳

1回答

发送一个post服务器并获取发送的页面和服务器，然后在会话中进行解析

try { org.jsoup.Connection.Response res = (org.jsoup.Connection.Response) Jsoup.connect(url) .data(postParams) .header(cookies1) .header("Cache-Control", "private") .header("Content-Leng

浏览 1提问于2013-08-15得票数 0

1回答

在带有Python请求的post请求中发送cookie的正确格式是什么？

、、

在用BS4解析cookie之前，我想将cookie设置为URL。首先，我不确定，我是否对cookie使用了正确的格式。以下是它们在chrome DevTool中的样子：名称: aep_usuc_f 价值: site=rus&c_tp=USD®ion=IE&b_locale=ru_RU 这是我的代码： url = 'https://example.com/item/123.html' cookies = {'aep_usuc_f': 'site=rus&c_tp=USD&region=IE&b_lo

浏览 0提问于2019-04-09得票数 1

回答已采纳

1回答

获取已擦伤的数据表Python时遇到的问题

import requests from bs4 import BeautifulSoup url = 'https://crypto.com/price' response = requests.get(url).text soup = BeautifulSoup(response,"html.parser") row = soup.find("tbody") for x in row : a = str(x.text) print(a) 如何正确地抓取数据？

浏览 4提问于2022-05-02得票数 -1

1回答

BeautifulSoup在Try/Except循环中无法正确解析HTML

、

我在使用BS4解析文档时遇到了问题，我不确定发生了什么。响应代码是OK的，url是好的，代理可以工作，一切都很好，代理洗牌按预期工作，但是使用除html5lib之外的任何解析器，soup都是空白的。html5lib返回的汤在<body>标签处停止。我在Colab工作，我已经能够在另一个笔记本上成功地运行这个功能的一部分，并且已经能够循环通过一组搜索结果，从链接中获取我想要的数据，但我的目标网站最终会阻止我，所以我切换到使用代理。 check(proxy)是一个帮助器函数，它会在尝试向目标站点发出请求之前检查代理列表。当我将它包含在try/except中时，问题似乎已经开始了。我推

浏览 7提问于2019-10-26得票数 0

回答已采纳

1回答

在Scrapy中利用Beautifulsoup

、、、

我已经用Scrapy创建了一个简单的爬虫程序，它从给定的链接开始，跟踪给定DEPTH_LIMIT中的所有链接，由于项目参数的原因，每次运行爬行器时都会对其进行调整。为了简单起见，该脚本打印响应URL。 import scrapy from scrapy.spiders import CrawlSpider, Rule from scrapy.linkextractors import LinkExtractor from NONPROF.items import NonprofItem from scrapy.http import Request import re class Nonpr

浏览 12提问于2018-01-04得票数 2

回答已采纳

1回答

我正在尝试用Python抓取QS世界大学排名

、

我试图从QS排名网站中提取大学名称，排名和学术声誉。(地址如下)“学术声誉”数据在“排名指标”选项卡中。 "“ 首先，我尝试用Python获取大学名称，但没有成功。这段代码似乎给出了很多'a‘标签数据，但我无法获得带有"uni-link“类的大学名称。有人能帮我改进我的代码吗？ from bs4 import BeautifulSoup import requests url="https://www.topuniversities.com/university-rankings/world-university-rankings/2022" res

浏览 3提问于2021-11-25得票数 0

1回答

无法从web获得下载链接

、、

我试着从这个网站下载所有的报告：，但我无法自动找到链接与美丽的汤和请求。有人能帮我吗？到目前为止，我已经尝试了以下代码： from bs4 import BeautifulSoup from urllib.request import Request, urlopen import re req = Request("https://www.opec.org/opec_web/static_files_project/media") html_page = urlopen(req) soup = BeautifulSoup(html_page, "lxml

浏览 0提问于2019-03-16得票数 1

回答已采纳

2回答

如何从udemy网站找到价格与网络抓取？

、、

我正在使用python的精美汤包来找到课程的价格。用漂亮的汤，我得到的价格是美元，当我把它换算成卢比时，它是不同的。 price in udemy website : 700 price by beautiful soup : 13.99$ 我试图通过计算不同的课程比率来寻找逻辑，但它不起作用。下面是我的代码： from bs4 import BeautifulSoup import requests page = requests.get('https://www.udemy.com/course/python-data-science-machine-learning-bootca

浏览 0提问于2020-05-20得票数 1