无法使用BeautifulSoup Python找到HTML元素_无法使用Python BeautifulSoup找到表_无法使用BeautifulSoup找到特定表 - 腾讯云开发者社区

python、python-3.x、beautifulsoup、web-crawler、html-parsing

我正在用Python3.5开发一个网络爬虫。使用请求和Beautifulsoup4。我正在尝试获得所有主题的链接在论坛的第一页。并将它们添加到列表中。我有两个问题： 1)不确定如何使用beautifulsoup获得链接，我无法进入链接本身，只有div 2) Beautifulsoup似乎只返回了几个主题，而不是所有的主题。 def getTopics(): topics = [] url = 'http://forum.jogos.uol.com.br/pc_f_40' source_code = requests.get(url) plain_text = source_c

浏览 1提问于2015-10-28得票数 1

1回答

在漂亮的汤网里一丝不挂

python、html、web-scraping、beautifulsoup

我的意愿我想要刮从提交用户使用漂亮的汤与python。我的问题获得none作为我的脚本的结果。我的代码 from bs4 import BeautifulSoup import requests html = requests.get('https://github.com/pnp/cli-microsoft365').text soup = BeautifulSoup(html, 'html.parser') commits = soup.select_one('svg.octicon.octicon-history + span stron

浏览 1提问于2021-10-16得票数 1

回答已采纳

1回答

如何使用python访问onclick href？

python-3.x、web-scraping

如何使用python访问onclick href？我想要删除的网站的html代码和我的代码如下： HTML代码： < a class="" onclick href="http://moodle/mod/resource/view.php?id=394" id="yui_3_17_2_1_1505961565869_139"> <span class="instancename" id="yui_3_17_2_1_1505961565869_138">Lecture 1- 4

浏览 1提问于2017-09-21得票数 0

2回答

Spyder3崩溃后，安装jupyter-记事本

ubuntu-16.04、spyder

在笔记本电脑中，我使用的是Spyder3，在安装Jupyter-记事本之前没有任何问题。当从命令行运行spyder3时，将出现下一条消息：文件"/usr/lib/python2.7/dist-packages/bs4/builder/_html5lib.py"，第70行，在TreeBuilderForHtml5lib(html5lib.treebuilders._base.TreeBuilder)：类中 AttributeError：“模块”对象没有属性“_base” 经过一些搜索后，尝试以下建议的解决方案：尝试： sudo pip安装-升级beautifulsou

浏览 0提问于2018-09-22得票数 1

回答已采纳

1回答

基于h3日期和相关列表项修改HTML

python、html、beautifulsoup

我对Python非常陌生，无法理解这一点。我希望有一个脚本来完成以下工作： earlierRemoves 读取文件在h3标记中查找昨天的日期，或查找与无关的所有日期。任何洞察力都会受到极大的赞赏--我已经和BeautifulSoup混在一起了，但我不确定我是否有足够的经验或知识把它整合在一起。下面是我的尝试，它成功地删除了昨天h3标记之间的日期，但我不知道如何处理与前面的h3标记相关联的不同长度的列表项。 from datetime import datetime, timedelta from bs4 import BeautifulSoup # parse html h =

浏览 6提问于2022-05-21得票数 -1

回答已采纳

1回答

如何在python中输入html

python、beautifulsoup

我想将html文档输入到python中。我知道这个错误： UnicodeDecodeError：'cp950‘编解码器无法在位置解码字节0 0xbb 362:非法多字节序列使用此代码时： from bs4 import BeautifulSoup soup = BeautifulSoup(open(xxx.html)) print(soup) 我做错了什么？

浏览 5提问于2017-09-23得票数 0

1回答

HTML文本上的BeautifulSoup解析数字

python、parsing、beautifulsoup

我正在学习Python，并试图使用BeautifulSoup解析数据。我希望它打印的IPv4，而不是IPv6的地址，从一个网站。当第一次出现在html标记中的IPv6地址时，我似乎不明白为什么它会在IPv4上解析IPv4地址。感谢你在这方面的任何帮助。 import urllib2 from bs4 import BeautifulSoup page = urllib2.urlopen("http://www.whatsmyip.net") pagehtml = page.read() page.close() soup = BeautifulSoup(pagehtml)

浏览 1提问于2014-04-25得票数 1

回答已采纳

1回答

如何使用BeautifulSoup匹配嵌入了<a></a>的<div></div>中的文本？

python、html、beautifulsoup

我在test.py中有以下BeautifulSoup代码。 #!/usr/bin/env python # vim: set noexpandtab tabstop=2 shiftwidth=2 softtabstop=-1: from bs4 import BeautifulSoup import sys soup = BeautifulSoup(sys.stdin.read(), 'html.parser', from_encoding='utf-8') import re from pprint import pprint pprint(soup.f

浏览 1提问于2016-01-03得票数 1

2回答

如何查找具有特定值的文本BeautifulSoup python2.7

python、html、beautifulsoup

我有下面的html:我正在尝试将下面的数字保存为变量，7,148.49，HatchBack，Good。我遇到的问题是，我无法独立地拔出它们，因为它们没有附加类。我在想怎么解决这个问题。下面是html，然后是我解决这个问题的徒劳无功的代码。 </div> <div class="car-profile-info"> <div class="col-md-12 no-padding"> <div class="col-md-6 no-padding"> <strong>Status:<

浏览 2提问于2016-03-05得票数 1

回答已采纳

3回答

在replaceWith()不起作用后查找(使用BeautifulSoup)

python、find、beautifulsoup

请考虑以下python会话： >>> from BeautifulSoup import BeautifulSoup >>> s = BeautifulSoup("<p>This <i>is</i> a <i>test</i>.</p>"); myi = s.find("i") >>> myi.replaceWith(BeautifulSoup("was")) >>> s.find("i"

浏览 0提问于2013-03-17得票数 6

回答已采纳

2回答

如何在python脚本中导入.py

python、beautifulsoup

我试图在python脚本中直接导入BeautifulSoup库，但我无法安装它，因为我在语法DS213+中使用它，所以我尝试这样做： from BeautifulSoup import BeautifulSoup import urllib, urllib2 opener = urllib2.build_opener(urllib2.HTTPHandler(debuglevel=0)) opener.addheaders = [('User-agent', 'Mozilla/5.0')] ins = open( "str.txt", "

浏览 3提问于2014-02-24得票数 0

回答已采纳

3回答

无法使用BeautifulSoup从span元素中收集属性

python、html、beautifulsoup

是我希望使用BeautifulSoup从下面的站点()解析的源代码的映像。我希望提取< span class=‘print’>属性中的属性: htm链接. 我的python代码如下所示： import urllib.request try:

浏览 10提问于2017-08-01得票数 0

回答已采纳

2回答

无法在web上抓取所有数据，而不是获取所有<td>值

python、web-scraping

我试图为这个站点抓取html表，但无法获取chhange(24h)列。 from requests import get from urllib.request import urlopen from bs4 import BeautifulSoup import pandas as pd import matplotlib.pyplot as plt content = urlopen("https://coinmarketcap.com/") soup = BeautifulSoup(content, 'html.parser') rows = soup.

浏览 0提问于2018-08-14得票数 0

回答已采纳

1回答

Python BeautifulSoup无法解析每一项

python、html、parsing、beautifulsoup

我正在解析来自一个网站的数据，我已经将其保存为本地文件。我可以毫无问题地解析一些文本，然而，下一个问题是我遇到困难的地方。我要解析的html被注释掉了，所以我将数据保存到本地文件并转换为html。我可以导航到tbody，但无法获取每个tr。for循环似乎在第一次迭代时就卡住了。 import requests from bs4 import BeautifulSoup from bs4 import Comment from csv import writer response = requests.get('https://www.pro-football-reference.c

浏览 0提问于2018-08-02得票数 0

回答已采纳

1回答

网络抓取-使用BeautifulSoup

python、python-3.x、web-scraping、beautifulsoup

我刚接触漂亮的汤，在篮球参考中使用它也有困难。我正在尝试将高级统计数据的整个数据帧存储到pandas数据帧中，但我甚至无法选择它。到目前为止，我的代码如下： from urllib.request import urlopen from bs4 import BeautifulSoup import pandas as pd url='http://www.basketball-reference.com/teams/ATL/2016.html' html = urlopen(url) soup = BeautifulSoup(html) soup.findAll(

浏览 0提问于2016-01-14得票数 0

2回答

如何使用selenium在网页上执行所有javascript内容，以便在已满载的网页上查找和发送登录表单信息

python、selenium、selenium-webdriver、frame、webdriverwait

我一直试图制作一个Python脚本来登录到某个网站，浏览菜单，填写表单并将它生成的文件保存到文件夹中。我一直在使用Selenium来使网站完全加载，这样我就可以找到登录的元素，但我并不成功，可能是因为该网站在完全加载之前做了大量的JavaScript内容，但我无法让它完全加载并显示我想要的数据。我尝试了Robobrowser、Selenium、Request和BeautifulSoup来完成它。 import requests from bs4 import BeautifulSoup from selenium import webdriver url = "https://d

浏览 0提问于2019-04-16得票数 2

回答已采纳

1回答

如何在指定的类中找到包含“美丽汤”的链接

python、beautifulsoup

我使用“美丽汤4”解析一个新闻站点，以获得正文文本中包含的链接。我找到了包含链接的所有段落，但是paragraph.get('href')返回了每个链接的none类型。我正在使用Python3.5.1。任何帮助都是非常感谢的。 from bs4 import BeautifulSoup import urllib.request import re soup = BeautifulSoup("http://www.cnn.com/2016/11/18/opinions/how-do-you-deal-with-donald-trump-dantonio/index.h

浏览 1提问于2016-11-19得票数 1

回答已采纳

1回答

从Understat.com中抓取特定元素

python、web-scraping

我想从此站点上的多个匹配中检索特定的统计数据(PPDA)： https//understat.com/match/xxxx 我已经创建了以下代码来解析HTML并使用Python遍历每个匹配项，但是我正在努力解决如何提取特定的统计数据并将其加载到csv和图形中的问题。我是一个初学者，任何帮助都将不胜感激！代码： import pandas as pd import re import random import requests from bs4 import BeautifulSoup from selenium import webdriver import datetime impor

浏览 18提问于2019-02-15得票数 0

回答已采纳

1回答

元素依次返回None。

python、beautifulsoup

我只是想简单地获得subredit上的用户数。当我打开HTML时，我可以看到它。 <div class="_3XFx6CfPlg-4Usgxm0gK8R">55.3k</div> 我编写了一些python代码来尝试获取数字： import requests from bs4 import BeautifulSoup url = "https://www.reddit.com/r/TowerofGod/" response = requests.get(url) soup = BeautifulSoup(response.text,

浏览 2提问于2020-06-18得票数 2

回答已采纳

2回答

如何找到reddit帖子上的点击数

python、html、python-3.x、web-scraping、beautifulsoup

import results as results import soup as soup from bs4 import BeautifulSoup import requests import os, os.path, csv from sqlalchemy.sql.operators import div page = requests.get(URL) soup = BeautifulSoup(page.content, 'html.parser') UpvoteCount = resul

浏览 28提问于2020-09-18得票数 1

1回答

为什么漂亮汤没有正确解析元素名为"area"？

python、xml、parsing、beautifulsoup

我正在编写一个使用beautiful soup解析xml文档的python脚本。有些文档包含名为"area“的元素。由于某些原因，我无法正确地解析这些元素。它们总是作为空的<area/>元素出现。下面是正在发生的事情的一个极小的例子： #!/usr/bin/python3.5 from bs4 import BeautifulSoup xml = """"" <?xml version = '1.0' encoding = 'UTF-8' standalone = 'yes'?

浏览 4提问于2017-11-23得票数 2

回答已采纳

3回答

一个快速的python HTML解析器

python、html、xml、beautifulsoup

我写了一个python脚本，处理大量下载的网页HTML(120K页面)。我需要解析它们并从中提取一些信息。我试过使用BeautifulSoup，它简单直观，但运行起来似乎超级慢。因为这是必须在弱机器(在amazon上)上例行运行的东西，所以速度很重要。在python中有没有比BeautifulSoup快得多的HTML/XML解析器？或者我必须求助于正则表达式解析..

浏览 0提问于2012-03-13得票数 14

回答已采纳

4回答

ImportError:没有名为html.entities的模块

python

我正在尝试让这个模块在服务器上工作，但我在标题中得到了错误：我的脚本： from bs4 import BeautifulSoup 当我运行它时： aclark@tycho ~ % python test.py Traceback (most recent call last): File "test.py", line 1, in <module> from bs4 import BeautifulSoup File "/usr/lib/python2.7/site-packages/bs4/__init__.py", line

浏览 0提问于2014-12-09得票数 7

5回答

美丽的汤寻找隐藏风格的元素

python、html、beautifulsoup

我的简单需求。如何查找当前在网页上不可见的元素？我猜style="visibility:hidden"或style="display:none"是隐藏元素的简单方法，但BeautifulSoup不知道它是否隐藏。例如，HTML为： Textbox_Invisible1: <input id="tbi1" type="text" style="visibility:hidden"> Textbox_Invisible2: <input id="tbi2" type="tex

浏览 3提问于2011-12-21得票数 5

1回答

Python数据刮刀

python、web-scraping

我编写了以下代码 #!/usr/bin/python #weather.scraper from bs4 import BeautifulSoup import urllib def main(): """weather scraper""" r = urllib.urlopen("https://www.wunderground.com/history/airport/KPHL/2016/1/1/MonthlyHistory.html?&reqdb.zip=&reqdb.magic=&reqd

浏览 0提问于2016-04-04得票数 0

回答已采纳

2回答

无法从python中的html页面提取文本

python、beautifulsoup、html-parsing

我对网络抓取非常陌生。我读到了关于BeautifulSoup的文章，并试图使用它。但我无法提取具有给定类名“company-desc-and-排序容器”的文本。我甚至不能从html页面中提取标题。这是我尝试过的代码： from BeautifulSoup import BeautifulSoup import requests url= 'http://fortune.com/best-companies/' r = requests.get(url) soup = BeautifulSoup(r.text) #print soup.prettify()[0:10

浏览 5提问于2016-12-20得票数 1

回答已采纳

1回答

网络抓取:没有使用BeautifulSoup(page.content，'html.parser')返回正确的内容

python、html、web-scraping

我试图从AJIO网站上进行抓取，但Python获取的内容似乎与我在检查确切网页的元素时看到的内容不完全相同。在后端创建HTML页面的页面上似乎存在某种java代码，但是当我尝试用Python获取页面内容时，它会向我展示java代码，而不是确切的HTML页面。有人能对此提出解决方案吗？下面是我正在使用的代码。在下面的代码中，我在最后一行后得到错误"TypeError：'NoneType‘object是不可迭代的“，这是因为页面没有通过"soup=BeautifulSoup(page.text，’html.parser‘)被正确地获取。”我可以在检查HTML页面时看到“预

浏览 8提问于2021-12-28得票数 0

回答已采纳

1回答

如何选择所有的'a‘标签

python、html、beautifulsoup

我是BeautifulSoup和Python的新手。这是我的HTML： <html> <head></head> <body> <a href="https://google.com">Google</a> <a href="https://yahoo.com">Yahoo</a> </body> </html> 现在我的代码是： from bs4 import BeautifulSoup # Getting page souped ins

浏览 12提问于2020-11-06得票数 0

回答已采纳

1回答

Python3.6- BeautifulSoup4，解析表AttributeError: ResultSet对象没有属性“findAll”

python、python-3.x、beautifulsoup、html-parsing

我试图使用bs4解析一个包含加利福尼亚所有城市的表，但是我得到了下面的错误 AttributeError: ResultSet object has no attribute 'findAll'. You're probably treating a list of items like a single item. Did you call find_all() when you meant to call find()? 我尝试过使用find_all，findAll (就像这个论坛上其他帖子所建议的那样)，但是它也抛出了同样的错误。据我所知，我不能这样做，因为我的程

浏览 3提问于2017-10-20得票数 0

回答已采纳

1回答

如何存储解析后的html结果？

python、url、html-parsing、beautifulsoup、finance

我正在使用Python的HTMLParser和BeautifulSoup来解析雅虎的财务数据。已经有一个非常好的软件包可以做到这一点，但它没有得到“有形价格/账面价值”，也就是说，它在计算账面价值时包括了商誉和其他无形资产。因此，我不得不推出自己的解决方案。这并不是很好。下面是代码 from BeautifulSoup import BeautifulSoup import urllib2 from HTMLParser import HTMLParse class data(HTMLParser): def handle_data(self, data): pri

浏览 0提问于2012-08-05得票数 1

回答已采纳

1回答

在Python中将HTML表格转换为Pandas数据框

html、python-3.x、dataframe、web-scraping、beautifulsoup

在这里，我试图从Python代码中指定的网站中提取一个表。我能够得到HTML表，而且我无法使用Python转换为数据帧。以下是代码 # import libraries import requests from bs4 import BeautifulSoup # specify url url = 'http://my-trade.in/' # request html page = requests.get(url) # Parse html using BeautifulSoup, you can use a different parser like lxml

浏览 10提问于2019-07-10得票数 7

回答已采纳

1回答

无法解析python中响应中元素的值？

python、asp.net、beautifulsoup、python-requests、python-3.4

import requests from bs4 import BeautifulSoup s = requests.Session() content = s.get('https://nucleus.niituniversity.in/Default.aspx').content soup = BeautifulSoup(content,"html5lib") print("viewState = " + str(soup.select_one("#__VIEWSTATE")["value"])) print(

浏览 5提问于2016-08-11得票数 0

19回答

如何按类查找元素

python、html、web-scraping、beautifulsoup

我在使用Beautifulsoup解析带有"class“属性的HTML元素时遇到了问题。代码如下所示 soup = BeautifulSoup(sdata) mydivs = soup.findAll('div') for div in mydivs: if (div["class"] == "stylelistrow"): print div 在脚本结束后，我在同一行得到了一个错误。 File "./beautifulcoding.py", line 130, in getlanguage

浏览 6提问于2011-02-18得票数 532

回答已采纳

2回答

用漂亮汤刮掉雅虎财务的标准偏差

python、web-scraping、beautifulsoup

我试图使用BeautifulSoup和Python2.7：从雅虎财务网页上的风险统计表中提取一些数字到目前为止，我已经使用查看了html #!/usr/bin/python from bs4 import BeautifulSoup, Comment import urllib riskURL = "https://finance.yahoo.com/quote/SHSAX/risk" page = urllib.urlopen(riskURL) content = page.read().decode('utf-8') soup = B

浏览 0提问于2018-09-21得票数 2

回答已采纳

1回答

BeautifulSoup4找不到文章的深度

python、web、web-scraping、beautifulsoup、python-requests

我刚开始用python和BeautifulSoup做实验。我想获得与特定城市相关的文章的链接。下面是当前的代码 import requests from bs4 import BeautifulSoup city = "london" result = requests.get('https://www.origo.hu/kereses/index.html?q=' + city) def main_loop(): soup = BeautifulSoup(result.content, features="lxml")

浏览 2提问于2020-09-23得票数 0

回答已采纳

1回答

Python -使用web抓取下载视频

python

我正在尝试编写一个下载视频的函数，它使用网页的作为练习的参数。我基本上有两个问题。首先:我无法使用以下代码找到iframe源代码，以便在Python中切换到它。有没有什么原因或者是我遗漏了什么： import requests from bs4 import BeautifulSoup url = 'https://fmovies.wtf/film/adventures-of-rufus-the-fantastic-pet.72o71' r = requests.get(url) soup = BeautifulSoup(r.content,'html.parse

浏览 3提问于2020-05-29得票数 0

2回答

Python bs4如何从find_all()中找到内联样式

python、web、beautifulsoup

我正在尝试使用python和BeautifulSoup获取内联样式元素的高度值，我设法获得了带有特定类的所有div，但无法知道如何获得下面输出的内联style=height值，这是我的代码。 import requests from bs4 import BeautifulSoup URL = "https://exampleonly.org/" page = requests.get(URL) soup = BeautifulSoup(page.content, "html.parser") samples = soup.find_all("div

浏览 3提问于2022-05-28得票数 0

回答已采纳

2回答

复制python中嵌套的html列表？

python、html

我是一个初级程序员，所以这可能是一个很小的问题:我有一个.html文件，其中有一个嵌套很深的无序列表。例如，我如何在Python中将前4个嵌套级别复制到一个新的空.html文件中？我需要BeautifulSoup吗？为了更好地说明，这里是Javascript中显示效果的代码： function nestless(root, selector, level) { var use = root; for (var i = 0; i <= level; i++) { use += ' ' + selector; } $(use).

浏览 3提问于2012-07-20得票数 1

2回答

美丽的汤:获取子节点的内容

python、beautifulsoup

我有以下python代码： def scrapeSite(urlToCheck): html = urllib2.urlopen(urlToCheck).read() from BeautifulSoup import BeautifulSoup soup = BeautifulSoup(html) tdtags = soup.findAll('td', { "class" : "c" }) for t in tdtags: print t.encode('latin1

浏览 1提问于2010-10-21得票数 1

回答已采纳

1回答

为什么BeautifulSoup .children包含无名元素和预期的标记？

python、html-parsing、beautifulsoup

代码 #!/usr/bin/env python3 from bs4 import BeautifulSoup test="""<!DOCTYPE html> <html> <head> <meta content="text/html; charset=UTF-8" http-equiv="Content-Type"/> <title>Test</title> </head> <body> <table> <tbody

浏览 0提问于2013-08-17得票数 2

回答已采纳

1回答

从html中抓取一对标记。

python、html、python-3.x、web-scraping、beautifulsoup

我使用python3.6和Pycharm 2016.2作为编辑器。我想爬行"th“："td”标签中的内容对，如果"td“标签有一个子标记，它是带有”check=‘checked’“的输入标记。我尝试了regEx、BeautifulSoup和其他方面的find_all，但是仍然有错误消息。请帮帮忙。这是网站地址：下面是我的代码： from bs4 import BeautifulSoup import urllib.request from urllib.parse import urlparse import re popup_inspection =

浏览 2提问于2017-01-02得票数 0

回答已采纳

2回答

无法在漂亮的汤中解析html文件

python、html、beautifulsoup

代码： from bs4 import BeautifulSoup # Opening the html file HTMLFile = open("index.html", "r") # Reading the file contents = HTMLFile.read() # Creating a BeautifulSoup object and specifying the parser S = BeautifulSoup(contents, 'html.parser') print (S.find_all("

浏览 31提问于2021-11-11得票数 1

3回答

TypeError :在带有BeautifulSoup的Python中使用split时，'NoneType‘对象不可调用

python、beautifulsoup、python-requests

我今天尝试了一下BeautifulSoup和Requests API。所以我想我应该写一个简单的抓取器来跟踪深度为2的链接(如果这有意义的话)。我正在抓取的网页中的所有链接都是相对的。(例如：<a href="/free-man-aman-sethi/books/9788184001341.htm" title="A Free Man">)所以为了让它们成为绝对的，我想我应该使用urljoin将页面url与相对链接连接起来。为此，我必须首先从<a>标记中提取href值，为此，我想我应该使用split #!/bin/python #cra

浏览 2提问于2013-03-14得票数 2

回答已采纳

1回答

从URL抓取文本并在pi上显示

python、raspberry-pi

我正在从web服务器上获取文本，并试图在python上的raspberry pi屏幕上显示当前的歌曲。使用LCD 16x2 #!/usr/bin/python # Example using a character LCD connected to a Raspberry Pi or BeagleBone Black. import math import time import urllib2 from BeautifulSoup import BeautifulSoup import Adafruit_CharLCD as LCD page = urllib2.urlopen(&#

浏览 5提问于2014-07-24得票数 0

回答已采纳

4回答

用BeautifulSoup摘录标题

python-3.x、beautifulsoup

我有这个 from urllib import request url = "http://www.bbc.co.uk/news/election-us-2016-35791008" html = request.urlopen(url).read().decode('utf8') html[:60] from bs4 import BeautifulSoup raw = BeautifulSoup(html, 'html.parser').get_text() raw.find_all('title', limit=1) pr

浏览 5提问于2016-03-12得票数 20

回答已采纳

2回答

AttributeError: web爬取器中的“”NoneType“”对象没有属性“”findAll“”

python

我正在制作一个网页抓取程序，但这是我第一次。我使用的教程是为python 2.7构建的，但我使用的是3.8.2。我大部分时间都在编辑我的代码，使其适合python 3，但是弹出一个错误，我无法修复它。 import requests import csv from bs4 import BeautifulSoup url = 'http://www.showmeboone.com/sheriff/JailResidents/JailResidents.asp' response = requests.get(url) html = response.content so

浏览 85提问于2020-04-05得票数 1

回答已采纳

2回答

find_all在混合内容中找不到文本

python、regex、beautifulsoup

使用BeautifulSoup，我在Python中有一点点屏幕刮擦代码，这让我头疼。对html的小改动使我的代码中断，但我不明白为什么它不能工作。这基本上是一个html解析时的演示： soup=BeautifulSoup(""" <td> <a href="https://alink.com"> Foo Some text Bar </a> </td> """) links = soup.find_all('a',text=re.com

浏览 3提问于2014-12-20得票数 2

回答已采纳

1回答

Python漂亮汤find_all找不到<div class=“”>

python、html、python-3.x、beautifulsoup

我试着用漂亮的汤来找到HTML标签中的内容。但是，当标记为/div class=“"/时，它就不工作了。如果有双引号中的空间，则无法正确识别。这是我的密码： from bs4 import BeautifulSoup if __name__ == "__main__": soup = BeautifulSoup(open("1946.html", encoding='utf-8'), 'lxml') for k in (soup.find_all('div', class_=" ")):

浏览 2提问于2022-03-14得票数 -1

2回答

Bs4和requests

python、web-scraping、beautifulsoup

我试图制作一个，我想从那里得到口袋妖怪的描述，pokename的意思是口袋妖怪的名字。例如：有一个标签包含描述，而p标记的类是: version-xactive。当我打印描述的时候，我什么也得不到，有时什么也没有。下面是代码： import requests from bs4 import BeautifulSoup # Assign URL url = "https://www.pokemon.com/us/pokedex/"+text_id_name.get(1.0, "end-1c") # Fetch raw HTML content h

浏览 3提问于2021-04-28得票数 0

回答已采纳

1回答

PyDictionary/BeautifulSoup中的问题

python、macos、beautifulsoup、macos-sierra

我正在macOS塞拉利昂上运行Python3，需要创建由特定单词的同义词组成的句子。为此，我使用PyDictionary。但是，在运行我的代码(如下所示)时，我会得到一个错误(Python解释器)和一个警告(BeautifulSoup)。输出： /Library/Frameworks/Python.framework/Versions/3.5/lib/python3.5/site-packages/beautifulsoup4-4.5.3-py3.5.egg/bs4/__init__.py:181: UserWarning: No parser was e xplicitly specif

浏览 0提问于2017-02-20得票数 0