beautifulsoup decode_decode_linux decode - 腾讯云开发者社区

、、、、

我试图通过BeautifulSoup解析一个带有lxml的很长的html文件。我知道html文件的字符编码是UTF-8 with BOM，但是每当我试图运行contents = f.read()时，都会得到以下错误： 'charmap' codec can't decode byte 0x8d in position 33222: character maps to <undefined> 这是我的第一个(也是有问题的)代码： from bs4 import BeautifulSoup with open("doc.html", "

浏览 1提问于2019-12-22得票数 2

回答已采纳

1回答

Python和BeautifulSoup编码问题

、、、、

我在网络上抓取这个url：遇到这个错误： movTitle = str(link['title']) UnicodeEncodeError: 'ascii' codec can't encode character u'\u2013' in position 41: ordinal not in range(128) 下面是我的代码片段 rajTamilurl='http://www.rajtamil.com/category/vijay-tv-shows/' req = urllib2.

浏览 4提问于2014-02-22得票数 0

2回答

使用aiohttp的Python漂亮汤

、

有人知道怎么做： import html5lib import urllib from bs4 import BeautifulSoup soup = BeautifulSoup(urllib.request.urlopen('http://someWebSite.com').read().decode('utf-8'), 'html5lib') 使用aiohttp而不是urllib？谢谢^^

浏览 6提问于2017-05-25得票数 10

回答已采纳

1回答

在将HTML文本文件读入BeautifulSoup4时遇到问题

、

我正在尝试用BeautifulSoup4和python3 (Anaconda Jupyter Notebook)读取世界冠状病毒html页面的保存副本。下面是我的代码： from bs4 import BeautifulSoup with open(r"c:\data\test.html") as fp: soup = BeautifulSoup(fp.read(), "html.parser") 当我执行此命令时，我得到以下错误： ------------------------------------------------------------

浏览 9提问于2020-03-24得票数 0

1回答

如何以UTF-8的形式打开HTML文件进行解析？

、、

我试图用python 3使用BeautifulSoup来解析html文件，但是我得到了UTF-8解码错误。我尝试添加选项打开文件解码作为UTF-8，但错误仍然出现。怎么解决这个问题？这就是我到目前为止所拥有的。 from bs4 import BeautifulSoup with open("file.html") as fp: unicode_html = fp.read().decode('utf-8', 'ignore') soup = BeautifulSoup( unico

浏览 3提问于2020-02-26得票数 0

回答已采纳

3回答

网站的Python正确编码(漂亮汤)

、、、、

我试图加载一个html页面并输出文本，尽管我得到了正确的网页，但BeautifulSoup以某种方式破坏了编码。资料来源： # -*- coding: utf-8 -*- import requests from BeautifulSoup import BeautifulSoup url = "http://www.columbia.edu/~fdc/utf8/" r = requests.get(url) encodedText = r.text.encode("utf-8") soup = BeautifulSoup(encodedText) tex

浏览 8提问于2016-04-25得票数 13

回答已采纳

5回答

Python和BeautifulSoup编码问题

、、、

我正在用Python和BeautifulSoup编写一个爬虫，一切都很顺利，直到我遇到了这个网站：我正在获取请求库的内容： r = requests.get('http://www.elnorte.ec/') content = r.content 如果我在此时打印内容变量，所有的西班牙语特殊字符似乎都工作得很好。但是，一旦我尝试将内容变量提供给BeautifulSoup，一切都会变得混乱： soup = BeautifulSoup(content) print(soup) ... <a class="blogCalendarToday" href=&

浏览 0提问于2011-08-28得票数 29

回答已采纳

1回答

UnicodeDecodeError：'ascii‘编解码器不能在118374位置解码字节0 0xef :序数不在范围内(128个)

、

我正在试验一些NLP算法，我现在的重点是情感分析。出于这个原因，我从下载了一些带有正面和负面评论的.review格式文件。我正在使用BeautifulSoup解析这些XML文件，目前我只想通过执行以下源代码来读取它们： from bs4 import BeautifulSoup positive_reviews = BeautifulSoup(open('*******/electronics/positive.review').read()) positive_reviews = positive_reviews.findAll('review_text'

浏览 0提问于2018-06-12得票数 0

回答已采纳

2回答

BeautifulSoup请求还是请求？

、

我有一个问题，当我使用BeautifulSoup request时： page = urlopen(url).read().decode('utf8') soup = BeautifulSoup(page) text = ' '.join(map(lambda p: p.text, soup.find_all('p'))) return soup.title.text, text 我得到了如下的漂亮输出： Coronavirus: Johnson sets out 'ambitious' economic recover

浏览 27提问于2020-06-30得票数 1

1回答

BeautifulSoup代码适用于IPython笔记本，而不是IPython

、、、、

当从木星IPython笔记本运行时，以下代码工作良好： from bs4 import BeautifulSoup xml_file_path = "<Path to XML file>" s = BeautifulSoup(open(xml_file_path), "xml") 但是，当从Eclipse/PyDev运行时(它使用相同的Python解释器)创建汤时，它失败了： Traceback (most recent call last): File "~/parser/scratch.py", line 3, in <

浏览 0提问于2017-04-11得票数 0

3回答

Python3漂亮汤Web抓取

、、

我目前正在使用BeautifulSoup。我似乎有一些与编码有关的问题。这是我的代码： import requests from bs4 import BeautifulSoup req = requests.get('https://pythonprogramming.net/parsememcparseface/') soup = BeautifulSoup(req.content.decode('utf-8','ignore')) print(soup.find_all('p')) 以下是我的错误： UnicodeEnc

浏览 2提问于2017-04-24得票数 0

回答已采纳

2回答

如何使用BeautifulSoup解析带有非ASCII码字符的超文本标记语言？

、

在尝试使用BeautifulSoup解析某些html时，我一直收到以下错误： UnicodeDecodeError: 'ascii' codec can't decode byte 0xae in position 0: ordinal not in range(128) 我已经尝试使用下面问题的解决方案来解码html，但仍然得到相同的错误。我已经尝试了下面问题的所有解决方案，但没有一个有效(张贴，这样我就不会得到重复的答案，以防他们通过查看问题的相关方法来帮助任何人找到解决方案)。有人知道我哪里错了吗？这是BeautifulSoup中的一个错误吗?我应该安装一个更早

浏览 0提问于2011-07-18得票数 0

回答已采纳

1回答

Python -从HTML页面捕获所有表

、、、

我有带有嵌入HTML表格的电子邮件，还有使用BeautifulSoup提取表和表中数据的代码，我的问题是有时只有当有更多的表时，它才能成功捕获一个表。我通常在这些表上运行的代码是： with open(file_path) as in_f: msg = email.message_from_file(in_f) html_msg = msg.get_payload(1) body = html_msg.get_payload(decode=True) html = body.decode() table = bs4.BeautifulSoup(html).find(

浏览 3提问于2017-06-06得票数 0

回答已采纳

1回答

在python2中解码html实体

、

我有一串转义的html标记，'í'，我希望它具有正确的重音字符'í'。在读了这么多之后，这是我的尝试： messy = 'í' print type(messy) >>> <type 'str'> decoded=messy.decode('utf-8') print decoded >>> í 德雷茨。在阅读了之后，我尝试了这样的方法： from BeautifulSoup import * soup = B

浏览 3提问于2013-11-06得票数 2

回答已采纳

1回答

使用Python读取.htm文件时的编码问题

、、

我正在尝试用Python读入大量的.htm文件。为此，我使用以下代码： HtmlFile = codecs.open(file, 'r') text = BeautifulSoup(HtmlFile.read()).text 但是，这会导致以下错误： UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 411: character maps to <undefined> 因此，我尝试使用utf-8进行编码，如下所示： HtmlFile = codecs

浏览 35提问于2018-12-18得票数 2

回答已采纳

2回答

Python3中来自BeautifulSoup的“非法多字节序列”错误

、、、、

.html保存到本地磁盘，我使用BeautifulSoup (bs4)解析它。直到最近，它才被更改为Python3。我在另一台机器Python2中测试了相同的.html文件，它正常工作，并返回页面内容。 soup = BeautifulSoup(open('page.html'), "lxml") Python 3的机器不工作，它说： UnicodeDecodeError: 'gbk' codec can't decode byte 0x92 in position 298670: illegal multibyte sequence

浏览 4提问于2019-10-09得票数 2

回答已采纳

1回答

使用Python3时出错，崇高BeautifulSoup

、、

简而言之，这是我的代码： import requests from bs4 import BeautifulSoup import re request = requests.get("http://www.yellowpages.com/") soup = BeautifulSoup(request.content) print(soup.find_all("a")) 考虑到上面的代码，我得到了以下错误： [Decode error - output not utf-8] 有人知道发生了什么事吗?我该怎么解决？干杯!

浏览 3提问于2015-04-29得票数 0

回答已采纳

1回答

python中的$ sign和计算

、

我正在尝试计算一个$ sign中的值，然后从中减去。不确定确切的how...sorry如果基本问题，但非常感谢你。 import urllib.request from bs4 import BeautifulSoup import time urleth = "https://coinmarketcap.com/currencies/ethereum/" page = urllib.request.urlopen(urleth) content = page.read().decode('utf-8') soup = BeautifulSoup(conte

浏览 0提问于2017-08-20得票数 0

回答已采纳

2回答

在本地HTML文件上使用Python中的美观汤的重音字符错误

、、、、

我非常熟悉Python中的“美丽汤”，我一直都习惯于抓取网站。现在，我正在抓取一个本地HTML文件(，以防您想测试代码)，唯一的问题是重音字符没有以正确的方式表示(在我抓取活动站点时从未发生过这种情况)。这是代码的简化版本。 import requests, urllib.request, time, unicodedata, csv from bs4 import BeautifulSoup soup = BeautifulSoup(open('AH.html'), "html.parser") tables = soup.find_all('t

浏览 7提问于2020-03-18得票数 0

回答已采纳

1回答

熊猫用.，我怎么才能修好它呢？

、

我试图制造一个刮刀，但它给我带来了问题，因为url并没有完全显示它们，而是只显示了.而且它也不允许我去刮它应该做的事情。这是代码： from bs4 import BeautifulSoup import requests import urllib.request import pandas as pd import numpy as np url = input("Url a scrapear: ") #https://www.plasticosur.com/hosteler%C3%ADa#/pageSize=36&viewMode=grid&orderB

浏览 7提问于2022-02-16得票数 0

回答已采纳

2回答

如何使用PythonVersion3x从网站读取html正文

我想连接和接收来自特定网站链接的http响应。我有许多Python代码： import urllib.request import os,sys,re,datetime fp = urllib.request.urlopen("http://www.python.org") mybytes = fp.read() mystr = mybytes.decode(encoding=sys.stdout.encoding) fp.close() 当我将响应作为参数传递给：BeautifulSoup(str(mystr), 'html.parser')以获取已清除的

浏览 0提问于2015-08-14得票数 0

1回答

BeautifulSoup，Python3，编码错误

、、、、

我对BeautifulSoup有编码问题。在我的开发环境中，一切都很好(Ubuntu、Python3.4、Django开发服务器)。在生产服务器(Ubuntu、Python3.4、Django和BeautifulSoup的相同版本--唯一的区别是使用gunicorn和Nginx)上，我得到了： 'ascii' codec can't decode byte 0xc3 in position 301: ordinal not in range(128) trackback显示问题在“BeautifulSoup(数据)”语句中。 data = open(os.path.jo

浏览 2提问于2014-12-01得票数 3

回答已采纳

1回答

为公司详细信息刮取数据

、

我试图刮刮公司名称，邮政编码，电话号码和网页地址：发现很困难，因为信息只有在点击页面上的区域时才能检索到。如果有人能帮忙的话，我会非常感激的。对于Python来说都是非常新的，特别是抓取！ !pip install beautifulsoup4 !pip install urllib3 from bs4 import BeautifulSoup from urllib.request import urlopen url = "https://www.matki.co.uk/matki-dealers/" page = urlopen(url) html = page.re

浏览 6提问于2022-08-22得票数 0

1回答

使用BeautifulSoup对空结果进行刮擦

、、

我正在使用BeautifulSoup刮取数据，但是在选择任何标记时得到一个空的结果，下面是我的代码。 # -*- coding: utf-8 -*- import requests import sys from bs4 import BeautifulSoup headers = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.76 Safari/537.36'} boardgam

浏览 3提问于2020-02-21得票数 0

回答已采纳

2回答

无法用Python解码BeautifulSoup的输出

、、、

我一直试图用BeautifulSoup用Python编写一个小刮板。一切进展顺利，直到我尝试打印(或写入文件)各种HTML元素中包含的字符串。我正在抓取的网站是：，它包含各种法语字符。出于某种原因，当我试图将终端中的内容或文件中的内容打印出来，而不是像预期的那样解码字符串时，我将得到原始的unicode输出。下面是剧本： from BeautifulSoup import BeautifulSoup as bs import urllib as ul ##import re base_url = 'http://www.yellowpages.ca' data_file =

浏览 4提问于2011-12-03得票数 0

回答已采纳

3回答

如何用美丽的汤和蟒蛇提取网站的纬度、经度和位置名称？

、、、、

from bs4 import BeautifulSoup import requests import pandas as pd import re import json source = requests.get(https://www.cellcard.com.kh/en/detail/cellcard-shops/).text.encode('utf8').decode('ascii', 'ignore') response = BeautifulSoup(source, 'lxml') ，我不知道在那之后提取地方的长

浏览 1提问于2020-10-08得票数 1

回答已采纳

1回答

使用bs4提取html文件中的文本

、、、

我想从我的html文件中提取文本。如果我对特定文件使用以下命令： import bs4, sys from urllib import urlopen #filin = open(sys.argv[1], 'r') filin = '/home/iykeln/Desktop/R_work/file1.html' webpage = urlopen(filin).read().decode('utf-8') soup = bs4.BeautifulSoup(webpage) for node in soup.findAll('html&#

浏览 1提问于2013-08-04得票数 4

回答已采纳

2回答

编码和urllib有问题

、、

我正在使用urllib加载网页。有俄文符号，但页面编码为'utf-8‘ 1 pageData = unicode(requestHandler.read()).decode('utf-8') UnicodeDecodeError: 'ascii' codec can't decode byte 0xd0 in position 262: ordinal not in range(128) 2 pageData = requestHandler.read() soupHandler = BeautifulSoup(pageData) print

浏览 0提问于2010-05-14得票数 1

回答已采纳

2回答

在TypeError中对编码的BeautifulSoup执行替换()时，结果是

、、、

试图对通过Python3中的BeautifulSoup库解析HTML数据后收到的文本输出进行编码。 - gmtext.encode('ascii'，错误=‘替换’).replace(“？”，"") TypeError:需要一个类似字节的对象，而不是'str‘。下面是代码实现： import urllib.request as urllib2 from bs4 import BeautifulSoup articleURL = "http://digimon.wikia.com/wiki/Guilmon" page = url

浏览 0提问于2018-03-09得票数 0

回答已采纳

1回答

Python从页面上的链接下载多个文件

、、、

我正在尝试从这个下载所有的。我想我必须使用urlopen打开每个网址，然后使用urlretrieve从每个游戏底部附近的下载按钮访问它来下载每个pgn。我必须为每个游戏创建一个新的BeautifulSoup对象吗？我也不确定urlretrieve是如何工作的。 import urllib from urllib.request import urlopen, urlretrieve, quote from bs4 import BeautifulSoup url = 'http://www.chessgames.com/perl/chesscollection?cid=101449

浏览 0提问于2017-09-17得票数 6

回答已采纳

1回答

使用BeautifulSoup 4 (lxml解析器)，如何从标记中提取内部decode_contents (decode_contents不起作用)？

、、、

我使用的是BeautifulSoup 4和Python3.7。我想从找到的文章中提取内部HTML。我有这个 soup = BeautifulSoup(html, features="lxml") ... article_elt = top_article_elt.select('div[class*="outer"]')[0] article = article_elt.decode_contents() ... print("article: " + str(article) + " score:" + str(

浏览 6提问于2019-12-08得票数 0

回答已采纳

1回答

urllib重定向错误

、

我试图使用urllib和BeautifulSoup来抓取表，并得到了错误： "urllib.error.HTTPError: HTTP错误302: HTTP返回一个重定向错误，这将导致无限循环。最后30x错误消息是: Found“ 我听说这与需要cookie的站点有关，但在第二次尝试之后，我仍然会收到这个错误： import urllib.request from bs4 import BeautifulSoup import re opener = urllib.request.build_opener() opener.addheaders = [('User-agent

浏览 1提问于2017-08-22得票数 0

回答已采纳

1回答

BeautifulSoup译码误差

、

我正在尝试使用Beautiful解析Evernote生成的html文件。守则是： html = open('D:/page.html', 'r') soup = BeautifulSoup(html) 它会产生以下错误： File "C:\Python33\lib\site-packages\bs4\__init__.py", line 161, in __init__ markup = markup.read() File "C:\Python33\lib\encodings\cp1252.py", line 23,

浏览 3提问于2014-06-23得票数 9

回答已采纳

2回答

抓取网站表中的事件

、

我正在尝试从一个定期自动更新的网站上提取一个表格到熊猫中。我试过了： from urllib.request import urlopen, Request from bs4 import BeautifulSoup website = 'http://www.dallasfirerescue.com/active_incidents.html' req = Request(website) abc = urlopen(req) raw = abc.read().decode("utf-8") page = raw.replace('<!--&g

浏览 10提问于2018-02-18得票数 0

1回答

无法在Russian_Python2.7.9中输出

、、

我不能用俄语输出，只有Unicode=的输出( 我使用Pythonv.2.7.9 Microsoft 8 我怎样才能用list做到这一点呢？ #! /usr/bin/env python # -*- coding: utf-8 -*- import requests from bs4 import BeautifulSoup r = requests.get("http://fs.to/video/films/group/film_genre/") response = r.content.decode('utf-8') page = Beautiful

浏览 2提问于2015-02-22得票数 0

1回答

获取iframe id="swGoogleDrive“中的PDF

、、、

如何获取在此URL的iframe中找到的PDF ? (1)下面的代码抛出一个错误。 import requests, re from bs4 import BeautifulSoup url = r'https://www.d88a.org/domain/102' headers = {'User-Agent': 'C19SchoolsWebscrape'} s = requests.Session() r = s.get(url, headers=headers) soup = BeautifulSoup(r.content,

浏览 18提问于2020-07-22得票数 1

回答已采纳

1回答

用ZipFile读取文件后，如何对html文件进行编码？

、、、

我正在从URL读取zip文件。在zip文件中，有一个HTML文件。在我读完这个文件之后，一切都很正常。但是当我打印文本时，我面临着一个Unicode问题。Python版本: 3.8 from zipfile import ZipFile from io import BytesIO from bs4 import BeautifulSoup from lxml import html content = requests.get("www.url.com") zf = ZipFile(BytesIO(content.content)) file_name = zf.namel

浏览 13提问于2021-05-05得票数 2

回答已采纳

1回答

用漂亮汤解析HTML表格标签

、、

我有以下任务，使用BeautifulSoup在HTML页面中查找标记“< table”和属性‘BeautifulSoup可折叠折叠’的特定表(从一开始只有第二个表)。当我像字典一样组织属性结构时，程序会无缘无故地将所有属性作为一个项目来读取。我需要他们分开，就像字典对象，只提取第二项。这是代码： from urllib.request import urlopen from bs4 import BeautifulSoup response = urlopen('file:///C:/Users/User/Documents/Visual%20Studio%202017

浏览 1提问于2020-07-21得票数 1

回答已采纳

1回答

史坦莎节图书馆很慢吗？

我有两套代码来计算一个文本文件中的句子数。这两个选项产生不同的结果，选项2(第二节)非常缓慢。备选案文2(节)是否更准确？我该如何加速备选方案2(第二节)？非常感谢! 选项1(正则表达式)：以下代码需要2秒，输出为1444。 import requests from bs4 import BeautifulSoup import re sentence_regex = re.compile(r"\b[A-Z](?:[^\.!?]|\.\d)*[\.!?]") def identify_sentences(input_text:str): """R

浏览 3提问于2022-02-24得票数 0

4回答

用BeautifulSoup摘录标题

、

我有这个 from urllib import request url = "http://www.bbc.co.uk/news/election-us-2016-35791008" html = request.urlopen(url).read().decode('utf8') html[:60] from bs4 import BeautifulSoup raw = BeautifulSoup(html, 'html.parser').get_text() raw.find_all('title', limit=1) pr

浏览 5提问于2016-03-12得票数 20

回答已采纳

1回答

无法通过BeautifulSoup读取维基页面

、、、

我尝试使用urllib和漂亮的汤阅读wiki页面，如下所示。我试着按照这个。 import urllib.parse as parse, urllib.request as request from bs4 import BeautifulSoup name = "メインページ" root = 'https://ja.wikipedia.org/wiki/' url = root + parse.quote_plus(name) response = request.urlopen(url) html = response.read() print (h

浏览 14提问于2019-10-02得票数 0

回答已采纳

1回答

用美汤导入雅虎金融股票价格并请求

、、、

所以我有一个检查股票价格的脚本。雅虎改变了一些东西，现在我得到的是%的变化，而不是股票价格。下面是原始脚本。当我运行它时，我得到"+0.70 (+0.03%)"，而不是2,477.83。我真正看到的唯一区别是： data-reactid="36“ 和 data-reactid="35“。当我更改为35时，它失败了。36有效，但仅显示%的变化。我要的是股票价格，而不是%的变化。谢谢你的帮忙! import urllib.request from bs4 import BeautifulSoup # S&P 500 page = urllib.req

浏览 3提问于2017-07-27得票数 0

回答已采纳

1回答

UnicodeDecodeError Python错误

、

我正在尝试编写一个python google api。遇到一些unicode问题。到目前为止，我真正的基本PoC是： #!/usr/bin/env python import urllib2 from bs4 import BeautifulSoup query = "filetype%3Apdf" url = "http://www.google.com/search?sclient=psy-ab&hl=en&site=&source=hp&q="+query+"&btnG=Search"

浏览 1提问于2012-09-15得票数 0

回答已采纳

2回答

BeautifulSoup汉字编码错误

、、、、

我试图识别并保存特定站点上的所有标题，并不断获取我认为是编码错误的内容。网站是：目前的代码是： holder = {} url = urllib.urlopen('http://paper.people.com.cn/rmrb/html/2016-05/06/nw.D110000renmrb_20160506_2-01.htm').read() soup = BeautifulSoup(url, 'lxml') head1 = soup.find_all(['h1','h2','h3']) prin

浏览 6提问于2016-05-08得票数 4

回答已采纳

2回答

对于<meta>标记，BeautifulSoup返回过多的内容

、、

基本上，我试图获得所有的元标签从一个网站与bs4。 import urllib.request from bs4 import BeautifulSoup response = urllib.request.urlopen("https://grab.careers/").read() response_decode = response.decode('utf-8') soup = BeautifulSoup(response_decode,"html.parser") metatags = soup.find_all('meta

浏览 30提问于2020-02-25得票数 1

回答已采纳

1回答

如何将mbox转换为JSON结构？

、

我正在尝试将mbox转换为适合导入到MongoDB的JSON结构，即我正在使用挖掘社交网络第二版邮箱章节，但它不能正常工作。我正在尝试将mbox转换为适合导入到MongoDB的JSON结构，即我正在使用挖掘社交网络第二版邮箱章节，但它不能正常工作。 import sys import mailbox import email import quopri import json import time from BeautifulSoup import BeautifulSoup from dateutil.parser import parse MBOX = 're

浏览 1提问于2014-02-25得票数 3

1回答

用BeautifulSoup实现HTML页面中的子串计数

、、

我需要找到并计算所有的"python“和"c++”字作为一个子字符串在BeautifulSoup模块的超文本标记语言代码。在维基百科中，这些词相应地出现了1到9次。为什么我的代码写0和0？ from urllib.request import urlopen, urlretrieve from bs4 import BeautifulSoup resp = urlopen("https://stepik.org/media/attachments/lesson/209717/1.html") html = resp.read().decode(

浏览 18提问于2020-07-15得票数 0

回答已采纳

1回答

BeautifulSoup无法使用“html5lib”解析html

、、

BeautifulSoup无法使用选项html5lib解析html页面，但通常使用选项html.parser。根据，html5lib应该比html.parser更宽容，那么为什么我在使用它来解析html页面时遇到了混乱的代码呢？下面是一个小的可执行示例。(在用html5lib更改html.parser之后，中文输出是正常的。) #_*_coding:utf-8_*_ import requests from bs4 import BeautifulSoup ss = requests.Session() res = ss.get("http://tech.qq.com/a/2015

浏览 5提问于2015-12-25得票数 1

回答已采纳

1回答

如何使用Beautiful从python访问Google中的place类型(小部件-窗格-链接)

、、、、

我试图得到标签美国餐厅的名字El Merendero和审查4.1使用蟒蛇和美丽的汤，但我无法进入。对于如何找到这个谷歌地图类别，有什么建议吗？我得到的是这篇带有Javascript错误的文本，我猜：使JavaScript能够查看Google。我的代码： import requests from bs4 import BeautifulSoup Query = " Restaurant el merendero " durl= "https://www.google.com/maps/search/?api=1&query=%s"

浏览 0提问于2019-04-07得票数 0

回答已采纳

2回答

python漂亮的soup模块出错

、

我使用下面的代码来尝试做网络抓取。 import sys , os import requests, webbrowser,bs4 from PIL import Image import pyautogui p = requests.get('http://www.goal.com/en-ie/news/ozil-agent-eviscerates-jealous-keown-over-stupid-comments/1javhtwzz72q113dnonn24mnr1') n = open("exml.txt" , 'wb') for i

浏览 34提问于2018-06-05得票数 -1