文章/答案/技术大牛

发布

社区首页 >问答首页 >Python read_html没有找到表

问Python read_html没有找到表
EN

Stack Overflow用户

提问于 2022-02-23 02:45:05

回答 1查看 1.2K关注 0票数 0

我对python非常陌生，并且尝试做我自己的数据分析。

我试图解析这个网站的数据：https://www.tsn.ca/nhl/statistics

我想要一种数据帧格式的表格。

我试过这个：

import pandas as pd

players_list_unclean = pd.read_html('https://www.sportsnet.ca/hockey/nhl/players/?season=2021&?seasonType=reg&tab=Skaters')

我得到以下错误：

引发ValueError(“没有找到表”) ValueError:没有找到表

我可以看到有桌子，但出于某种原因，它没有被阅读。

我找到了另一个建议使用selenium的堆栈溢出解决方案：

pandas read_html ValueError: No tables found

但是，当我试图实现此代码时，我无法在html页面源代码中找到表ID。有人知道其他方法吗？我试过其他网站，但我最终也有同样的问题。

from selenium.webdriver.common.keys import Keys

driver = webdriver.Firefox()
driver.get("https://www.wunderground.com/personal-weather-station/dashboard?ID=KMAHADLE7#history/tdata/s20170201/e20170201/mcustom.html")
elem = driver.find_element_by_id("history_table")

head = elem.find_element_by_tag_name('thead')
body = elem.find_element_by_tag_name('tbody')

list_rows = []

for items in body.find_element_by_tag_name('tr'):
    list_cells = []
    for item in items.find_elements_by_tag_name('td'):
        list_cells.append(item.text)
    list_rows.append(list_cells)
driver.close() ```

pandas

selenium

python

回答 1

Stack Overflow用户

回答已采纳

发布于 2022-02-23 04:08:07

没有表，但您很幸运，因为数据来自获取：

https://datacrunch.9c9media.ca/statsapi/sports/hockey/leagues/nhl/sortablePlayerSeasonStats/skater?brand=tsn&type=json&seasonType=regularSeason&season=2021

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/71230788

复制

相似问题

问Python read_html没有找到表
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Python read_html没有找到表EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Python read_html没有找到表
EN