首页
学习
活动
专区
圈层
工具
发布
首页
学习
活动
专区
圈层
工具
MCP广场
社区首页 >问答首页 >从网页中抓取数据

从网页中抓取数据
EN

Stack Overflow用户
提问于 2019-11-01 21:03:24
回答 1查看 117关注 0票数 0

我试图从以下网页https://www.cricbuzz.com/live-cricket-scorecard/10711/aus-vs-ind-1st-test-india-in-australia-test-series-2011-12抓取数据,我需要一个表格格式的记分板。有人能帮我吗?我正在使用python3。我对网络抓取是个新手,对网页的内部原理不是很熟悉。提前感谢!

我试着在urllib2中使用BeautifulSoup,等等,但是什么地方都没找到。

EN

回答 1

Stack Overflow用户

发布于 2019-11-01 23:00:23

你可以使用熊猫的read_html()。这将返回一个数据帧列表。你在那里做什么由你决定。您可能需要对数据进行一些整理,但我只是将它们放入一个大表中,以便向您展示。

代码语言:javascript
运行
复制
import pandas as pd

url = 'https://m.cricbuzz.com/live-cricket-scorecard/10711/aus-vs-ind-1st-test-india-in-australia-test-series-2011-12'
dfs = pd.read_html(url)

result =  pd.concat( [ df for df in dfs ] )

输出:

代码语言:javascript
运行
复制
print (result.to_string())
                      0                     1                     2                     3                     4
0               Batting                     R                     B                    4s                    6s
0              Ed Cowan                    68                   177                     7                     0
1  c M Dhoni b R Ashwin  c M Dhoni b R Ashwin  c M Dhoni b R Ashwin  c M Dhoni b R Ashwin  c M Dhoni b R Ashwin
0          David Warner                    37                    49                     4                     1
1   c M Dhoni b U Yadav   c M Dhoni b U Yadav   c M Dhoni b U Yadav   c M Dhoni b U Yadav   c M Dhoni b U Yadav
0           Shaun Marsh                     0                     6                     0                     0
1   c V Kohli b U Yadav   c V Kohli b U Yadav   c V Kohli b U Yadav   c V Kohli b U Yadav   c V Kohli b U Yadav
0         Ricky Ponting                    62                    94                     6                     0
1  c V Laxman b U Yadav  c V Laxman b U Yadav  c V Laxman b U Yadav  c V Laxman b U Yadav  c V Laxman b U Yadav
0        Michael Clarke                    31                    68                     5                     0
1              b Z Khan              b Z Khan              b Z Khan              b Z Khan              b Z Khan
0        Michael Hussey                     0                     1                     0                     0
1    c M Dhoni b Z Khan    c M Dhoni b Z Khan    c M Dhoni b Z Khan    c M Dhoni b Z Khan    c M Dhoni b Z Khan
0           Brad Haddin                    27                    69                     1                     0
1   c V Sehwag b Z Khan   c V Sehwag b Z Khan   c V Sehwag b Z Khan   c V Sehwag b Z Khan   c V Sehwag b Z Khan
0          Peter Siddle                    41                   100                     4                     0
1    c M Dhoni b Z Khan    c M Dhoni b Z Khan    c M Dhoni b Z Khan    c M Dhoni b Z Khan    c M Dhoni b Z Khan
0       James Pattinson                    18                    54                     2                     0
1               not out               not out               not out               not out               not out
0        Ben Hilfenhaus                    19                    32                     3                     0
1  c V Kohli b R Ashwin  c V Kohli b R Ashwin  c V Kohli b R Ashwin  c V Kohli b R Ashwin  c V Kohli b R Ashwin
0           Nathan Lyon                     6                    11                     1                     0
1            b R Ashwin            b R Ashwin            b R Ashwin            b R Ashwin            b R Ashwin
0                Bowler                     O                     M                     R                     W
1           Zaheer Khan                    31                     6                    77                     4
2         Ishant Sharma                    24                     7                    48                     0
3           Umesh Yadav                    26                     5                   106                     3
4   Ravichandran Ashwin                    29                     3                    81                     3
0                  Home           Live Scores                   NaN                   NaN                   NaN
1              Schedule                  News                   NaN                   NaN                   NaN
2            Editorials                Photos                   NaN                   NaN                   NaN
3              Archives               Players                   NaN                   NaN                   NaN
4              Rankings                Series                   NaN                   NaN                   NaN
5                  Poll                Videos                   NaN                   NaN                   NaN
6          Points Table            Contact Us                   NaN                   NaN                   NaN
7       Cricbuzz TV Ads    Careers @ Cricbuzz                   NaN                   NaN                   NaN
8           Mobile Apps    This day that year                   NaN                   NaN                   NaN
9          Wickets Zone                   NaN                   NaN                   NaN                   NaN
0           Mobile Apps       Social Channels                   NaN                   NaN                   NaN
1                iPhone              facebook                   NaN                   NaN                   NaN
2               Android               twitter                   NaN                   NaN                   NaN
票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/58659692

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档