我已经搜过了很多刮过的帖子,但找不到像我这样的例子。我遵循选择小工具的R vignette示例(https://blog.rstudio.com/2014/11/24/rvest-easy-web-scraping-with-r/),但在必要时输入我的用例。没有一个选择小工具的建议能让我得到我想要的。我需要提取页面上每个评论的名称。在引擎盖下面的名称示例如下:
<span itemprop="name" class="sg_selected">This Name</span>
这是我的密码。理想情况下,这段代码应该给我这个网页上的个人名字。
library(rvest)
library(dplyr)
dsa_reviews <-
read_html("https://www.directsalesaid.com/companies/traveling-
vineyard#reviews")
review_names <- html_nodes(dsa_reviews,'#reviews span')
df <- bind_rows(lapply(xml_attrs(review_names), function(x)
data.frame(as.list(x), stringsAsFactors=FALSE)))如果这是一个重复的问题,或者它的格式不正确,请道歉。请随时请求任何必要的编辑。
发布于 2017-10-10 20:53:15
下面是:
library(rvest)
library(dplyr)
dsa_reviews <-
read_html("https://www.directsalesaid.com/companies/traveling-vineyard#reviews")
html_nodes(dsa_reviews,'[itemprop=name]') %>%
html_text()
[1] "Traveling Vineyard" ""
[3] "Kiersten Ray-kuhn" "Miley Sama"
[5] " Nancy Shawtone " "Amanda Moore"
[7] "Matt" "Kathy Barzal"
[9] "Lesa Brinker" "Lori Stryker"
[11] "Jeanette Holtman" "Penny Notarnicola"
[13] "Laura Ann" "Nicole Lafave"
[15] "Gretchen Hess Miller" "Gina Devine"
[17] "Ashley Lawton Converse" "Morgan Williams"
[19] "Angela Baston Mckeone" "Traci Feshler"
[21] "Kisha Marshall Dlugos" "Jody Cole Dvorak" 科林
https://stackoverflow.com/questions/46675642
复制相似问题